TR2008-075

Regularized Non-Negative Matrix Factorization With Temporal Dependencies for Speech Denoising


    •  Wilson, K. W.; Raj, B.; Smaragdis, P., "Regularized Non-negative Matrix Factorization with Temporal Dependencies for Speech Denoising", Interspeech, September 2008.
      BibTeX Download PDF
      • @inproceedings{Wilson2008sep,
      • author = {Wilson, K. W. and Raj, B. and Smaragdis, P.},
      • title = {Regularized Non-negative Matrix Factorization with Temporal Dependencies for Speech Denoising},
      • booktitle = {Interspeech},
      • year = 2008,
      • month = sep,
      • url = {http://www.merl.com/publications/TR2008-075}
      • }
  • Research Areas:

    Multimedia, Speech & Audio


TR Image
A toy example showing the advantage of regularizing across frames. Each panel is a spectrogram, where the horizontal axis represents time and the vertical axis represents frequency.

We present a technique for denoising speech using temporally regularized nonnegative matrix factorization (NMF). In previous work [1], we used a regularized NMF update to impose structure within each audio frame. In this paper, we add frame-to-frame regularization across time and show that this additional regularization can also improve our speech denoising results. We evaluate our algorithm on a range of nonstationary noise types and outperform a state-of-the-art Wiener filter implementation.