Mitsubishi Electric Research Laboratories

Regularized Non-Negative Matrix Factorization With Temporal Dependencies for Speech Denoising

Citation:   Wilson, K. W.; Raj, B.; Smaragdis, P., "Regularized Non-negative Matrix Factorization with Temporal Dependencies for Speech Denoising", Interspeech , September 2008 (Single- and Multichannel Speech Enhancement II)
MERL Report:  TR2008-075
MERL Contact:   Kevin W. Wilson


A toy example showing the advantage of regularizing across frames. Each panel is a spectrogram, where the horizontal axis represents time and the vertical axis represents frequency.

We present a technique for denoising speech using temporally regularized nonnegative matrix factorization (NMF). In previous work [1], we used a regularized NMF update to impose structure within each audio frame. In this paper, we add frame-to-frame regularization across time and show that this additional regularization can also improve our speech denoising results. We evaluate our algorithm on a range of nonstationary noise types and outperform a state-of-the-art Wiener filter implementation.

 Read the full technical report (PDF: 101 kB)