Regularized Non-Negative Matrix Factorization With Temporal Dependencies for Speech Denoising
| Citation: | Wilson, K. W.; Raj, B.; Smaragdis, P., "Regularized Non-negative Matrix Factorization with Temporal Dependencies for Speech Denoising", Interspeech , September 2008 (Single- and Multichannel Speech Enhancement II) |
| MERL Report: | TR2008-075 |
| MERL Contact: | Kevin W. Wilson |
A toy example showing the advantage of regularizing across frames. Each panel is a spectrogram, where the horizontal axis represents time and the vertical axis represents frequency.
We present a technique for denoising speech using temporally regularized nonnegative matrix factorization (NMF). In previous work [1], we used a regularized NMF update to impose structure within each audio frame. In this paper, we add frame-to-frame regularization across time and show that this additional regularization can also improve our speech denoising results. We evaluate our algorithm on a range of nonstationary noise types and outperform a state-of-the-art Wiener filter implementation.
|
| Read the full technical report (PDF: 101 kB) |