Audio Separation
We are developing new approaches and algorithms to solve the problem of source separation. Although we focus on the problem of audio mixtures our work is directly applicable to multiple types of signals ranging from audiovisual, to biomedical/chemical, to vibrations and others. Our focus is to enable processing on non-clean data to not be influenced by interfering sources.
Background & Objective: As is often the case audio signals are captured with interference from other sources. Since most time-series algorithms are designed to work on a single source signal, this interference results into suboptimal performance. This problem becomes especially prevalent when dealing with systems that perform speech recognition, or when users need to evaluate noisy data (e.g. a noisy cell phone recording). Our objective is to investigate methods with which we can perform various tasks on multiple sound recordings as well as we can with single sound ones.
Technical Discussion: We have recently presented a sequence of papers which describe some new techniques for source separation based on latent model decompositions. We have fine-tuned our techniques to work on source separation problems and our results are very competitive with the state of the art.
Contact: Bent Schmidt-Nielsen
Publications:
Shashanka, M.V.S.; Raj, B.; Smaragdis, P., "Sparse Overcomplete Decomposition for Single Channel Speaker Separation", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), ISSN: 1520-6149, Vol. 2, pp. 11-641 - II-644, April 2007 (IEEE Xplore, TR2007-031)
Smaragdis, P., "Convolutive Speech Bases and their Application to Supervised Speech Separation", IEEE Transaction on Audio, Speech and Language Processing, ISSN: 1558-7916, Vol. 15, Issue 1, pp. 1-12, January 2007 (IEEE Xplore, TR2007-002)
Raj, B.; Shashanka, M.V.S.; Smaragdis, P., "Latent Dirichlet Decomposition for Single Channel Speaker Separation", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2006 (ICASSP 2006, TR2006-064)
Raj, B.; Smaragdis, P., "Latent Variable Decomposition of Spectrograms for Single Channel Speaker Separation", IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 17-20, October 2005 (IEEE Xplore, TR2005-137)
Technology Area: Sensor and Data Systems
Modification Date: January 16, 2009
