TR2004-087

Reconstruction of Missing Features for Robust Speech Recognition


    •  Raj, B., Seltzer, M.L., Stern, R.M., "Reconstruction of Missing Features for Robust Speech Recognition", Speech Communication, Vol. 43, No. 4, pp. 275-296, September 2004.
      BibTeX TR2004-087 PDF
      • @article{Raj2004sep,
      • author = {Raj, B. and Seltzer, M.L. and Stern, R.M.},
      • title = {Reconstruction of Missing Features for Robust Speech Recognition},
      • journal = {Speech Communication},
      • year = 2004,
      • volume = 43,
      • number = 4,
      • pages = {275--296},
      • month = sep,
      • url = {https://www.merl.com/publications/TR2004-087}
      • }
  • Research Areas:

    Artificial Intelligence, Speech & Audio

Abstract:

Speech recognition systems perform poorly in the presence of corrupting noise. Missing feature methods attempt to compensate for the noise by removing noise corrupted components of spectrographic representation of noisy speech and performing recognition with the remaining reliable components. Conventional classifier-compensation methods modify the recognition system to work with the incomplete representations so obtained. This contrains them to perform recognition using spectrographic features which are known to be less optimal than cepstra. In this paper we present two missing-features algorithms that reconstruct complete spectrograms from incomplete noisy ones. Cepstral vectors can now be derived from the reconstructed spectrograms for recognition. The first algorithm uses MAP procedures to estimate corrupt components from their correlations with reliable components. The second algorithm clusters spectral vectors of clean speech. Corrupt components of noisy speech are estimated from the distrubition of the cluster that the analysis frame is identified with. Experiments show that, although conventional classifier-compensation methods are superior when recognition is performed with spectrographic features, cepstra derived from the reconstructed spectrograms result in better recogntion performance overall. The proposed methods are also less expensive computationally and do not require modification of the recognizer.

 

  • Related News & Events

    •  NEWS    Speech Communication: 2 publications by MERL researchers and others
      Date: September 12, 2004
      Where: Speech Communication
      Brief
      • The articles "Reconstruction of Missing Features for Robust Speech Recognition" by Raj, B., Seltzer, M.L. and Stern, R.M. and "A Bayesian Framework for Spectrographic Mask Estimation for Missing Feature Speech Recognition" by Seltzer, M.L., Raj, B. and Stern, R.M. were published in Speech Communication.
    •