TR2005-159

A Robust Voice Activity Detector Using an Acoustic Doppler Radar


    •  Hu, R., Raj, B., "A Robust Voice Activity Detector Using an Acoustic Doppler Radar", IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), November 2005, pp. 171-176.
      BibTeX TR2005-159 PDF
      • @inproceedings{Hu2005nov,
      • author = {Hu, R. and Raj, B.},
      • title = {A Robust Voice Activity Detector Using an Acoustic Doppler Radar},
      • booktitle = {IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)},
      • year = 2005,
      • pages = {171--176},
      • month = nov,
      • url = {https://www.merl.com/publications/TR2005-159}
      • }
  • Research Area:

    Speech & Audio

Abstract:

This paper describes a robust voice activity detector using an acoustic Doppler radar device. The sensor is used to detect the dynamic status of the speaker's mouth. At the frequencies of operation, background noises are largely attenuated, rendering the device robust to external acoustic noises in most operating conditions. Unlike the other non-acoustic sensors, the device need not be taped to the speaker, making it more acceptable in most situations. In this paper, various fetures computed from the sensor output are exploited for voice activity detection. The best set of features is selected based on robustness analysis. A support vector machine classifier is used to make the final speech/non-speech decision. Experimental results show that the proposed doppler-based voice activity detector improves speech/non-speech classification accuracy over that obtained using speech alone. The most significant improvements happen in low signal-to-noise (SNR) environments.

 

  • Related News & Events

    •  NEWS    ASRU 2005: 2 publications by MERL researchers and others
      Date: November 28, 2005
      Where: IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)
      Brief
      • The papers "A Robust Voice Activity Detector Using an Acoustic Doppler Radar" by Hu, R. and Raj, B. and "Reconstructing Spectral Vectors with Uncertain Spectrographic Masks for Robust Speech Recognition" by Raj, B. and Singh, R. were presented at the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
    •