Ultrasonic Sensing for Robust Speech Recognition
| Citation: |
Srinivasan, S.; Raj, B.; Ezzat, T., "Ultrasonic Sensing for Robust Speech Recognition", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), SP-P14.5, March 2010 (ICASSP 2010) |
| MERL Report: | TR2010-015 |
In this paper, we present our work using ultrasonic sensing of speech for digit recognition. First, a set of spectral ultrasonic features are developed and tuned in order to achieve optimal performance for the digit recognition task. Using these features, we demonstrate an overall accuracy of 33.00% on a digit recognition task using HMMs with recordings from 6 speakers. The results indicate that ultrasonic sensing of speech is viable, but that further work is needed to achieve word accuracies that match those of audio. Finally, experimental results are presented which demonstrate that fusing information from ultrasound and audio sources show marginal improvements over audio-only performances.