TR2008-014

Ultrasonic Doppler Sensor For Speaker Recognition

- Kalgaonkar, K., Raj, B., "Ultrasonic Doppler Sensor for Speaker Recognition", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), March 2008, pp. 4865-4868.
  BibTeX TR2008-014 PDF
  - @inproceedings{Kalgaonkar2008mar,
  - author = {Kalgaonkar, K. and Raj, B.},
  - title = {{Ultrasonic Doppler Sensor for Speaker Recognition}},
  - booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
  - year = 2008,
  - pages = {4865--4868},
  - month = mar,
  - issn = {1520-6149},
  - url = {https://www.merl.com/publications/TR2008-014}
  - }
Research Areas:

Artificial Intelligence, Speech & Audio

Abstract:

In this paper we present a novel use of an acoustic Doppler sonar for multi-modal speaker identification. An ultrasonic emitter directs a 40kHz tone toward the speaker. Reflections from the speaker\'s face are recorded as the speaker talks. The frequency of the tone is modified by the velocity of the facial structures it is reflected by. The received ultrasonic signal thus contains an entire spectrum of frequencies representing the set of all velocities of facial components. The pattern of frequencies in the reflected signal is observed to be typical of the speaker. The captured ultrasonic signal is synchronously analyzed with the corresponding voice signal to extract specific characteristics that can be used to identify the speaker. Experiments show that the information this can result in significant improvements in speaker identification accuracy both under clean conditions and in noise.

Related News & Events

NEWS ICASSP 2008: 4 publications by MERL researchers and others
Date: March 31, 2008
Where: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
Brief
- The papers "Joint Tracking and Video Registration by Factorial Hidden Markov Models" by Mei, X. and Porikli, F., "Speech Denoising Using Nonnegative Matrix Factorization with Priors" by Wilson, K.W., Raj, B., Smaragdis, P. and Divakaran, A., "Ultrasonic Doppler Sensor for Speaker Recognition" by Kalgaonkar, K. and Raj, B. and "Sparse and Shift-Invariant Feature Extraction from Non-Negative Data" by Smaragdis, P., Raj, B. and Shashanka, M. were presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP).

Research Areas:

Abstract: