TR2007-058

Bandwidth Expansion with a Polya Urn Model


    •  Raj, B., Singh, R., Shashanka, M., Smaragdis, P., "Bandwidth Expansion with a Polya URN Model", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), April 2007, vol. 4, pp. IV597-IV600.
      BibTeX TR2007-058 PDF
      • @inproceedings{Raj2007apr,
      • author = {Raj, B. and Singh, R. and Shashanka, M. and Smaragdis, P.},
      • title = {Bandwidth Expansion with a Polya URN Model},
      • booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
      • year = 2007,
      • volume = 4,
      • pages = {IV597--IV600},
      • month = apr,
      • url = {https://www.merl.com/publications/TR2007-058}
      • }
  • Research Area:

    Speech & Audio

Abstract:

We present a new statistical technique for the estimation of the high frequency components (4-8kHz) of speech signals from narrow-band (0-4 kHz) signals. The magnitude spectra of broadband speech are modeled as the outcome of a Polya Urn process, that represents the spectra as the histogram of the outcome of several draws from a mixture multinomial distribution over frequency indices. The multinomial distributions that compose this process are learnt from a corpus of broadband (0-8kHz) speech. To estimate high-frequency components of narrow-band speech, its spectra are also modeled as the outcome of draws from a mixture-multinomial process that is composed of the learnt multinomials, where the counts of the indices of higher frequencies have been obscured. The obscured high-frequency components are then estimated as the expected number of draws of their indices from the mixture-multinomial. Experiments conducted on bandlimited signals derived from the WSJ corpus show that the proposed procedure is able to aaccurately estimate the high frequency components of these signals.

 

  • Related News & Events

    •  NEWS    ICASSP 2007: 4 publications by Anthony Vetro, Paris Smaragdis and others
      Date: April 15, 2007
      Where: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
      MERL Contact: Anthony Vetro
      Brief
      • The papers "Using Distributed Source Coding to Secure Fingerprint Biometrics" by Draper, S.C., Khisti, A., Martinian, E., Vetro, A. and Yedidia, J.S., "A Framework for Secure Speech Recognition" by Smaragdis, P. and Shashanka, M., "Sparse Overcomplete Decomposition for Single Channel Speaker Separation" by Shashanka, M.V.S., Raj, B. and Smaragdis, P. and "Bandwidth Expansion with a Polya URN Model" by Raj, B., Singh, R., Shashanka, M. and Smaragdis, P. were presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP).
    •