TR2006-121

A Probabilistic Latent Variable Model for Acoustic Modeling


    •  Smaragdis, P.; Raj, B.; Shashanka, M., "A Probabilistic Latent Variable Model for Acoustic Modeling", Advances in Neural Information Processing Systems (NIPS), December 2006.
      BibTeX Download PDF
      • @inproceedings{Smaragdis2006dec,
      • author = {Smaragdis, P. and Raj, B. and Shashanka, M.},
      • title = {A Probabilistic Latent Variable Model for Acoustic Modeling},
      • booktitle = {Advances in Neural Information Processing Systems (NIPS)},
      • year = 2006,
      • month = dec,
      • url = {http://www.merl.com/publications/TR2006-121}
      • }
  • Research Areas:

    Multimedia, Speech & Audio


In this paper we describe a model developed for the analysis of acoustic spectra. Unlike decompositions techniques that can result in difficult to interpret results this model explicitly models spectra as distributions and extracts sets of additive and semantically useful components that facilitate a variety of applications ranging from source separation, denoising, music transcription and sound recognition. This model is probabilistic in nature and is easily extended to produce sparse codes, and discover transform invariant components which can be optimized for particular applications.