TR2013-097

Hierarchical and Coupled Non-negative Dynamical Systems with Application to Audio Modeling


    •  Simsekli, U., Le Roux, J., Hershey, J.R., "Hierarchical and Coupled Non-negative Dynamical Systems with Application to Audio Modeling", IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), DOI: 10.1109/​WASPAA.2013.6701891, October 2013, pp. 1-4.
      BibTeX TR2013-097 PDF
      • @inproceedings{Simsekli2013oct,
      • author = {Simsekli, U. and {Le Roux}, J. and Hershey, J.R.},
      • title = {Hierarchical and Coupled Non-negative Dynamical Systems with Application to Audio Modeling},
      • booktitle = {IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)},
      • year = 2013,
      • pages = {1--4},
      • month = oct,
      • doi = {10.1109/WASPAA.2013.6701891},
      • issn = {1931-1168},
      • url = {https://www.merl.com/publications/TR2013-097}
      • }
  • MERL Contact:
Abstract:

Many kinds of non-negative data, such as power spectra and count data, have been modeled using non-negative matrix factorization. Even though this modeling paradigm has yielded successful applications, it falls short when the data have certain hierarchical and temporal structure. In this study, we propose a novel dynamical system model that can handle these kinds of complex structures that often arise in non-negative data. We show that our model can be extended to handle heterogeneous data for data-driven regularization. We present convergence-guaranteed update rules for each latent factor. In order to assess the performance, we evaluate our model on the transcription of classical piano pieces, and show that it outperforms related models. We also illustrate that the performance can be further improved by making use of symbolic data.