Publications

12 / 2,796 publications found.


  •  Le Roux, J., Wisdom, S., Erdogan, H., Hershey, J., "SDR-- Half- Baked or Well Done?", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), DOI: 10.1109/ICASSP.2019.8683855, May 2019.
  •  Erdogan, H., Hershey, J., Watanabe, S., Le Roux, J., Deep recurrent networks for separation and recognition of single-channel speech in non-stationary background audio, Springer, ISBN: 978-3-319-64680-0, July 2018.
  •  Xiao, X., Watanabe, S., Erdogan, H., Mandel, M., Lu, L., Hershey, J., Seltzer, M., Chen, G., Zhang, Y., Yu, D., "Discriminative beamforming with phase aware neural networks for speech enhancement and recognition", New Era for Robust Speech Recognition: Exploiting Deep Learning, July 9, 2018.
  •  Meng, Z., Watanabe, S., Hershey, J.R., Erdogan, H., "Deep Long Short-Term Memory Adaptive Beamforming Networks for Multichannel Robust Speech Recognition", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), March 2017.
  •  Le Roux, J., Vincent, E., Erdogan, H., "Learning- Based Approaches to Speech Enhancement and Separation," Tech. Rep. TR2016-113, Interspeech Tutorials, September 2016.
  •  Erdogan, H., Hershey, J.R., Watanabe, S., Mandel, M., Le Roux, J., "Improved MVDR beamforming using single-channel mask prediction networks", Interspeech, DOI: 10.21437/Interspeech.2016-552, September 2016, pp. 1981-1985.
  •  Xiao, X.; Watanabe, S.; Erdogan, H.; Lu, L.; Hershey, J.; Seltzer, M.; Chen, G.; Zhang, Y.; Mandel, M.; Yu, D., "Deep Beamforming Networks for Multi-Channel Speech Recognition", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), DOI: 10.1109/ICASSP.2016.7472778, March 2016, pp. 5745-5749.
    BibTeX Download PDFAbout TR2016-002
    • @inproceedings{Xiao2016mar,
    • author = {Xiao, X. and Watanabe, S. and Erdogan, H. and Lu, L. and Hershey, J. and Seltzer, M. and Chen, G. and Zhang, Y. and Mandel, M. and Yu, D.},
    • title = {Deep Beamforming Networks for Multi-Channel Speech Recognition},
    • booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
    • year = 2016,
    • pages = {5745--5749},
    • month = mar,
    • doi = {10.1109/ICASSP.2016.7472778},
    • url = {https://www.merl.com/publications/TR2016-002}
    • }
  •  Hori, T.; Chen, Z.; Erdogan, H.; Hershey, J.R.; Le Roux, J.; Mitra, V.; Watanabe, S., "The MERL/SRI System for the 3rd CHiME Challenge Using Beamforming, Robust Feature Extraction, and Advanced Speech Recognition", IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), DOI: 10.1109/ASRU.2015.7404833, December 2015, pp. 475-481.
    BibTeX Download PDFAbout TR2015-135
    • @inproceedings{Hori2015dec2,
    • author = {Hori, T. and Chen, Z. and Erdogan, H. and Hershey, J.R. and {Le Roux}, J. and Mitra, V. and Watanabe, S.},
    • title = {The MERL/SRI System for the 3rd CHiME Challenge Using Beamforming, Robust Feature Extraction, and Advanced Speech Recognition},
    • booktitle = {IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)},
    • year = 2015,
    • pages = {475--481},
    • month = dec,
    • publisher = {IEEE},
    • doi = {10.1109/ASRU.2015.7404833},
    • url = {https://www.merl.com/publications/TR2015-135}
    • }
  •  Chen, Z.; Watanabe, S.; Erdogan, H.; Hershey, J.R., "Speech Enhancement and Recognition Using Multi-Task Learning of Long Short-Term Memory Recurrent Neural Networks", Interspeech, ISBN: 978-1-5108-1790-6, September 2015, vol. 1 of 5, pp. 1278.
    BibTeX Download PDFAbout TR2015-100
    • @inproceedings{Chen2015sep,
    • author = {Chen, Z. and Watanabe, S. and Erdogan, H. and Hershey, J.R.},
    • title = {Speech Enhancement and Recognition Using Multi-Task Learning of Long Short-Term Memory Recurrent Neural Networks},
    • booktitle = {Interspeech},
    • year = 2015,
    • volume = {1 of 5},
    • pages = 1278,
    • month = sep,
    • isbn = {978-1-5108-1790-6},
    • url = {https://www.merl.com/publications/TR2015-100}
    • }
  •  Weninger, F.J.; Erdogan, H.; Watanabe, S.; Vincent, E.; Le Roux, J.; Hershey, J.R.; Schuller, B.W., "Speech Enhancement with LSTM Recurrent Neural Networks and Its Application to Noise-Robust ASR", Latent Variable Analysis and Signal Separation Conference (LVA), DOI: 10.1007/978-3-319-22482-4_11, ISBN: 978-3-319-22482-4, August 2015, vol. 9237, pp. 91-99.
    BibTeX Download PDFAbout TR2015-094
    • @inproceedings{Weninger2015aug,
    • author = {Weninger, F.J. and Erdogan, H. and Watanabe, S. and Vincent, E. and {Le Roux}, J. and Hershey, J.R. and Schuller, B.W.},
    • title = {Speech Enhancement with LSTM Recurrent Neural Networks and Its Application to Noise-Robust ASR},
    • booktitle = {Latent Variable Analysis and Signal Separation Conference (LVA)},
    • year = 2015,
    • volume = 9237,
    • pages = {91--99},
    • month = aug,
    • doi = {10.1007/978-3-319-22482-4_11},
    • isbn = {978-3-319-22482-4},
    • url = {https://www.merl.com/publications/TR2015-094}
    • }
  •  Erdogan, H.; Hershey, J.R.; Watanabe, S.; Le Roux, J., "Phase-Sensitive and Recognition-Boosted Speech Separation Using Deep Recurrent Neural Networks", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), DOI: 10.1109/ICASSP.2015.7178061, April 2015, pp. 708-712.
    BibTeX Download PDFAbout TR2015-031
    • @inproceedings{Erdogan2015apr,
    • author = {Erdogan, H. and Hershey, J.R. and Watanabe, S. and {Le Roux}, J.},
    • title = {Phase-Sensitive and Recognition-Boosted Speech Separation Using Deep Recurrent Neural Networks},
    • booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
    • year = 2015,
    • pages = {708--712},
    • month = apr,
    • publisher = {IEEE},
    • doi = {10.1109/ICASSP.2015.7178061},
    • url = {https://www.merl.com/publications/TR2015-031}
    • }
  •  Topkaya, I.S.; Erdogan, H.; Porikli, F., "Detecting and Tracking Unknown Number of Objects with Dirichlet Process Mixture Models and Markov Random Fields", International Symposium on Visual Computing (ISVC), July 2013.
    BibTeX Download PDFAbout TR2013-078
    • @inproceedings{Topkaya2013jul,
    • author = {Topkaya, I.S. and Erdogan, H. and Porikli, F.},
    • title = {Detecting and Tracking Unknown Number of Objects with Dirichlet Process Mixture Models and Markov Random Fields},
    • booktitle = {International Symposium on Visual Computing (ISVC)},
    • year = 2013,
    • month = jul,
    • url = {https://www.merl.com/publications/TR2013-078}
    • }