Publications

41 / 2,443 publications found.


  •  Ochiai, T., Watanabe, S., Katagiri, S., "Does speech enhancement work with end-to-end ASR objectives?: Experimental analysis of multichannel end-to-end ASR", IEEE International Workshop on Machine Learning for Signal Processing (MLSP), DOI: 10.1109/JSTSP.2017.2764276, Vol. 11, No. 8, pp. 1274 - 1288, October 2017.
  •  Tachioka, Y., Narita, T., Miura, I., Uramoto, T., Monta, N., Uenohara, S., Furuya, K., Watanabe, S., Le Roux, J., "Coupled initialization of multi-channel non-negative matrix factorization based on spatial and spectral information", Interspeech, August 2017.
  •  Ochiai, T., Watanabe, S., Hori, T., Hershey, J.R., "Multichannel End-to-end Speech Recognition", International Conference on Machine Learning (ICML), August 2017.
  •  Ochiai, T., Watanabe, S., Hori, T., Hershey, J.R., "Multichannel End-to-end Speech Recognition", Tech. Rep. TR2017-035, Mitsubishi Electric Research Laboratories, Cambridge, MA, March 2017.
    BibTeX Download PDFRead TR2017-035
    • @techreport{MERL_TR2017-035,
    • author = {Ochiai, T. and Watanabe, S. and Hori, T. and Hershey, J.R.},
    • title = {Multichannel End-to-end Speech Recognition},
    • institution = {MERL - Mitsubishi Electric Research Laboratories},
    • address = {Cambridge, MA 02139},
    • number = {TR2017-035},
    • month = mar,
    • year = 2017,
    • url = {http://www.merl.com/publications/TR2017-035/}
    • }
  •  Chen, S., Tian, D., Feng, C., Vetro, A., Kovacevic, J., "Contour-Enhanced Resampling of 3D Point Clouds Via Graphs", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), March 2017.
  •  Watanabe, S., Hori, T., Le Roux, J., Hershey, J.R., "Student- Teacher Network Learning with Enhanced Features", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), March 2017.
  •  Vincent, E., Watanabe, S., Nugraha, A.A., Barker, J., Marxer, R., "An analysis of environment, microphone and data simulation mismatches in robust speech recognition", Computer Speech & Language, DOI: 10.1016/j.csl.2016.11.005, December 2016.
  •  Delcroix, M., Watanabe, S., "Recent Advances in Distant Speech Recognition," Tech. Rep. TR2016-115, Interspeech Tutorials, September 2016.
  •  Erdogan, H., Hershey, J.R., Watanabe, S., Mandel, M., Le Roux, J., "Improved MVDR beamforming using single-channel mask prediction networks", Interspeech, DOI: 10.21437/Interspeech.2016-552, September 2016, pp. 1981-1985.
  •  Isik, Y., Le Roux, J., Chen, Z., Watanabe, S., Hershey, J.R., "Single-Channel Multi-Speaker Separation using Deep Clustering", Interspeech, DOI: 10.21437/Interspeech.2016-1176, September 2016, pp. 545-549.
  •  Le Roux, J., Vincent, E., Erdogan, H., "Learning- Based Approaches to Speech Enhancement and Separattion," Tech. Rep. TR2016-113, Interspeech Tutorials, September 2016.
  •  Kamilov, U., "Parallel Proximal Methods for Total Variation Minimization", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), DOI: 10.1109/ICASSP.2016.74772568, March 2016, pp. 4697-4701.
    BibTeX Download PDFRead TR2016-007
    • @inproceedings{Kamilov2016mar1,
    • author = {Kamilov, U.},
    • title = {Parallel Proximal Methods for Total Variation Minimization},
    • booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
    • year = 2016,
    • pages = {4697--4701},
    • month = mar,
    • doi = {10.1109/ICASSP.2016.74772568},
    • url = {http://www.merl.com/publications/TR2016-007}
    • }
  •  Xiao, X.; Watanabe, S.; Erdogan, H.; Lu, L.; Hershey, J.; Seltzer, M.; Chen, G.; Zhang, Y.; Mandel, M.; Yu, D., "Deep Beamforming Networks for Multi-Channel Speech Recognition", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), DOI: 10.1109/ICASSP.2016.7472778, March 2016, pp. 5745-5749.
    BibTeX Download PDFRead TR2016-002
    • @inproceedings{Xiao2016mar,
    • author = {Xiao, X. and Watanabe, S. and Erdogan, H. and Lu, L. and Hershey, J. and Seltzer, M. and Chen, G. and Zhang, Y. and Mandel, M. and Yu, D.},
    • title = {Deep Beamforming Networks for Multi-Channel Speech Recognition},
    • booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
    • year = 2016,
    • pages = {5745--5749},
    • month = mar,
    • doi = {10.1109/ICASSP.2016.7472778},
    • url = {http://www.merl.com/publications/TR2016-002}
    • }
  •  Barker, J.; Marxer, R.; Vincent, E.; Watanabe, S., "The Third 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines", IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), DOI: 10.1109/ASRU.2015.75404837, December 2015, pp. 504-511.
    BibTeX Download PDFRead TR2015-136
    • @inproceedings{Barker2015dec,
    • author = {Barker, J. and Marxer, R. and Vincent, E. and Watanabe, S.},
    • title = {The Third 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines},
    • booktitle = {IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)},
    • year = 2015,
    • pages = {504--511},
    • month = dec,
    • publisher = {IEEE},
    • doi = {10.1109/ASRU.2015.75404837},
    • url = {http://www.merl.com/publications/TR2015-136}
    • }
  •  Hori, T.; Chen, Z.; Erdogan, H.; Hershey, J.R.; Le Roux, J.; Mitra, V.; Watanabe, S., "The MERL/SRI System for the 3rd CHiME Challenge Using Beamforming, Robust Feature Extraction, and Advanced Speech Recognition", IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), DOI: 10.1109/ASRU.2015.7404833, December 2015, pp. 475-481.
    BibTeX Download PDFRead TR2015-135
    • @inproceedings{Hori2015dec2,
    • author = {Hori, T. and Chen, Z. and Erdogan, H. and Hershey, J.R. and {Le Roux}, J. and Mitra, V. and Watanabe, S.},
    • title = {The MERL/SRI System for the 3rd CHiME Challenge Using Beamforming, Robust Feature Extraction, and Advanced Speech Recognition},
    • booktitle = {IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)},
    • year = 2015,
    • pages = {475--481},
    • month = dec,
    • publisher = {IEEE},
    • doi = {10.1109/ASRU.2015.7404833},
    • url = {http://www.merl.com/publications/TR2015-135}
    • }
  •  Hsiao, R.; Ma, J.; Hartmann, W.; Karafiat, M.; Grezl, F.; Burget, L.; Szoke, I.; Cernocky, J.; Watanabe, S.; Chen, Z.; Mallidi, S.H.; Hermansky, H.; Tsakalidis, S.; Schwartz, R., "Robust Speech Recognition in Unknown Reverberant and Noisy Conditions", IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), DOI: 10.1109/ARSU.2015.7404841, December 2015, pp. 533-538.
    BibTeX Download PDFRead TR2015-138
    • @inproceedings{Hsiao2015dec,
    • author = {Hsiao, R. and Ma, J. and Hartmann, W. and Karafiat, M. and Grezl, F. and Burget, L. and Szoke, I. and Cernocky, J. and Watanabe, S. and Chen, Z. and Mallidi, S.H. and Hermansky, H. and Tsakalidis, S. and Schwartz, R.},
    • title = {Robust Speech Recognition in Unknown Reverberant and Noisy Conditions},
    • booktitle = {IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)},
    • year = 2015,
    • pages = {533--538},
    • month = dec,
    • publisher = {IEEE},
    • doi = {10.1109/ARSU.2015.7404841},
    • url = {http://www.merl.com/publications/TR2015-138}
    • }
  •  Abdelaziz, A.H.; Watanabe, S.; Hershey, J.R.; Vincent, E.; Kolossa, D., "Uncertainty Propagation Through Deep Neural Networks", Interspeech, ISBN: 978-1-5108-1790-6, September 2015, vol. 1 or 5, pp. 3561.
    BibTeX Download PDFRead TR2015-098
    • @inproceedings{Abdelaziz2015sep,
    • author = {Abdelaziz, A.H. and Watanabe, S. and Hershey, J.R. and Vincent, E. and Kolossa, D.},
    • title = {Uncertainty Propagation Through Deep Neural Networks},
    • booktitle = {Interspeech},
    • year = 2015,
    • volume = {1 or 5},
    • pages = 3561,
    • month = sep,
    • isbn = {978-1-5108-1790-6},
    • url = {http://www.merl.com/publications/TR2015-098}
    • }
  •  Chen, Z.; Watanabe, S.; Erdogan, H.; Hershey, J.R., "Speech Enhancement and Recognition Using Multi-Task Learning of Long Short-Term Memory Recurrent Neural Networks", Interspeech, ISBN: 978-1-5108-1790-6, September 2015, vol. 1 of 5, pp. 1278.
    BibTeX Download PDFRead TR2015-100
    • @inproceedings{Chen2015sep,
    • author = {Chen, Z. and Watanabe, S. and Erdogan, H. and Hershey, J.R.},
    • title = {Speech Enhancement and Recognition Using Multi-Task Learning of Long Short-Term Memory Recurrent Neural Networks},
    • booktitle = {Interspeech},
    • year = 2015,
    • volume = {1 of 5},
    • pages = 1278,
    • month = sep,
    • isbn = {978-1-5108-1790-6},
    • url = {http://www.merl.com/publications/TR2015-100}
    • }
  •  Tachioka, Y.; Watanabe, S., "Uncertainty Training and Decoding Methods of Deep Neural Networks Based on Stochastic Representation of Enhanced Features", Interspeech, ISBN: 978-1-5108-1790-6, September 2015, vol. 1 or 5, pp. 3541.
    BibTeX Download PDFRead TR2015-099
    • @inproceedings{Tachioka2015sep,
    • author = {Tachioka, Y. and Watanabe, S.},
    • title = {Uncertainty Training and Decoding Methods of Deep Neural Networks Based on Stochastic Representation of Enhanced Features},
    • booktitle = {Interspeech},
    • year = 2015,
    • volume = {1 or 5},
    • pages = 3541,
    • month = sep,
    • isbn = {978-1-5108-1790-6},
    • url = {http://www.merl.com/publications/TR2015-099}
    • }
  •  Weninger, F.J.; Erdogan, H.; Watanabe, S.; Vincent, E.; Le Roux, J.; Hershey, J.R.; Schuller, B.W., "Speech Enhancement with LSTM Recurrent Neural Networks and Its Application to Noise-Robust ASR", Latent Variable Analysis and Signal Separation Conference (LVA), DOI: 10.1007/978-3-319-22482-4_11, ISBN: 978-3-319-22482-4, August 2015, vol. 9237, pp. 91-99.
    BibTeX Download PDFRead TR2015-094
    • @inproceedings{Weninger2015aug,
    • author = {Weninger, F.J. and Erdogan, H. and Watanabe, S. and Vincent, E. and {Le Roux}, J. and Hershey, J.R. and Schuller, B.W.},
    • title = {Speech Enhancement with LSTM Recurrent Neural Networks and Its Application to Noise-Robust ASR},
    • booktitle = {Latent Variable Analysis and Signal Separation Conference (LVA)},
    • year = 2015,
    • volume = 9237,
    • pages = {91--99},
    • month = aug,
    • doi = {10.1007/978-3-319-22482-4_11},
    • isbn = {978-3-319-22482-4},
    • url = {http://www.merl.com/publications/TR2015-094}
    • }