Bret Harsham

Bret Harsham
  • Biography

    Before joining MERL in 2001, Bret worked at Dragon Systems on handheld and automotive speech products. At MERL, he works on research projects in the area of speech and multimodal applications, with a focus on effectiveness and usability. Past research projects have included work on multi-user touch interfaces and the safety & usability of in-car speech applications.

  • Recent News & Events


    See All News & Events for Bret
  • Research Highlights

  • MERL Publications

    •  Hori, T., Wang, W., Koji, Y., Hori, C., Harsham, B.A., Hershey, J., "Adversarial Training and Decoding Strategies for End-to-end Neural Conversation Models", Computer Speech and Language, DOI: 10.1016/j.csl.2018.08.006, Vol. 54, pp. 122-139, December 2018.
      BibTeX TR2018-161 PDF
      • @article{Hori2018dec2,
      • author = {Hori, Takaaki and Wang, Wen and Koji, Yusuke and Hori, Chiori and Harsham, Bret A. and Hershey, John},
      • title = {Adversarial Training and Decoding Strategies for End-to-end Neural Conversation Models},
      • journal = {Computer Speech and Language},
      • year = 2018,
      • volume = 54,
      • pages = {122--139},
      • month = dec,
      • doi = {10.1016/j.csl.2018.08.006},
      • url = {https://www.merl.com/publications/TR2018-161}
      • }
    •  Wang, W., Koji, Y., Harsham, B.A., Hori, T., Hershey, J.R., "Sequence Adversarial Training and Minimum Bayes Risk Decoding for End-to-end Neural Conversation Models", Dialog System Technology Challenges, December 2017.
      BibTeX TR2017-180 PDF
      • @inproceedings{Wang2017dec,
      • author = {Wang, Wen and Koji, Yusuke and Harsham, Bret A. and Hori, Takaaki and Hershey, John R.},
      • title = {Sequence Adversarial Training and Minimum Bayes Risk Decoding for End-to-end Neural Conversation Models},
      • booktitle = {Dialog System Technology Challenges},
      • year = 2017,
      • month = dec,
      • url = {https://www.merl.com/publications/TR2017-180}
      • }
    •  Hori, C., Hori, T., Lee, T.-Y., Zhang, Z., Harsham, B.A., Sumi, K., Marks, T.K., Hershey, J.R., "Attention-Based Multimodal Fusion for Video Description", IEEE International Conference on Computer Vision (ICCV), DOI: 10.1109/ICCV.2017.450, October 2017.
      BibTeX TR2017-156 PDF
      • @inproceedings{Hori2017oct,
      • author = {Hori, Chiori and Hori, Takaaki and Lee, Teng-Yok and Zhang, Ziming and Harsham, Bret A. and Sumi, Kazuhiko and Marks, Tim K. and Hershey, John R.},
      • title = {Attention-Based Multimodal Fusion for Video Description},
      • booktitle = {IEEE International Conference on Computer Vision (ICCV)},
      • year = 2017,
      • month = oct,
      • doi = {10.1109/ICCV.2017.450},
      • url = {https://www.merl.com/publications/TR2017-156}
      • }
    •  Hori, T., Wang, H., Hori, C., Watanabe, S., Harsham, B.A., Le Roux, J., Hershey, J.R., Koji, Y., Jing, Y., Zhu, Z., Aikawa, T., "Dialog State Tracking with Attention-based Sequence-to-sequence Learning", IEEE Workshop on Spoken Language Technology (SLT), DOI: 10.1109/SLT.2016.7846317, December 2016, pp. 552-558.
      BibTeX TR2016-163 PDF
      • @inproceedings{Hori2016dec,
      • author = {Hori, Takaaki and Wang, Hai and Hori, Chiori and Watanabe, Shinji and Harsham, Bret A. and Le Roux, Jonathan and Hershey, John R. and Koji, Yusuke and Jing, Yi and Zhu, Zhaocheng and Aikawa, Takeyuki},
      • title = {Dialog State Tracking with Attention-based Sequence-to-sequence Learning},
      • booktitle = {IEEE Workshop on Spoken Language Technology (SLT)},
      • year = 2016,
      • pages = {552--558},
      • month = dec,
      • doi = {10.1109/SLT.2016.7846317},
      • url = {https://www.merl.com/publications/TR2016-163}
      • }
    •  Hori, C., Watanabe, S., Hori, T., Harsham, B.A., Hershey, J.R., Koji, Y., Fujii, Y., Furumoto, Y., "Driver Confusion Status Detection Using Recurrent Neural Networks", IEEE International Conference on Multimedia and Expo (ICME), DOI: 10.1109/ICME.2016.7552966, July 2016.
      BibTeX TR2016-088 PDF
      • @inproceedings{Hori2016jul,
      • author = {Hori, Chiori and Watanabe, Shinji and Hori, Takaaki and Harsham, Bret A. and Hershey, John R. and Koji, Yusuke and Fujii, Youichi and Furumoto, Yuki},
      • title = {Driver Confusion Status Detection Using Recurrent Neural Networks},
      • booktitle = {IEEE International Conference on Multimedia and Expo (ICME)},
      • year = 2016,
      • month = jul,
      • doi = {10.1109/ICME.2016.7552966},
      • url = {https://www.merl.com/publications/TR2016-088}
      • }
    See All Publications for Bret
  • Videos

  • MERL Issued Patents

    • Title: "Method for using a Multi-Scale Recurrent Neural Network with Pretraining for Spoken Language Understanding Tasks"
      Inventors: Watanabe, Shinji; Luan, Yi; Harsham, Bret A.
      Patent No.: 9,607,616
      Issue Date: Mar 28, 2017
    • Title: "Actions Prediction for Hypothetical Driving Conditions"
      Inventors: Harsham, Bret A.; Hershey, John R.; Le Roux, Jonathan; Nikovski, Daniel N.; Esenther, Alan W.
      Patent No.: 9,434,389
      Issue Date: Sep 6, 2016
    • Title: "Method and System for Autonomously Delivering Information to Drivers"
      Inventors: Nikovski, Daniel N.; Harsham, Bret A.; Hershey, John R.; Brinkman, Dirk
      Patent No.: 9,305,306
      Issue Date: Apr 5, 2016
    • Title: "Determining Word Sequence Constraints for Low Cognitive Speech Recognition"
      Inventors: Harsham, Bret A.; Hershey, John R.
      Patent No.: 9,196,246
      Issue Date: Nov 24, 2015
    • Title: "Method and System for Dynamically Adapting user Interfaces in Vehicle Navigation Systems to Minimize Interaction Complexity"
      Inventors: Nikovski, Daniel N.; Hershey, John R.; Harsham, Bret A.; Le Roux, Jonathan
      Patent No.: 9,170,119
      Issue Date: Oct 27, 2015
    • Title: "System and Method for Recognizing Speech"
      Inventors: Harsham, Bret A.; Hershey, John R.
      Patent No.: 9,159,317
      Issue Date: Oct 13, 2015
    • Title: "Method for Indexing for Retrieving Documents Using Particles"
      Inventors: Ramakrishnan, Bhiksha R.; Gouvea, Evandro B.; Harsham, Bret A.; Schmidt-Nielsen, Bent K.; Weinberg, Garrett L.
      Patent No.: 8,229,921
      Issue Date: Jul 24, 2012
    • Title: "Method for Interacting With Users of Speech Recognition Systems"
      Inventors: Schmidt-Nielsen, Bent K.; Weinberg, Garrett L.; Ramakrishnan, Bhiksha R.; Harsham, Bret A.
      Patent No.: 7,917,368
      Issue Date: Mar 29, 2011
    See All Patents for MERL