Chiori Hori

Chiori Hori
  • Biography

    Chiori has been a member of MERL's research team since 2015. Her work is focused on spoken dialog and audio visual scene-aware dialog technologies toward human-robot communications. She's on the editorial board of "Computer Speech and Language" and is a technical committee member of "Speech and Language Processing Group" of IEEE Signal Processing Society. Prior to joining MERL, Chiori spent 8 years at Japan's National Institute of Information and Communication Technology (NICT), where she held the position of Research Manager of the Spoken Language Communication Laboratory. She also spent time researching at Carnegie Mellon and the NTT Communication Science Laboratories, prior to NICT.

  • Recent News & Events


    See All News & Events for Chiori
  • Research Highlights

  • Internships with Chiori

    • SA1358: Multimodal AI

      MERL is looking for an intern to work on fundamental research in the area of audiovisual semantic understanding for scene-aware dialog technologies by combining end-to-end dialog and video scene understanding technologies. The intern will collaborate with MERL researchers to derive and implement new models, conduct experiments, and prepare results for high impact publication. The ideal candidate would be a senior Ph.D. student with experience in one or more of video captioning/description, end-to-end conversation modeling and natural language processing including practical machine learning algorithms with related programming skills. The duration of the internship is expected to be 3-6 months.

    See All Internships at MERL
  • MERL Publications

    •  Cherian, A., Wang, J., Hori, C., Marks, T., "Spatio-Temporal Ranked-Attention Networks for Video Captioning", IEEE Winter Conference on Applications of Computer Vision (WACV), February 2020.
      BibTeX Download PDFAbout TR2020-016
      • @inproceedings{Cherian2020feb,
      • author = {Cherian, Anoop and Wang, Jue and Hori, Chiori and Marks, Tim},
      • title = {Spatio-Temporal Ranked-Attention Networks for Video Captioning},
      • booktitle = {IEEE Winter Conference on Applications of Computer Vision (WACV)},
      • year = 2020,
      • month = feb,
      • url = {https://www.merl.com/publications/TR2020-016}
      • }
    •  Hori, C., Cherian, A., Marks, T., Hori, T., "Joint Student-Teacher Learning for Audio-Visual Scene-Aware Dialog", Interspeech, September 2019, pp. 1886-1890.
      BibTeX Download PDFAbout TR2019-097
      • @inproceedings{Hori2019sep,
      • author = {Hori, Chiori and Cherian, Anoop and Marks, Tim and Hori, Takaaki},
      • title = {Joint Student-Teacher Learning for Audio-Visual Scene-Aware Dialog},
      • booktitle = {Interspeech},
      • year = 2019,
      • pages = {1886--1890},
      • month = sep,
      • publisher = {ISCA},
      • url = {https://www.merl.com/publications/TR2019-097}
      • }
    •  Alamri, H., Cartillier, V., Das, A., Wang, J., Lee, S., Anderson, P., Essa, I., Parikh, D., Batra, D., Cherian, A., Marks, T.K., Hori, C., "Audio-Visual Scene-Aware Dialog", IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2019.
      BibTeX Download PDFAbout TR2019-048
      • @inproceedings{Alamri2019jun,
      • author = {Alamri, Huda and Cartillier, Vincent and Das, Abhishek and Wang, Jue and Lee, Stefan and Anderson, Peter and Essa, Irfan and Parikh, Devi and Batra, Dhruv and Cherian, Anoop and Marks, Tim K. and Hori, Chiori},
      • title = {Audio-Visual Scene-Aware Dialog},
      • booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
      • year = 2019,
      • month = jun,
      • url = {https://www.merl.com/publications/TR2019-048}
      • }
    •  Hori, C., Alamri, H., Wang, J., Wichern, G., Hori, T., Cherian, A., Marks, T.K., Cartillier, V., Lopes, R., Das, A., Essa, I., Batra, D., Parikh, D., "End-to-End Audio Visual Scene-Aware Dialog Using Multimodal Attention-Based Video Features", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), DOI: 10.1109/ICASSP.2019.8682583, May 2019.
      BibTeX Download PDFAbout TR2019-016
      • @inproceedings{Hori2019may2,
      • author = {Hori, Chiori and Alamri, Huda and Wang, Jue and Wichern, Gordon and Hori, Takaaki and Cherian, Anoop and Marks, Tim K. and Cartillier, Vincent and Lopes, Raphael and Das, Abhishek and Essa, Irfan and Batra, Dhruv and Parikh, Devi},
      • title = {End-to-End Audio Visual Scene-Aware Dialog Using Multimodal Attention-Based Video Features},
      • booktitle = {IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
      • year = 2019,
      • month = may,
      • doi = {10.1109/ICASSP.2019.8682583},
      • url = {https://www.merl.com/publications/TR2019-016}
      • }
    •  d’Haro, L.F., Banchs, R., Hori, C., Li, H., "Automatic Evaluation of End-to-End Dialog Systems with Adequacy-Fluency Metrics", Special issue on DSTC6 in Computer Speech and Langauge, DOI: 10.1016/j.csl.2018.12.004, Vol. 55, pp. 200-215, March 2019.
      BibTeX Download PDFAbout TR2018-195
      • @article{dHaro2019mar,
      • author = {d’Haro, Luis Fernando and Banchs, Rafael and Hori, Chiori and Li, Haizhou},
      • title = {Automatic Evaluation of End-to-End Dialog Systems with Adequacy-Fluency Metrics},
      • journal = {Special issue on DSTC6 in Computer Speech and Langauge},
      • year = 2019,
      • volume = 55,
      • pages = {200--215},
      • month = mar,
      • doi = {10.1016/j.csl.2018.12.004},
      • url = {https://www.merl.com/publications/TR2018-195}
      • }
    See All Publications for Chiori
  • MERL Issued Patents

    • Title: "Method and System for Multi-Modal Fusion Model"
      Inventors: Hori, Chiori; Hori, Takaaki; Hershey, John R.; Marks, Tim
      Patent No.: 10,417,498
      Issue Date: Sep 17, 2019
    • Title: "Method and System for Training Language Models to Reduce Recognition Errors"
      Inventors: Hori, Takaaki; Hori, Chiori; Watanabe, Shinji; Hershey, John R.
      Patent No.: 10,176,799
      Issue Date: Jan 8, 2019
    • Title: "Method and System for Role Dependent Context Sensitive Spoken and Textual Language Understanding with Neural Networks"
      Inventors: Hori, Chiori; Hori, Takaaki; Watanabe, Shinji; Hershey, John R.
      Patent No.: 9,842,106
      Issue Date: Dec 12, 2017
    See All Patents for MERL