TR2004-145

Towards Context-Based Visual Feedback Recognition for Embodied Agents


    •  Louis-Philippe Morency, Candace L. Sidner, Trevor Darrell, "Towards Context-Based Visual Feedback Recognition for Embodied Agents", Tech. Rep. TR2004-145, Mitsubishi Electric Research Laboratories, Cambridge, MA, April 2005.
      BibTeX TR2004-145 PDF
      • @techreport{MERL_TR2004-145,
      • author = {Louis-Philippe Morency, Candace L. Sidner, Trevor Darrell},
      • title = {Towards Context-Based Visual Feedback Recognition for Embodied Agents},
      • institution = {MERL - Mitsubishi Electric Research Laboratories},
      • address = {Cambridge, MA 02139},
      • number = {TR2004-145},
      • month = apr,
      • year = 2005,
      • url = {https://www.merl.com/publications/TR2004-145/}
      • }
Abstract:

Head pose and gesture offer several key conversational grounding cues and are used extensively in face-to-face interaction among people. We investigate how contextural information can improve visual recognition of feedback gestures during interactions with embodied conversational agents. We present a visual recognition model that integrates cues from the spoken dialogue of an embodied agent with direct observation of a user\'s head pose. In preliminary experiments using a discriminative framework, contextual information improved the performance of head nod detection.