TR2004-145

Towards Context-Based Visual Feedback Recognition for Embodied Agents

- Louis-Philippe Morency, Candace L. Sidner, Trevor Darrell, "Towards Context-Based Visual Feedback Recognition for Embodied Agents", Tech. Rep. TR2004-145, Mitsubishi Electric Research Laboratories, Cambridge, MA, April 2005.
  BibTeX TR2004-145 PDF
  - @techreport{MERL_TR2004-145,
  - author = {Louis-Philippe Morency, Candace L. Sidner, Trevor Darrell},
  - title = {Towards Context-Based Visual Feedback Recognition for Embodied Agents},
  - institution = {MERL - Mitsubishi Electric Research Laboratories},
  - address = {Cambridge, MA 02139},
  - number = {TR2004-145},
  - month = apr,
  - year = 2005,
  - url = {https://www.merl.com/publications/TR2004-145/}
  - }

Abstract:

Head pose and gesture offer several key conversational grounding cues and are used extensively in face-to-face interaction among people. We investigate how contextural information can improve visual recognition of feedback gestures during interactions with embodied conversational agents. We present a visual recognition model that integrates cues from the spoken dialogue of an embodied agent with direct observation of a user\'s head pose. In preliminary experiments using a discriminative framework, contextual information improved the performance of head nod detection.