TR2005-107

Contextual Recognition of Head Gestures


    •  Louis-Philippe Morency, Candace L. Sidner, Christopher Lee, Trevor Darrell, "Contextual Recognition of Head Gestures", Tech. Rep. TR2005-107, Mitsubishi Electric Research Laboratories, Cambridge, MA, October 2005.
      BibTeX TR2005-107 PDF
      • @techreport{MERL_TR2005-107,
      • author = {Louis-Philippe Morency, Candace L. Sidner, Christopher Lee, Trevor Darrell},
      • title = {Contextual Recognition of Head Gestures},
      • institution = {MERL - Mitsubishi Electric Research Laboratories},
      • address = {Cambridge, MA 02139},
      • number = {TR2005-107},
      • month = oct,
      • year = 2005,
      • url = {https://www.merl.com/publications/TR2005-107/}
      • }
Abstract:

Head pose and gesture offer several key conversational grounding cues and are used extensively in face-to-face interaction among people. We investigate how dialog context from an embodied conversational agent (ECA) can improve visual recognition of user gestures. We present a recogntion framework which (1) extracts contextual features from an ECA\'s dialog manager, (2) computes a predicition of head nod and head shakes, and (3) integrates the contextual predictions with the visual observation of a vision-based head gesture recognizer. We found a subset of lexical, punctuation and timing features that are easily available in most ECA architectures and can be used to learn how to predict user feedback. Using a discriminative approach to contextual prediction and multi-modal integration, we were able to improve the performancae of head gesture detection even when the topic of the test set was significantly different than the training set.