Mitsubishi Electric Research Laboratories

Learning Normal Activity and Detecting Anomalies

At present, you attend to the computer. In the future, the computer should attend to you. This computer vision system watches you and learns your normal work habits and basic activities. It can summarize your day, adjust your environment for different of activity, or signal an alarm when something unusual or disruptive happens.

Background & Objective:  We are interested in the use of computer vision to "keep an eye on things" while you're busy or away. This application, a personal security device, might be used to keep an eye on a child at play, a hospital patient, or an elderly parent.

Technical Discussion:  The basic approach is to learn a probabilistic model of the scene dynamics. The demonstration system extracts a motion-based feature vector from each video frame, then learns a summary of signal dynamics in the form of a hidden Markov model. An entropy-minimization algorithm estimates both model structure and parameters, yielding a small, sparse, accurate model that generalizes very well to new video. Due to its sparsity, the states in the model are highly correlated with meaningful partitions of the video, e.g., activities. The resulting model reads like a flowchart of a normal day's work activities. We can use the model to detect when you are in particular states, e.g., working at the whiteboard, and have the computer adjust your environment appropriately, e.g., brightening the lights. The system is also very good at detecting anomalous activity-the laboratory prototype proved quite adept at detecting coffee buzz an hour after several espressos.

Publications:
Brand, M.E., "Structure Learning in Conditional Probability Models via an Eutropic Prior and Parameter Extinction", Neural Computation Journal, Vol. 11, No. 5, pp. 1155-1182, July 1999 (The MIT Press, TR1998-018)

Brand, M.E., "Pattern Discovery via Entropy Minimization", Uncertainty 99: International Workshop on Artificial Intelligence and Statistics, January 1999 (AISTATS 1999, TR1998-021)

Technical Reports:
TR1997-025 Learning concise models of human activity from ambient video via a structure-inducing M-step estimator

Technology Areas:
Computer Vision
Artificial Intelligence

Modification Date:  July 9, 2001