TR2003-101

Unsupervised Discovery of Multilevel Statistical Video Structures Using Hierarchical Hidden Markov Models


    •  Xie, L.; Chang, S.-F.; Divakaran, A.; Sun, H., "Unsupervised Discovery of Multilevel Statistical Video Structures Using Hierarchical Hidden Markov Models", IEEE International Conference on Multimedia and Expo (ICME), July 2003, vol. 3, pp. 29-32.
      BibTeX Download PDF
      • @inproceedings{Xie2003jul,
      • author = {Xie, L. and Chang, S.-F. and Divakaran, A. and Sun, H.},
      • title = {Unsupervised Discovery of Multilevel Statistical Video Structures Using Hierarchical Hidden Markov Models},
      • booktitle = {IEEE International Conference on Multimedia and Expo (ICME)},
      • year = 2003,
      • volume = 3,
      • pages = {29--32},
      • month = jul,
      • url = {http://www.merl.com/publications/TR2003-101}
      • }
  • MERL Contact:
  • Research Areas:

    Digital Video, Multimedia


Structure elements in a time sequence (e.g. video) are repetitive segments with consistent deterministic or stochastic characteristics. While most existing work in detecting structurs follow a supervised paradigm, we propose a fully unsupervised statistical solution in this paper. We present a unified approach to structure discovery from long video sequences as simultaneously finding the statistical descriptions of structure and locating segments that matches the descriptions. We model the multilevel statistical structure as hierarchical hidden Markov models, and present efficient algorithms for learning both the parameters and the model structure. When tested on a specific domain, soccer video, the unsupervised learning scheme achieves very promising results: it automatically discovers the statistical descriptions of high-level structures, and at the same time achieves even slightly better accuracy in detecting discovered structures in unlabelled videos than a supervised approach designed with domain knowledge and trained with comparable hidden Markov models.