TR2003-101

Unsupervised Discovery of Multilevel Statistical Video Structures Using Hierarchical Hidden Markov Models


    •  Xie, L., Chang, S.-F., Divakaran, A., Sun, H., "Unsupervised Discovery of Multilevel Statistical Video Structures Using Hierarchical Hidden Markov Models", IEEE International Conference on Multimedia and Expo (ICME), July 2003, vol. 3, pp. 29-32.
      BibTeX TR2003-101 PDF
      • @inproceedings{Xie2003jul,
      • author = {Xie, L. and Chang, S.-F. and Divakaran, A. and Sun, H.},
      • title = {Unsupervised Discovery of Multilevel Statistical Video Structures Using Hierarchical Hidden Markov Models},
      • booktitle = {IEEE International Conference on Multimedia and Expo (ICME)},
      • year = 2003,
      • volume = 3,
      • pages = {29--32},
      • month = jul,
      • url = {https://www.merl.com/publications/TR2003-101}
      • }
  • MERL Contact:
  • Research Area:

    Digital Video

Abstract:

Structure elements in a time sequence (e.g. video) are repetitive segments with consistent deterministic or stochastic characteristics. While most existing work in detecting structurs follow a supervised paradigm, we propose a fully unsupervised statistical solution in this paper. We present a unified approach to structure discovery from long video sequences as simultaneously finding the statistical descriptions of structure and locating segments that matches the descriptions. We model the multilevel statistical structure as hierarchical hidden Markov models, and present efficient algorithms for learning both the parameters and the model structure. When tested on a specific domain, soccer video, the unsupervised learning scheme achieves very promising results: it automatically discovers the statistical descriptions of high-level structures, and at the same time achieves even slightly better accuracy in detecting discovered structures in unlabelled videos than a supervised approach designed with domain knowledge and trained with comparable hidden Markov models.

 

  • Related News & Events

    •  NEWS    ICME 2003: 7 publications by Chia Shen, Anthony Vetro, Ajay Divakaran and Huifang Sun
      Date: July 6, 2003
      Where: IEEE International Conference on Multimedia and Expo (ICME)
      MERL Contacts: Anthony Vetro; Huifang Sun
      Brief
      • The papers "Multi-Camera Calibration, Object Tracking and Query Generation" by Porikli, F.M. and Divakaran, A., "Unsupervised Discovery of Multilevel Statistical Video Structures Using Hierarchical Hidden Markov Models" by Xie, L., Chang, S.-F., Divakaran, A. and Sun, H., "FGS Enhancement Layer Truncation with Minimized Intra-Frame Quality Variation" by Zhou, J., Shao, H.-R., Shen, C. and Sun, M.-T., "Object-Based Coding for Long-Term Archive of Surveillance Video" by Vetro, A., Haga, T., Sumi, K. and Sun, H., "Rate Allocation for FGS-Coded Video Using Composite Rate-Distortion Analysis" by Cheng, H., Zhang, X.M., Shi, Y.Q., Vetro, A. and Sun, H., "Audio Events Detection Based Highlights Extraction from Baseball, Golf and Soccer Games in a Unified Framework" by Xiong, Z., Radhakrishnan, R., Divakaran, A. and Huang, T.S. and "Comparing MFCC and MPEG-7 Audio Features for Feature Extraction, Maximum Likelihood HMM and Entropic Prior HMM for Sports Audio Classification" by Xiong, Z., Radhakrishnan, R., Divakaran, A. and Huang, T.S. were presented at the IEEE International Conference on Multimedia and Expo (ICME).
    •