TR2004-007

Video Mining Using Combinations of Unsupervised and Supervised Learning Techniques


    •  Divakaran, A., Miyaraha, K., Peker, K.A., Radhakrishnan, R., Xiong, Z., "Video Mining Using Combinations of Unsupervised and Supervised Learning Techniques", SPIE Conference on Storage and Retrieval for Multimedia Databases, January 2004, vol. 5307, pp. 235-243.
      BibTeX TR2004-007 PDF
      • @inproceedings{Divakaran2004jan,
      • author = {Divakaran, A. and Miyaraha, K. and Peker, K.A. and Radhakrishnan, R. and Xiong, Z.},
      • title = {Video Mining Using Combinations of Unsupervised and Supervised Learning Techniques},
      • booktitle = {SPIE Conference on Storage and Retrieval for Multimedia Databases},
      • year = 2004,
      • volume = 5307,
      • pages = {235--243},
      • month = jan,
      • url = {https://www.merl.com/publications/TR2004-007}
      • }
  • Research Area:

    Digital Video

Abstract:

We discuss the meaning and significance of the video mining problem, and present our work on some aspects of video mining. A simple definition of video mining is unsupervised discovery of patterns in audio-visual content. Such purely unsupervised discovery is readily applicable to video surveillance as well as to consumer video browsing applications. We interpret video mining as content-adaptive or blind content processing, in which the first stage is content characterization and the second stage is event discovery based on the characterization obtained in stage 1. We discuss the target applications and find that using a purely unsupervised approach is too computationally complex and generally unmanageable to be implemented on our product platform. We then describe various combinations of unsupervised and supervised learning techniques that help discover patterns that are useful to the end-user of the application. We target consumer video browsing applications such as commercial message detection, sports highlights extraction etc. We employ both audio and video features. We find that supervised audio classification combined with unsupervised unusual event discovery enables accurate supervised detection of desired events. Our techniques are computationally simple and robust to common variations in production styles etc.

 

  • Related News & Events

    •  NEWS    SPIE Conference on Storage and Retrieval for Multimedia Databases 2004: 2 publications by Ajay Divakaran and others
      Date: January 20, 2004
      Where: SPIE Conference on Storage and Retrieval for Multimedia Databases
      Brief
      • The papers "Audio-Visual Event Detection Based on Mining of Semantic Audio-Visual Labels" by Goh, K.-S., Miyahara, K., Radhakrishan, R., Xiong, Z. and Divakaran, A. and "Video Mining Using Combinations of Unsupervised and Supervised Learning Techniques" by Divakaran, A., Miyaraha, K., Peker, K.A., Radhakrishnan, R. and Xiong, Z. were presented at the SPIE Conference on Storage and Retrieval for Multimedia Databases.
    •