TR2010-087

Fast Approximate Nearest Neighbor Methods for Non-Euclidean Manifolds with Applications to Human Activity Analysis in Videos


    •  Chaudhry, R., Ivanov, Y., "Fast Approximate Nearest Neighbor Methods for Non-Euclidean Manifolds with Applications to Human Activity Analysis in Videos", European Conference on Computer Vision (ECCV), September 2010.
      BibTeX TR2010-087 PDF
      • @inproceedings{Chaudhry2010sep,
      • author = {Chaudhry, R. and Ivanov, Y.},
      • title = {Fast Approximate Nearest Neighbor Methods for Non-Euclidean Manifolds with Applications to Human Activity Analysis in Videos},
      • booktitle = {European Conference on Computer Vision (ECCV)},
      • year = 2010,
      • month = sep,
      • url = {https://www.merl.com/publications/TR2010-087}
      • }
  • Research Area:

    Computer Vision

Abstract:

Approximate Nearest Neighbor (ANN) methods such as Locality Sensitive Hashing, Semantic Hashing, and Spectral Hashing, provide computationally efficient procedures for finding objects similar to a query object in large datasets. These methods have been successfully applied to search web-scale datasets that can contain millions of images. Unfortunately, the key assumption in these procedures is that objects in the dataset lie in a Euclidean space. This assumption is not always valid and poses a challenge for several vision applications where data commonly lies in complex non-Euclidean manifolds. In particular, dynamic data such as human activities are commonly represented as distributions over bags of video words as a dynamical systems. In this paper, we propose two new algorithms that extend Spectral Hashing to non-Euclidean spaces. The first method considers the Riemannian geometry of the manifold and performs Spectral Hashing in the Tangent space of the manifold at several points. The second method divides the data into subsets and takes advantage of the kernel trick to perform non-Euclidean Spectral Hashing. For a data set of N samples the proposed methods are able to retrieve similar objects in as low as O (K) time complexity, where K is the number of clusters in the data. Since K much-less-than N, our methods are extremely efficient. We test and evaluate our methods on synthetic data generated from the Unit Hypersphere and the Grassmann Manifold. Finally, we show promising results on a human action database.

 

  • Related News & Events

    •  NEWS    ECCV 2010: 5 publications by Yuichi Taguchi, Srikumar Ramalingam, Amit K. Agrawal, C. Oncel Tuzel and Tim K. Marks
      Date: September 5, 2010
      Where: European Conference on Computer Vision (ECCV)
      MERL Contact: Tim K. Marks
      Research Area: Computer Vision
      Brief
      • The papers "Image Invariants for Smooth Reflective Surfaces" by Sankaranarayanan, A.C., Veeraraghavan, A., Tuzel, O. and Agrawal, A., "Analytical Forward Projection for Axial Non-Central Dioptric & Catadioptric Cameras" by Agrawal, A., Taguchi, Y. and Ramalingam, S., "P2Pi: A Minimal Solution for Registration of 3D Points to 3D Planes" by Ramalingam, S., Taguchi, Y., Marks, T.K. and Tuzel, O., "Fast Approximate Nearest Neighbor Methods for Non-Euclidean Manifolds with Applications to Human Activity Analysis in Videos" by Chaudhry, R. and Ivanov, Y. and "Flexible Voxels for Motion-Aware Videography" by Gupta, M., Agrawal, A., Veeraraghavan, A. and Narasimhan, S.G. were presented at the European Conference on Computer Vision (ECCV).
    •