Anoop Cherian

Anoop Cherian
  • Biography

    Anoop was a postdoctoral researcher in the LEAR group at Inria from 2012-2015 where his research was on the estimation and tracking of human poses in videos. From 2015-2017, he was a Research Fellow at the Australian National University, where he worked on the problem of recognizing human activities in video sequences. Anoop is the recipient of the Best Student Paper award at the Intl. Conference on Image Processing in 2012. Currently, his research focus is on modeling the semantics of video data.

  • Internships with Anoop

    • CV1287: Human Activity Prediction

      MERL is looking for a self-motivated intern to work on problems related to human action recognition and anticipation. The ideal candidate would be a PhD student with a strong mathematical background in machine learning and computer vision. The candidate must have prior experience in using deep learning on human poses (2D/3D). Working knowledge of generative adversarial networks and deep learning methods for video understanding will be a plus. Proficiency in Python and flexibility in using diverse deep learning software (TensorFlow, Pytorch, Keras, etc.) is expected. The internship is for 3 months with flexible start date.

    • CV1288: Visual Reasoning and Question Answering

      MERL is looking for a self-motivated intern to work on problems at the intersection of video understanding and visual question answering. The ideal candidate would be a senior year (>=3) PhD student with a strong mathematical background in machine learning and computer vision and who has published at least one paper in a top-tier machine learning or computer vision venue (NIPS/CVPR/ECCV/ICCV/ICML/PAMI etc.). The candidate must have prior experience in using deep learning methods for video understanding (such as action recognition, human pose estimation and tracking, etc.) and language models (such as in visual question answering or captioning). Working knowledge of generative adversarial networks will be a plus. Proficiency in Python and flexibility in using different deep learning software (TensorFlow, Pytorch, Keras, etc.) is expected. The internship is for 3-6 months with flexible start date.

    • CV1294: Multimodal Learning

      MERL is looking for a self-motivated intern to work on problems at the intersection of video understanding, audio/speech recognition, and language models. The ideal candidate would be a PhD student with a strong mathematical background in machine learning and computer vision. The candidate must have prior experience in using deep learning methods for video understanding (such as action recognition, human pose estimation and tracking, etc.) and language models (such as in visual question answering or captioning). Knowledge of deep learning for speech recognition and experience with generative adversarial networks will be a plus. Proficiency in Python and flexibility in using different deep learning software (TensorFlow, Pytorch, Keras, etc.) is expected. The intern is expected to collaborate with computer vision and speech teams at MERL to develop algorithms and prepare manuscripts for scientific publications. The internship is for 3 months with flexible start date.

    See All Internships at MERL
  • MERL Publications

    •  Wang, J., Anoop, C., "Discriminative Subspace Pooling for Action Recognition", Workshop on Perceptual Organization in Computer Vision as part of the European Conference on Computer Vision (ECCV), September 2018.
    •  Wang, J., Anoop, C., "Learning Discriminative Video Representations Using Adversarial Perturbations", European Conference on Computer Vision (ECCV), September 2018.
    •  Hori, C., Alamri, H., Wang, J., Wichern, G., Hori, T., Cherian, A., Marks, T.K., Cartillier, V., Lopes, R., Das, A., Essa, I., Batra, D., Parikh, D., "End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features", arXiv, July 13, 2018.
      BibTeX Download PDFAbout TR2018-085
      • @techreport{MERL_TR2018-085,
      • author = {Hori, C. and Alamri, H. and Wang, J. and Wichern, G. and Hori, T. and Cherian, A. and Marks, T.K. and Cartillier, V. and Lopes, R. and Das, A. and Essa, I. and Batra, D. and Parikh, D.},
      • title = {End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features},
      • institution = {MERL - Mitsubishi Electric Research Laboratories},
      • address = {Cambridge, MA 02139},
      • number = {TR2018-085},
      • month = jul,
      • year = 2018,
      • url = {http://www.merl.com/publications/TR2018-085/}
      • }
    •  Alamri, H., Cartillier, V., Lopes, R., Das, A., Wang, J., Essa, I., Batra, D., Parikh, D., Cherian, A., Marks, T.K., Hori, C., "Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7", arXiv, July 12, 2018.
      BibTeX Download PDFAbout TR2018-069
      • @techreport{MERL_TR2018-069,
      • author = {Alamri, H. and Cartillier, V. and Lopes, R. and Das, A. and Wang, J. and Essa, I. and Batra, D. and Parikh, D. and Cherian, A. and Marks, T.K. and Hori, C.},
      • title = {Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7},
      • institution = {MERL - Mitsubishi Electric Research Laboratories},
      • address = {Cambridge, MA 02139},
      • number = {TR2018-069},
      • month = jul,
      • year = 2018,
      • url = {http://www.merl.com/publications/TR2018-069/}
      • }
    •  Santacruz, R., Fernando, B., Cherian, A., Gould, S, "Neural Algebra of Classifiers", Tech. Rep. TR2018-033, Mitsubishi Electric Research Laboratories, Cambridge, MA, March 2018.
      BibTeX Download PDFAbout TR2018-033
      • @techreport{MERL_TR2018-033,
      • author = {Santacruz, R. and Fernando, B. and Cherian, A. and Gould, S},
      • title = {Neural Algebra of Classifiers},
      • institution = {MERL - Mitsubishi Electric Research Laboratories},
      • address = {Cambridge, MA 02139},
      • number = {TR2018-033},
      • month = mar,
      • year = 2018,
      • url = {http://www.merl.com/publications/TR2018-033/}
      • }
  • Other Publications

    •  Cherian, Anoop; Fernando, Basura; Harandi, Mehrtash; Gould, Stephen, "Generalized rank pooling for activity recognition", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017.
      BibTeX
      • @Inproceedings{cherian2017generalized,
      • author = {Cherian, Anoop and Fernando, Basura and Harandi, Mehrtash and Gould, Stephen},
      • title = {Generalized rank pooling for activity recognition},
      • booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
      • year = 2017
      • }
    •  Cruz, Rodrigo Santa; Fernando, Basura; Cherian, Anoop; Gould, Stephen, "DeepPermNet: Visual Permutation Learning", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017.
      BibTeX
      • @Inproceedings{cruz2017deeppermnet,
      • author = {Cruz, Rodrigo Santa and Fernando, Basura and Cherian, Anoop and Gould, Stephen},
      • title = {DeepPermNet: Visual Permutation Learning},
      • booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
      • year = 2017
      • }
    •  Cherian, Anoop; Morellas, Vassilios; Papanikolopoulos, Nikolaos, "Bayesian nonparametric clustering for positive definite matrices", IEEE transactions on pattern analysis and machine intelligence, Vol. 38, No. 5, pp. 862-874, 2016.
      BibTeX
      • @Article{cherian2016bayesian,
      • author = {Cherian, Anoop and Morellas, Vassilios and Papanikolopoulos, Nikolaos},
      • title = {Bayesian nonparametric clustering for positive definite matrices},
      • journal = {IEEE transactions on pattern analysis and machine intelligence},
      • year = 2016,
      • volume = 38,
      • number = 5,
      • pages = {862--874},
      • publisher = {IEEE}
      • }
    •  Koniusz, Piotr; Cherian, Anoop, "Sparse coding for third-order super-symmetric tensor descriptors with application to texture recognition", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 5395-5403.
      BibTeX
      • @Inproceedings{koniusz2016sparse,
      • author = {Koniusz, Piotr and Cherian, Anoop},
      • title = {Sparse coding for third-order super-symmetric tensor descriptors with application to texture recognition},
      • booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
      • year = 2016,
      • pages = {5395--5403}
      • }
    •  Koniusz, Piotr; Cherian, Anoop; Porikli, Fatih, "Tensor representations via kernel linearization for action recognition from 3D skeletons", European Conference on Computer Vision, 2016, pp. 37-53.
      BibTeX
      • @Inproceedings{koniusz2016tensor,
      • author = {Koniusz, Piotr and Cherian, Anoop and Porikli, Fatih},
      • title = {Tensor representations via kernel linearization for action recognition from 3D skeletons},
      • booktitle = {European Conference on Computer Vision},
      • year = 2016,
      • pages = {37--53},
      • organization = {Springer}
      • }
    •  Cherian, Anoop; Sra, Suvrit; Morellas, Vassilios; Papanikolopoulos, Nikolaos, "Efficient nearest neighbors via robust sparse hashing", IEEE Transactions on Image Processing, Vol. 23, No. 8, pp. 3646-3655, 2014.
      BibTeX
      • @Article{cherian2014efficient,
      • author = {Cherian, Anoop and Sra, Suvrit and Morellas, Vassilios and Papanikolopoulos, Nikolaos},
      • title = {Efficient nearest neighbors via robust sparse hashing},
      • journal = {IEEE Transactions on Image Processing},
      • year = 2014,
      • volume = 23,
      • number = 8,
      • pages = {3646--3655},
      • publisher = {IEEE}
      • }
    •  Cherian, Anoop; Mairal, Julien; Alahari, Karteek; Schmid, Cordelia, "Mixing body-part sequences for human pose estimation", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014, pp. 2353-2360.
      BibTeX
      • @Inproceedings{cherian2014mixing,
      • author = {Cherian, Anoop and Mairal, Julien and Alahari, Karteek and Schmid, Cordelia},
      • title = {Mixing body-part sequences for human pose estimation},
      • booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
      • year = 2014,
      • pages = {2353--2360}
      • }
    •  Cherian, Anoop, "Nearest neighbors using compact sparse codes", Proceedings of the 31st International Conference on Machine Learning (ICML-14), 2014, pp. 1053-1061.
      BibTeX
      • @Inproceedings{cherian2014nearest,
      • author = {Cherian, Anoop},
      • title = {Nearest neighbors using compact sparse codes},
      • booktitle = {Proceedings of the 31st International Conference on Machine Learning (ICML-14)},
      • year = 2014,
      • pages = {1053--1061}
      • }
    •  Cherian, Anoop; Sra, Suvrit, "Riemannian sparse coding for positive definite matrices", European conference on computer vision, 2014, pp. 299-314.
      BibTeX
      • @Inproceedings{cherian2014riemannian,
      • author = {Cherian, Anoop and Sra, Suvrit},
      • title = {Riemannian sparse coding for positive definite matrices},
      • booktitle = {European conference on computer vision},
      • year = 2014,
      • pages = {299--314},
      • organization = {Springer}
      • }
    •  Cherian, Anoop; Sra, Suvrit; Banerjee, Arindam; Papanikolopoulos, Nikolaos, "Jensen-bregman logdet divergence with application to efficient similarity search for covariance matrices", IEEE transactions on pattern analysis and machine intelligence, Vol. 35, No. 9, pp. 2161-2174, 2013.
      BibTeX
      • @Article{cherian2013jensen,
      • author = {Cherian, Anoop and Sra, Suvrit and Banerjee, Arindam and Papanikolopoulos, Nikolaos},
      • title = {Jensen-bregman logdet divergence with application to efficient similarity search for covariance matrices},
      • journal = {IEEE transactions on pattern analysis and machine intelligence},
      • year = 2013,
      • volume = 35,
      • number = 9,
      • pages = {2161--2174},
      • publisher = {IEEE}
      • }
    •  Cherian, Anoop; Morellas, Vassilios; Papanikolopoulos, Nikolaos; Bedros, Saad J, "Dirichlet process mixture models on symmetric positive definite matrices for appearance clustering in video surveillance applications", Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, 2011, pp. 3417-3424.
      BibTeX
      • @Inproceedings{cherian2011dirichlet,
      • author = {Cherian, Anoop and Morellas, Vassilios and Papanikolopoulos, Nikolaos and Bedros, Saad J},
      • title = {Dirichlet process mixture models on symmetric positive definite matrices for appearance clustering in video surveillance applications},
      • booktitle = {Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on},
      • year = 2011,
      • pages = {3417--3424},
      • organization = {IEEE}
      • }
    •  Cherian, Anoop; Sra, Suvrit; Banerjee, Arindam; Papanikolopoulos, Nikolaos, "Efficient similarity search for covariance matrices via the Jensen-Bregman LogDet divergence", Computer Vision (ICCV), 2011 IEEE International Conference on, 2011, pp. 2399-2406.
      BibTeX
      • @Inproceedings{cherian2011efficient,
      • author = {Cherian, Anoop and Sra, Suvrit and Banerjee, Arindam and Papanikolopoulos, Nikolaos},
      • title = {Efficient similarity search for covariance matrices via the Jensen-Bregman LogDet divergence},
      • booktitle = {Computer Vision (ICCV), 2011 IEEE International Conference on},
      • year = 2011,
      • pages = {2399--2406},
      • organization = {IEEE}
      • }
    •  Sra, Suvrit; Cherian, Anoop, "Generalized dictionary learning for symmetric positive definite matrices with application to nearest neighbor retrieval", Machine Learning and Knowledge Discovery in Databases, pp. 318-332, 2011.
      BibTeX
      • @Article{sra2011generalized,
      • author = {Sra, Suvrit and Cherian, Anoop},
      • title = {Generalized dictionary learning for symmetric positive definite matrices with application to nearest neighbor retrieval},
      • journal = {Machine Learning and Knowledge Discovery in Databases},
      • year = 2011,
      • pages = {318--332},
      • publisher = {Springer}
      • }
    •  Cherian, Anoop; Morellas, Vassilios; Papanikolopoulos, Nikolaos, "Accurate 3D ground plane estimation from a single image", Robotics and Automation, 2009. ICRA'09. IEEE International Conference on, 2009, pp. 2243-2249.
      BibTeX
      • @Inproceedings{cherian2009accurate,
      • author = {Cherian, Anoop and Morellas, Vassilios and Papanikolopoulos, Nikolaos},
      • title = {Accurate 3D ground plane estimation from a single image},
      • booktitle = {Robotics and Automation, 2009. ICRA'09. IEEE International Conference on},
      • year = 2009,
      • pages = {2243--2249},
      • organization = {IEEE}
      • }