Kuan-Chuan Peng

Phone: 617-621-7576
Email:

Position:
Research / Technical Staff

Principal Research Scientist
Education:
Ph.D., Cornell University, 2016
Research Areas:
External Links:
- Google Scholar

Biography

Before joining MERL, he was a Research Scientist (2016-2018) and Staff Scientist (2019) at Siemens Corporate Technology. His PhD research focuses on solving abstract tasks in computer vision using convolutional neural networks. In addition to his PhD, he received a bachelor's degree in Electrical Engineering and an MS degree in Computer Science and Information Engineering from National Taiwan University in 2009 and 2012 respectively. His research interests include incremental learning, developing practical solutions given biased or scarce data, and fundamental computer vision and machine learning problems.
Recent News & Events
- NEWS MERL Papers and Workshops at CVPR 2025
  Date: June 11, 2025 - June 15, 2025
  Where: Nashville, TN, USA
  MERL Contacts: Matthew Brand; Moitreya Chatterjee; Anoop Cherian; Michael J. Jones; Toshiaki Koike-Akino; Jing Liu; Suhas Lohit; Tim K. Marks; Pedro Miraldo; Kuan-Chuan Peng; Pu (Perry) Wang; Ye Wang
  Research Areas: Artificial Intelligence, Computer Vision, Machine Learning, Signal Processing, Speech & Audio
  Brief
  - MERL researchers are presenting 2 conference papers, co-organizing two workshops, and presenting 7 workshop papers at the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2025 conference, which will be held in Nashville, TN, USA from June 11-15, 2025. CVPR is one of the most prestigious and competitive international conferences in the area of computer vision. Details of MERL contributions are provided below:
    
    Main Conference Papers:
    
    1. "UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing" by Y.H. Lai, J. Ebbers, Y. F. Wang, F. Germain, M. J. Jones, M. Chatterjee
    
    This work deals with the task of weakly‑supervised Audio-Visual Video Parsing (AVVP) and proposes a novel, uncertainty-aware algorithm called UWAV towards that end. UWAV works by producing more reliable segment‑level pseudo‑labels while explicitly weighting each label by its prediction uncertainty. This uncertainty‑aware training, combined with a feature‑mixup regularization scheme, promotes inter‑segment consistency in the pseudo-labels. As a result, UWAV achieves state‑of‑the‑art performance on two AVVP datasets across multiple metrics, demonstrating both effectiveness and strong generalizability.
    
    Paper: https://www.merl.com/publications/TR2025-072
    
    2. "TailedCore: Few-Shot Sampling for Unsupervised Long-Tail Noisy Anomaly Detection" by Y. G. Jung, J. Park, J. Yoon, K.-C. Peng, W. Kim, A. B. J. Teoh, and O. Camps.
    
    This work tackles unsupervised anomaly detection in complex scenarios where normal data is noisy and has an unknown, imbalanced class distribution. Existing models face a trade-off between robustness to noise and performance on rare (tail) classes. To address this, the authors propose TailSampler, which estimates class sizes from embedding similarities to isolate tail samples. Using TailSampler, they develop TailedCore, a memory-based model that effectively captures tail class features while remaining noise-robust, outperforming state-of-the-art methods in extensive evaluations.
    
    paper: https://www.merl.com/publications/TR2025-077
    
    MERL Co-Organized Workshops:
    
    1. Multimodal Algorithmic Reasoning (MAR) Workshop, organized by A. Cherian, K.-C. Peng, S. Lohit, H. Zhou, K. Smith, L. Xue, T. K. Marks, and J. Tenenbaum.
    
    Workshop link: https://marworkshop.github.io/cvpr25/
    
    2. The 6th Workshop on Fair, Data-Efficient, and Trusted Computer Vision, organized by N. Ratha, S. Karanam, Z. Wu, M. Vatsa, R. Singh, K.-C. Peng, M. Merler, and K. Varshney.
    
    Workshop link: https://fadetrcv.github.io/2025/
    
    Workshop Papers:
    
    1. "FreBIS: Frequency-Based Stratification for Neural Implicit Surface Representations" by N. Sawada, P. Miraldo, S. Lohit, T.K. Marks, and M. Chatterjee (Oral)
    
    With their ability to model object surfaces in a scene as a continuous function, neural implicit surface reconstruction methods have made remarkable strides recently, especially over classical 3D surface reconstruction methods, such as those that use voxels or point clouds. Towards this end, we propose FreBIS - a neural implicit‑surface framework that avoids overloading a single encoder with every surface detail. It divides a scene into several frequency bands and assigns a dedicated encoder (or group of encoders) to each band, then enforces complementary feature learning through a redundancy‑aware weighting module. Swapping this frequency‑stratified stack into an off‑the‑shelf reconstruction pipeline markedly boosts 3D surface accuracy and view‑consistent rendering on the challenging BlendedMVS dataset.
    
    paper: https://www.merl.com/publications/TR2025-074
    
    2. "Multimodal 3D Object Detection on Unseen Domains" by D. Hegde, S. Lohit, K.-C. Peng, M. J. Jones, and V. M. Patel.
    
    LiDAR-based object detection models often suffer performance drops when deployed in unseen environments due to biases in data properties like point density and object size. Unlike domain adaptation methods that rely on access to target data, this work tackles the more realistic setting of domain generalization without test-time samples. We propose CLIX3D, a multimodal framework that uses both LiDAR and image data along with supervised contrastive learning to align same-class features across domains and improve robustness. CLIX3D achieves state-of-the-art performance across various domain shifts in 3D object detection.
    
    paper: https://www.merl.com/publications/TR2025-078
    
    3. "Improving Open-World Object Localization by Discovering Background" by A. Singh, M. J. Jones, K.-C. Peng, M. Chatterjee, A. Cherian, and E. Learned-Miller.
    
    This work tackles open-world object localization, aiming to detect both seen and unseen object classes using limited labeled training data. While prior methods focus on object characterization, this approach introduces background information to improve objectness learning. The proposed framework identifies low-information, non-discriminative image regions as background and trains the model to avoid generating object proposals there. Experiments on standard benchmarks show that this method significantly outperforms previous state-of-the-art approaches.
    
    paper: https://www.merl.com/publications/TR2025-058
    
    4. "PF3Det: A Prompted Foundation Feature Assisted Visual LiDAR 3D Detector" by K. Li, T. Zhang, K.-C. Peng, and G. Wang.
    
    This work addresses challenges in 3D object detection for autonomous driving by improving the fusion of LiDAR and camera data, which is often hindered by domain gaps and limited labeled data. Leveraging advances in foundation models and prompt engineering, the authors propose PF3Det, a multi-modal detector that uses foundation model encoders and soft prompts to enhance feature fusion. PF3Det achieves strong performance even with limited training data. It sets new state-of-the-art results on the nuScenes dataset, improving NDS by 1.19% and mAP by 2.42%.
    
    paper: https://www.merl.com/publications/TR2025-076
    
    5. "Noise Consistency Regularization for Improved Subject-Driven Image Synthesis" by Y. Ni., S. Wen, P. Konius, A. Cherian
    
    Fine-tuning Stable Diffusion enables subject-driven image synthesis by adapting the model to generate images containing specific subjects. However, existing fine-tuning methods suffer from two key issues: underfitting, where the model fails to reliably capture subject identity, and overfitting, where it memorizes the subject image and reduces background diversity. To address these challenges, two auxiliary consistency losses are porposed for diffusion fine-tuning. First, a prior consistency regularization loss ensures that the predicted diffusion noise for prior (non- subject) images remains consistent with that of the pretrained model, improving fidelity. Second, a subject consistency regularization loss enhances the fine-tuned model’s robustness to multiplicative noise modulated latent code, helping to preserve subject identity while improving diversity. Our experimental results demonstrate the effectiveness of our approach in terms of image diversity, outperforming DreamBooth in terms of CLIP scores, background variation, and overall visual quality.
    
    paper: https://www.merl.com/publications/TR2025-073
    
    6. "LatentLLM: Attention-Aware Joint Tensor Compression" by T. Koike-Akino, X. Chen, J. Liu, Y. Wang, P. Wang, M. Brand
    
    We propose a new framework to convert a large foundation model such as large language models (LLMs)/large multi- modal models (LMMs) into a reduced-dimension latent structure. Our method uses a global attention-aware joint tensor decomposition to significantly improve the model efficiency. We show the benefit on several benchmark including multi-modal reasoning tasks.
    
    paper: https://www.merl.com/publications/TR2025-075
    
    7. "TuneComp: Joint Fine-Tuning and Compression for Large Foundation Models" by T. Koike-Akino, X. Chen, J. Liu, Y. Wang, P. Wang, M. Brand
    
    To reduce model size during post-training, compression methods, including knowledge distillation, low-rank approximation, and pruning, are often applied after fine- tuning the model. However, sequential fine-tuning and compression sacrifices performance, while creating a larger than necessary model as an intermediate step. In this work, we aim to reduce this gap, by directly constructing a smaller model while guided by the downstream task. We propose to jointly fine-tune and compress the model by gradually distilling it to a pruned low-rank structure. Experiments demonstrate that joint fine-tuning and compression significantly outperforms other sequential compression methods.
    
    paper: https://www.merl.com/publications/TR2025-079
- NEWS MERL Papers and Workshops at AAAI 2025
  Date: February 25, 2025 - March 4, 2025
  Where: The Association for the Advancement of Artificial Intelligence (AAAI)
  MERL Contacts: Ankush Chakrabarty; Toshiaki Koike-Akino; Jing Liu; Kuan-Chuan Peng; Diego Romeres; Ye Wang
  Research Areas: Artificial Intelligence, Machine Learning, Optimization
  Brief
  - MERL researchers presented 2 conference papers, 2 workshop papers, and co-organized 1 workshop at the AAAI 2025 conference, which was held in Philadelphia from Feb. 25 to Mar. 4, 2025. AAAI is one of the most prestigious and competitive international conferences in artificial intelligence (AI). Details of MERL contributions are provided below.
    
    - AAAI Papers in Main Tracks:
    
    1. "Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage" by M.R.U. Rashid, J. Liu, T. Koike-Akino, Y. Wang, and S. Mehnaz. [Oral Presentation]
    
    This work proposes a novel unlearning-based model poisoning method that amplifies privacy breaches during fine-tuning. Extensive empirical studies show the proposed method’s efficacy on both membership inference and data extraction attacks. The attack is stealthy enough to bypass detection based defenses, and differential privacy cannot effectively defend against the attacks without significantly impacting model utility.
    
    Paper: https://www.merl.com/publications/TR2025-017
    
    2. "User-Preference Meets Pareto-Optimality: Multi-Objective Bayesian Optimization with Local Gradient Search" by J.H.S. Ip, A. Chakrabarty, A. Mesbah, and D. Romeres. [Poster Presentation]
    
    This paper introduces a sample-efficient multi-objective Bayesian optimization method that integrates user preferences with gradient-based search to find near-Pareto optimal solutions. The proposed method achieves high utility and reduces distance to Pareto-front solutions across both synthetic and real-world problems, underscoring the importance of minimizing gradient uncertainty during gradient-based optimization. Additionally, the study introduces a novel utility function that respects Pareto dominance and effectively captures diverse user preferences.
    
    Paper: https://www.merl.com/publications/TR2025-018
    
    - AAAI Workshop Papers:
    
    1. "Quantum Diffusion Models for Few-Shot Learning" by R. Wang, Y. Wang, J. Liu, and T. Koike-Akino.
    
    This work presents the quantum diffusion model (QDM) as an approach to overcome the challenges of quantum few-shot learning (QFSL). It introduces three novel algorithms developed from complementary data-driven and algorithmic perspectives to enhance the performance of QFSL tasks. The extensive experiments demonstrate that these algorithms achieve significant performance gains over traditional baselines, underscoring the potential of QDM to advance QFSL by effectively leveraging quantum noise modeling and label guidance.
    
    Paper: https://www.merl.com/publications/TR2025-025
    
    2. "Quantum Implicit Neural Compression", by T. Fujihashi and T., Koike-Akino.
    
    This work introduces a quantum counterpart of implicit neural representation (quINR) which leverages the exponentially rich expressivity of quantum neural networks to improve the classical INR-based signal compression methods. Evaluations using some benchmark datasets show that the proposed quINR-based compression could improve rate-distortion performance in image compression compared with traditional codecs and classic INR-based coding methods.
    
    Paper: https://www.merl.com/publications/TR2025-024
    
    - AAAI Workshops Contributed by MERL:
    
    1. "Scalable and Efficient Artificial Intelligence Systems (SEAS)"
    
    K.-C. Peng co-organized this workshop, which offers a timely forum for experts to share their perspectives in designing and developing robust computer vision (CV), machine learning (ML), and artificial intelligence (AI) algorithms, and translating them into real-world solutions.
    
    Workshop link: https://seasworkshop.github.io/aaai25/index.html
    
    2. "Quantum Computing and Artificial Intelligence"
    
    T. Koike-Akino served a session chair of Quantum Neural Network in this workshop, which focuses on seeking contributions encompassing theoretical and applied advances in quantum AI, quantum computing (QC) to enhance classical AI, and classical AI to tackle various aspects of QC.
    
    Workshop link: https://sites.google.com/view/qcai2025/
See All News & Events for Kuan-Chuan
Research Highlights
- Robust Machine Learning
MERL Publications
- Yang, C.-A., Peng, K.-C., Yeh, R., "Toward Long-Tailed Online Anomaly Detection through Class-Agnostic Concepts", IEEE International Conference on Computer Vision (ICCV), October 2025.
  BibTeX TR2025-124 PDF Video Data Presentation
  - @inproceedings{Yang2025oct,
  - author = {{{Yang, Chiao-An and Peng, Kuan-Chuan and Yeh, Raymond}}},
  - title = {{{Toward Long-Tailed Online Anomaly Detection through Class-Agnostic Concepts}}},
  - booktitle = {IEEE International Conference on Computer Vision (ICCV)},
  - year = 2025,
  - month = oct,
  - url = {https://www.merl.com/publications/TR2025-124}
  - }
- Peng, K.-C., "Joint Training of Image Generator and Detector for Road Defect Detection", arXiv, September 2025.
  BibTeX arXiv
  - @article{Peng2025sep,
  - author = {Peng, Kuan-Chuan},
  - title = {{Joint Training of Image Generator and Detector for Road Defect Detection}},
  - journal = {arXiv},
  - year = 2025,
  - month = sep,
  - url = {https://arxiv.org/abs/2509.03465}
  - }
- Xiang, X., Peng, K.-C., Lohit, S., Jones, M.J., Zhang, J., "Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes", arXiv, August 2025.
  BibTeX arXiv
  - @article{Xiang2025aug,
  - author = {Xiang, Xinhao and Peng, Kuan-Chuan and Lohit, Suhas and Jones, Michael J. and Zhang, Jiawei},
  - title = {{Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes}},
  - journal = {arXiv},
  - year = 2025,
  - month = aug,
  - url = {https://arxiv.org/abs/2508.16812}
  - }
- Hegde, D., Lohit, S., Peng, K.-C., Jones, M.J., Patel, V.M., "Multimodal 3D Object Detection on Unseen Domains", IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshop, June 2025, pp. 2499-2509.
  BibTeX TR2025-078 PDF
  - @inproceedings{Hegde2025jun,
  - author = {Hegde, Deepti and Lohit, Suhas and Peng, Kuan-Chuan and Jones, Michael J. and Patel, Vishal M.},
  - title = {{Multimodal 3D Object Detection on Unseen Domains}},
  - booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshop},
  - year = 2025,
  - pages = {2499--2509},
  - month = jun,
  - url = {https://www.merl.com/publications/TR2025-078}
  - }
- Jung, Y.G., Park, J., Yoon, J., Peng, K.-C., Kim, W., Teoh, A.B.J., Camps, O., "TailedCore: Few-Shot Sampling for Unsupervised Long-Tail Noisy Anomaly Detection", IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Isola, P. and Kjellström, H. and Lepetit, V. and Li, F. and Su, H. and Tang, S., Eds., DOI: 10.1109/CVPR52734.2025.02378, June 2025, pp. 25539-25548.
  BibTeX TR2025-077 PDF Video Presentation
  - @inproceedings{Jung2025jun,
  - author = {{{Jung, Yoon G. and Park, Jaewoo and Yoon, Jaeho and Peng, Kuan-Chuan and Kim, Wonchul and Teoh, Andrew B. J. and Camps, Octavia}}},
  - title = {{{TailedCore: Few-Shot Sampling for Unsupervised Long-Tail Noisy Anomaly Detection}}},
  - booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  - year = 2025,
  - editor = {Isola, P. and Kjellström, H. and Lepetit, V. and Li, F. and Su, H. and Tang, S.},
  - pages = {25539--25548},
  - month = jun,
  - publisher = {IEEE},
  - doi = {10.1109/CVPR52734.2025.02378},
  - issn = {2575-7075},
  - isbn = {979-8-3315-4364-8},
  - url = {https://www.merl.com/publications/TR2025-077}
  - }
See All MERL Publications for Kuan-Chuan
Other Publications
- Prithviraj Dhar, Rajat Vikram Singh, Kuan-Chuan Peng, Ziyan Wu and Rama Chellappa, "Learning without Memorizing", IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
  BibTeX
  - @Inproceedings{Dhar_CVPR19,
  - author = {Dhar, Prithviraj and Singh, Rajat Vikram and Peng, Kuan-Chuan and Wu, Ziyan and Chellappa, Rama},
  - title = {Learning without Memorizing},
  - booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  - year = 2019
  - }
- Kunpeng Li, Ziyan Wu, Kuan-Chuan Peng, Jan Ernst and Yun Fu, "Guided Attention Inference Network", IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019.
  BibTeX
  - @Article{Li_TPAMI19,
  - author = {Li, Kunpeng and Wu, Ziyan and Peng, Kuan-Chuan and Ernst, Jan and Fu, Yun},
  - title = {Guided Attention Inference Network},
  - journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)},
  - year = 2019,
  - publisher = {IEEE}
  - }
- Lezi Wang, Ziyan Wu, Srikrishna Karanam, Kuan-Chuan Peng, Rajat Vikram Singh, Bo Liu and Dimitris N. Metaxas, "Sharpen Focus: Learning with Attention Separability and Consistency", IEEE International Conference on Computer Vision (ICCV), 2019.
  BibTeX
  - @Inproceedings{Wang_ICCV19,
  - author = {Wang, Lezi and Wu, Ziyan and Karanam, Srikrishna and Peng, Kuan-Chuan and Singh, Rajat Vikram and Liu, Bo and Metaxas, Dimitris N.},
  - title = {Sharpen Focus: Learning with Attention Separability and Consistency},
  - booktitle = {IEEE International Conference on Computer Vision (ICCV)},
  - year = 2019
  - }
- Yunye Gong, Srikrishna Karanam, Ziyan Wu, Kuan-Chuan Peng, Jan Ernst and Peter C. Doerschuk, "Learning Compositional Visual Concepts with Mutual Consistency", IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
  BibTeX
  - @Inproceedings{Gong_CVPR18,
  - author = {Gong, Yunye and Karanam, Srikrishna and Wu, Ziyan and Peng, Kuan-Chuan and Ernst, Jan and Doerschuk, Peter C.},
  - title = {Learning Compositional Visual Concepts with Mutual Consistency},
  - booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  - year = 2018
  - }
- Kunpeng Li, Ziyan Wu, Kuan-Chuan Peng, Jan Ernst and Yun Fu, "Tell Me Where to Look: Guided Attention Inference Network", IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
  BibTeX
  - @Inproceedings{Li_CVPR18,
  - author = {Li, Kunpeng and Wu, Ziyan and Peng, Kuan-Chuan and Ernst, Jan and Fu, Yun},
  - title = {Tell Me Where to Look: Guided Attention Inference Network},
  - booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  - year = 2018
  - }
- Kuan-Chuan Peng, Ziyan Wu and Jan Ernst, "Zero-Shot Deep Domain Adaptation", European Conference on Computer Vision (ECCV), 2018.
  BibTeX
  - @Inproceedings{Peng_ECCV18,
  - author = {Peng, Kuan-Chuan and Wu, Ziyan and Ernst, Jan},
  - title = {Zero-Shot Deep Domain Adaptation},
  - booktitle = {European Conference on Computer Vision (ECCV)},
  - year = 2018
  - }
- Kuan-Chuan Peng, Tsuhan Chen, Amir Sadovnik and Andrew C. Gallagher, "Where Do Emotions Come from? Predicting the Emotion Stimuli Map", IEEE International Conference on Image Processing (ICIP), 2016.
  BibTeX
  - @Inproceedings{Peng_ICIP16,
  - author = {Peng, Kuan-Chuan and Chen, Tsuhan and Sadovnik, Amir and Gallagher, Andrew C.},
  - title = {Where Do Emotions Come from? Predicting the Emotion Stimuli Map},
  - booktitle = {IEEE International Conference on Image Processing (ICIP)},
  - year = 2016
  - }
- Kuan-Chuan Peng and Tsuhan Chen, "Toward Correlating and Solving Abstract Tasks Using Convolutional Neural Networks", IEEE Winter Conference on Applications of Computer Vision (WACV), 2016.
  BibTeX
  - @Inproceedings{Peng_WACV16,
  - author = {Peng, Kuan-Chuan and Chen, Tsuhan},
  - title = {Toward Correlating and Solving Abstract Tasks Using Convolutional Neural Networks},
  - booktitle = {IEEE Winter Conference on Applications of Computer Vision (WACV)},
  - year = 2016
  - }
- Kuan-Chuan Peng, Tsuhan Chen, Amir Sadovnik and Andrew C. Gallagher, "A Mixed Bag of Emotions: Model, Predict, and Transfer Emotion Distributions", IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
  BibTeX
  - @Inproceedings{Peng_CVPR15,
  - author = {Peng, Kuan-Chuan and Chen, Tsuhan and Sadovnik, Amir and Gallagher, Andrew C.},
  - title = {A Mixed Bag of Emotions: Model, Predict, and Transfer Emotion Distributions},
  - booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  - year = 2015
  - }
- Kuan-Chuan Peng and Tsuhan Chen, "Cross-layer Features in Convolutional Neural Networks for Generic Classification Tasks", IEEE International Conference on Image Processing (ICIP), 2015.
  BibTeX
  - @Inproceedings{Peng_ICIP15,
  - author = {Peng, Kuan-Chuan and Chen, Tsuhan},
  - title = {Cross-layer Features in Convolutional Neural Networks for Generic Classification Tasks},
  - booktitle = {IEEE International Conference on Image Processing (ICIP)},
  - year = 2015
  - }
- Kuan-Chuan Peng and Tsuhan Chen, "A Framework of Extracting Multi-scale Features Using Multiple Convolutional Neural Network", IEEE International Conference on Multimedia and Expo (ICME), 2015.
  BibTeX
  - @Inproceedings{Peng_ICME15,
  - author = {Peng, Kuan-Chuan and Chen, Tsuhan},
  - title = {A Framework of Extracting Multi-scale Features Using Multiple Convolutional Neural Network},
  - booktitle = {IEEE International Conference on Multimedia and Expo (ICME)},
  - year = 2015
  - }
- Kuan-Chuan Peng, Kolbeinn Karlsson, Tsuhan Chen, Dongqing Zhang and Hong Heather Yu, "A Framework of Changing Image Emotion Using Emotion Prediction", IEEE International Conference on Image Processing (ICIP), 2014.
  BibTeX
  - @Inproceedings{Peng_ICIP14,
  - author = {Peng, Kuan-Chuan and Karlsson, Kolbeinn and Chen, Tsuhan and Zhang, Dongqing and Yu, Hong Heather},
  - title = {A Framework of Changing Image Emotion Using Emotion Prediction},
  - booktitle = {IEEE International Conference on Image Processing (ICIP)},
  - year = 2014
  - }
- Kuan-Chuan Peng and Tsuhan Chen, "Incorporating Cloud Distribution in Sky Representation", IEEE International Conference on Computer Vision (ICCV), 2013.
  BibTeX
  - @Inproceedings{Peng_ICCV13,
  - author = {Peng, Kuan-Chuan and Chen, Tsuhan},
  - title = {Incorporating Cloud Distribution in Sky Representation},
  - booktitle = {IEEE International Conference on Computer Vision (ICCV)},
  - year = 2013
  - }
Software & Data Downloads
Videos

[ICCV 2025] Toward Long-Tailed Online Anomaly Detection through Class-Agnostic Concepts

[CVPR 2025] TailedCore: Few-Shot Sampling for Unsupervised Long-Tail Noisy Anomaly Detection

[NeurIPS 2024] Evaluating Large Vision-and-Language Models on Children's Mathematical Olympiads

[ECCV 2024] Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection

Are Deep Neural Networks SMARTer than Second Graders?
MERL Issued Patents
- Title: "Method and System for Zero-Shot Cross Domain Video Anomaly Detection"
  Inventors: Peng, Kuan-Chuan; Aich, Abhishek
  Patent No.: 12,315,242
  Issue Date: May 27, 2025
- Title: "Contactless Elevator Service for an Elevator Based on Augmented Datasets"
  Inventors: Sahinoglu, Zafer; Peng, Kuan-Chuan; Sullivan, Alan; Yerazunis, William S.
  Patent No.: 12,071,323
  Issue Date: Aug 27, 2024
See All Patents for MERL