TR2026-029

MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions

- Kogashi, K., Cherian, A., Kuo, M.-Y.J., "MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions", IEEE Winter Conference on Applications of Computer Vision (WACV), March 2026, pp. 1512-1521.
  BibTeX TR2026-029 PDF Video Data
  - @inproceedings{Kogashi2026mar,
  - author = {Kogashi, Kaen and Cherian, Anoop and Kuo, Meng-Yu Jennifer},
  - title = {{MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions}},
  - booktitle = {IEEE Winter Conference on Applications of Computer Vision (WACV)},
  - year = 2026,
  - pages = {1512--1521},
  - month = mar,
  - url = {https://www.merl.com/publications/TR2026-029}
  - }
MERL Contacts:
- Kaen
  Kogashi
- Anoop
  Cherian
Research Areas:

Artificial Intelligence, Computer Vision, Machine Learning

Abstract:

Real-world scenes often feature multiple humans interacting with multiple objects in ways that are causal, goal-oriented, or cooperative. Yet existing 3D human- object interaction (HOI) benchmarks consider only a fraction of these complex interactions. To close this gap, we present MMHOI – a large-scale, Multi-human Multi-object Interaction dataset consisting of images from 12 everyday scenarios. MMHOI offers complete 3D shape and pose an- notations for every person and object, along with labels for 78 action categories and 14 interaction-specific body parts, providing a comprehensive testbed for next-generation HOI research. Building on MMHOI, we present MMHOI-Net, an end- to-end transformer-based neural network for jointly estimating human–object 3D geometries, their interactions, and associated actions. A key innovation in our frame- work is a structured dual-patch representation for model- ing objects and their interactions, combined with action recognition to enhance the interaction prediction. Experiments on MMHOI and the recently proposed CORE4D datasets demonstrate that our approach achieves state-of- the-art performance in multi-HOI modeling, excelling in both accuracy and reconstruction quality. The MMHOI dataset is available at https://zenodo.org/records/17711786.

Software & Data Downloads

MMHOI Dataset: Modeling Complex 3D Multi-Human Multi-Object Interactions

Related Publication

Kogashi, K., Cherian, A., Kuo, M.-Y.J., "MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions", arXiv, December 2025.

BibTeX arXiv

@article{Kogashi2025dec,
author = {Kogashi, Kaen and Cherian, Anoop and Kuo, Meng-Yu Jennifer},
title = {{MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions}},
journal = {arXiv},
year = 2025,
month = dec,
url = {https://arxiv.org/abs/2510.07828}
}

MERL Contacts:

KaenKogashi

AnoopCherian

Research Areas:

Abstract:

Kaen
Kogashi

Anoop
Cherian