TR2026-011

Learning Non-prehensile Manipulation with Force and Vision Feedback Using Optimization-based Demonstrations


    •  Shirai, Y., Ota, K., Jha, D.K., Romeres, D., "Learning Non-prehensile Manipulation with Force and Vision Feedback Using Optimization-based Demonstrations", IEEE Robotics and Automation Letters, January 2026.
      BibTeX TR2026-011 PDF Video
      • @article{Shirai2026jan,
      • author = {Shirai, Yuki and Ota, Kei and Jha, Devesh K. and Romeres, Diego},
      • title = {{Learning Non-prehensile Manipulation with Force and Vision Feedback Using Optimization-based Demonstrations}},
      • journal = {IEEE Robotics and Automation Letters},
      • year = 2026,
      • month = jan,
      • url = {https://www.merl.com/publications/TR2026-011}
      • }
  • MERL Contacts:
  • Research Area:

    Robotics

Abstract:

Non-prehensile manipulation is challenging due to complex contact interactions between objects, the environment, and robots. Model-based approaches can efficiently generate complex trajectories of robots and objects under contact constraints. However, they tend to be sensitive to model inaccuracies and require access to privileged information (e.g., object mass, size, pose), making them less suitable for novel objects. In contrast, learning-based approaches are typically more robust to modeling errors but require large amounts of data. In this paper, we bridge these two approaches to propose a framework for learning closed- loop non-prehensile manipulation. By leveraging computationally efficient Contact-Implicit Trajectory Optimization (CITO), we design demonstration-guided deep Reinforcement Learning (RL), leading to sample-efficient learning. We also present a sim- to-real transfer approach using a privileged training strategy, enabling the robot to perform non-prehensile manipulation using only proprioception, vision, and force sensing without access to privileged information. Our method is evaluated on several tasks, demonstrating that it can successfully perform zero-shot sim-to- real transfer.

 

  • Related Video

  • Related Publication

  •  Shirai, Y., Ota, K., Jha, D.K., Romeres, D., "Sim-to-Real Contact-Rich Pivoting via Optimization-Guided RL with Vision and Touch", Embodied World Models for Decision Making, NeurIPS Workshop, December 2025.
    BibTeX TR2025-169 PDF Video
    • @inproceedings{Shirai2025dec,
    • author = {Shirai, Yuki and Ota, Kei and Jha, Devesh K. and Romeres, Diego},
    • title = {{Sim-to-Real Contact-Rich Pivoting via Optimization-Guided RL with Vision and Touch}},
    • booktitle = {NeurIPS 2025 Workshop on Embodied World Models for Decision Making},
    • year = 2025,
    • month = dec,
    • url = {https://www.merl.com/publications/TR2025-169}
    • }