TR2016-041

Data-Driven Anytime Algorithms for Motion Planning with Safety Guarantees

- Jha, D.K., Zhu, M., Wang, Y., Ray, A., "Data-Driven Anytime Algorithms for Motion Planning with Safety Guarantees", American Control Conference (ACC), DOI: 10.1109/ACC.2016.7526565, July 2016, pp. 5716-5721.
  BibTeX TR2016-041 PDF
  - @inproceedings{Jha2016jul,
  - author = {Jha, Devesh K. and Zhu, Minghui and Wang, Yebin and Ray, Asok},
  - title = {{Data-Driven Anytime Algorithms for Motion Planning with Safety Guarantees}},
  - booktitle = {American Control Conference (ACC)},
  - year = 2016,
  - pages = {5716--5721},
  - month = jul,
  - doi = {10.1109/ACC.2016.7526565},
  - url = {https://www.merl.com/publications/TR2016-041}
  - }
MERL Contact:
- Yebin
  Wang
Research Areas:

Control, Robotics

Abstract:

This paper presents a learning-based (i.e., datadriven) approach to motion planning of robotic systems. This is motivated by controller synthesis problems for safety critical systems where an accurate estimate of the uncertainties (e.g., unmodeled dynamics, disturbance) can improve the performance of the system. The state-space of the system is built by sampling from the state-set as well as the input set of the underlying system. The robust adaptive motion planning problem is modeled as a learning-based approach evasion differential game, where a machine-learning algorithm is used to update the statistical estimates of the uncertainties from system observations. The system begins with a conservative estimate of the uncertainty set to ensure safety of the underlying system and we relax the robustness constraints as we get better estimates of the unmodeled uncertainty. The estimates from the machine learning algorithm are used to refine the estimates of the controller in an anytime fashion. We show that the values for the game converges to the optimal values with known disturbance given the statistical estimates on the uncertainty converges. Using confidence intervals for the unmodeled disturbance estimated by the machine learning estimator during the transient learning phase, we are able to guarantee safety of the robotic system with the proposed algorithms during transience.

Related News & Events

NEWS MERL makes a strong showing at the American Control Conference
Date: July 6, 2016 - July 8, 2016
Where: American Control Conference (ACC)
MERL Contacts: Scott A. Bortoff; Petros T. Boufounos; Stefano Di Cairano; Abraham Goldsmith; Christopher R. Laughman; Daniel N. Nikovski; Arvind Raghunathan; Yebin Wang; Avishai Weiss
Research Areas: Control, Dynamical Systems, Machine Learning
Brief
- The premier American Control Conference (ACC) takes place in Boston July 6-8. This year MERL researchers will present a record 20 papers(!) at ACC, with several contributions, especially in autonomous vehicle path planning and in Model Predictive Control (MPC) theory and applications, including manufacturing machines, electric motors, satellite station keeping, and HVAC. Other important themes developed in MERL's presentations concern adaptation, learning, and optimization in control systems.

MERL Contact:

YebinWang

Research Areas:

Abstract:

Yebin
Wang