TR2020-164

Active Learning for Estimating Reachable Sets for Systems with Unknown Dynamics


    •  Chakrabarty, A., Raghunathan, A., Di Cairano, S., Danielson, C., "Active Learning for Estimating Reachable Sets for Systems with Unknown Dynamics", IEEE Transactions on Cybernetics, DOI: 10.1109/​TCYB.2020.3000966, December 2020.
      BibTeX TR2020-164 PDF
      • @article{Chakrabarty2020dec,
      • author = {Chakrabarty, Ankush and Raghunathan, Arvind and Di Cairano, Stefano and Danielson, Claus},
      • title = {Active Learning for Estimating Reachable Sets for Systems with Unknown Dynamics},
      • journal = {IEEE Transactions on Cybernetics},
      • year = 2020,
      • month = dec,
      • doi = {10.1109/TCYB.2020.3000966},
      • url = {https://www.merl.com/publications/TR2020-164}
      • }
  • MERL Contacts:
  • Research Areas:

    Control, Machine Learning, Optimization

Abstract:

This paper presents a data-driven method for computing reachable sets where active learning is used to reduce the computational burden. Set-based methods used to estimate reachable sets typically do not scale well with state-space dimension, or rely heavily on the existence of a model. If such a model is not available, it is simple to generate state trajectory data by numerically simulating black-box oracles of systems (whose dynamics are unknown) from sampled initial conditions. Using these data samples, the estimation of reachable sets can be posed as a classification problem, wherein active learning (AL) can intelligently select samples that are most informative and least similar to previously labeled samples. By exploiting submodularity, the actively learned samples can be selected efficiently, with bounded sub-optimality. Our proposed framework is illustrated by estimating the domains of attractions of model predictive controllers (MPCs) and reinforcement learners. We also consider a scenario where there are two oracles that differ with respect to evaluation costs and labeling accuracy. We propose a framework to reduce the dependency of the expensive oracle in labeling samples using disagreement-based active learning (DBAL). The potential of the DBAL algorithm is demonstrated on a solver selection problem for real-time MPC