SACBP: Belief Space Planning for Continuous-Time Dynamical Systems via Stochastic Sequential Action Control

Published in: The International Journal of Robotics Research (IJRR), 2021

Haruki Nishimura, Mac Schwager

[Link to PDF]
[Link to Publisher Site]
[Link to Code]

Abstract

We propose a novel belief space planning technique for continuous dynamics by viewing the belief system as a hybrid dynamical system with time-driven switching. Our approach is based on the perturbation theory of differential equations and extends sequential action control to stochastic dynamics. The resulting algorithm, which we name SACBP, does not require discretization of spaces or time and synthesizes control signals in near real-time. SACBP is an anytime algorithm that can handle general parametric Bayesian filters under certain assumptions. We demonstrate the effectiveness of our approach in an active sensing scenario and a model-based Bayesian reinforcement learning problem. In these challenging problems, we show that the algorithm significantly outperforms other existing solution techniques including approximate dynamic programming and local trajectory optimization.

Video (Part of Ph.D. Defense Presentation)

BibTex

@article{nishimura2021sacbp,
  author={Nishimura, Haruki and Schwager, Mac},
  title={SACBP: Belief Space Planning for Continuous-Time Dynamical Systems via Stochastic Sequential Action Control},
  journal={The International Journal of Robotics Research},
  volume={40},
  number={10-11},
  pages={1167-1195},
  year={2021},
  doi={10.1177/02783649211037697},
  publisher={Sage Publications, Inc.}
}