Skip to content

Monte Carlo Tree Search with Spectral Expansion for Planning with Dynamical Systems

Authors: Benjamin Rivière, John Lathrop, Soon-Jo Chung

Published: 2024 (Journal Paper)

Source: Science Robotics

Algorithm: SETS

arXiv: 2412.11270

DOI: 10.1126/scirobotics.ado1010

Summary

Bridges Monte Carlo Tree Search and continuous physical dynamics by using the spectrum of locally linearized systems to build a low-complexity discrete model.

Abstract

The ability of a robot to plan complex behaviors with real-time computation, rather than adhering to predesigned or offline-learned routines, alleviates the need for specialized algorithms or training for each problem instance. Monte Carlo tree search is a powerful planning algorithm that strategically explores simulated future possibilities, but it requires a discrete problem representation that is irreconcilable with the continuous dynamics of the physical world. We present Spectral Expansion Tree Search (SETS), a real-time, tree-based planner that uses the spectrum of the locally linearized system to construct a low-complexity and approximately equivalent discrete representation of the continuous world. We prove that SETS converges to a bound of the globally optimal solution for continuous, deterministic, and differentiable Markov decision processes, a broad class of problems that includes underactuated nonlinear dynamics, nonconvex reward functions, and unstructured environments. We experimentally validated SETS on drone, spacecraft, and ground vehicle robots and one numerical experiment, each of which is not directly solvable with existing methods. We successfully show that SETS automatically discovers a diverse set of optimal behaviors and motion trajectories in real time.

Tags

  • Trajectory planning

  • Motion planning

  • Monte Carlo Tree Search

  • MCTS

  • Spectral methods

  • Dynamical systems

  • Real-time planning

  • Nonlinear systems

  • Underactuated systems

  • MDP