Skip to content

Synthesis and Stabilization of Complex Behaviors through Online Trajectory Optimization

Authors: Yuval Tassa, Tom Erez, Emanuel Todorov

Published: 2012 (Conference Paper)

Source: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Algorithm: iLQR

DOI: 10.1109/IROS.2012.6386025

Summary

Refines iLQR with key algorithmic improvements now standard in modern implementations: state Hessian regularization, improved feedforward gain line search, and adaptive regularization scheduling. Clearer presentation than the original Li & Todorov paper.

Abstract

We present an online trajectory optimization method and software platform applicable to complex humanoid robots performing challenging tasks such as getting up from an arbitrary pose on the ground and recovering from large disturbances using dexterous acrobatic maneuvers. The resulting behaviors, illustrated in the attached video, are computed only 7x slower than real time, on a standard PC. The video also shows results on the acrobot problem, planar swimming and one-legged hopping. These simpler problems can already be solved in real time, without pre-computing anything.

Tags

  • Trajectory optimization

  • Iterative linear quadratic regulator

  • Differential dynamic programming

  • iLQR

  • DDP

  • Nonlinear control

  • Online optimization

  • Synthesis

  • Stabilization

  • Behaviors