Robust Learning-Based Control via Bootstrapped Multiplicative Noise¶
Authors: Benjamin Gravell, Tyler Summers
Published: 2020 (Conference Paper)
Source: Conference on Learning for Dynamics and Control (L4DC)
arXiv: 2002.10069
Summary¶
Assembles a pipeline for automatically designing controllers from data that perform well throughout the data acquisition and system operation timeline. Uncertainty is quantified with a statistical bootstrap and designed against by using a multiplicative noise framework to achieve robustness. The aim is to achieve both good robustness in the low-data short-term transient as well as good performance in the high-data long-term steady state condition.
Abstract¶
Despite decades of research and recent progress in adaptive control and reinforcement learning, there remains a fundamental lack of understanding in designing controllers that provide robustness to inherent non-asymptotic uncertainties arising from models estimated with finite, noisy data. We propose a robust adaptive control algorithm that explicitly incorporates such non-asymptotic uncertainties into the control design. The algorithm has three components: (1) a least-squares nominal model estimator; (2) a bootstrap resampling method that quantifies non-asymptotic variance of the nominal model estimate; and (3) a non-conventional robust control design method using an optimal linear quadratic regulator (LQR) with multiplicative noise. A key advantage of the proposed approach is that the system identification and robust control design procedures both use stochastic uncertainty representations, so that the actual inherent statistical estimation uncertainty directly aligns with the uncertainty the robust controller is being designed against. We show through numerical experiments that the proposed robust adaptive controller can significantly outperform the certainty equivalent controller on both expected regret and measures of regret risk.
Links¶
Primary
Standard
Alternate
Tags¶
-
Control
-
Optimal control
-
Robust control
-
Adaptive control
-
Reinforcement learning
-
Model-based
-
System identification
-
Multiplicative noise
-
Stochastic parameters
-
Bootstrap
-
Regret