End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks
California Institute of Technology · University of Michigan
Abstract
Reinforcement Learning (RL) algorithms have found limited success beyond simulated applications, and one main reason is the absence of safety guarantees during the learning process. Real world systems would realistically fail or break before an optimal controller can be learned. To address this issue, we propose a controller architecture that combines (1) a model-free RL-based controller with (2) model-based controllers utilizing control barrier functions (CBFs) and (3) online learning of the unknown system dynamics, in order to ensure safety during learning. Our general framework leverages the success of RL algorithms to learn high-performance controllers, while the CBF-based controllers both guarantee safety…
Citation impact
- FWCI
- 122.20
- Percentile
- 100%
- References
- 39
Authors
4Topics & keywords
- Reinforcement learning
- Computer science
- Inverted pendulum
- Controller (irrigation)
- Process (computing)
- Gaussian process
- Artificial intelligence
- Machine learning
- Peace, Justice and strong institutions