End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks

Cheng, Richard; Orosz, Gábor; Murray, Richard M.; Burdick, Joel W.

doi:10.1609/aaai.v33i01.33013387

articleProceedings of the AAAI Conference on Artificial IntelligenceJul 17, 2019DIAMOND OA

End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks

RCRichard Cheng GOGábor Orosz RMRichard M. Murray JWJoel W. Burdick

California Institute of Technology · University of Michigan

Indexed incrossref

Abstract

Reinforcement Learning (RL) algorithms have found limited success beyond simulated applications, and one main reason is the absence of safety guarantees during the learning process. Real world systems would realistically fail or break before an optimal controller can be learned. To address this issue, we propose a controller architecture that combines (1) a model-free RL-based controller with (2) model-based controllers utilizing control barrier functions (CBFs) and (3) online learning of the unknown system dynamics, in order to ensure safety during learning. Our general framework leverages the success of RL algorithms to learn high-performance controllers, while the CBF-based controllers both guarantee safety…

Citation impact

527

total citations

FWCI: 122.20
Percentile: 100%
References: 39

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Reinforcement learning
Computer science
Inverted pendulum
Controller (irrigation)
Process (computing)
Gaussian process
Artificial intelligence
Machine learning

UN Sustainable Development Goals

Peace, Justice and strong institutions

No related works found for this paper.