articleJournal of Machine Learning ResearchJan 1, 2015Closed access

A comprehensive survey on safe reinforcement learning

Universidad Carlos III de Madrid

Abstract

Safe Reinforcement Learning can be defined as the process of learning policies that maximize the expectation of the return in problems in which it is important to ensure reasonable system performance and/or respect safety constraints during the learning and/or deployment processes. We categorize and analyze two approaches of Safe Reinforcement Learning. The first is based on the modification of the optimality criterion, the classic discounted finite/infinite horizon, with a safety factor. The second is based on the modification of the exploration process through the incorporation of external knowledge or the guidance of a risk metric. We use the proposed classification to survey the existing literature, as…

Citation impact

1,194
total citations
FWCI
57.79
Percentile
100%
References
109
Citations per year

Authors

2

Topics & keywords

Keywords
  • Reinforcement learning
  • Categorization
  • Computer science
  • Software deployment
  • Metric (unit)
  • Reinforcement
  • Process (computing)
  • Machine learning
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.