Diagnosing Non-Intermittent Anomalies in Reinforcement Learning Policy Executions (Short Paper)

Avraham, Natan,; Roni, Stern,; Meir, Kalech,

doi:10.4230/oasics.dx.2024.16

preprintarXiv (Cornell University)Jul 20, 2017GREEN OA

Diagnosing Non-Intermittent Anomalies in Reinforcement Learning Policy Executions (Short Paper)

NANatan, AvrahamSRStern, RoniKMKalech, Meir

Ben-Gurion University of the Negev

Indexed inarxivdatacite

Abstract

Due to the safety risks and training sample inefficiency, it is often preferred to develop controllers in simulation. However, minor differences between the simulation and the real world can cause a significant sim-to-real gap. This gap can reduce the effectiveness of the developed controller. In this paper, we examine a case study of transferring an octorotor reinforcement learning controller from simulation to the real world. First, we quantify the effectiveness of the real-world transfer by examining safety metrics. We find that although there is a noticeable (around 100%) increase in deviation in real flights, this deviation may not be considered unsafe, as it will be within > 2m safety corridors. Then, we…

Citation impact

11,298

total citations

FWCI: 570.62
Percentile: 100%
References: 11

Citations per year

Authors

3

NA
Natan, AvrahamCorresponding
Ben-Gurion University of the Negev
SR
Stern, Roni
Ben-Gurion University of the Negev
KM
Kalech, Meir
Ben-Gurion University of the Negev

Topics & keywords

Topics

Keywords

Computer science
Optimization algorithm
Algorithm
Mathematical optimization
Mathematics

No related works found for this paper.