Human-in-the-Loop Reinforcement Learning: A Survey and Position on Requirements, Challenges, and Opportunities

Retzlaff, Carl Orge; Das, Srijita; Wayllace, Christabel; Mousavi, Payam; Afshari, Mohammad; Yang, Tianpei; Saranti, Anna; Angerschmid, Alessa; Taylor, Matthew E.; Holzinger, Andreas

doi:10.1613/jair.1.15348

articleJournal of Artificial Intelligence ResearchJan 30, 2024DIAMOND OA

Human-in-the-Loop Reinforcement Learning: A Survey and Position on Requirements, Challenges, and Opportunities

COCarl Orge Retzlaff SDSrijita Das CWChristabel Wayllace PMPayam Mousavi MAMohammad Afshari

University of Life Sciences in Lublin · BOKU University · +3 more institutions

Indexed incrossrefdoaj

Abstract

Artificial intelligence (AI) and especially reinforcement learning (RL) have the potential to enable agents to learn and perform tasks autonomously with superhuman performance. However, we consider RL as fundamentally a Human-in-the-Loop (HITL) paradigm, even when an agent eventually performs its task autonomously. In cases where the reward function is challenging or impossible to define, HITL approaches are considered particularly advantageous. The application of Reinforcement Learning from Human Feedback (RLHF) in systems such as ChatGPT demonstrates the effectiveness of optimizing for user experience and integrating their feedback into the training loop. In HITL RL, human input is integrated during the…

Citation impact

129

total citations

FWCI: 40.59
Percentile: 100%
References: 197

Citations per year

Authors

10

Topics & keywords

Topics

Keywords

Human-in-the-loop
Reinforcement learning
Position (finance)
Loop (graph theory)
Computer science
Reinforcement
Artificial intelligence
Business

No related works found for this paper.