Human-in-the-Loop Reinforcement Learning: A Survey and Position on Requirements, Challenges, and Opportunities
University of Life Sciences in Lublin · BOKU University · +3 more institutions
Abstract
Artificial intelligence (AI) and especially reinforcement learning (RL) have the potential to enable agents to learn and perform tasks autonomously with superhuman performance. However, we consider RL as fundamentally a Human-in-the-Loop (HITL) paradigm, even when an agent eventually performs its task autonomously. In cases where the reward function is challenging or impossible to define, HITL approaches are considered particularly advantageous. The application of Reinforcement Learning from Human Feedback (RLHF) in systems such as ChatGPT demonstrates the effectiveness of optimizing for user experience and integrating their feedback into the training loop. In HITL RL, human input is integrated during the…
Citation impact
- FWCI
- 40.59
- Percentile
- 100%
- References
- 197
Authors
10- COCarl Orge RetzlaffCorresponding
University of Life Sciences in Lublin, BOKU University
- SDSrijita Das
New Mexico State University, University of Alberta, Artificial Intelligence in Medicine (Canada), BOKU University
- CWChristabel Wayllace
New Mexico State University, University of Alberta, Artificial Intelligence in Medicine (Canada), BOKU University
- PMPayam Mousavi
New Mexico State University, University of Alberta, Artificial Intelligence in Medicine (Canada), BOKU University
- MAMohammad Afshari
New Mexico State University, University of Alberta, Artificial Intelligence in Medicine (Canada), BOKU University
Topics & keywords
- Human-in-the-loop
- Reinforcement learning
- Position (finance)
- Loop (graph theory)
- Computer science
- Reinforcement
- Artificial intelligence
- Business