Safety Guardrails for LLM-Enabled Robots
University of Pennsylvania · Carnegie Mellon University
Abstract
Although the integration of large language models (LLMs) into robotics has unlocked transformative capabilities, it has also introduced significant safety concerns, ranging from average-case LLM errors (e.g., hallucinations) to adversarial jailbreaking attacks, which can produce harmful robot behavior in real-world settings. Traditional robot safety approaches do not address the contextual vulnerabilities of LLMs, and current LLM safety approaches overlook the physical risks posed by robots operating in real-world environments. To ensure the safety of LLM-enabled robots, we propose RoboGuard, a two-stage guardrail architecture. RoboGuard first contextualizes pre-defined safety rules by grounding them in the…
Citation impact
- FWCI
- 69.14
- Percentile
- 100%
- References
- 0
Authors
5Topics & keywords
- Robot
- Robotics
- Reliability (semiconductor)
- Adversarial system
- Transformative learning
- Ground
- System safety
- Affordable and clean energy