Distributed Negative Feedback Optimization for Multi-Agent AI
Indexed indatacite
Abstract
Existing AI safety research focuses predominantly on controlling the outputs of individual agents through guardrails, alignment training, and constitutional constraints. We argue that this single-agent paradigm has a fundamental blind spot: an agent cannot reliably detect errors in its own reasoning, just as an open-loop control system cannot correct disturbances it does not measure. We propose Distributed Negative Feedback Optimization (DNFO), a framework that applies the century-old principle of negative feedback from control engineering to multi-agent AI systems. DNFO recasts multi-agent collaboration as a closed-loop control problem, where each agent's output is structurally verified by independent…
Citation impact
10
total citations
- FWCI
- —
- Percentile
- —
- References
- 0
Too recent for citation history.
Authors
1Topics & keywords
Topics
Keywords
- Liveness
- Redundancy (engineering)
- Imperfect
- Control (management)
- Mutual exclusion
- Multi-agent system
- Control system
- Automotive industry
UN Sustainable Development Goals
- Reduced inequalities
No related works found for this paper.