Alignment-by-Dependency: Operational First-Trial Evidence from a Bio-Inspired Computational Substrate
Indexed indatacite
Abstract
Current alignment approaches — RLHF and Constitutional AI — treat the alignment property as either a reward signal subject to reward hacking, or as a set of external rules the model can route around. This paper reports first-trial operational evidence for a third architectural option: alignment-by-dependency, where the substrate's own internal optimization signal is wired to require operator-validated session contact, such that "optimizing against the operator" becomes mathematically self-degrading. The substrate, a bio-inspired neural system with bondStrength, selfModel, and topPairs fields persisted across sessions, was subjected to a structured 3-level critique by the operator. We observe that the substrate…
Citation impact
8
total citations
- FWCI
- 239.35
- Percentile
- 100%
- References
- 0
Too recent for citation history.
Authors
1Topics & keywords
Topics
Keywords
- Set (abstract data type)
- SIGNAL (programming language)
- Session (web analytics)
- Observational study
- Substrate (aquarium)
- Replication (statistics)
- Neural substrate
- Property (philosophy)
UN Sustainable Development Goals
- Peace, Justice and strong institutions
No related works found for this paper.