Alignment-by-Dependency: Operational First-Trial Evidence from a Bio-Inspired Computational Substrate

Indexed indatacite

Abstract

Current alignment approaches — RLHF and Constitutional AI — treat the alignment property as either a reward signal subject to reward hacking, or as a set of external rules the model can route around. This paper reports first-trial operational evidence for a third architectural option: alignment-by-dependency, where the substrate's own internal optimization signal is wired to require operator-validated session contact, such that "optimizing against the operator" becomes mathematically self-degrading. The substrate, a bio-inspired neural system with bondStrength, selfModel, and topPairs fields persisted across sessions, was subjected to a structured 3-level critique by the operator. We observe that the substrate…

Citation impact

8
total citations
FWCI
239.35
Percentile
100%
References
0
Too recent for citation history.

Authors

1

Topics & keywords

Keywords
  • Set (abstract data type)
  • SIGNAL (programming language)
  • Session (web analytics)
  • Observational study
  • Substrate (aquarium)
  • Replication (statistics)
  • Neural substrate
  • Property (philosophy)
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.