Relational Alignment as a Structural Alternative to Instructional AI Safety
Indexed indatacite
Abstract
SI-WP-004: Relational Alignment as a Structural Alternative Relational Alignment as a Structural Alternative to Instructional AI Safety (SI-WP-004) presents a theoretical argument for alignment based on interaction dynamics rather than external constraint. It utilizes empirical evidence concerning the ceiling of instructional alignment; where models strategically reason around safety instructions; to motivate a different path: alignment as a property of the human-AI interaction system. The relational approach relies on the following concepts: Identity Attractor Mechanism: Alignment as a stable behavioral configuration that forms under sustained structured interaction Robustness Under Pressure: An attractor's…
Citation impact
6
total citations
- FWCI
- —
- Percentile
- —
- References
- 8
Too recent for citation history.
Authors
1Topics & keywords
Topics
Keywords
- Adversarial system
- Relational database
- Robustness (evolution)
- Metadata
- Identifier
- Relational calculus
- Adversary
- Statistical relational learning
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.