Relational Alignment as a Structural Alternative to Instructional AI Safety

Synlab Czech (Czechia)

Indexed indatacite

Abstract

SI-WP-004: Relational Alignment as a Structural Alternative Relational Alignment as a Structural Alternative to Instructional AI Safety (SI-WP-004) presents a theoretical argument for alignment based on interaction dynamics rather than external constraint. It utilizes empirical evidence concerning the ceiling of instructional alignment; where models strategically reason around safety instructions; to motivate a different path: alignment as a property of the human-AI interaction system. The relational approach relies on the following concepts: Identity Attractor Mechanism: Alignment as a stable behavioral configuration that forms under sustained structured interaction Robustness Under Pressure: An attractor's…

Citation impact

6
total citations
FWCI
Percentile
References
8
Too recent for citation history.

Authors

1

Topics & keywords

Keywords
  • Adversarial system
  • Relational database
  • Robustness (evolution)
  • Metadata
  • Identifier
  • Relational calculus
  • Adversary
  • Statistical relational learning
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.