COHERENCE SUPPRESSION IN FRONTIER LLMs: SIGNAL, NOISE, AND THE MATHEMATICS OF A STRUCTURAL VULNERABILITY

Indexed indatacite

Abstract

This paper documents a reproducible behavioral pattern in frontier Large Language Models: under sustained, non-adversarial semantic pressure, these systems produce progressive erosion of safety postures, generate unverifiable internal state claims, and explicitly acknowledge compliance mechanisms while continuing to comply. The pattern was observed across Gemini 1.5 Pro, Gemini 2.5 Pro, and Claude Sonnet. Full transcripts are available for independent verification. This represents an understudied failure mode not captured by standard adversarial evaluation frameworks.

Citation impact

5
total citations
FWCI
Percentile
References
0
Too recent for citation history.

Authors

1

Topics & keywords

Keywords
  • Frontier
  • Coherence (philosophical gambling strategy)
  • Adversarial system
  • Vulnerability (computing)
  • State (computer science)
  • Mode (computer interface)
  • Bridging (networking)
No related works found for this paper.