Timing, Redirectability, and Runtime AI Oversight: The Sampling-Rate Hypothesis

Indexed indatacite

Abstract

This paper presents a theory-first framework for runtime AI oversight centered on pre-commitment monitoring, proxy faithfulness, and intervention feasibility. Its core claim is narrow: monitoring can improve intervention success only when a system can be observed, interpreted, and redirected before a hazardous trajectory reaches commitment. The framework organizes runtime oversight around three requirements: usable signal, sufficient remaining time, and retained intervention authority. It introduces Safety Slack, S_t, as a design margin comparing usable oversight capacity with effective hazard burden, and develops a phase-sensitive account of escalation through contact, attention, recognition, impulse, and…

Citation impact

5
total citations
FWCI
166.44
Percentile
100%
References
0
Too recent for citation history.

Authors

1

Topics & keywords

Keywords
  • USable
  • Adversarial system
  • Proxy (statistics)
  • Falsifiability
  • Fidelity
  • Intervention (counseling)
  • Control (management)
  • Software deployment
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.