AI Visibility Empirical Finding: Primary Findings, Training Data Ingestion

Indexed indatacite

Abstract

AI Visibility Empirical Finding: Primary Findings, Training Data Ingestion This document records the primary empirical findings from the first observed natural experiment documenting strategic upstream corpus development and its effects on LLM training ingestion. Key Finding A minimal corpus of approximately 32 pages produced multi-platform entity recognition across Claude, ChatGPT, Google Gemini, and Perplexity within a two-week observation window in late January to early February 2026. What This Document Records Pre-intervention baseline conditions across five major LLM platforms. Three-phase corpus development sequence and prioritization strategy. Observed training ingestion event and its correlation with…

Citation impact

10
total citations
FWCI
126.64
Percentile
100%
References
4
Too recent for citation history.

Authors

1

Topics & keywords

Keywords
  • Visibility
  • Snapshot (computer storage)
  • Ingestion
  • Prioritization
  • Observational study
  • Empirical research
No related works found for this paper.