Adaptive Robust Watermarking for Large Language Models via Dynamic Token Embedding Perturbation

Zeng, Ziyang; Lin, Han; Zhang, S; Wang, B.

doi:10.1109/access.2026.3653833

articleIEEE AccessJan 1, 2026GOLD OA

Adaptive Robust Watermarking for Large Language Models via Dynamic Token Embedding Perturbation

ZZZiyang ZengHLHan LinSZS Zhang BWB. Wang

New York University · University of Wisconsin–Madison · +2 more institutions

Indexed incrossrefdoaj

Abstract

Large Language Models (LLMs) have demonstrated remarkable capabilities in generating high-quality text, raising significant concerns regarding copyright protection and content provenance verification. However, most existing watermarking techniques rely on uniform perturbation or rule-based token biasing schemes, which exhibit critical vulnerabilities under adversarial attacks such as paraphrasing, translation, and content truncation, often failing to maintain detection reliability in real-world deployment scenarios. To address these challenges, this paper introduces a novel context-aware robust watermarking framework that dynamically adjusts watermark embedding strength according to contextual semantic…

Citation impact

9

total citations

FWCI: 221.37
Percentile: 100%
References: 23

Too recent for citation history.

Authors

4

ZZ
Ziyang ZengCorresponding
New York University
HL
Han Lin
University of Wisconsin–Madison
SZ
S Zhang
University of California, Berkeley
BW
B. Wang
University of Southern California

Topics & keywords

Topics

Keywords

Digital watermarking
Watermark
Robustness (evolution)
Security token
Embedding
Scalability
Scrambling
Language model

No related works found for this paper.