Code of "Sirens' Whisper: Inaudible Near-Ultrasonic Jailbreaks of Speech-Driven LLMs"
Huazhong University of Science and Technology
Abstract
## Introduction Large language models (LLMs) are increasingly deployed in voice interfaces such as smartphones, smart speakers, and in-vehicle systems, which broadens the attack surface to the acoustic front end. **SWhisper (Sirens’ Whisper)** is the first inaudible near-ultrasonic jailbreak targeting LLM-based voice assistants. By encoding malicious prompts into near-ultrasound carriers, SWhisper exploits microphone nonlinearities to demodulate signals into audible speech. These signals are imperceptible to humans but consistently transcribed by off-the-shelf speech recognizers and forwarded to LLMs. Key aspects include: - Decoupling audio from semantics: prompts are semantically fluent, short, and…
Citation impact
- FWCI
- —
- Percentile
- —
- References
- 0
Authors
1- GXGao, XiuyongCorresponding
Huazhong University of Science and Technology
Topics & keywords
- Adversarial system
- Computer science
- Suffix
- Ingenuity
- Language model
- Artificial intelligence
- Computer security
- Theoretical computer science
- Peace, Justice and strong institutions