preprintarXiv (Cornell University)Jul 27, 2023GREEN OA

Code of "Sirens' Whisper: Inaudible Near-Ultrasonic Jailbreaks of Speech-Driven LLMs"

GXGao, Xiuyong

Huazhong University of Science and Technology

Indexed inarxivdatacite

Abstract

## Introduction Large language models (LLMs) are increasingly deployed in voice interfaces such as smartphones, smart speakers, and in-vehicle systems, which broadens the attack surface to the acoustic front end. **SWhisper (Sirens’ Whisper)** is the first inaudible near-ultrasonic jailbreak targeting LLM-based voice assistants. By encoding malicious prompts into near-ultrasound carriers, SWhisper exploits microphone nonlinearities to demodulate signals into audible speech. These signals are imperceptible to humans but consistently transcribed by off-the-shelf speech recognizers and forwarded to LLMs. Key aspects include: - Decoupling audio from semantics: prompts are semantically fluent, short, and…

Citation impact

184
total citations
FWCI
Percentile
References
0
Citations per year

Authors

1
  • GX
    Gao, XiuyongCorresponding

    Huazhong University of Science and Technology

Topics & keywords

Keywords
  • Adversarial system
  • Computer science
  • Suffix
  • Ingenuity
  • Language model
  • Artificial intelligence
  • Computer security
  • Theoretical computer science
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.

Funding