Code of "Sirens' Whisper: Inaudible Near-Ultrasonic Jailbreaks of Speech-Driven LLMs"

Xiuyong, Gao,

doi:10.48550/arxiv.2307.15043

preprintarXiv (Cornell University)Jul 27, 2023GREEN OA

Code of "Sirens' Whisper: Inaudible Near-Ultrasonic Jailbreaks of Speech-Driven LLMs"

GXGao, Xiuyong

Huazhong University of Science and Technology

Indexed inarxivdatacite

Abstract

## Introduction Large language models (LLMs) are increasingly deployed in voice interfaces such as smartphones, smart speakers, and in-vehicle systems, which broadens the attack surface to the acoustic front end. **SWhisper (Sirens’ Whisper)** is the first inaudible near-ultrasonic jailbreak targeting LLM-based voice assistants. By encoding malicious prompts into near-ultrasound carriers, SWhisper exploits microphone nonlinearities to demodulate signals into audible speech. These signals are imperceptible to humans but consistently transcribed by off-the-shelf speech recognizers and forwarded to LLMs. Key aspects include: - Decoupling audio from semantics: prompts are semantically fluent, short, and…

Citation impact

184

total citations

FWCI: —
Percentile: —
References: 0

Citations per year

Authors

1

GX
Gao, XiuyongCorresponding
Huazhong University of Science and Technology

Topics & keywords

Topics

Keywords

Adversarial system
Computer science
Suffix
Ingenuity
Language model
Artificial intelligence
Computer security
Theoretical computer science

UN Sustainable Development Goals

Peace, Justice and strong institutions

No related works found for this paper.