Not What You've Signed Up For: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
Saarland University · Helmholtz Center for Information Security
Abstract
Large Language Models (LLMs) are increasingly being integrated into applications, with versatile functionalities that can be easily modulated via natural language prompts. So far, it was assumed that the user is directly prompting the LLM. But, what if it is not the user prompting? We show that LLM-Integrated Applications blur the line between data and instructions and reveal several new attack vectors, using Indirect Prompt Injection, that enable adversaries to remotely (i.e., without a direct interface) exploit LLM-integrated applications by strategically injecting prompts into data likely to be retrieved at inference time. We derive a comprehensive taxonomy from a computer security perspective to broadly…
Citation impact
- FWCI
- 53.89
- Percentile
- 100%
- References
- 10
Authors
6Topics & keywords
- Computer science
- Exploit
- Computer security
- Software deployment
- Interface (matter)
- Software engineering
- Operating system