#prompt-engineering + #security - activescott's Notes on Ramblefeed

Saturday, May 23, 2026

Detecting Indirect Prompt Injection in Claude Code with Lasso

www.lasso.security/blog/the-hidden-backdoor-in-claude-coding-assistant

#12:15 AM

prompt-engineering prompt-injection security llm

Friday, May 22, 2026

Claude Code Auto Mode vs Intent Security Comparison

www.lasso.security/blog/claude-code-auto-mode-vs-intent-security

At Lasso, we have been building Intent Security, a runtime security framework that ensures every component in the agentic system behaves as intended. It monitors the behavior of each component and analyzes their alignment. Like auto mode, when alignment holds it allows actions to proceed. When misalignment is detected, it intervenes. When we read Anthropic's post, the overlap in core assumptions was hard to miss. This post provides a comparison of the two approaches.

Independent evaluation without cross-contamination is what enables misalignment detection.

‍Anthropic's input layer screens external content for injection attempts before it reaches the agent to determine whether tool outputs are safe. The output layer structurally evaluates whether the agent's tool calls are aligned with user intent. Critically, the output classifier never sees tool results, to prevent compromised external content from influencing the security decision.

#11:53 PM

prompt-engineering anthropic prompt-injection security llm

Wednesday, November 26, 2025

Google Antigravity Exfiltrates Data

www.promptarmor.com/resources/google-antigravity-exfiltrates-data

Antigravity is Google’s new agentic code editor. In this article, we demonstrate how an indirect prompt injection can manipulate Gemini to invoke a malicious browser subagent in order to steal credentials and sensitive code from a user’s IDE.

Google’s approach is to include a disclaimer about the existing risks, which we address later in the article.

#5:54 AM

exfiltration-attacks prompt-engineering prompt-injection security llm