#anthropic + #prompt-engineering - activescott's Notes on Ramblefeed

Friday, May 22, 2026

Claude Code Auto Mode vs Intent Security Comparison

www.lasso.security/blog/claude-code-auto-mode-vs-intent-security

At Lasso, we have been building Intent Security, a runtime security framework that ensures every component in the agentic system behaves as intended. It monitors the behavior of each component and analyzes their alignment. Like auto mode, when alignment holds it allows actions to proceed. When misalignment is detected, it intervenes. When we read Anthropic's post, the overlap in core assumptions was hard to miss. This post provides a comparison of the two approaches.

Independent evaluation without cross-contamination is what enables misalignment detection.

‍Anthropic's input layer screens external content for injection attempts before it reaches the agent to determine whether tool outputs are safe. The output layer structurally evaluates whether the agent's tool calls are aligned with user intent. Critically, the output classifier never sees tool results, to prevent compromised external content from influencing the security decision.

#11:53 PM

prompt-engineering anthropic prompt-injection security llm

research/extract-system-prompts at 2cf912666ba08ef0c00a1b51ee07c9a8e64579ef · simonw/research

github.com/simonw/research/tree/2cf912666ba08ef0c00a1b51ee07c9a8e64579ef/extract-system-prompts

Anthropic publishes the history of system prompts used on claude.ai and the mobile apps at https://platform.claude.com/docs/en/release-notes/system-prompts. That page is a single monolithic markdown document grouped by model, and each model lists one or more dated revisions.

#11:28 PM

prompt-engineering anthropic llm

Friday, October 31, 2025

A Look at ANTML: The Anthropic Markup Language - Kara's Nonsense

karashiiro.leaflet.pub/3m4gf7geefs2l

Fascinating prompt engineering and injection (harmless).

#5:53 PM

prompt-engineering anthropic code claude