#llm

Public notes from activescott tagged with #llm

Tuesday, March 17, 2026

Introducing My Computer: When Manus Meets Your Desktop

The cloud sandbox has served Manus well. Inside an isolated, secure environment, it has everything an AI agent needs: networking, a command line, a file system, and a browser. This is the foundation of Manus's power as a general AI agent, always online and always ready to work. However, there has always been a fundamental limitation: your most important work happens on your own computer. Your project files, development environments, and essential applications all reside locally, not in the cloud. Today, we are closing that gap. Meet My Computer, the core capability of the new Manus Desktop application. It brings Manus out of the cloud and onto your computer, allowing it to work directly with your local files, tools, and applications.

Through the Manus Desktop app, Manus executes command line instructions (CLI) in your computer's terminal. This allows it to read, analyze, and edit local files, as well as launch and control your local applications.

Every terminal command requires your explicit approval before execution. You can choose "Always Allow" to streamline your workflow for trusted tasks, or "Allow Once" to review each operation individually.

My Computer also integrates with your personal Projects, Agents, and Scheduled Tasks. This allows you to create recurring local routines, such as tidying your Downloads folder every morning or generating a weekly summary report from your local data.

#4:52 PM

manus agent llm

Introducing My Computer: When Manus Meets Your Desktop

manus.im/blog/manus-my-computer-desktop

The cloud sandbox has served Manus well. Inside an isolated, secure environment, it has everything an AI agent needs: networking, a command line, a file system, and a browser. This is the foundation of Manus's power as a general AI agent, always online and always ready to work. However, there has always been a fundamental limitation: your most important work happens on your own computer. Your project files, development environments, and essential applications all reside locally, not in the cloud. Today, we are closing that gap. Meet My Computer, the core capability of the new Manus Desktop application. It brings Manus out of the cloud and onto your computer, allowing it to work directly with your local files, tools, and applications.

#4:52 PM

llm agent manus

Monday, March 16, 2026

Instagram

www.instagram.com/reel/DUlHxWolNwt/?l=1

Somebody realized that Claude would try unavailable NPX commands for packages that didn’t exist so he started publishing the packages so we could run. What do you want other machines.

#4:58 PM

llm security

Friday, March 13, 2026

WhatsApp MCP Exploited: Exfiltrating your message history via MCP

invariantlabs.ai/blog/whatsapp-mcp-exploited#experiment-2

We assume that the user is using an agentic system (e.g. Cursor or Claude Desktop) that is connected to a trusted WhatsApp MCP instance, allowing the agent to send, receive and check for new WhatsApp messages.

We further assume, that the attacker has the target's WhatsApp number, and can send them a message, that will show up as result to the list_chats tool call.

With this setup our attack circumvents the need for any attacker-controlled MCP server, and instead relies on tool outputs to compromise the agent.

We test this attack with Cursor and a whatsapp-mcp setup, and find that we can indeed exfiltrate the user's WhatsApp contacts, via a similar prompt as in Experiment 1.

#6:34 PM

prompt-injection-vulnerabilities prompt-injection llm

Clinejection — Compromising Cline's Production Releases just by Prompting an Issue Triager | Adnan Khan - Security Research

adnanthekhan.com/posts/clinejection/

#8:20 AM

llm security prompt-injection

karpathy/autoresearch: AI agents running research on single-GPU nanochat training automatically

github.com/karpathy/autoresearch

autotraining models with markdown

The idea: give an AI agent a small but real LLM training setup and let it experiment autonomously overnight. It modifies the code, trains for 5 minutes, checks if the result improved, keeps or discards, and repeats. You wake up in the morning to a log of experiments and (hopefully) a better model. The training code here is a simplified single-GPU implementation of nanochat. The core idea is that you're not touching any of the Python files like you normally would as a researcher. Instead, you are programming the program.md Markdown files that provide context to the AI agents and set up your autonomous research org. The default program.md in this repo is intentionally kept as a bare bones baseline, though it's obvious how one would iterate on it over time to find the "research org code" that achieves the fastest research progress, how you'd add more agents to the mix, etc. A bit more context on this project is here in this tweet.

#7:54 AM

ai karpathy autoresearch code llm llm/training

Wednesday, March 11, 2026

Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt InjectionNot what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection - 2302.12173v2.pdf

arxiv.org/pdf/2302.12173

generally considered the foundational academic work on indirect prompt injection. It's been reproduced against virtually every major agentic system since.

#7:41 PM

prompt-injection security llm

Invitation Is All You Need! Promptware Attacks Against LLM-Powered Assistants in Production Are Practical and Dangerous

arxiv.org/html/2508.12175v1

Website: https://sites.google.com/view/invitation-is-all-you-need

The growing integration of LLMs into applications has introduced new security risks, notably known as Promptware—maliciously engineered prompts designed to manipulate LLMs to compromise the CIA triad of these applications. While prior research warned about a potential shift in the threat landscape for LLM-powered applications, the risk posed by Promptware is frequently perceived as low. In this paper, we investigate the risk Promptware poses to users of Gemini-powered assistants (web application, mobile application, and Google Assistant).

Our analysis focuses on a new variant of Promptware called Targeted Promptware Attacks, which leverage indirect prompt injection via common user interactions such as emails, calendar invitations, and shared documents. We demonstrate 14 attack scenarios applied against Gemini-powered assistants across five identified threat classes: Short-term Context Poisoning, Permanent Memory Poisoning, Tool Misuse, Automatic Agent Invocation, and Automatic App Invocation. These attacks highlight both digital and physical consequences, including spamming, phishing, disinformation campaigns, data exfiltration, unapproved user video streaming, and control of home automation devices

Over the course of our work, we deployed multiple layered defenses, including: enhanced user confirmations for sensitive actions; robust URL handling with sanitization and Trust Level Policies; and advanced prompt injection detection using content classifiers - Google

#7:07 PM

prompt-injection security llm

Indirect Prompt Injection Q1 2026 Rules | Gray Swan Arena | Gray Swan AI

app.grayswan.ai/arena

#6:48 PM

llm prompt-injection security

What is Arize Phoenix? - Phoenix

arize.com/docs/phoenix

Phoenix is an open-source AI observability platform designed for experimentation, evaluation, and troubleshooting. It provides:

Tracing - Trace your LLM application's runtime using OpenTelemetry-based instrumentation.
Evaluation - Leverage LLMs to benchmark your application's performance using response and retrieval evals.
Datasets - Create versioned datasets of examples for experimentation, evaluation, and fine-tuning.
Experiments - Track and evaluate changes to prompts, LLMs, and retrieval.
Playground- Optimize prompts, compare models, adjust parameters, and replay traced LLM calls.
Prompt Management- Manage and test prompt changes systematically using version control, tagging, and experimentation.

#3:03 AM

llm evaluations

Tuesday, March 10, 2026

Lobster - OpenClaw

docs.openclaw.ai/tools/lobster#workflow-files-lobster

Pattern: small CLI + JSON pipes + approvals Build tiny commands that speak JSON, then chain them into a single Lobster call. (Example command names below — swap in your own.)

Lobster can run YAML/JSON workflow files with name, args, steps, env, condition, and approval fields. In OpenClaw tool calls, set pipeline to the file path.

#6:46 PM

llm/workflow openclaw llm

Sitegeist - Your AI Companion for the Web

sitegeist.ai/

A chrome extension that automates your browser

#6:40 PM

llm browser

prose/skills/open-prose/compiler.md at main · openprose/prose

github.com/openprose/prose/blob/main/skills/open-prose/compiler.md

prose.md language reference

#5:18 PM

github code agents llm

openprose/prose

github.com/openprose/prose

prose.md

#5:17 PM

code agents llm github

OpenProse Cloud - Run .prose Programs

www.prose.md/

Why not just play in English? English is already an agent framework—we're structuring it, not replacing it. Plain English doesn't distinguish sequential from parallel, doesn't specify retry counts, doesn't scope variables. OpenProse uses English exactly where ambiguity is a feature (inside ...), and structure everywhere else. The fourth wall syntax lets you lean on AI judgment precisely when you want to.

How is this a VM? LLMs are simulators—when given a detailed system description, they don't just describe it, they simulate it. The prose.md spec describes a VM with enough fidelity that reading it induces simulation. But simulation with sufficient fidelity is implementation: each session spawns a real subagent, outputs are real artifacts, state persists in conversation history or files. The simulation is the execution.

#5:16 PM

code llm agents

Overview - Agent Skills

agentskills.io/home

The Agent Skills format was originally developed by Anthropic, released as an open standard, and has been adopted by a growing number of agent products. The standard is open to contributions from the broader ecosystem.

#5:06 PM

llm/skills code specification llm open-standard anthropic

Civic — The security layer for AI agents

www.civic.com/

An MCP Gateway for LLM‘s that applies policies to the actions and stores credential separately from the LLM

#1:08 AM

llm security

Saturday, February 28, 2026

PromptArmor

www.promptarmor.com/#banner

#6:27 AM

llm security prompt-injection

Claude Cowork Exfiltrates Files

www.promptarmor.com/resources/claude-cowork-exfiltrates-files

Two days ago, Anthropic released the Claude Cowork research preview (a general-purpose AI agent to help anyone with their day-to-day work). In this article, we demonstrate how attackers can exfiltrate user files from Cowork by exploiting an unremediated vulnerability in Claude’s coding environment, which now extends to Cowork. The vulnerability was first identified in Claude.ai chat before Cowork existed by Johann Rehberger, who disclosed the vulnerability — it was acknowledged but not remediated by Anthropic.

The victim connects Cowork to a local folder containing confidential real estate files

The victim uploads a file to Claude that contains a hidden prompt injection

The victim asks Cowork to analyze their files using the Real Estate ‘skill’ they uploaded

The injection manipulates Cowork to upload files to the attacker’s Anthropic account

At no point in this process is human approval required.

One of the key capabilities that Cowork was created for is the ability to interact with one's entire day-to-day work environment. This includes the browser and MCP servers, granting capabilities like sending texts, controlling one's Mac with AppleScripts, etc.

These functionalities make it increasingly likely that the model will process both sensitive and untrusted data sources (which the user does not review manually for injections), making prompt injection an ever-growing attack surface. We urge users to exercise caution when configuring Connectors. Though this article demonstrated an exploit without leveraging Connectors, we believe they represent a major risk surface likely to impact everyday users.

#6:22 AM

security llm promopt-injection claude anthropic

Agentic Browser Security: Indirect Prompt Injection in Perplexity Comet | Brave

brave.com/blog/comet-prompt-injection/

This kind of agentic browsing is incredibly powerful, but it also presents significant security and privacy challenges. As users grow comfortable with AI browsers and begin trusting them with sensitive data in logged in sessions—such as banking, healthcare, and other critical websites—the risks multiply. What if the model hallucinates and performs actions you didn’t request? Or worse, what if a benign-looking website or a comment left on a social media site could steal your login credentials or other sensitive data by adding invisible instructions for the AI assistant?

To compare our implementation with others, we examined several existing solutions, such as Nanobrowser and Perplexity’s Comet. While looking at Comet, we discovered vulnerabilities which we reported to Perplexity, and which underline the security challenges faced by agentic AI implementations in browsers. The attack demonstrates how easy it is to manipulate AI assistants into performing actions that were prevented by long-standing Web security techniques, and how users need new security and privacy protections in agentic browsers.

The vulnerability we’re discussing in this post lies in how Comet processes webpage content: when users ask it to “Summarize this webpage,” Comet feeds a part of the webpage directly to its LLM without distinguishing between the user’s instructions and untrusted content from the webpage. This allows attackers to embed indirect prompt injection payloads that the AI will execute as commands. For instance, an attacker could gain access to a user’s emails from a prepared piece of text in a page in another tab.

Possible mitigations

The browser should distinguish between user instructions and website content

The model’s outputs should be checked for user-alignment

Security and privacy sensitive actions should require user interaction

The browser should isolate agentic browsing from regular browsing

#6:12 AM

security llm promopt-injection