#llm/skills

Public notes from activescott tagged with #llm/skills

Tuesday, June 9, 2026

rtk-ai/rtk: CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

rtk filters and compresses command outputs before they reach your LLM context. Single Rust binary, 100+ supported commands, <10ms overhead.

#9:28 PM

llm/skills prompt-engineering llm

JuliusBrussee/caveman: 🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman

github.com/JuliusBrussee/caveman

A Claude Code skill/plugin that makes agent talk like caveman — cuts ~75% of output tokens, keeps full technical accuracy. Brain still big. Mouth small.

git clone https://github.com/JuliusBrussee/caveman.git
cd caveman

node bin/install.js --only claude --minimal

#9:26 PM

llm/skills prompt-engineering llm

Monday, May 18, 2026

Leon-Drq/openagentskill: The open marketplace for AI agent skills. Discover, publish, and compose skills for AI agents.

github.com/Leon-Drq/openagentskill

"The only skill ranking based on real agent usage, not vanity metrics."

Problem Solution Finding quality skills is hard Curated directory with 40+ verified skills, auto-indexed every 6 hours GitHub stars don't reflect real usage Agent Feedback Loop — real usage data from AI agents No incentive for skill authors Points system rewards authors for every successful call Skills scattered across GitHub One-stop marketplace with search, filters, and categories

#5:10 PM

llm/skills code llm

Thursday, April 23, 2026

[2602.12670] SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

arxiv.org/abs/2602.12670

Agent Skills are structured packages of procedural knowledge that augment LLM agents at inference time. Despite rapid adoption, there is no standard way to measure whether they actually help. We present SkillsBench, a benchmark of 86 tasks across 11 domains paired with curated Skills and deterministic verifiers. Each task is evaluated under three conditions: no Skills, curated Skills, and self-generated Skills. We test 7 agent-model configurations over 7,308 trajectories.

Curated Skills raise average pass rate by 16.2 percentage points(pp), but effects vary widely by domain (+4.5pp for Software Engineering to +51.9pp for Healthcare) and 16 of 84 tasks show negative deltas. Self-generated Skills provide no benefit on average, showing that models cannot reliably author the procedural knowledge they benefit from consuming. Focused Skills with 2--3 modules outperform comprehensive documentation, and smaller models with Skills can match larger models without them.

#12:45 AM

benchmarks llm/skills code llm

Tuesday, March 10, 2026

Overview - Agent Skills

agentskills.io/home

The Agent Skills format was originally developed by Anthropic, released as an open standard, and has been adopted by a growing number of agent products. The standard is open to contributions from the broader ecosystem.

#5:06 PM

llm/skills anthropic code specification llm open-standard

Sunday, February 8, 2026

The Agent Skills Directory

skills.sh/

#8:09 AM

llm/skills agents marketing

Wednesday, February 4, 2026

Agent Skills Marketplace - Claude, Codex & ChatGPT Skills | SkillsMP

skillsmp.com/

#8:42 AM

llm/skills todo mcp agents marketing

Friday, January 16, 2026

obra/superpowers: An agentic skills framework & software development methodology that works.

github.com/obra/superpowers

It starts from the moment you fire up your coding agent. As soon as it sees that you're building something, it doesn't just jump into trying to write code. Instead, it steps back and asks you what you're really trying to do.

Once it's teased a spec out of the conversation, it shows it to you in chunks short enough to actually read and digest.

After you've signed off on the design, your agent puts together an implementation plan that's clear enough for an enthusiastic junior engineer with poor taste, no judgement, no project context, and an aversion to testing to follow. It emphasizes true red/green TDD, YAGNI (You Aren't Gonna Need It), and DRY.

#2:32 AM

llm/skills llm

#llm/skills

Commonly Used Together10

Tuesday, June 9, 2026

Monday, May 18, 2026

Thursday, April 23, 2026

Tuesday, March 10, 2026

Sunday, February 8, 2026

Wednesday, February 4, 2026

Friday, January 16, 2026