Remix.run Logo
Skills are quietly becoming the unit of agent knowledge
9 points by latand6 2 days ago | 9 comments

In the last few months agent skills went from a niche Claude Code feature to something every major runtime supports. Anthropic has an official skills repo. OpenAI shipped skills in Codex with a built-in skill-creator. Karpathy talks about "everything is skill issue" and describes writing skills as curricula for agents [1]. The format is converging: a folder with a SKILL.md, optional scripts, optional reference files.

What changed is that the models got good enough to follow written instructions reliably. A skill is just a tested workflow in markdown that the agent reads and follows instead of improvising. You can also bundle scripts that the agent runs during the workflow, which covers what most people use lightweight MCP servers for, except the agent can read the script source and extend it.

Karpathy talks about an "economy of agents" and says we should stop writing HTML docs for humans and start writing markdown docs for agents [1]. Anthropic just shipped a skill-creator that benchmarks whether a skill still works after model updates. There are already tens of thousands of community skills on GitHub.

Distribution still feels early. Most useful skills are tiny. A markdown file, maybe one script. Useful enough to keep reusing, but not something anyone turns into a proper GitHub repo with a README and install instructions. So they stay on one machine.

I have been writing skills for my own agents for a while and I keep running into this. The format works. Moving them between machines or handing one to someone else does not.

Curious if others are hitting the same wall or if there are approaches I am missing.

[1] https://www.youtube.com/watch?v=kwSVtQ7dziU (Karpathy on No Briars podcast, skills discussion around 1:03:40)

dmppch an hour ago | parent | next [-]

The distribution problem is harder than it looks because it's actually a composition problem in disguise. A single skill is trivially shareable — zip it, gist it, whatever.

But in practice you end up with skills that depend on other skills, or a skill that assumes specific instructions are already loaded, conflicting skills, versioning and supply chain issues - and suddenly you need dependency resolution.

I've built a package-manager approach for this (APM - github.com/microsoft/apm) and the thing that surprised me most was how quickly even small teams end up with config sprawl - and how much a manifest that travels with the project helps.

The "too small for a repo" thing is real, but one pattern that works is having a monorepo per dev team or org with all skills and building jointly over there.

latand6 3 minutes ago | parent [-]

Yeah, I've built my own skill-package manager as well btw! Then it clicked and I hyperfocused for a whole week and vibecoded a skill marketplace haha. Because the selling of a skill seems to sound like a new idea. going to write a show HN about that maybe

MarcelinoGMX3C 3 hours ago | parent | prev | next [-]

Thinking about what manzanarama mentioned regarding checking skills into a repo, that's exactly how I look at it. To me, these skills are just specialized configuration. We've been versioning configuration for decades because it's critical to reproducibility and maintainability.

The "cognitive load" problem latand6 raised for reviewing every skill is real. That's where you integrate security and quality gates – treat these skills like any other software artifact. You wouldn't manually review every line of every dependency, so automate the validation here too.

SirensOfTitan 11 hours ago | parent | prev | next [-]

Most of this discourse feels like some kind of religious ritual built on the foundation of authority bias. Where is the evidence that skills improves performance over any other methodology outside of the fact of its nascent popularity?

I do agree with Jacques Ellul in Technological Society that technique precedes science, and that's certainly the case with LLMs; however, this whole industry waves off rigorous validation in favor of personal anecdotes ("it feels more productive to me!" "they didn't study after Opus 4.5 was released").

rbitr 16 hours ago | parent | prev | next [-]

Will skills replace MCP servers eventually?

brazukadev 3 hours ago | parent [-]

How would it replace something that is not used widespread yet? I think both skills and MCP already have similar levels of adoption.

manzanarama a day ago | parent | prev | next [-]

What do you think about checking the skills directly into the repo where they are useful?

latand6 a day ago | parent [-]

Yeah, that's the default way to approach it, but the cognitive load is still a problem. Manually reviewing every single skill I might need just for one task is tedious. GitHub stars won't give you any useful signal. I actually started building a product that solves this.

moomoo11 15 hours ago | parent | prev [-]

How are skills different from Having the agent create todo-the-ticket.md (with me) that contain the work scope and steps.

That’s what I do for each ticket.