Skill Creator
The meta-skill for building Claude skills. Create, test, A/B compare, and benchmark skills. Skills 2.0 features: structured evals, A/B testing, trigger-description optimizer, and retirement dates for skills that become obsolete as the base model improves.
- Build a custom automation recipe for your team's unique workflow
- Test and compare two versions of a skill to see which works better
- Benchmark your custom skill's accuracy with structured evaluations
Essential for anyone building custom Claude workflows. A/B testing lets you measure whether a skill actually improves output quality vs. baseline Claude.
Developers and teams building custom Claude Code skills. Also useful for optimizing existing skills.
https://github.com/anthropics/claude-plugins-official
By Anthropic
How to Get It
claude plugins install anthropic-skills/skill-creator
Tip: Paste this into a Claude Code conversation. Verify command matches your Claude Code version.
Trust Signals Personally Tested
Data & Access
Reviewer notes
Personally Tested review. These are observations, not a security certification.
Very powerful for power users. Tier 2 — most clients won't need this themselves, but it's valuable for consultants building client-specific workflows.
Anthropic official. Creates files but doesn't execute arbitrary code.
2026-04-02: LIMITATION: Skills under 2,000 tokens perform best per Anthropic's guide — skill-creator doesn't enforce this limit, so generated skills can creep past the threshold and degrade performance.
How to evaluate tools before deploying →
Data shown here comes from public APIs and automated scanning. Reviewer notes reflect one person's experience. This is not a security certification or legal recommendation. Always evaluate tools according to your own organization's policies.
Evaluation
Very powerful for power users. Tier 2 — most clients won't need this themselves, but it's valuable for consultants building client-specific workflows.