What reviewing 1,174 Claude tools actually looks like.
The Claude Observatory tracks 1,853 tools — skills, MCP servers, hooks, patterns, and workflows. 1,174 of them have reached a verdict. This page is the ledger: every number below is queried from the review database the moment the site builds, so it can't go stale and it can't be massaged.
Published with a grade, a review depth, and caveats where they're due.
Didn't clear the bar. They stay in the database — the no's are part of the record.
Frozen pending a security question we couldn't resolve. Not listed until it is.
Every tool, every status
679 tools are still in the pipeline — evaluating or staged for triage. They don't get listed until they get a verdict.
- Approved · 879 · 47.4%
- Rejected · 290 · 15.7%
- Security hold · 5 · 0.3%
- Evaluating · 677 · 36.5%
- Staging · 2 · 0.1%
Of the 1,174 tools that reached a verdict, 25.1% didn't make it — roughly one in 4. A catalog that approves everything isn't a review; it's a directory.
Approval is not endorsement
Each approved tool carries a letter grade from its latest evaluation — a weighted score across client readiness, breadth of use, reliability, and security.
Grade C is the biggest bucket at 67.6% of approvals. Only 25 tools have earned an A. That's the honest shape of this ecosystem right now: mostly usable, rarely exceptional.
How closely each tool was examined
Not every review is the same review, and pretending otherwise would be dishonest. Every listing on the site discloses its depth.
Only 13 tools have been personally tested end to end; 76.7% are scanned. Hands-on testing is the scarcest resource in this catalog — which is exactly why we label it.
Where the catalog leans
Approved tools across the eight domains we track. The skew is real, so we show it.
Code & development alone is 44.5% of everything approved. The Claude tool ecosystem still builds mostly for developers — a gap worth knowing about if you're shopping for anything else.
What the factual record shows
Alongside reviews, the pipeline collects factual signals from GitHub and npm for 866 of the 879 approved tools. Three that matter:
728 approved tools have an identifiable license. The rest — you're deploying on trust.
724 approved tools show recent maintainer activity as of the latest scan.
Across all approved tools' registry records at last scan. Absence of a CVE is not a security guarantee — see review depth above.
The paper trail
Trust-signal scans for approved tools run continuously: the oldest current scan dates to April 6, 2026, the newest to July 3, 2026. Tools that fail on re-review get downgraded or moved to the not-recommended list — the grade you see is the latest, not the best.
How to cite this
This page is built to be referenced. Cite it as:
Matthews, A. "The Dataset Report: What reviewing 1,174 Claude tools actually looks like." Value Alignment Consulting, July 3, 2026. https://valuealignmentconsulting.com/dataset-report
Numbers regenerate from the live review database on every site build, so figures on this page move as the catalog grows. The methodology is documented on the evaluation guide and about pages.