webclaw
Fast, local-first web content extraction for LLMs. Scrape, crawl, extract structured data — all from Rust. CLI, REST API, and MCP server.
- Feed competitor pricing pages into LLM for market analysis
- Extract job listings from career sites for talent sourcing
- Harvest product reviews and ratings for sentiment analysis
Extracts structured data from web pages at scale without external dependencies or API calls. Runs locally, reducing latency and eliminating third-party service costs.
Teams building LLM pipelines that need to ingest and normalize web content reliably without relying on cloud extraction services.
https://github.com/0xMassi/webclaw
By 0xMassi
How to Get It
claude plugins install 0xMassi/webclaw
Tip: Paste this into a Claude Code conversation. Verify command matches your Claude Code version.
Trust Signals Auto-scanned
Community Pulse Active
Discussed on Hacker News, Reddit
- I open-sourced a web scraper in Rust that hit 120 stars in 4 days, no browser, T — Reddit · 136 pts
- Show HN: Sukhoi – A flexible and extensible Webcrawler in Python — Hacker News · 131 pts
- Building blocks of a scalable webcrawler. — Hacker News · 80 pts
25 mentions across 2 sources
Reviewer notes
Auto-scanned review. These are observations, not a security certification.
Scored from trust signals (evidence-eval-v1): 1,285 GitHub stars; contributors unknown; last commit 0d ago; license AGPL-3.0.
Things to check
- License (AGPL-3.0) may restrict commercial use. Check with your legal team.
How to evaluate tools before deploying →
Data shown here comes from public APIs and automated scanning. Reviewer notes reflect one person's experience. This is not a security certification or legal recommendation. Always evaluate tools according to your own organization's policies.
Evaluation
Scored from trust signals (evidence-eval-v1): 1,285 GitHub stars; contributors unknown; last commit 0d ago; license AGPL-3.0.