webclaw
Extracts structured data from web pages at scale without external dependencies or API calls.
Fast, local-first web content extraction for LLMs. Scrape, crawl, extract structured data — all from Rust. CLI, REST API, and MCP server.
- Feed competitor pricing pages into LLM for market analysis
- Extract job listings from career sites for talent sourcing
- Harvest product reviews and ratings for sentiment analysis
Extracts structured data from web pages at scale without external dependencies or API calls. Runs locally, reducing latency and eliminating third-party service costs.
Teams building LLM pipelines that need to ingest and normalize web content reliably without relying on cloud extraction services.
https://github.com/0xMassi/webclaw
By 0xMassi
How to Get It
claude plugins install 0xMassi/webclaw
Tip: Paste this into a Claude Code conversation. Verify command matches your Claude Code version.
Trust Signals Auto-scanned
Community Pulse Active
Discussed on Hacker News, Reddit
- webclaw hit almost 400 GitHub stars in 8 days here's what it does and what's nex — Reddit · 19 pts
- We open-sourced the TLS fingerprinting stack behind webclaw — here's how browser — Reddit · 18 pts
- webclaw MCP server, 10 tools for web extraction, runs locally — Reddit · 15 pts
3 mentions across 1 sources
Reviewer notes
Auto-scanned review. These are observations, not a security certification.
Scored from trust signals (evidence-eval-v1): 1,285 GitHub stars; contributors unknown; last commit 0d ago; license AGPL-3.0.
Things to check
- License (AGPL-3.0) may restrict commercial use. Check with your legal team.
How to evaluate tools before deploying →
Data shown here comes from public APIs and automated scanning. Reviewer notes reflect one person's experience. This is not a security certification or legal recommendation. Always evaluate tools according to your own organization's policies.
Evaluation
Scored from trust signals (evidence-eval-v1): 1,285 GitHub stars; contributors unknown; last commit 0d ago; license AGPL-3.0.