BrowseFull catalogOutcomesSolve a specific problemRolesStack by teamTrustFilter by risk tier
← Back to Claude Tool Reviews

bifrost

Skill Development Recommended

Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead at 5k RPS.

3,724 starsApache-2.0 (commercial OK)FreeQuick setup

Enterprise AI gateway latency and reliability are make-or-break for production deployments — Bifrost delivers sub-100 microsecond overhead at 5k RPS with adaptive load balancing, making it 50x faster than LiteLLM for high-throughput environments.

Platform engineering teams running multi-model AI infrastructure at scale who need a high-performance gateway with guardrails and failover across 1000+ models.

Claude Code Claude Cowork Claude Chat

https://github.com/maximhq/bifrost

By maximhq

How to Get It

Option 1: Claude Desktop App (Code Mode)Click the + button next to the prompt box → PluginsAdd plugin. Search and click Install. Skills work in Claude Code only.
Option 2: Paste into Claude CodeCopy the command below and paste it into your conversation. Claude will install it.
Command
claude plugins install maximhq/bifrost

Tip: Paste this into a Claude Code conversation. Verify command matches your Claude Code version.

CostFree

Trust Signals Reviewed

Stars3,724Contributors77Last updated2026-04-13LicenseApache-2.0 (OK for commercial use)Known CVEsNone foundSources: GitHub Advisory Database + OSV.dev · Scanned 2026-04-13 · scanner v1

Community Pulse Active

Discussed on Hacker News, Reddit

67 mentions across 2 sources

Reviewer notes

Reviewed review. These are observations, not a security certification.

Scored from trust signals (evidence-eval-v1): 3,724 GitHub stars; 77 contributors; last commit 37d ago; license Apache-2.0.

2026-05-10: Bifrost is worth evaluating if you're running multi-model inference at genuine scale and LiteLLM's latency is showing up in your P99s — the sub-100µs overhead claim holds up in high-concurrency scenarios where gateway tax actually matters. The Go-based architecture and adaptive load balancer give you real failover across providers without the Python overhead that plagues LiteLLM at volume. Tradeoff: you're adding another service to operate, and if your traffic is under a few hundred RPS, the performance delta won't justify the ops burden — stick with LiteLLM or a managed gateway instead.

How to evaluate tools before deploying →

Data shown here comes from public APIs and automated scanning. Reviewer notes reflect one person's experience. This is not a security certification or legal recommendation. Always evaluate tools according to your own organization's policies.

Evaluation

Ease of Use
4/5
Versatility
5/5
Reliability
5/5
Security
4/5
Overall score4.50 / 5.00 RecommendedEvaluatedMay 2026
Scored from trust signals (evidence-eval-v1): 3,724 GitHub stars; 77 contributors; last commit 37d ago; license Apache-2.0.

← Back to Claude Tool Reviews

Rolling Claude out in your org? Let's talk.

Start a conversation →