Fairvisor vs. LiteLLM

The Situation

LiteLLM is an LLM proxy: model routing, unified API compatibility, and provider abstraction.

Fairvisor is a drop-in LLM endpoint: change your base URL and API key format — no SDK code changes — and your traffic gets enforcement applied transparently. Rate limits, cost budgets, loop controls, and policy actions before requests reach upstream models.

Comparison

Capability	Fairvisor	LiteLLM
Primary function	Drop-in LLM endpoint with enforcement (rate limits, budgets, protection)	LLM proxy (unified API, model routing)
Drop-in deployment	Yes — change base URL + API key format, no SDK changes	Proxy/routing focus, not enforcement-first
Runtime	Lua/OpenResty-based edge runtime	Python runtime
Rate limiting	Token bucket + cost-based + TPM/TPD	Basic rate limiting
Loop detection	Yes	No
Circuit breaker	Yes	No
Streaming enforcement	Mid-stream termination	Pass-through
Multi-model routing	No (not a proxy)	Yes (100+ models)
Cost tracking	Enforcement + attribution	Tracking only

When to Use Fairvisor

You want enforcement without touching application code — drop-in endpoint, change base URL only.
You need cost budgets, staged actions, and loop prevention at the edge.
You need policy decisions at edge speed before provider calls.

When to Use LiteLLM

You need multi-model routing and unified LLM API abstraction.
You need provider fallback and routing strategy features.
You mainly need proxy functionality, not advanced enforcement.

Use Them Together

Put Fairvisor in front as enforcement decision layer.
Keep LiteLLM for model routing and provider abstraction.
Combined flow: App -> Fairvisor -> LiteLLM -> model providers.

Your App -> [Fairvisor Edge] -> [LiteLLM Proxy] -> OpenAI / Claude / Gemini

Routing and enforcement are different jobs

See Fairvisor + LiteLLM architecture

LLM-friendly markdown version