Changelog
What’s new in Gateway-LLM
Curated release notes. For the unfiltered git log, see the GitHub releases.
- v1.4.02026-04-22feature
Free-first routing mode + simulated multi-provider in the live demo
- New `free-first` router mode: simple → free Llama, medium → gpt-4o-mini, complex → gpt-4o.
- Live router on the homepage now routes across 8 models (5 OpenAI + 3 simulated).
- Routed-model badges in the live router (Free / Cheap / Fast / Smart / Flagship / Reliable).
- v1.3.02026-03-31improvement
Faster classifier, tighter cache defaults
- Classifier hot path rewritten — sub-microsecond decisions even on long prompts.
- Default semantic cache threshold raised from 0.91 to 0.93 (fewer false positives in production).
- New Prometheus series `gatewayllm_router_failover_total{from,to}`.
- v1.2.02026-02-28feature
Anthropic + Gemini deployment translation upgrades
- Anthropic tool-call translation now handles deeply nested schemas correctly.
- Gemini streaming chunk format converted to OpenAI shape on the server (no client changes).
- New provider type `openai_compatible` for fine-tuned / self-hosted endpoints.
- v1.1.02026-01-15feature
Virtual API keys: budgets, scoped routers, signed receipts
- `monthly_budget_usd` per key with hard 429 ceiling.
- Per-key router strategy override (`router.force_strategy`).
- Ed25519-signed audit receipts now ship in the OSS binary, not just the Custom plan.
- v1.0.02025-12-01feature
1.0 — public release
- Apache 2.0 licensed.
- OpenAI-compatible HTTP API across 8 providers.
- Smart routing with prompt-complexity classifier.
- Virtual API keys, RPM/TPM limits, exact + semantic cache.
- Prometheus, OpenTelemetry, Datadog, Langfuse exporters.
Want the next release in your inbox?
Email mitshawtechnologies@gmail.com — we’ll add you to the announce list. One email per release. No marketing.
Read the blog →