Engineering notes and product updates
Notes from the team building Kunavo's AI API gateway — model picks, pricing decisions, Stripe and SDK plumbing.
- Review·May 24, 2026·7 min read
Image models shootout — Nano Banana Pro vs Flux 2 Pro vs Seedream V4
Three top 1K image models, 60 prompts, honest verdicts. Best text rendering, best photo realism, best illustration — plus per-image cost on Kunavo. Pick the right one for the right use case.
- Engineering·May 24, 2026·9 min read
Engineering a 99.95% SLO for an AI API gateway — failover, watchdogs, and the boring stuff
How we hit 99.95% monthly uptime across 12 upstreams: weighted candidate routing with real-time weight decay, an aggressive 50ms first-byte watchdog, transactional balance reservations, and the operational scaffolding that matters more than the dispatcher.
- Guide·May 24, 2026·8 min read
Anthropic prompt caching — cut 90% off your input bill in 30 minutes
The full picture: how cache_control works, OpenAI-style automatic caching, the 4-breakpoint pattern for agent loops, what silently breaks caching, and how to verify your hit rate is non-zero. Includes Kunavo cache rates per model.
- Guide·May 23, 2026·6 min read
Migrating from OpenAI to Kunavo in 10 minutes — Python, Node, LangChain, Vercel AI SDK
Four flavors of OpenAI integration, the one-line change each needs to start running through Kunavo, and a smoke-test that costs less than a cent.
- Playbook·May 23, 2026·8 min read
AI API cost optimization — five techniques that actually cut the bill
Prompt caching, model tiering, output caps, parallelism, retry hygiene — with runnable code for each and realistic per-technique savings ranges. Stack them and you cut 70–80%.
- Guide·May 23, 2026·6 min read
Calling Claude with the OpenAI SDK — change one line, keep your codebase
Anthropic's SDK is great, but the ecosystem standardized on OpenAI's. Here's how to call Claude Opus 4.7, Sonnet 4.6 and Haiku 4.5 with the unmodified OpenAI Python and Node SDKs — streaming, tool use, vision included.
- Guide·May 23, 2026·7 min read
Veo 3 and Sora API quickstart — text-to-video and image-to-video in five minutes
First Veo 3 and Sora video generation calls through an OpenAI-style API: text-to-video, image-to-video with first/last frame control, file uploads, and a production-ready Python and Node example. No waitlist, no per-provider billing.
- Announcement·May 23, 2026·4 min read
Launching Kunavo — one OpenAI-compatible API for every frontier model, 30% under official price
Why we built a Stripe-native AI API gateway: lower than upstream pricing, one bill across Claude, Gemini, GPT-Image, Veo, and a $2 free credit to get you started.