Point your OpenAI SDK at one endpoint. CodeBurst sends each request to the best available model across NVIDIA, Mistral, Google, Groq, Cohere and more — and fails over the instant a provider degrades. Your app never sees the outage.
base_url = "https://codeburst.ai/api/v1"
Most gateways just proxy your request to one provider. When that provider rate-limits or goes down, you get the error. CodeBurst is built around the assumption that providers fail — and routes around it for you.
Every model is a fallback chain, not a single endpoint. The instant a provider rate-limits or degrades, CodeBurst reroutes mid-request to the next healthy lane. Nightly health probes keep the chains fresh, so a dead provider is dropped before it ever reaches you.
Don't pick a model — pick a job. codeburst-agent for tool-calling, codeburst-vision for images, codeburst-compress for context, codeburst-swarm for hard reasoning. Each alias routes to the model that's measurably best at that task.
Standard /v1/chat/completions. Change your base_url and key — keep your existing OpenAI SDK, your code, your tooling. Swap models by changing one string, never your integration.
Bring your own provider keys and pay the provider directly — CodeBurst adds zero per-token markup. Or hand us the keys and let managed routing pick the best model per request. Your bill, your control.
NVIDIA NIM, Mistral, Google Gemini, Groq, Cohere, SambaNova, Cerebras, Cloudflare and more — behind one endpoint. New providers slot into the routing layer without a single change to your app.
Beyond failover: multi-model voting, debate, and size-aware context routing ship as first-class aliases. Reach for them with a model name — no orchestration code on your side.
If your app already speaks the OpenAI API, you're done in under a minute.
from openai import OpenAI client = OpenAI( base_url="https://codeburst.ai/api/v1", # ← the only change api_key="YOUR_CODEBURST_KEY", ) resp = client.chat.completions.create( model="codeburst-best", # or codeburst-agent / codeburst-vision / codeburst-swarm messages=[{"role": "user", "content": "Hello"}], ) print(resp.choices[0].message.content)
CodeBurst speaks the exact same OpenAI-compatible interface — so migrating is a base_url change. What you gain is routing that actively works around provider failures instead of passing them through.
Call any model by name, or use a smart alias and let CodeBurst route to the best one for the job — with failover already wired in.
codeburst-bestTop-quality reasoning — the strongest model currently upcodeburst-agentTool-calling & multi-step agents, with tool-format repair built incodeburst-visionImages — receipts, screenshots, photos (gpt-4o → gemini → scout)codeburst-swarmHard problems — multiple models vote for a more reliable answercodeburst-compressSize-aware context compaction — never overflows a model's windowcodeburst-fastLow-latency lane for quick, cheap turnscodeburst-1m / 2mMillion-token+ context for long documentscodeburst-128k / 256kPick a context tier explicitlyEach request lands on whichever of these is healthiest and best-fit for the job.
One OpenAI-compatible endpoint that routes to the best model and survives provider outages on its own.