Question 1

Do I need to change my code?

Accepted Answer

No. Nadir exposes an OpenAI compatible API. Change your base URL to api.getnadir.com and set model to auto. That is the entire change.

Question 2

How does routing decide?

Accepted Answer

A lightweight classifier reads each prompt, scores it on complexity and task type, and picks the cheapest model above your quality threshold. It adds under ten milliseconds.

Question 3

What about quality?

Accepted Answer

You set a quality floor per API key. Simple prompts route to Haiku class models. Anything above your threshold routes to your configured premium model.

Question 4

Do you store my prompts?

Accepted Answer

Only if you turn on logging. With BYOK and logging off, we never see your plaintext. Just headers and token counts.

Question 5

Can I bring my own keys?

Accepted Answer

Yes. BYOK is supported on every tier, including Free. Your keys stay in your environment.

Question 6

What if a provider is down?

Accepted Answer

Automatic failover. If Anthropic errors, Nadir retries against OpenAI or Google on your configured chain. Your app stays up.

Question 7

What is Nadir?

Accepted Answer

Nadir is an open-source LLM router that sits between your application and LLM providers. It analyzes prompt complexity in real-time and routes simple requests to cheaper models while keeping complex tasks on premium models, saving up to 47% on API costs without changing code.

Question 8

Is Nadir free?

Accepted Answer

Yes. Nadir is free and open-source under the MIT license. A hosted Pro plan is also available at $9 per month plus a variable savings fee.

Question 9

What LLM providers does Nadir support?

Accepted Answer

Nadir supports Anthropic (Claude), OpenAI (GPT), and Google (Gemini) models out of the box. You can configure custom model tiers and routing rules for any OpenAI-compatible provider.

Question 10

What are the benchmark results?

Accepted Answer

In a 50-prompt benchmark across diverse real-world prompts, Nadir's Wide & Deep asymmetric classifier (λ=20) achieves up to 47% savings vs always-Opus with 0% catastrophic routes and 96% routing accuracy. Savings vary with your prompt mix.