Skip to main content
Kombai gives you two ways to power a task:
  • Model Routers: Kombai’s internal routers that automatically pick the ideal model for your task within a chosen cost tier.
  • Models: Direct access to individual frontier models from providers like OpenAI, Anthropic, Google, xAI, Moonshot, and Qwen.
Each model and router has a credit multiplier that is applied to the base credit usage of a task. You can switch between routers and models at any point within the same chat.
The credit multiplier is relative to the baseline (1x). For example, a task that consumes 10 credits on a 1x model would consume approximately 5 credits on a 0.5x model and 25 credits on a 2.5x model.

Model Routers

Model Routers are powered by a continuously optimized and benchmarked model stack. Kombai benchmarks the latest LLMs from top providers across a wide range of frontend tasks—from interpreting complex UI logic to multi-file refactors—and the internal router automatically selects the best model for the task within the cost tier you choose. This gives you granular control over cost while ensuring you consistently get the best performance per dollar, even as newer and better models emerge, without any effort to manually test and benchmark them.
RouterCreditsBest for
Kombai-AutoAutoAutomatically picks the best model for the current task
Kombai-Ultra2xBest for hard tasks and deep reasoning. Uses Opus 4.8
Kombai-High1xExcels in complex problem-solving and reasoning
Kombai-Medium0.5xBalances cost-efficiency with quality output
Kombai-Mini0.33xGood for high volume, low-complexity tasks
Kombai-Auto is the recommended default. It analyzes each task and routes it to the optimal model automatically, balancing quality and cost for you.

Models

If you prefer to run a specific model, choose one directly. Models are grouped by provider, and each shows its credit multiplier.

OpenAI

ModelCreditsDescription
GPT 5.41xOpenAI’s top tier model.
GPT 5.4 Mini0.5xOpenAI’s mid tier model.
GPT 5.4 Nano0.33xOpenAI’s low tier model.
GPT 5.52.5xOpenAI’s flagship model.

Anthropic

ModelCreditsDescription
Claude Fable 54x 2xAnthropic’s most capable widely-released (Mythos-class) model.
Claude Opus 4.82.5xCreates the most sophisticated UIs. Anthropic’s flagship model.
Claude Opus 4.72.5xCreates the most sophisticated UIs. Anthropic’s previous flagship model.
Claude Sonnet 4.61xExcels at long horizon frontend tasks.
Claude Haiku 4.50.5xAnthropic’s fast model. Good for quick tasks but creates lazy UIs.
Claude Opus 4.62xExcels at all tasks. Earlier version of Opus 4.7 but more cost efficient.
Limited-time launch offer: Claude Fable 5 runs at 2x credits, 50% off its standard 4x rate.

Gemini

ModelCreditsDescription
Gemini 3.5 Flash0.5xGoogle’s latest model. Creates beautiful UIs with slight compromise on intelligence.
Gemini 3.1 Pro1xGoogle’s flagship model. Most creative at UI tasks at cost of instruction following issues.
Gemini 3 Flash0.33xCreates the best UIs at this model size.
Gemini 3.1 Flash Lite0.1xGoogle’s fast and cheap model.

xAI

ModelCreditsDescription
Grok Build 0.10.33xxAI’s latest early access model.

Moonshot

ModelCreditsDescription
Kimi K2.50.33xPunches above its weight. Moonshot’s strong performing model.
Kimi K2.60.5xMoonshot’s latest model, competes with the best.

Qwen

ModelCreditsDescription
Qwen 3.6 27B0.5xAlibaba’s high-performance 27B model with 262K context.

Thinking effort

Thinking effort controls how deeply a model reasons before it responds. Higher effort improves planning, edge-case handling, and self-correction for complex tasks, while lower effort returns faster, cheaper responses for simpler ones.
EffortWhen to use
NoneNo additional reasoning. Fastest responses for trivial changes.
MinimalVery light reasoning for simple, well-defined tasks.
LowLight reasoning that keeps responses quick.
MediumBalanced reasoning suitable for most everyday tasks.
HighDeep reasoning for complex logic, refactors, and architectural work.
ExtendedMaximum reasoning depth available on supported OpenAI models.
MaxMaximum reasoning depth available on supported Anthropic models.
Not every model supports every level. When you open the thinking effort selector for a model, only the levels it supports are shown. Some models (such as Grok Build 0.1, Kimi K2.5, Kimi K2.6, and Qwen 3.6 27B) don’t expose a configurable thinking effort.

Supported thinking effort by model

ModelSupported thinking efforts
Kombai-Auto, Kombai-Ultra, Kombai-High, Kombai-Medium, Kombai-MiniLow, High
GPT 5.4 / 5.4 Mini / 5.4 Nano / 5.5None, Minimal, Low, Medium, High, Extended
Claude Fable 5Low, Medium, High, Extended, Max
Claude Opus 4.8None, Low, Medium, High, Extended, Max
Claude Opus 4.7None, Low, Medium, High, Extended, Max
Claude Sonnet 4.6None, Low, Medium, High, Max
Claude Haiku 4.5None, Minimal, Low, Medium, High
Claude Opus 4.6None, Low, Medium, High, Max
Gemini 3.5 Flash / 3.1 Pro / 3 Flash / 3.1 Flash LiteMinimal, Low, Medium, High
Grok Build 0.1, Kimi K2.5, Kimi K2.6, Qwen 3.6 27BNot configurable
Higher thinking effort increases generation time and cost, but it does not consume your context window, allowing the agent to think deeply without losing track of large codebases.

Select a model

You can switch models and routers directly from the chat input box:
  1. Click the model dropdown in the Kombai chat input box.
  2. Search for a model, or pick one from the Model Router or Models group (grouped by provider).
  3. To adjust reasoning depth, click the current thinking effort shown beside the brain () icon on the model’s row, then choose a level from the list.
Your selection applies to all subsequent messages in the conversation until you change it.

Manage available models

You can control which routers and models appear in the chat dropdown from the Models tab of the Settings page.
  1. Click the icon in Kombai’s top menubar to open the Settings page.
  2. Select the Models tab from the left sidebar.
  3. Use the toggle beside each entry under Model Routers and Models to show or hide it in the model dropdown.
Hiding a model only removes it from the dropdown, it does not affect any chat that is already using it.