AI Providers

Model Selection & Token Modes

Choose the right model for DevOps work, switch it in-session, and control cost with token modes.

Last updated: June 21, 2026

Start With The Provider Default

For most users, the best first step is simple:

choose a provider
keep its default model
work with it for a while
only then optimize for price, speed, or reasoning depth

Current provider defaults are:

OpenRouter: openai/gpt-oss-120b:free
OpenAI: gpt-4o
Anthropic: claude-sonnet-4-20250514

Change The Model In The TUI

Use:

text

/model

This opens the interactive selector for provider and model choice during a live session.

Set A Default Model In Config

yaml

ai:
  provider: openai
  model: gpt-4o

If you omit model, YeePilot uses the provider default.

Override A Model For One Run

bash

yeepilot exec --provider openai --model gpt-4o "summarize changed files"

This is ideal when you want a stronger or cheaper model for one task without changing your day-to-day default.

Token Modes

YeePilot supports two main token-use styles:

normal
saver

Switch during a session:

text

/tokenmode normal
/tokenmode saver

Set it in config:

yaml

ai:
  token_mode: saver

What `normal` is for

richer context retention
fuller answers
the best default for everyday use

What `saver` is for

long sessions
cost-sensitive work
smaller, tighter answers
reducing context pressure

Usage And Efficiency Controls

During a session, these commands help you understand and tune model usage:

text

/tokens
/efficiency

/tokens shows current usage context. /efficiency exposes the adaptive saver behavior that can reduce follow-up calls and verbose output when a session becomes expensive or token-heavy.

The related config is:

yaml

ai:
  efficiency:
    mode: adaptive
    policy: soft

For most users, adaptive with soft policy is the safest balance: YeePilot can reduce waste without suddenly changing the whole workflow.

These settings also influence how a model behaves in real usage:

yaml

ai:
  max_tokens: 4096
  conversation_max_history: 10
  output_truncate_length: 500
  max_response_length: 102400
  usage_warning_tokens: 0
  usage_warning_cost: 0

Use lower values when cost matters. Use higher values when you need deeper answers or more room for long outputs.

Practical Recommendations

use the provider default first
switch to saver before switching providers just to reduce cost
use stronger models for repo review, troubleshooting, or migration planning
use cheaper models for routine checks and repetitive tasks

AI Provider Overview

AI Providers

Thinking Modes

AI Providers

Start With The Provider Default

Change The Model In The TUI

Set A Default Model In Config

Override A Model For One Run

Token Modes

What normal is for

What saver is for

Usage And Efficiency Controls

Other Model-Related Settings

Practical Recommendations

What `normal` is for

What `saver` is for