Model Selection & Token Modes
Choose the right model for the job, switch it in-session, and control cost with token modes.
Last updated: May 12, 2026
Start With The Provider Default
For most users, the best first step is simple:
- choose a provider
- keep its default model
- work with it for a while
- only then optimize for price, speed, or reasoning depth
Current provider defaults are:
- OpenRouter:
openai/gpt-oss-120b:free - OpenAI:
gpt-4o - Anthropic:
claude-sonnet-4-20250514
Change The Model In The TUI
Use:
text
/modelThis opens the interactive selector for provider and model choice during a live session.
Set A Default Model In Config
yaml
ai:
provider: openai
model: gpt-4oIf you omit model, YeePilot uses the provider default.
Override A Model For One Run
bash
yeepilot exec --provider openai --model gpt-4o "summarize changed files"This is ideal when you want a stronger or cheaper model for one task without changing your day-to-day default.
Token Modes
YeePilot supports two main token-use styles:
normalsaver
Switch during a session:
text
/tokenmode normal
/tokenmode saverSet it in config:
yaml
ai:
token_mode: saverWhat normal is for
- richer context retention
- fuller answers
- the best default for everyday use
What saver is for
- long sessions
- cost-sensitive work
- smaller, tighter answers
- reducing context pressure
Other Model-Related Settings
These settings also influence how a model behaves in real usage:
yaml
ai:
max_tokens: 4096
conversation_max_history: 10
output_truncate_length: 500
max_response_length: 102400Use lower values when cost matters. Use higher values when you need deeper answers or more room for long outputs.
Practical Recommendations
- use the provider default first
- switch to
saverbefore switching providers just to reduce cost - use stronger models for repo review, troubleshooting, or migration planning
- use cheaper models for routine checks and repetitive tasks