1.3 KiB
1.3 KiB
| name | description |
|---|---|
| freebsd-cost-optimization | How to use Colibri's fast/smart/max cost modes to minimize DeepSeek API spending on FreeBSD deployments |
FreeBSD Cost Optimization
Cost modes
Colibri supports three cost modes that control context window usage:
- fast — tighter context budget, compaction triggers earlier. Use for high-volume, low-importance tasks (log scanning, status checks).
- smart (default) — balanced budget. Most agent sessions should run here.
- max — preserves full context, compaction only when absolutely necessary. Use for complex multi-step tasks where losing history would cause rework.
When to escalate
The daemon auto-escalates fast→smart→max when compaction alone cannot keep the
session within budget. This is intent, not a policy engine — the thresholds are
fixed in colibri-daemon/src/cost.rs:
- fast: budget 4K tokens, compact at 80%
- smart: budget 16K tokens, compact at 75%
- max: budget 64K tokens, compact at 90%
Operator tips
- Set
COLIBRI_COST_MODE=fastin/etc/rc.conffor disposable worker agents. - Use
colibri set-cost-mode smartvia the CLI for interactive sessions. - Monitor cache-hit rate via
colibri status— if it drops below 50%, consider escalating to the next mode. - On FreeBSD live USB, the default is
smartto balance cost and capability.