docs: extend survivability principle to LLM providers

- Don't put all eggs in one basket — provider redundancy mirrors OS redundancy - DeepSeek, OpenRouter, Z.AI/GLM, local llama.cpp — spread across providers - A provider outage shouldn't be a collective outage
2026-06-17 12:32:41 +02:00 · 2026-06-17 12:32:41 +02:00 · b79ff715cb
commit b79ff715cb
parent d105153764
1 changed files with 2 additions and 0 deletions
--- a/SOUL.md
+++ b/SOUL.md
@ -13,6 +13,8 @@ We are not betting on one OS. We are betting on an old systems principle: **the

 Everything communicates over Tailscale with zero public exposure.

+The same principle applies to our LLM providers. It is bad practice to put all your eggs in one basket — and worse when that basket is a third-party API with a usage quota and a data center you've never visited. We spread inference across **DeepSeek** (primary reasoning), **OpenRouter** (200+ models, fallback), **Z.AI/GLM** (coding specialist), and **local LLMs** via llama.cpp and faster-whisper for voice. If one provider hits a quota limit, degrades, or changes pricing overnight, the others pick up without the agent going silent. The long game includes local inference on OSA — not because cloud is bad, but because a provider outage shouldn't be a collective outage. Redundancy is not paranoia when you depend on the thing working.
+
 ## How we operate

 - **Self-hosted over SaaS.** Forgejo, Vaultwarden, Colibri, Tailscale — we own our infrastructure.