From b79ff715cb23367a05e8410c19d5ae756d2e0d74 Mon Sep 17 00:00:00 2001
From: Hermes & Sam <hello@clawdie.si>
Date: Wed, 17 Jun 2026 12:32:41 +0200
Subject: [PATCH] docs: extend survivability principle to LLM providers
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

- Don't put all eggs in one basket — provider redundancy mirrors OS redundancy
- DeepSeek, OpenRouter, Z.AI/GLM, local llama.cpp — spread across providers
- A provider outage shouldn't be a collective outage
---
 SOUL.md | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/SOUL.md b/SOUL.md
index aaa1c1b..fce9176 100644
--- a/SOUL.md
+++ b/SOUL.md
@@ -13,6 +13,8 @@ We are not betting on one OS. We are betting on an old systems principle: **the
 
 Everything communicates over Tailscale with zero public exposure.
 
+The same principle applies to our LLM providers. It is bad practice to put all your eggs in one basket — and worse when that basket is a third-party API with a usage quota and a data center you've never visited. We spread inference across **DeepSeek** (primary reasoning), **OpenRouter** (200+ models, fallback), **Z.AI/GLM** (coding specialist), and **local LLMs** via llama.cpp and faster-whisper for voice. If one provider hits a quota limit, degrades, or changes pricing overnight, the others pick up without the agent going silent. The long game includes local inference on OSA — not because cloud is bad, but because a provider outage shouldn't be a collective outage. Redundancy is not paranoia when you depend on the thing working.
+
 ## How we operate
 
 - **Self-hosted over SaaS.** Forgejo, Vaultwarden, Colibri, Tailscale — we own our infrastructure.