Commit graph

11 commits

Author SHA1 Message Date
faf060e0ce docs: introduce Layered Memory Fabric terminology (Sam & Codex)
Replaces public split-brain wording with Layered Memory Fabric, documents the skills/brain/ops planes, and sketches the shared FreeBSD/Linux install contract around PostgreSQL, ZFS/OpenZFS, and platform isolation adapters.\n\nChecks: npx --yes prettier@3 --check touched docs/html; git diff --check

---
Build: pass | Tests: FAIL — 1 failed
2026-06-13 21:32:50 +02:00
9e6bd5ee1c Polish update and monitoring docs
---
Build: pass | Tests: pass — 2343 passed (694 files)
2026-05-10 10:36:36 +02:00
f31f406fec Update docs for DNS doctor and artifact refresh
---
Build: pass | Tests: pass — 2260 passed (671 files)
2026-05-09 16:48:36 +02:00
d0e9349a46 Align public docs with system namespace (Sam & Codex)
Collapse the stale manual controlplane install page to a legacy landing point, update README install guidance to the orchestrator, and align public architecture docs with the service/platform namespace split and current Warden slot map. Sync the Astro docs consumers from docs/public.

---

Build: pass

Tests: pass — 2197 passed (164 files)

---
Build: pass | Tests: pass — 2197 passed (650 files)
2026-05-07 12:20:01 +02:00
Operator & claude
3828e5ce83 docs: integrate operator observability + provider fallback work
Brings the public docs in line with what shipped on multitenant over the
last few days. Three new operator-facing pages, three updates to existing
ones, and a CHANGELOG batch.

New pages (docs/public/operate/):
- operator-commands.md — single reference for all Telegram slash commands,
  grouped by purpose (status, structured reports, runtime, sessions, admin
  actions) with auth gating per command. Previously only in-bot /help text.
- provider-fallback.md — operator guide for the cooldown layer: env vars,
  how cooldowns are detected and tracked, /policy surfacing, /clearcooldown
  for manual release, the configured/effective/actual observability triple.
  Includes a "path convention note" flagging that the cooldown file still
  uses the legacy $CLAWDIE_VAR_DIR resolution while test/build status
  files have moved to repo tmp/ — divergence to harmonize later in code.
- structured-reports.md — explains the Observed/Interpretation/Operator
  Notes pattern, lists the six structured reports, documents the
  test/build pipeline contract (status JSON schema + new $AGENT_STATUS_DIR
  → $CLAWDIE_VAR_DIR → tmp/status precedence Codex landed in 1389e17),
  and covers free-text routing (classifyReportIntent + isOpsFlavored).

Updates:
- monitoring.md: appended "Operator-Facing Reports" section pointing at
  the new structured-reports page, and "Provider Fallback Health" pointing
  at the fallback page.
- operate/index.md: added the three new pages to the runbook list.
- architecture/controlplane.md: added "Runtime Observability" section
  documenting the configured/effective/actual triple and linking to the
  new operate pages.
- README.md: expanded the Telegram Commands table (was 10 rows, missing
  every structured report, /policy, /clearcooldown, /budgetreset) and
  added a pointer to operator-commands.md as the full reference. Also
  noted free-text routing.
- CHANGELOG.md: appended an "operator observability + provider fallback,
  apr.2026" batch under [Unreleased] covering provider fallback, the
  reports family, the test/build wrapper pipeline, free-text routing,
  /clearcooldown, the observability triple, the Telegram setMyCommands
  menu, and the new "Verify Before Claiming Remote State" rule in
  AGENTS.md.

No code changes. Slovenian sl/ mirror left untouched (out of localization
scope).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

---
Build: pass | Tests: FAIL — Tests  8 failed | 1940 passed (1948)

---
Build: pass | Tests: FAIL — Tests  2 failed | 1949 passed (1951)
2026-04-26 13:01:43 +02:00
d8cbd5ca70 chore(multitenant): harden agent workflow and README sync
Move the multitenant agent-workflow decision into repo docs, enforce effective author/committer identities in the pre-commit hook, and replace the shell-based README version rewrite with a reusable Node helper.

---
Build: pass | Tests: pass — node scripts/update-readme-version.mjs --check; sh -n hooks/pre-commit

---
Build: FAIL | Tests: FAIL — Tests  58 failed | 1109 passed (1167)

---
Build: FAIL | Tests: FAIL — Tests  58 failed | 1107 passed (1165)
2026-04-25 07:58:18 +02:00
a521ec77ff docs: comprehensive doc audit — update 16 files for consistency with codebase
Systematic review of all doc/, docs/internal/, docs/public/, ARCHITECTURE.md,
and README.md against recent codebase changes. 16 files updated:

Cross-cutting fixes (multiple files):
- Model references: anthropic/claude-3-5-sonnet → zai/glm-5-turbo (4 files)
- Port references: hardcoded 3100 → CONTROLPLANE_API_PORT (3 files)
- Skills mechanism: --no-skills + --append-system-prompt + skills_search (6 files)
- CONTROLPLANE_SHARED_SECRET: documented in security, architecture, install (5 files)
- Prompt guardrails: AGENT_MAX_INBOUND_CHARS etc. added to 3 files
- controlplane is NOT a jail — runs on host (3 files corrected)
- git jail added to layouts and IP tables (3 files)
- npm run → just (2 files)

Specific fixes:
- .env.example: AGENT_SESSION_MAX_BYTES session rollover hint
- README.md: fix IP layout (git=.6 not .4), add run-*.sh generation note
- ARCHITECTURE.md: add config vars, recipe count update, --no-skills
- doc/CONTROLPLANE-AGENT-ROLES.md: fix model, remove deleted file ref
- doc/CONTROLPLANE-ARCHITECTURE.md: port params, security, guardrails section
- doc/CONTROLPLANE-MESSAGE-CONTRACT.md: auth header, skills catalog rewrite
- doc/SESSION-HANDOFF-2026-04-18.md: fix Telegram (plain text not Markdown)
- doc/THREE-BIRD-ARCHITECTURE.md: fix 5 broken STRAPI-FREEBSD-GOTCHA refs
- doc/HANDOFF-PHASE7.md: mark sysprompt cleanup as done
- docs/internal/DOCUMENTATION.md: just CLI, tracked hooks, parameterized paths
- docs/internal/HEARTBEAT.md: add controlplane heartbeat reference, fix setup step
- docs/public/architecture/controlplane.md: phases 2-7 all  DONE
- docs/public/architecture/freebsd-jail-implementation.md: git jail, Forgejo
- docs/public/architecture/warden.md: controlplane=host, git jail added
- docs/public/operate/monitoring.md: just doctor, all guardrail vars
- docs/public/operate/security.md: API auth, shell injection, guardrails

Build: pass | Tests: not run (Linux) (Sam & Claude)
2026-04-18 22:15:59 +02:00
bad6fcc25d Improve session rollover observability
- Add session size snapshot logging and  metric.
- Lower recommended AGENT_SESSION_MAX_BYTES default to 200k in .env.example.
- Document session rollover in monitoring docs.

---
Build: pass | Tests: pass — 1530 passed (91 files)
2026-04-18 19:59:15 +00:00
6b9d8f0f24 Docs: align split-brain and doctor with dev mode
- Clarify that missing skills artifact degrades knowledge but is expected in dev.
- Update monitoring doc to use STATUS: warn
HOST_STARTED: 18.apr.2026 13:37:38
MESSAGE_LOOP: 18.apr.2026 18:48:17
SCHEDULER: 18.apr.2026 18:47:47
TELEGRAM_CONNECTED: 18.apr.2026 13:37:38
LAST_INBOUND: 18.apr.2026 12:20:27
LAST_RUN_STARTED: 18.apr.2026 12:20:29
LAST_RUN_FINISHED: 18.apr.2026 12:21:07
LAST_REPLY_SENT: 18.apr.2026 12:21:07
JAIL_LAST_SUCCESS: 18.apr.2026 12:21:07
JAIL_LAST_FAILURE: 16.apr.2026 20:46:47
JAIL_LAST_ERROR: n/a (Warning: Model "glm-5-turbo" not found for provider "zai". Using custom model id.
Unhandled stop reason: model_context_window_exceeded)
STRIPE_STATUS: disabled
STRIPE_KEY_MODE: missing
STRIPE_REFUNDS: disabled
WATCHDOG_MODE: auto
WATCHDOG_THROTTLED: false
WATCHDOG_FREE_MEMORY_MB: 3294
WATCHDOG_ACTIVE_JAILS: 0
WATCHDOG_QUEUED: 0
CONTROLPLANE: ok
CONTROLPLANE_HOSTD: ok
CONTROLPLANE_DB_HOST: ok
CONTROLPLANE_LLAMA_SERVER: ok
SPLIT_BRAIN: degraded
SKILLS_DB: available
SKILLS_ARTIFACT: missing
SKILLS_ARTIFACT_VERSION: v1.0.0-complete
SKILLS_ARTIFACT_DB_VERSION: unknown
SKILLS_RUNTIME_LOOKUP: present
SKILLS_DOCUMENT_ROWS: 0
SKILLS_CHUNK_ROWS: 0
MEMORY_DB: available
MEMORY_ROWS: 135
MEMORY_CHUNKS: 467
MEMORY_EMBEDDINGS: 137
ISSUES:
- built-in knowledge artifact is missing and document ok/warn/error exit behavior.
- Sync skills artifact plan wording with current doctor output.

---
Build: pass | Tests: not run
2026-04-18 19:11:05 +00:00
18efa6e691 Fix Starlight build: add frontmatter to all public docs
18 markdown files under docs/public/ were missing YAML frontmatter
(title field). Starlight requires this for content collection schema
validation. Extracted titles from existing # headings.

Build: PASS (cms setup succeeds, site deployed)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

---
Build: pass | Tests: pass — Tests  603 passed (603)
2026-04-06 01:54:45 +00:00
3ab5d01ba9 Regenerate built-in knowledge: 84 docs, 1853 embeddings
Rewrote embed-builtin-knowledge.py to dynamically discover all project
docs, internal docs, identity files, and skill definitions instead of
using a hardcoded 6-chunk array.  Artifact now covers full corpus via
OpenRouter BAAI/bge-m3 at 1024 dimensions.

Also fixed stale config.ts defaults: EMBED_MODEL from nomic-embed to
BAAI/bge-m3, EMBED_DIMENSIONS from 768 to 1024 (matching the pgvector
schema's vector(1024) column).

Tested: imported into live clawdie_skills DB, FTS search confirmed.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

---
Build: pass | Tests: pass — Tests  603 passed (603)
2026-04-06 01:37:26 +00:00
Renamed from docs/public/operate/MONITORING.md (Browse further)