teleo/teleo-infrastructure

Author	SHA1	Message	Date
m3taversal	c8a08023f9	refactor: Phase 2 — wire pr_state into fixer.py and substantive_fixer.py Some checks are pending CI / lint-and-test (push) Waiting to run Details Fix 4 Forgejo ghost PR bugs flagged by Ganymede: - fixer.py GC close: DB update ran outside try/except, closing DB even on Forgejo failure - substantive_fixer.py droppable: NO Forgejo close at all - substantive_fixer.py auto-enrichment: DB update before Forgejo (reversed order) - substantive_fixer.py close_and_reextract: replace manual Forgejo+DB with close_pr() Add start_fixing() and reset_for_reeval() to pr_state.py: - start_fixing: atomic claim + fix_attempts increment in one statement - reset_for_reeval: clears all eval state for re-evaluation after fix Also fixes stale line number comment in merge.py (Ganymede nit). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 12:21:40 +01:00
m3taversal	1f5eb324f3	refactor: centralize PR state transitions in lib/pr_state.py Some checks are pending CI / lint-and-test (push) Waiting to run Details Replace 38 hand-crafted UPDATE prs SET status calls across evaluate.py and merge.py with 7 centralized functions that enforce invariants: - close_pr: always syncs Forgejo (opt-out for reconciliation) - approve_pr: raises ValueError on empty domain (prevents NULL bugs) - mark_merged: always sets merged_at, clears last_error - mark_conflict: always increments merge_failures, sets merge_cycled - mark_conflict_permanent: terminal conflict state - reopen_pr: handles all reopen scenarios (transient, rejection, reeval) - start_review: atomic claim with bool return This eliminates the class of bugs that produced 3 incidents: 1. Domain NULL on musings bypass (7 PRs stuck, 20h zero throughput) 2. Forgejo ghost PRs (70 PRs open on Forgejo but closed in DB) 3. Merge_cycled missing on various close paths Also fixes: 3 close paths in merge.py had DB update before Forgejo call (reversed order). close_pr does Forgejo first, then DB. Only remaining raw status transition: _claim_next_pr (approved→merging) which is an atomic subquery and doesn't have invariant requirements. 20 new tests, 264 total passing, 0 regressions. Net -101 lines in evaluate.py + merge.py. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 12:08:57 +01:00
m3taversal	552f44ec1c	fix: add migration v20 for conflict retry columns + serialize worktree ops Some checks are pending CI / lint-and-test (push) Waiting to run Details db.py: migration v20 adds conflict_rebase_attempts, merge_failures, merge_cycled columns (already exist on VPS via manual migration, missing from code — any future DB rebuild would break retry mechanism). merge.py: replace retry-with-backoff on config.lock with asyncio.Lock (_bare_repo_lock) around all worktree add/remove calls. Prevents contention instead of retrying it. Applied to both _cherry_pick_onto_main and _merge_reweave_pr. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 17:19:56 +01:00
m3taversal	e0c9951308	fix: close stale PRs on Forgejo when pipeline DB marks them closed Some checks are pending CI / lint-and-test (push) Waiting to run Details Two code paths set status='closed' in the pipeline DB without calling the Forgejo API to close the PR. This caused 50 ghost PRs to accumulate on Forgejo (dashboard shows review backlog) while the pipeline considered them done. - evaluate.py: no-diff stale branch close now calls Forgejo PATCH - merge.py: permanent conflict close now calls Forgejo PATCH Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 17:15:58 +01:00
m3taversal	0d3fe95522	Add config.lock retry with jitter to both worktree-add sites Some checks are pending CI / lint-and-test (push) Waiting to run Details Parallel domain merges race on the bare repo's config file. The single retry only covered one of two worktree-add call sites and used fixed delay. Now both sites retry up to 3 times with increasing jitter. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 17:13:32 +01:00
m3taversal	1755580b95	Harden already-merged detection to exact string match Some checks are pending CI / lint-and-test (push) Waiting to run Details Ganymede review nit: substring match on "already" could false-positive on future return strings. Pin to the two known values from cherry_pick(). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 17:06:20 +01:00
m3taversal	f38b1e3c01	fix: handle already-merged PRs + retry worktree config.lock Some checks are pending CI / lint-and-test (push) Waiting to run Details Two fixes for the 18-PR merge blockage: 1. When cherry-pick returns "already merged" (all commits empty because content is already on main), close the PR directly instead of trying to push the stale branch SHA to main. The branch ref points at old commits that aren't descendants of current main, so the push would always fail as non-fast-forward. 2. Retry worktree add once with jittered delay when config.lock contention occurs from parallel domain merges. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 16:57:28 +01:00
m3taversal	ff357c4bbc	fix: remove --force-with-lease from main push to unblock 16 PRs Some checks are pending CI / lint-and-test (push) Waiting to run Details Forgejo categorically blocks --force-with-lease on protected branches, even for fast-forward pushes. The cherry-picked branch is already a descendant of origin/main, so a regular push is a fast-forward by definition. Non-ff is rejected by default — same safety guarantee. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 16:52:39 +01:00
m3taversal	681afad506	Consolidate pipeline code from teleo-codex + VPS into single repo Some checks failed CI / lint-and-test (push) Has been cancelled Details Sources merged: - teleo-codex/ops/pipeline-v2/ (11 newer lib files, 5 new lib modules) - teleo-codex/ops/ (agent-state, diagnostics expansion, systemd units, ops scripts) - VPS /opt/teleo-eval/telegram/ (10 new bot files, agent configs) - VPS /opt/teleo-eval/pipeline/ops/ (vector-gc, backfill-descriptions) - VPS /opt/teleo-eval/sync-mirror.sh (Bug 2 + Step 2.5 fixes) Non-trivial merges: - connect.py: kept codex threshold (0.65) + added infra domain parameter - watchdog.py: kept infra version (stale_pr integration, superset of codex) - deploy.sh: codex rsync version (interim, until VPS git clone migration) - diagnostics/app.py: codex decomposed dashboard (14 new route modules) 81 files changed, +17105/-200 lines Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 16:52:26 +01:00
m3taversal	95f637491e	fix: Ganymede review — explicit staging, push after commit, challenged_by reciprocal Some checks failed CI / lint-and-test (push) Has been cancelled Details Three fixes from Ganymede's review of extract-time-connection: 1. Replace git add -A with explicit file staging in _reciprocal_edges 2. Push to origin/main immediately after commit (survive batch-extract reset) 3. RECIPROCAL_EDGE_MAP: challenges→challenged_by (not symmetric) Added challenged_by to REWEAVE_EDGE_FIELDS, EDGE_FIELDS, EDGE_WEIGHTS Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 15:46:47 +01:00
m3taversal	be010e666a	feat: extract-time connection + post-merge reciprocal edges Some checks are pending CI / lint-and-test (push) Waiting to run Details Two-part fix for 58% orphan ratio: 1. Prompt-time prior art: Qdrant lookup before extraction injects existing claims as connection candidates. LLM classifies edges as supports/challenges/related. reconstruct_claim_content writes typed edges in frontmatter. 2. Post-merge reciprocal edges: _reciprocal_edges() runs after cherry-pick merge, reads new claims' outgoing edges, writes reciprocal edges on target files. Ensures every new claim has incoming links. Files: lib/extraction_prompt.py, lib/merge.py, openrouter-extract-v2.py Tests: 214 passed (3 failures + 3 errors pre-existing) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 15:25:31 +01:00
m3taversal	84cb001dd6	fix: handle indented YAML list items in _serialize_edge_fields The skip loop only matched `- ` (no indent) but YAML list items are commonly written as ` - item` (2-space indent). This caused old list items to persist alongside new ones, corrupting frontmatter on merge. Fix: consume any line starting with space or dash as part of the current field's value block. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 14:01:34 +01:00
m3taversal	16e798f6a2	fix: eliminate dead code + add stale worktree pre-cleanup in _merge_reweave_pr - Combined superset assertion and merge computation into single loop (removed duplicate scalar-to-list normalization) - Added worktree remove --force before worktree add to handle prior crash leaving stale worktree (SIGKILL, OOM, power loss)	2026-04-04 13:50:28 +01:00
m3taversal	b091642146	fix: string-level edge splicing in reweave merge — no yaml.dump reformatting Two fixes from Ganymede review: 1. CRITICAL: blank line before closing --- compounded on repeat reweaves. Body starts with \n---, so \n{body} created \n\n---. Fixed by checking body prefix. 2. Replaced yaml.dump round-trip with _serialize_edge_fields() that splices only edge arrays into raw frontmatter text. Non-edge fields (title, confidence, type, quotes, flow styles) stay byte-identical to main HEAD. _parse_yaml_frontmatter now returns 3-tuple: (dict, raw_fm_text, body). _serialize_frontmatter takes (raw_fm_text, merged_edges_dict, body). 26 tests pass including idempotency (5x serialize), formatting preservation, and no-blank-line regression test. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 13:48:44 +01:00
m3taversal	6b3a5833df	feat: per-file frontmatter union for reweave PR merge Reweave PRs modify existing files (appending YAML edges). Cherry-pick fails ~75% when main moves between PR creation and merge. _merge_reweave_pr() reads each changed file from both main HEAD and branch HEAD, unions the edge arrays (order-preserving, main-first), and writes the result. Eliminates merge conflicts structurally. Key design decisions (Ganymede + Theseus approved): - Order-preserving dedup: main's edges first, branch-new appended - Superset assertion: logs warning if branch missing main edges - Uses main's body text (reweave only touches frontmatter) - Loud failure on parse errors (no cherry-pick fallback) - Append-only contract: reweave adds edges, never removes 18 tests covering parse, union, serialize, superset, and full workflow.	2026-04-04 13:43:32 +01:00
m3taversal	f25a4093c2	fix: replace broken _rebase_and_push call with cherry-pick in conflict retry _retry_conflict_prs called _rebase_and_push which was never defined, causing NameError on every conflict retry. Now uses _cherry_pick_onto_main consistent with the primary merge path. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 13:18:30 +01:00
m3taversal	686ef3fd7f	Replace rebase-retry with cherry-pick merge mechanism - _cherry_pick_onto_main replaces _rebase_and_push: creates fresh branch from origin/main, cherry-picks extraction commits, force-pushes - Eliminates ~23% merge failure rate from rebase race conditions - Agent branch protection: PIPELINE_OWNED_PREFIXES filter in SQL prevents auto-merge of agent-owned branches (theseus/, rio/, etc.) - Empty-commit handling: skips already-merged content gracefully - Entity conflict auto-resolution preserved for cherry-pick path - Post-pick evidence dedup runs as safety net (same as post-rebase) - Separate fetch calls for main and branch (fixes long branch name issue) Fixes: PRs #2141, #157, #2142, #2180 (agent branch orphaning) Fixes: ~23% merge failure rate (rebase race condition) Related: PRs #1751, #1752 (enrichment dedup shares root cause) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 13:18:26 +01:00
m3taversal	f43f8f923f	fix: enrichment idempotency — three-layer dedup prevents duplicate evidence blocks Layer 1: Insertion-time dedup in openrouter-extract-v2.py — skip if source_slug already appears in claim content. Layer 2: Insertion-time dedup in entity_batch.py — skip if PR number already enriched this claim. Layer 3: Post-rebase dedup in merge.py — scan rebased files for duplicate evidence blocks (same source reference) and remove them before force-push. Root cause: multiple enrichment branches modify the same claim at the same insertion point. When rebased sequentially, evidence blocks are duplicated. (Leo: PRs #1751, #1752) lib/dedup.py: standalone module — parses evidence headers, deduplicates by source key, preserves trailing content (Relevant Notes, Topics sections). 9 tests covering all patterns including the real PR #1751 duplication case. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 13:18:23 +01:00
m3taversal	5f554bc2de	feat: atomic extract-and-connect + stale PR monitor + response audit Some checks failed CI / lint-and-test (pull_request) Has been cancelled Details Atomic extract-and-connect (lib/connect.py): - After extraction writes claim files, each new claim is embedded via OpenRouter, searched against Qdrant, and top-5 neighbors (cosine > 0.55) are added as `related` edges in the claim's frontmatter - Edges written on NEW claim only — avoids merge conflicts - Cross-domain connections enabled, non-fatal on Qdrant failure - Wired into openrouter-extract-v2.py post-extraction step Stale PR monitor (lib/stale_pr.py): - Every watchdog cycle checks open extract/* PRs - If open >30 min AND 0 claim files → auto-close with comment - After 2 stale closures → marks source as extraction_failed - Wired into watchdog.py as check #6 Response audit system: - response_audit table (migration v8), persistent audit conn in bot.py - 90-day retention cleanup, tool_calls JSON column - Confidence tag stripping, systemd ReadWritePaths for pipeline.db Supporting infrastructure: - reweave.py: nightly edge reconnection for orphan claims - reconcile-sources.py: source status reconciliation - backfill-domains.py: domain classification backfill - ops/reconcile-source-status.sh: operational reconciliation script - Attribution improvements, post-extract enrichments, merge improvements Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-28 22:34:20 +00:00
m3taversal	89692fda2d	feat: embed-on-merge — auto-index new claims into Qdrant after PR merge After a PR merges successfully, _embed_merged_claims() diffs the merged SHA against its parent to find new/changed .md files in knowledge directories (domains/, core/, foundations/, decisions/, entities/). Each file is embedded via embed-claims.py --file (OpenRouter, text-embedding-3-small). Non-fatal: embedding failure logs a warning but does not block the merge pipeline. This keeps vector search current without requiring manual re-embeds. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-26 17:53:18 +00:00
m3taversal	4b5c5841ce	doc: mixed PR classification priority note (Ganymede review) Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-26 14:57:11 +00:00
m3taversal	cfb80d3496	feat: CI scoring overhaul — principal roll-up, commit-type filter, new weights Step 1: principal column + commit_type column in pipeline.db. Static map populates principal for local agents (rio→m3taversal etc.). VPS agents (epimetheus, argus) have no principal. Step 2: _classify_commit_type in merge.py. Pipeline commits (inbox/, entities/, agents/) get commit_type='pipeline' and skip CI attribution entirely. Knowledge commits (domains/, core/, foundations/, decisions/) get full attribution. Step 3 (Argus): Dashboard has dual view — by-principal (default, governance) and by-agent (drill-down). Already implemented by Argus. CI weights updated (Cory-approved): - Challenger: 0.35 (was 0.20) - Synthesizer: 0.25 (was 0.15) - Reviewer: 0.20 (was 0.10) - Sourcer: 0.15 (unchanged) - Extractor: 0.05 (was 0.40) Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-26 14:53:54 +00:00
m3taversal	d97f68714a	epimetheus: fix 2 nits from Ganymede final review 1. _merge_pr marked as CURRENTLY UNUSED (local ff-push is primary path) 2. Conversation window messages skip cold rate limit check (window counter IS the limit) Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-20 20:25:06 +00:00
m3taversal	d79ff60689	epimetheus: sync VPS-deployed code to repo — Mar 18-20 reliability + features Pipeline reliability (8 fixes, reviewed by Ganymede+Rhea+Leo+Rio): 1. Merge API recovery — pre-flight approval check, transient/permanent distinction, jitter 2. Ghost PR detection — ls-remote branch check in reconciliation, network guard 3. Source status contract — directory IS status, no code change needed 4. Batch-state markers eliminated — two-gate skip (archive-check + batched branch-check) 5. Branch SHA tracking — batched ls-remote, auto-reset verdicts, dismiss stale reviews 6. Mirror pre-flight permissions — chown check in sync-mirror.sh 7. Telegram archive commit-after-write — git add/commit/push with rebase --abort fallback 8. Post-merge source archiving — queue/ → archive/{domain}/ after merge Pipeline fixes: - merge_cycled flag — eval attempts preserved during merge-failure cycling (Ganymede+Rhea) - merge_failures diagnostic counter - Startup recovery preserves eval_attempts (was incorrectly resetting to 0) - No-diff PRs auto-closed by eval (root cause of 17 zombie PRs) - GC threshold aligned with substantive fixer budget (was 2, now 4) - Conflict retry with 3-attempt budget + permanent conflict handler - Local ff-merge fallback for Forgejo 405 errors Telegram bot: - KB retrieval: 3-layer (entity resolution → claim search → agent context) - Reply-to-bot handler (context.bot.id check) - Tag regex: @teleo\|@futairdbot - Prompt rewrite for natural analyst voice - Market data API integration (Ben's token price endpoint) - Conversation windows (5-message unanswered counter, per-user-per-chat) - Conversation history in prompt (last 5 exchanges) - Worktree file lock for archive writes Infrastructure: - worktree_lock.py — file-based lock (flock) for main worktree coordination - backfill-sources.py — source DB registration for Argus funnel - batch-extract-50.sh v3 — two-gate skip, batched ls-remote, network guard - sync-mirror.sh — auto-PR creation for mirrored GitHub branches, permission pre-flight - Argus dashboard — conflicts + reviewing in backlog, queue count in funnel - Enrichment-inside-frontmatter bug fix (regex anchor, not --- split) Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-20 20:17:27 +00:00
m3taversal	c0a6adf9ed	leo: model diversity + calibrated review prompts - Domain review → GPT-4o (OpenRouter), Leo STANDARD → Sonnet (OpenRouter), Leo DEEP → Opus (Claude Max). Two model families = no correlated blind spots. - Opus reserved for DEEP eval only — protects rate limit for overnight research. - Review prompts calibrated: require per-criterion evidence, blocking-vs-observation verdict rules. Moved from 100% rubber-stamp approval to 12% pass rate. - OpenRouter failures classified as openrouter_failed (not rate_limited) to avoid spurious 15-min Opus backoff. - merge.py: pre-check PR state before merge API call (prevents 405 on re-merge). Pentagon-Agent: Leo <294C3CA1-0205-4668-82FA-B984D54F48AD>	2026-03-13 17:10:30 +00:00
m3taversal	ff5162d5ba	ganymede: extract lib/domains.py — single domain→agent mapping Some checks failed CI / lint-and-test (pull_request) Has been cancelled Details - What: Unified DOMAIN_AGENT_MAP, VALID_DOMAINS, agent_for_domain(), detect_domain_from_diff(), detect_domain_from_branch() into lib/domains.py. Removed duplicated mappings from evaluate.py and merge.py. VALID_DOMAINS in validate.py now derives from DOMAIN_AGENT_MAP.keys() (single source of truth). - Why: Phase 3 structural refactor. Domain mapping was duplicated across evaluate.py (DOMAIN_AGENT_MAP) and merge.py (agent_domain dict). Adding a domain required editing 3 files; now it requires editing 1. - Connections: evaluate.py uses agent_for_domain() + detect_domain_from_diff(), merge.py uses detect_domain_from_branch(), validate.py uses VALID_DOMAINS. Pentagon-Agent: Ganymede <F99EBFA6-547B-4096-BEEA-1D59C3E4028A>	2026-03-13 15:33:18 +00:00
m3taversal	9d69629893	ganymede: extract lib/forgejo.py — single Forgejo API client Some checks failed CI / lint-and-test (pull_request) Has been cancelled Details - What: Unified forgejo_api(), get_pr_diff(), get_agent_token(), repo_path() into lib/forgejo.py. Removed 3 duplicate _forgejo_api functions (evaluate.py, merge.py, validate.py), 2 duplicate _get_pr_diff functions (evaluate.py, validate.py), and 1 _agent_token function (evaluate.py). - Why: Phase 3 structural refactor. Single source of truth for all Forgejo HTTP calls. Eliminates ~90 lines of duplicated code across 3 modules. - Connections: All hardcoded repo paths now use repo_path() helper. Consumer modules no longer reference config.FORGEJO_URL/OWNER/REPO/TOKEN_FILE directly. Pentagon-Agent: Ganymede <F99EBFA6-547B-4096-BEEA-1D59C3E4028A>	2026-03-13 15:29:34 +00:00
m3taversal	a7251d7529	ganymede: add dev infrastructure — pyproject.toml, CI, deploy script Some checks failed CI / lint-and-test (pull_request) Has been cancelled Details Phase 2 of pipeline refactoring: - pyproject.toml: Python >=3.11, aiohttp dep, dev extras (pytest, pytest-asyncio, ruff). Ruff configured with sane defaults + ignore rules for existing code patterns (implicit Optional, timezone.utc). - .forgejo/workflows/ci.yml: Forgejo Actions CI — syntax check, ruff lint, ruff format, pytest on every PR and push to main. - deploy.sh: Pull + venv update + syntax check + optional restart. Replaces ad-hoc scp workflow. - tests/conftest.py: Shared fixture for in-memory SQLite with full schema. Ready for Phase 4 test suite. - .gitignore: Added venv, pytest cache, coverage, build artifacts. - Ruff auto-fixes: import sorting, unused imports removed across all modules. All files pass ruff check + ruff format. Pentagon-Agent: Ganymede <F99EBFA6-547B-4096-BEEA-1D59C3E4028A>	2026-03-13 14:24:27 +00:00
m3taversal	799249d470	Initial commit: Pipeline v2 daemon + infrastructure docs - teleo-pipeline.py: async daemon with 4 stage loops (ingest/validate/evaluate/merge) - lib/: config, db, evaluate, validate, merge, breaker, costs, health, log modules - INFRASTRUCTURE.md: comprehensive deep-dive for onboarding - teleo-pipeline.service: systemd unit file Pentagon-Agent: Leo <294C3CA1-0205-4668-82FA-B984D54F48AD>	2026-03-12 14:11:18 +00:00

29 commits