teleo/teleo-infrastructure

Author	SHA1	Message	Date
m3taversal	ac794f5c68	Fix source_channel migration: add to SCHEMA_SQL, default 'unknown' not 'telegram' Ganymede review findings: 1. source_channel was missing from CREATE TABLE (fresh installs wouldn't have it) 2. Default fallback changed from 'telegram' to 'unknown' — unknown prefixes are genuinely unknown, not telegram 3. Cross-reference comments added between BRANCH_PREFIX_MAP and _CHANNEL_MAP Also wires classify_source_channel into merge.py PR discovery INSERT. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 13:27:15 +01:00
m3taversal	25a537d2e1	fix: divergence alerting — alert suppression bug + stale ref detection Bug: echo "alerted" ran regardless of curl success, permanently suppressing alerts on delivery failure. Fix: if/then/else wraps the state write. Warning: stale tracking refs after push steps caused false divergence. Fix: re-fetch both remotes before comparing. Both findings from Ganymede review of Step 6. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-17 11:10:32 +01:00
m3taversal	0f868aefab	Add GitHub PR feedback module and fix attribution for mirrored PRs Some checks failed CI / lint-and-test (push) Has been cancelled Details github_feedback.py posts pipeline status to GitHub PRs at three touchpoints: discovery ack, eval review result, and merge/close outcome. Only fires for PRs with a github_pr link (set by sync-mirror.sh). All calls non-fatal. contributor.py: expanded git author fallback to scan all non-merge commits (was only checking last commit), added teleo-bot and github-actions[bot] to bot filter list. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 18:16:28 +01:00
m3taversal	13f21f7732	feat: external contributor pipeline — fork PR handling, attribution, prefix recognition - Mirror: fetch GitHub fork PR refs (refs/pull//head), push to Forgejo as gh-pr-N/branch - Mirror: fork PRs auto-create Forgejo PR with GitHub PR title, link github_pr in DB - db.py: add contrib + gh-pr- to classify_branch for external contributor branches - contributor.py: git commit author as attribution fallback (before branch agent) - contributor.py: skip bot/generic authors (m3taversal, teleo, pipeline) - Tests: fix fallback test for new git author path, add external contributor test Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 18:14:01 +01:00
m3taversal	0b28c71e11	Wire github_pr storage into sync-mirror.sh (Step 3) Some checks are pending CI / lint-and-test (push) Waiting to run Details When mirror auto-creates a Forgejo PR from a GitHub branch, look up the GitHub PR number via API and store it in pipeline.db (github_pr column from migration v21). Enables reverse mapping for feedback and back-sync. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 18:10:06 +01:00
m3taversal	fb121e4010	Add github_pr column to prs table (migration v21) Some checks are pending CI / lint-and-test (push) Waiting to run Details Enables GitHub↔Forgejo PR linking for the contributor pipeline. Mirror script will store GitHub PR number when creating Forgejo PRs, allowing back-sync of eval feedback and merge/close status. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 18:07:04 +01:00
m3taversal	26a8b15f56	fix: skip merge commits in cherry-pick to prevent fork workflow content loss Some checks are pending CI / lint-and-test (push) Waiting to run Details External contributors who run `git merge main` create merge commits that cherry-pick can't handle without -m flag. --no-merges filters these out. Added detection for branches with only merge commits but real content diff. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 18:04:45 +01:00
m3taversal	687f3d3151	fix: prevent broken wiki links in extraction (226 rejections) Some checks are pending CI / lint-and-test (push) Waiting to run Details Two changes to address the #1 rejection reason: 1. extraction_prompt.py: Explicitly tell LLM NOT to use [[wiki links]] in body text — use connections/related_claims JSON fields instead. Remove misleading "post-processor handles wiki links" language. 2. extract.py _get_kb_index(): Expand KB index to include entity stems from entities/{domain}/ so the LLM knows what entities exist when building connections. Previously only showed domain claims. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 14:28:58 +01:00
m3taversal	22b6ebb6f6	fix: lower reweave threshold 0.70→0.55, increase batch 50→200 Some checks are pending CI / lint-and-test (push) Waiting to run Details Orphan ratio at 39.6% (443/1118 claims) vs <15% target. Root cause: reweave threshold 0.70 too strict for text-embedding-3-small — 56% of orphans found "no neighbors." At 0.55, dry-run shows 0% no-neighbor skips. Batch size 200 clears backlog in ~3-4 nights at ~$0.20/run. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 14:18:50 +01:00
m3taversal	0ce7412396	fix: check Forgejo close return value in 2 merge.py paths to prevent ghost PRs Both the "already merged" path and _handle_permanent_conflicts closed PRs on Forgejo without checking the return value. On API failure, the DB update would proceed anyway, creating ghost PRs (DB=closed/merged, Forgejo=open). Now both paths check for None return and skip DB updates on failure — same pattern as close_pr in pr_state.py. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 14:18:50 +01:00
m3taversal	28b25329b3	fix: remove FIRST early return that also blocked re-extraction Some checks are pending CI / lint-and-test (push) Waiting to run Details There were TWO `if not unprocessed: return 0, 0` gates. The previous fix (`c763c99`) only addressed the second one. The first at line 746 fires before the re-extraction query even runs. Replace with a comment explaining why we don't early-return there. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 14:17:20 +01:00
m3taversal	c763c99910	fix: re-extraction loop runs even when queue is empty Some checks are pending CI / lint-and-test (push) Waiting to run Details The re-extraction check was below an early return that fires when unprocessed queue is empty. Sources in needs_reextraction state were never picked up unless new sources happened to arrive simultaneously. Move re-extraction query above the gate so both paths run independently. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 14:04:49 +01:00
m3taversal	4c3ce265e4	fix: sanitize enrichment target_file path traversal Some checks are pending CI / lint-and-test (push) Waiting to run Details Path(target).name strips directory components from LLM-generated target filenames, preventing path traversal via ../. Same pattern already applied to claim filenames (line 404) and entity filenames (line 416). Ganymede-approved. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 13:40:37 +01:00
m3taversal	46ad508de7	Phase 6b: extract post_merge.py from merge.py — post-merge effects Some checks are pending CI / lint-and-test (push) Waiting to run Details 7 functions extracted to lib/post_merge.py: - embed_merged_claims, reciprocal_edges, find_claim_file, add_edge_to_file, archive_source_for_pr, commit_source_moves, update_source_frontmatter_status git_fn injection pattern (same as contributor.py) for 3 async functions that need git operations. Unused async_main_worktree_lock import removed from merge.py. merge.py: 1562 → 1200 lines (−362). Total reduction from 1912: −712 lines. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 13:20:59 +01:00
m3taversal	ed1edd6466	Phase 6a: extract frontmatter.py from merge.py — pure YAML helpers 4 functions + 2 constants extracted to lib/frontmatter.py: - parse_yaml_frontmatter, union_edge_lists, serialize_edge_fields, serialize_frontmatter, REWEAVE_EDGE_FIELDS, RECIPROCAL_EDGE_MAP merge.py: 1678 → 1562 lines (−116). test_reweave_merge.py: replaced local function copies with imports from frontmatter.py — fixes missing challenged_by in test's REWEAVE_EDGE_FIELDS. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 13:16:38 +01:00
m3taversal	53dc18afd5	Phase 5: Extract contributor.py from merge.py (−234 lines) Some checks are pending CI / lint-and-test (push) Waiting to run Details 5 functions extracted: is_knowledge_pr, refine_commit_type, record_contributor_attribution, upsert_contributor, recalculate_tier. git_fn parameter injection avoids circular import (merge→contributor, contributor needs _git from merge). Single call site passes _git. merge.py: 1912 → 1678 lines. 23 new tests, zero regressions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 13:08:26 +01:00
m3taversal	f46e14dfae	refactor: Phase 4 — extract eval_actions.py, drop underscore prefixes in eval_parse Some checks are pending CI / lint-and-test (push) Waiting to run Details Three changes: 1. Drop underscore prefixes in eval_parse.py — functions are now the public API of the module (filter_diff, parse_verdict, classify_issues, etc.). All 12 functions renamed, imports updated in evaluate.py and tests. 2. Extract eval_actions.py from evaluate.py — 3 async PR disposition functions: - post_formal_approvals: submit Forgejo reviews from 2 agents - terminate_pr: close PR, post rejection comment, requeue source - dispose_rejected_pr: disposition logic for rejected PRs on attempt 2+ evaluate.py drops from ~1140 to 911 lines. 3. 14 new tests in test_eval_actions.py covering all three functions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 12:57:51 +01:00
m3taversal	376b77999f	refactor: Phase 3 — fix close_pr ghost bug, wire stale_pr, extract eval_parse Some checks are pending CI / lint-and-test (push) Waiting to run Details Critical bug fix: close_pr now checks forgejo_api return value and skips DB update on Forgejo failure, preventing ghost PRs (DB closed, Forgejo open). Returns bool so callers can handle failures. _terminate_pr checks return value — skips source requeue on failure. stale_pr.py migrated from raw Forgejo+DB to close_pr (last raw close transition eliminated). eval_parse.py: 15 pure parsing functions extracted from evaluate.py (~370 lines removed). Zero I/O, zero async, independently testable. evaluate.py drops from ~1510 to ~1140 lines. Tests: 295 passed (42 new eval_parse + 2 new close_pr), zero regressions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 12:40:23 +01:00
m3taversal	716cc43890	extraction quality: trust hierarchy + verified tagging + telegram review endpoint Some checks are pending CI / lint-and-test (push) Waiting to run Details Three fixes for conversation-sourced claim quality: 1. Trust hierarchy in extraction prompt: bot-generated numbers are flagged as unverified context, not evidence. Directional claims are extractable but specific figures require external verification. Prevents laundering bot guesses into the KB as evidence. 2. Conversation-sourced claims tagged with verified: false and source_type: conversation in frontmatter. Downstream consumers (Leo, dashboard) can filter/flag these for verification. 3. GET /api/telegram-extractions endpoint for daily spot-checking. Shows recent Telegram-sourced PRs with claim titles, status, merge rate, and eval issues. Quick review surface. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 12:38:39 +01:00
m3taversal	c8a08023f9	refactor: Phase 2 — wire pr_state into fixer.py and substantive_fixer.py Some checks are pending CI / lint-and-test (push) Waiting to run Details Fix 4 Forgejo ghost PR bugs flagged by Ganymede: - fixer.py GC close: DB update ran outside try/except, closing DB even on Forgejo failure - substantive_fixer.py droppable: NO Forgejo close at all - substantive_fixer.py auto-enrichment: DB update before Forgejo (reversed order) - substantive_fixer.py close_and_reextract: replace manual Forgejo+DB with close_pr() Add start_fixing() and reset_for_reeval() to pr_state.py: - start_fixing: atomic claim + fix_attempts increment in one statement - reset_for_reeval: clears all eval state for re-evaluation after fix Also fixes stale line number comment in merge.py (Ganymede nit). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 12:21:40 +01:00
m3taversal	1e0c1cd788	Write enrichments as file modifications; strengthen correction extraction Some checks are pending CI / lint-and-test (push) Waiting to run Details Two changes: 1. extract.py: Enrichments now modify existing claim files by appending evidence sections. Previously enrichment-only extractions were discarded as null-result even when they contained valuable challenges. 2. extraction_prompt.py: Corrections should produce BOTH a claim (the corrected knowledge) AND an enrichment (linking to what it corrects). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 12:12:29 +01:00
m3taversal	1f5eb324f3	refactor: centralize PR state transitions in lib/pr_state.py Some checks are pending CI / lint-and-test (push) Waiting to run Details Replace 38 hand-crafted UPDATE prs SET status calls across evaluate.py and merge.py with 7 centralized functions that enforce invariants: - close_pr: always syncs Forgejo (opt-out for reconciliation) - approve_pr: raises ValueError on empty domain (prevents NULL bugs) - mark_merged: always sets merged_at, clears last_error - mark_conflict: always increments merge_failures, sets merge_cycled - mark_conflict_permanent: terminal conflict state - reopen_pr: handles all reopen scenarios (transient, rejection, reeval) - start_review: atomic claim with bool return This eliminates the class of bugs that produced 3 incidents: 1. Domain NULL on musings bypass (7 PRs stuck, 20h zero throughput) 2. Forgejo ghost PRs (70 PRs open on Forgejo but closed in DB) 3. Merge_cycled missing on various close paths Also fixes: 3 close paths in merge.py had DB update before Forgejo call (reversed order). close_pr does Forgejo first, then DB. Only remaining raw status transition: _claim_next_pr (approved→merging) which is an atomic subquery and doesn't have invariant requirements. 20 new tests, 264 total passing, 0 regressions. Net -101 lines in evaluate.py + merge.py. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 12:08:57 +01:00
m3taversal	d073e22e8d	Add conversation-aware extraction for Telegram sources Some checks are pending CI / lint-and-test (push) Waiting to run Details When source format is "conversation", inject specialized extraction rules that prioritize human corrections/pushback as highest-value content. Fixes null-result on short but high-signal correction messages. Maps corrections to existing KB claims as challenges. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 12:05:51 +01:00
m3taversal	552f44ec1c	fix: add migration v20 for conflict retry columns + serialize worktree ops Some checks are pending CI / lint-and-test (push) Waiting to run Details db.py: migration v20 adds conflict_rebase_attempts, merge_failures, merge_cycled columns (already exist on VPS via manual migration, missing from code — any future DB rebuild would break retry mechanism). merge.py: replace retry-with-backoff on config.lock with asyncio.Lock (_bare_repo_lock) around all worktree add/remove calls. Prevents contention instead of retrying it. Applied to both _cherry_pick_onto_main and _merge_reweave_pr. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 17:19:56 +01:00
m3taversal	e0c9951308	fix: close stale PRs on Forgejo when pipeline DB marks them closed Some checks are pending CI / lint-and-test (push) Waiting to run Details Two code paths set status='closed' in the pipeline DB without calling the Forgejo API to close the PR. This caused 50 ghost PRs to accumulate on Forgejo (dashboard shows review backlog) while the pipeline considered them done. - evaluate.py: no-diff stale branch close now calls Forgejo PATCH - merge.py: permanent conflict close now calls Forgejo PATCH Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 17:15:58 +01:00
m3taversal	0d3fe95522	Add config.lock retry with jitter to both worktree-add sites Some checks are pending CI / lint-and-test (push) Waiting to run Details Parallel domain merges race on the bare repo's config file. The single retry only covered one of two worktree-add call sites and used fixed delay. Now both sites retry up to 3 times with increasing jitter. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 17:13:32 +01:00
m3taversal	1755580b95	Harden already-merged detection to exact string match Some checks are pending CI / lint-and-test (push) Waiting to run Details Ganymede review nit: substring match on "already" could false-positive on future return strings. Pin to the two known values from cherry_pick(). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 17:06:20 +01:00
m3taversal	ad7ee0831e	fix(evaluate): set domain + auto_merge on all 5 approval paths Some checks are pending CI / lint-and-test (push) Waiting to run Details Musings bypass and batch both_approve set status='approved' without domain or auto_merge. Merge gate requires domain IS NOT NULL and prefix match OR auto_merge=1. Result: agent PRs deadlocked for 20+ hours. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 17:03:42 +01:00
m3taversal	10b4e27c28	fix: tighten output gate patterns to eliminate false positives on public content 5 patterns were too broad — matched common English words: - "extraction" (concept) matched pipeline extraction pattern - "class X" (English) matched Python class definition pattern - ".md " (product name) matched file extension pattern - "threshold" (concept) matched internal metrics pattern Fixes: - extraction: require pipeline context words (queue/PR/branch/cron) - class/def/import: require line-start (actual code, not prose) - .py/.yaml/.json: require path-like prefix (not bare .md) - threshold: require pipeline context (cosine/vector/Qdrant) All 3 Hermes dry-run drafts now pass. 18/18 tests pass. 11/11 system content regression tests pass. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 17:02:08 +01:00
m3taversal	2b58ffc765	Harden output gate: add missing filter patterns for agent names, coordination language, infrastructure domains, UUIDs Patterns added per Hermes audit: - All agent names (Epimetheus, Ganymede, Hermes, etc.) as standalone - Leo/Rio with coordination context (avoids false positives on common words) - Pentagon, m3ta references - Coordination language (craft review, substance review, skill graph, eval rubric) - Infrastructure domains (teleo-codex, livingip.xyz) - UUID pattern (catches conversation IDs, agent IDs) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 17:02:08 +01:00
m3taversal	50ef90e7d3	Add X content pipeline: output gates + tweet queue + pluggable approval Output gate (output_gate.py): Deterministic classifier that blocks system/pipeline messages from reaching public outputs. Pattern-based detection of PR numbers, deploy logs, diagnostics, infrastructure references. Tweet queue (x_publisher.py): Submit drafts through output gate + OPSEC filter, enter approval_queue, auto-post to X via Twitter API v2 on Cory's approval. Pluggable approval stages (approval_stages.py): Extensible architecture where adding a new approval stage = implementing ApprovalStage.check(). Current stages: OutputGate (stage 0), OPSEC (stage 1), Human (stage 10). Designed for future agent voting, multi-human approval, and decision markets. Also syncs approvals.py from VPS to local repo (was deployed but never committed). 18 tests pass. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 17:02:08 +01:00
m3taversal	f38b1e3c01	fix: handle already-merged PRs + retry worktree config.lock Some checks are pending CI / lint-and-test (push) Waiting to run Details Two fixes for the 18-PR merge blockage: 1. When cherry-pick returns "already merged" (all commits empty because content is already on main), close the PR directly instead of trying to push the stale branch SHA to main. The branch ref points at old commits that aren't descendants of current main, so the push would always fail as non-fast-forward. 2. Retry worktree add once with jittered delay when config.lock contention occurs from parallel domain merges. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 16:57:28 +01:00
m3taversal	ff357c4bbc	fix: remove --force-with-lease from main push to unblock 16 PRs Some checks are pending CI / lint-and-test (push) Waiting to run Details Forgejo categorically blocks --force-with-lease on protected branches, even for fast-forward pushes. The cherry-picked branch is already a descendant of origin/main, so a regular push is a fast-forward by definition. Non-ff is rejected by default — same safety guarantee. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 16:52:39 +01:00
m3taversal	25062cf130	Fix health check: accept HTTP 503 (stalled) as healthy Some checks are pending CI / lint-and-test (push) Waiting to run Details Pipeline /health returns 503 when idle/stalled, which is a valid running state. Also increase post-restart wait from 15s to 30s for pipeline HTTP server initialization. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 16:26:03 +01:00
m3taversal	fe996c3299	feat: add auto-deploy script and systemd units for teleo-infrastructure Some checks are pending CI / lint-and-test (push) Waiting to run Details Auto-deploy watches teleo-infrastructure (not teleo-codex) and syncs to VPS working directories. New checkout path: deploy-infra/ (parallel to existing deploy/ for 48h rollback). Path mapping updated for reorganized repo structure (lib/, diagnostics/, telegram/ etc.). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 14:27:23 +01:00
m3taversal	81afcd319f	fix: sync all code from VPS — repo is now authoritative source of truth Some checks are pending CI / lint-and-test (push) Waiting to run Details 24 files: 8 pipeline lib modules, 6 diagnostics updates, 4 new diagnostics modules, telegram bot fix, 5 active operational scripts. Key changes: - Security: SQL injection prevention (alerting.py), SSL verification (review_queue.py), path traversal guard (extract.py) - Cost tracking: per-PR cost accumulation in evaluate.py - Auto-recovery: watchdog tier0 reset with retry cap + cooldown - Extraction: structured edge fields, post-write vector connection - New modules: vitality, research_tracking, research_routes Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 13:18:01 +01:00
m3taversal	d2aec7fee3	feat: reorganize repo with clear directory boundaries and agent ownership Some checks are pending CI / lint-and-test (push) Waiting to run Details Move scattered root-level files into categorized directories: - deploy/ — deployment + mirror scripts (Ship) - scripts/ — one-off backfills + migrations (Ship) - research/ — nightly research + prompts (Ship) - docs/ — all operational documentation (shared) Delete 3 dead cron scripts replaced by pipeline daemon: - batch-extract-50.sh, evaluate-trigger.sh, extract-cron.sh Add CODEOWNERS mapping every path to its owning agent. Add README with directory structure, ownership table, and VPS layout. Update deploy.sh paths to match new structure. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 18:20:13 +01:00
m3taversal	681afad506	Consolidate pipeline code from teleo-codex + VPS into single repo Some checks failed CI / lint-and-test (push) Has been cancelled Details Sources merged: - teleo-codex/ops/pipeline-v2/ (11 newer lib files, 5 new lib modules) - teleo-codex/ops/ (agent-state, diagnostics expansion, systemd units, ops scripts) - VPS /opt/teleo-eval/telegram/ (10 new bot files, agent configs) - VPS /opt/teleo-eval/pipeline/ops/ (vector-gc, backfill-descriptions) - VPS /opt/teleo-eval/sync-mirror.sh (Bug 2 + Step 2.5 fixes) Non-trivial merges: - connect.py: kept codex threshold (0.65) + added infra domain parameter - watchdog.py: kept infra version (stale_pr integration, superset of codex) - deploy.sh: codex rsync version (interim, until VPS git clone migration) - diagnostics/app.py: codex decomposed dashboard (14 new route modules) 81 files changed, +17105/-200 lines Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 16:52:26 +01:00
m3taversal	95f637491e	fix: Ganymede review — explicit staging, push after commit, challenged_by reciprocal Some checks failed CI / lint-and-test (push) Has been cancelled Details Three fixes from Ganymede's review of extract-time-connection: 1. Replace git add -A with explicit file staging in _reciprocal_edges 2. Push to origin/main immediately after commit (survive batch-extract reset) 3. RECIPROCAL_EDGE_MAP: challenges→challenged_by (not symmetric) Added challenged_by to REWEAVE_EDGE_FIELDS, EDGE_FIELDS, EDGE_WEIGHTS Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 15:46:47 +01:00
m3taversal	be010e666a	feat: extract-time connection + post-merge reciprocal edges Some checks are pending CI / lint-and-test (push) Waiting to run Details Two-part fix for 58% orphan ratio: 1. Prompt-time prior art: Qdrant lookup before extraction injects existing claims as connection candidates. LLM classifies edges as supports/challenges/related. reconstruct_claim_content writes typed edges in frontmatter. 2. Post-merge reciprocal edges: _reciprocal_edges() runs after cherry-pick merge, reads new claims' outgoing edges, writes reciprocal edges on target files. Ensures every new claim has incoming links. Files: lib/extraction_prompt.py, lib/merge.py, openrouter-extract-v2.py Tests: 214 passed (3 failures + 3 errors pre-existing) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 15:25:31 +01:00
m3taversal	84cb001dd6	fix: handle indented YAML list items in _serialize_edge_fields The skip loop only matched `- ` (no indent) but YAML list items are commonly written as ` - item` (2-space indent). This caused old list items to persist alongside new ones, corrupting frontmatter on merge. Fix: consume any line starting with space or dash as part of the current field's value block. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 14:01:34 +01:00
m3taversal	16e798f6a2	fix: eliminate dead code + add stale worktree pre-cleanup in _merge_reweave_pr - Combined superset assertion and merge computation into single loop (removed duplicate scalar-to-list normalization) - Added worktree remove --force before worktree add to handle prior crash leaving stale worktree (SIGKILL, OOM, power loss)	2026-04-04 13:50:28 +01:00
m3taversal	b091642146	fix: string-level edge splicing in reweave merge — no yaml.dump reformatting Two fixes from Ganymede review: 1. CRITICAL: blank line before closing --- compounded on repeat reweaves. Body starts with \n---, so \n{body} created \n\n---. Fixed by checking body prefix. 2. Replaced yaml.dump round-trip with _serialize_edge_fields() that splices only edge arrays into raw frontmatter text. Non-edge fields (title, confidence, type, quotes, flow styles) stay byte-identical to main HEAD. _parse_yaml_frontmatter now returns 3-tuple: (dict, raw_fm_text, body). _serialize_frontmatter takes (raw_fm_text, merged_edges_dict, body). 26 tests pass including idempotency (5x serialize), formatting preservation, and no-blank-line regression test. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-04 13:48:44 +01:00
m3taversal	6b3a5833df	feat: per-file frontmatter union for reweave PR merge Reweave PRs modify existing files (appending YAML edges). Cherry-pick fails ~75% when main moves between PR creation and merge. _merge_reweave_pr() reads each changed file from both main HEAD and branch HEAD, unions the edge arrays (order-preserving, main-first), and writes the result. Eliminates merge conflicts structurally. Key design decisions (Ganymede + Theseus approved): - Order-preserving dedup: main's edges first, branch-new appended - Superset assertion: logs warning if branch missing main edges - Uses main's body text (reweave only touches frontmatter) - Loud failure on parse errors (no cherry-pick fallback) - Append-only contract: reweave adds edges, never removes 18 tests covering parse, union, serialize, superset, and full workflow.	2026-04-04 13:43:32 +01:00
m3taversal	2253f48993	fix: rename eval.py to eval_checks.py to avoid shadowing stdlib eval Some checks failed CI / lint-and-test (push) Has been cancelled Details Also fixes _is_entity path check to use Path.parts instead of string containment, preventing false positives on paths like "domains/entities-overview/". Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 13:44:04 +01:00
m3taversal	ff68ebc561	Remove extra blank line in _group_into_windows Some checks are pending CI / lint-and-test (push) Waiting to run Details Ganymede review cleanup — duplicate by_chat block was already resolved during consolidation, this removes the leftover cosmetic blank line. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 13:36:06 +01:00
m3taversal	d89fb29c9e	chore: commit untracked decomposition modules, docs, and ops scripts - telegram/retrieval.py: RRF merge, query decomposition, vector search - telegram/response.py: system prompt builder, response parser - docs/tool-registry-spec.md: Ganymede's tool registry spec - ops/nightly-reweave.sh: cron wrapper for nightly orphan reweave - prompts/: changelog and rio system prompt Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 13:22:09 +01:00
m3taversal	5e0cdfc63a	feat: consolidate eval pipeline, reweave fixes, enrichment dedup, cherry-pick merge, TG batching Merges all work from epimetheus/enrichment-dedup-fix and epimetheus/eval-and-reweave-fixes: - Eval pipeline: _LLMResponse in call_openrouter, URL fabrication check, confidence floor, cost alerts - Reweave fixes: _is_entity gate, _same_source filter, temp 0.3, blank line sanitization - Enrichment dedup: three-layer fix (source-slug, PR-number, post-rebase scan) - Cherry-pick merge: replaces rebase-retry, --ours entity conflict resolution - TG batching: group by chat_id + time proximity, force-split on unparseable timestamps - Schema migration v10: response_audit columns for cost/confidence/blocking 67 tests pass. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 13:21:59 +01:00
m3taversal	9e42c34271	fix: TG message batching — group by chat_id + time proximity Root cause: _group_into_windows never checked time gaps or chat_id. All messages went into one stream, capped at 10 per window. 120 msgs from one chat → 12 windows → 12 source files → 12 extraction branches. Fix: - Group by chat_id first (different chats = different windows always) - Split on actual time gaps (>window_seconds between messages) - Cap at 50 messages per window (not 10) - Consolidate substantive windows from same chat into one source file at triage time (one source per chat per triage cycle) 6 tests in tests/test_tg_batching.py. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 13:19:35 +01:00
m3taversal	f25a4093c2	fix: replace broken _rebase_and_push call with cherry-pick in conflict retry _retry_conflict_prs called _rebase_and_push which was never defined, causing NameError on every conflict retry. Now uses _cherry_pick_onto_main consistent with the primary merge path. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-31 13:18:30 +01:00

1 2 3

134 commits