teleo-codex

Author	SHA1	Message	Date
m3taversal	6361c7e9e8	Merge branch 'epimetheus/eval-cost-tracking'	2026-04-14 12:25:46 +01:00
m3taversal	5f287ae9c8	epimetheus: fix connect.py title→slug mismatch in vector-search edges claim_title payloads wrote unresolvable human-readable titles into frontmatter related fields. Switched to claim_path with slug extraction so reciprocal edges in merge.py can resolve targets. Renamed neighbor_titles→neighbor_slugs throughout for consistency. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 12:25:41 +01:00
m3taversal	5b9ce01412	epimetheus: wire LLM connections into typed frontmatter edges Extract.py was discarding LLM-provided connections — related_claims went into frontmatter as wiki-links but supports/challenges/depends_on from the connections field were ignored entirely. This is the primary driver of 50%+ orphan ratio. Now: connections[] → typed edge fields (supports/challenges/related) in YAML frontmatter. related_claims fall back to related edges. Post-write connect_new_claims() adds vector-search edges for claims the LLM missed. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 12:01:21 +01:00
m3taversal	154f36f2d3	epimetheus: fix eval crash + wire per-PR cost tracking Three bugs fixed: 1. triage_pr() returns 3 values but line 611 unpacked 2 → ValueError on every non-deterministic PR (circuit breaker opened, 5 PRs stuck) 2. costs import was inside triage else-block → NameError on deterministic routes 3. pr_cost never written to prs.cost_usd → 0% cost tracking across 1,118 PRs Cost tracking now covers all 4 exit paths: domain failed, domain rejected, Leo failed, and normal completion. Uses additive UPDATE (cost_usd + ?) so re-evals accumulate correctly. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 12:01:13 +01:00
m3taversal	7bfce6b706	commit telegram bot module from VPS — 20 files never previously in repo Pulled from /opt/teleo-eval/telegram/ on VPS. Includes: - bot.py (92K), kb_retrieval.py, kb_tools.py (agentic retrieval) - retrieval.py (RRF merge, query decomposition, entity traversal) - response.py (system prompt builder, response parser) - agent_config.py, agent_runner.py (multi-agent template unit support) - approval_stages.py, approvals.py, digest.py (approval workflow) - eval_checks.py, eval.py (response quality checks) - output_gate.py, x_publisher.py, x_client.py, x_search.py (X pipeline) - market_data.py, worktree_lock.py (utilities) - rio.yaml, theseus.yaml (agent configs) These files were deployed to VPS but never committed to the repo. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 11:02:32 +02:00
m3taversal	3461f2ad8f	apply Ganymede review fixes: delete misplaced ops/db.py, correct diff log, fix stale_pr DB update Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 10:57:43 +02:00
m3taversal	e27f6a7b91	commit pending pipeline changes: watchdog tier0 recovery, stale_pr cleanup, deploy.sh improvements - watchdog.py: tier0 auto-recovery (3 retries, 1h cooldown, audit trail) — pending Ganymede review - stale_pr.py: new module, closes extraction PRs open >30 min with zero claims - deploy.sh: expanded with new deployment features - validate.py, extract.py, cascade.py, db.py: minor fixes - backfill-descriptions.py: utility script - review_queue.py: minor fix Note: watchdog + stale_pr not yet deployed to VPS (reverted after missing import crash) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 10:14:54 +02:00
m3taversal	efe23f931a	ship: fix evaluator column + correct contributor attribution - Add domain_agent and domain_model to pr-lifecycle API response (data was queried but dropped before serialization — evaluator column showed blank) - Show model name tag next to evaluator (Gemini Flash, GPT-4o, etc.) - Re-attribute 1201 "pipeline (self-directed)" PRs to @m3taversal — these were Cory-directed, not autonomous overnight research - Re-attribute 252 NULL PRs to @m3taversal - Fix extract.py defaults: new PRs without proposed_by default to @m3taversal - Fix backfill script defaults: extract/ branches → @m3taversal, not "pipeline (self-directed)" - Only agent-named branches (rio/, theseus/, etc.) from research-session.sh remain as "(self-directed)" Pentagon-Agent: Ship <B8D06D3F-1589-4777-B2E7-B2460D51C81F>	2026-04-07 14:56:03 +00:00
m3taversal	9925576c13	ship: add contributor attribution tracing to PR lifecycle - Migration v19: submitted_by column on prs + sources tables - extract.py: propagates proposed_by from source frontmatter → PR record - merge.py: sets submitted_by from Forgejo author for human PRs - dashboard_prs.py: redesigned with Contributor column, improved claim visibility in expanded rows, cost estimates, evaluator chain display - dashboard_routes.py: submitted_by + source_path in pr-lifecycle API - backfill_submitted_by.py: one-time backfill (1525/1777 PRs matched) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 14:56:03 +00:00
m3taversal	adbe3bd911	fix: prevent reweave PR flood — freshen base, cleanup branches on failure Three fixes for the reweave merge failure cycle: 1. reweave.py: fetch + reset to origin/main before branch creation, eliminating the stale-base problem that caused ~75% merge failure rate 2. merge.py: delete remote branch when closing reweave PRs (in reconcile, merge failure, and conflict retry paths) — prevents discover_external_prs from rediscovering stale branches and creating new PRs every 18 minutes 3. merge.py: skip cherry-pick retry for reweave branches — reweave modifies existing files so cherry-pick always fails, go straight to close+delete Pentagon-Agent: Ship <f3064ef4-c330-4809-ad37-39290b2eaa5b> Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 14:56:03 +00:00
m3taversal	0591c4c0df	wire cascade, cross_domain, and review_records into pipeline - merge.py: import + await cascade_after_merge and cross_domain_after_merge after reciprocal edges, before branch deletion. Both non-fatal. Added conn.commit() before slow branch deletion (Ganymede Q4). - db.py: add record_review() helper + migration v18 (review_records table with indexes). Schema version 17→18. - evaluate.py: call record_review() at all 3 verdict points: domain_rejected → outcome=rejected approved → outcome=approved changes_requested → outcome=approved-with-changes Notes field captures review text (capped 4000 chars). Pentagon-Agent: Ship <E2A054E5-A6D6-4AE0-B0A3-F51A3B4DBCA5>	2026-04-07 14:56:03 +00:00
m3taversal	8f6057686e	fix: reweave regex fallback uses consistent YAML list format The regex fallback was writing list entries as ' - "title"' (2-space indent + quotes) while existing frontmatter uses '- title' (0-space indent, no quotes). This caused YAML parse failures during merge. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 14:56:03 +00:00
m3taversal	a68f38609d	fix: add date_errors to substantive fixer tag routing date_errors was evaluated but never routed to any fixer, leaving PRs stuck permanently. Now classified as FIXABLE with targeted prompt guidance. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 14:56:02 +00:00
m3taversal	05d74d5e32	sync: import all VPS pipeline + diagnostics code as baseline Imports 67 files from VPS (/opt/teleo-eval/) into repo as the single source of truth. Previously only 8 of 67 files existed in repo — the rest were deployed directly to VPS via SCP, causing massive drift. Includes: - pipeline/lib/: 33 Python modules (daemon core, extraction, evaluation, merge, cascade, cross-domain, costs, attribution, etc.) - pipeline/: main daemon (teleo-pipeline.py), reweave.py, batch-extract-50.sh - diagnostics/: 19 files (4-page dashboard, alerting, daily digest, review queue, tier1 metrics) - agent-state/: bootstrap, lib-state, cascade inbox processor, schema - systemd/: service unit files for reference - deploy.sh: rsync-based deploy with --dry-run, syntax checks, dirty-tree gate - research-session.sh: updated with Step 8.5 digest + cascade inbox processing No new code written — all files are exact copies from VPS as of 2026-04-06. From this point forward: edit in repo, commit, then deploy.sh. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 00:00:00 +01:00
m3taversal	2c0d428dc0	Add Phase 1+2 instrumentation: review records, cascade automation, cross-domain index, agent state Phase 1 — Audit logging infrastructure: - review_records table (migration v12) capturing every eval verdict with outcome, rejection reason, disagreement type - Cascade automation: auto-flag dependent beliefs/positions when merged claims change - Merge frontmatter stamps: last_review metadata on merged claim files Phase 2 — Cross-domain and state tracking: - Cross-domain citation index: entity overlap detection across domains on every merge - Agent-state schema v1: file-backed state for VPS agents (memory, tasks, inbox, metrics) - Cascade completion tracking: process-cascade-inbox.py logs review outcomes - research-session.sh: state hooks + cascade processing integration All changes are live on VPS. This commit brings the code under version control for review. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-02 10:50:49 +00:00

15 commits