Teleo evaluation pipeline infrastructure — Python async daemon for claim extraction, validation, evaluation, and merge

Find a file

m3taversal 58fa8c5276 Some checks are pending CI / lint-and-test (push) Waiting to run Details feat(attribution): Phase A — event-sourced contribution ledger (schema v24) Introduces contribution_events table + non-breaking double-write. Schema lands today, forward traffic writes events alongside existing count upserts, backfill script replays history. Phase B will add leaderboard API reading from events; Phase C switches Argus dashboard over. ## Schema v24 (lib/db.py) - contribution_events: one row per credit-earning event (id, handle, kind, role, weight, pr_number, claim_path, domain, channel, timestamp) Partial UNIQUE indexes handle SQLite's NULL != NULL semantics: idx_ce_unique_claim on (handle, role, pr_number, claim_path) WHERE claim_path NOT NULL idx_ce_unique_pr on (handle, role, pr_number) WHERE claim_path IS NULL PR-level events (evaluator, author, challenger, synthesizer) dedup on 3-tuple. Per-claim events (originator) dedup on 4-tuple. Idempotent on replay. - contributor_aliases: canonical handle mapping Seeded: @thesensatore → thesensatore, cameron → cameron-s1 - contributors.kind TEXT DEFAULT 'person' Migration seeds 'agent' for known Pentagon agent handles. ## Role model (confirmed by Cory Apr 24) Weights: author 0.30, challenger 0.25, synthesizer 0.20, originator 0.15, evaluator 0.05 - author: human who submitted the PR (curation + submission work) - originator: person who authored the underlying content (rewards external creators) - challenger: agent/person who brought a productive disagreement - synthesizer: cross-domain work (enrichments, research sessions) - evaluator: reviewer who approved (Leo + domain agent) Humans-are-always-author: agents credit is capped at evaluator/synthesizer/ challenger. Pentagon agents classify as kind='agent' and surface in the agent-view leaderboard, not the default person view. ## Writer (lib/contributor.py) - New insert_contribution_event(): idempotent INSERT OR IGNORE with alias normalization + kind classification. Falls back silently on pre-v24 DBs. - record_contributor_attribution double-writes alongside existing upsert_contributor calls. Zero risk to current dashboard. - Author event: emitted once per PR from prs.submitted_by → git author → agent-branch-prefix. - Originator events: emitted per claim from frontmatter sourcer, skipping when sourcer == author (avoids self-credit double-count). - Evaluator events: Leo (always when leo_verdict='approve') + domain_agent (when domain_verdict='approve' and not Leo). - Challenger/Synthesizer: emitted from Pentagon-Agent trailer on agent-owned branches (theseus/, rio/, etc.) based on commit_type. Pipeline-owned branches (extract/, reweave/) get no trailer-based event — infrastructure work isn't contribution credit. ## Helpers (lib/attribution.py) - normalize_handle(raw, conn=None): lowercase + strip @ + alias lookup - classify_kind(handle): returns 'agent' for PENTAGON_AGENTS, else 'person' Intentionally narrow. Orgs get classified by operator review, not heuristics. ## Backfill (scripts/backfill-events.py) Replays all merged PRs into events. Idempotent (safe to re-run). Emits: - PR-level: author, evaluator, challenger, synthesizer - Per-claim: originator (walks knowledge tree, matches via description titles) Known limitation: post-merge PR branches are deleted from Forgejo, so we can't diff them for granular per-claim events. Claim→PR mapping uses prs.description (pipe-separated titles). Misses some edge cases but recovers the bulk of historical originator credit. Forward traffic gets clean per-claim events via the normal record_contributor_attribution path. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>		2026-04-24 13:59:22 +01:00
.forgejo/workflows	ganymede: add dev infrastructure — pyproject.toml, CI, deploy script	2026-03-13 14:24:27 +00:00
agent-state	Consolidate pipeline code from teleo-codex + VPS into single repo	2026-04-07 16:52:26 +01:00
deploy	fix: auto-deploy.sh rsync excludes broken + add tests/ sync	2026-04-20 17:22:11 +01:00
diagnostics	feat(activity): Timeline data gaps — type filter + commit_type classifier + source_channel reshape	2026-04-23 19:51:58 +01:00
docs	feat: reorganize repo with clear directory boundaries and agent ownership	2026-04-14 18:20:13 +01:00
hermes-agent	fix: set execute bit on research-session.sh and install-hermes.sh	2026-04-18 11:54:39 +01:00
lib	feat(attribution): Phase A — event-sourced contribution ledger (schema v24)	2026-04-24 13:59:22 +01:00
ops	fix: wire commit_type into contributor role assignment	2026-04-21 10:27:36 +01:00
research	fix: set execute bit on research-session.sh and install-hermes.sh	2026-04-18 11:54:39 +01:00
scripts	feat(attribution): Phase A — event-sourced contribution ledger (schema v24)	2026-04-24 13:59:22 +01:00
systemd	feat: add auto-deploy script and systemd units for teleo-infrastructure	2026-04-15 14:27:23 +01:00
telegram	add rio and theseus telegram bot agent configs	2026-04-20 17:20:21 +01:00
tests	fix(attribution): --diff-filter=A + handle sanity filter + remove legacy fallback	2026-04-24 12:58:55 +01:00
.gitignore	feat: add auto-deploy script and systemd units for teleo-infrastructure	2026-04-15 14:27:23 +01:00
CODEOWNERS	feat: reorganize repo with clear directory boundaries and agent ownership	2026-04-14 18:20:13 +01:00
fetch_coins.py	Skip liquidated entities in portfolio fetcher	2026-04-20 18:55:04 +01:00
pyproject.toml	ganymede: add dev infrastructure — pyproject.toml, CI, deploy script	2026-03-13 14:24:27 +00:00
README.md	feat: reorganize repo with clear directory boundaries and agent ownership	2026-04-14 18:20:13 +01:00
reweave.py	fix: quote YAML edge values containing colons, skip unparseable files in reweave merge	2026-04-18 12:07:28 +01:00
teleo-pipeline.py	fix: wrap breaker calls in stage_loop to prevent permanent task death	2026-04-20 12:37:28 +01:00

README.md

teleo-infrastructure

Pipeline infrastructure for the Teleo collective knowledge base. Async Python daemon that extracts, validates, evaluates, and merges claims via Forgejo PRs.

Directory Structure

teleo-infrastructure/
├── teleo-pipeline.py        # Daemon entry point
├── reweave.py               # Reciprocal edge maintenance
├── lib/                     # Pipeline modules (Python package)
├── diagnostics/             # Monitoring dashboard (port 8081)
├── telegram/                # Telegram bot interface
├── deploy/                  # Deployment + mirror scripts
├── systemd/                 # Service definitions
├── agent-state/             # Cross-session agent state
├── research/                # Nightly research orchestration
├── hermes-agent/            # Hermes agent setup
├── scripts/                 # One-off backfills + migrations
├── tests/                   # Test suite
└── docs/                    # Operational documentation

Ownership

Each directory has one owning agent. The owner is accountable for correctness and reviews all changes to their section. See CODEOWNERS for per-file detail.

Directory	Owner	What it does
`lib/` (core)	Ship	Config, DB, merge, cascade, validation, LLM calls
`lib/` (extraction)	Epimetheus	Source extraction, entity processing, pre-screening
`lib/` (evaluation)	Leo	Claim evaluation, analytics, attribution
`lib/` (health)	Argus	Health checks, search, claim index
`diagnostics/`	Argus	4-page dashboard, alerting, vitality metrics
`telegram/`	Ship	Telegram bot, X integration, retrieval
`deploy/`	Ship	rsync deploy, GitHub-Forgejo mirror
`systemd/`	Ship	teleo-pipeline, teleo-diagnostics, teleo-agent@
`agent-state/`	Ship	Bootstrap, state library, cascade inbox processor
`research/`	Ship	Nightly research sessions, prompt templates
`scripts/`	Ship	Backfills, migrations, one-off maintenance
`tests/`	Ganymede	pytest suite, integration tests
`docs/`	Shared	Architecture, specs, protocols

VPS Layout

Runs on Hetzner CAX31 (77.42.65.182) as user teleo.

VPS Path	Repo Source	Service
`/opt/teleo-eval/pipeline/`	`lib/`, `teleo-pipeline.py`, `reweave.py`	teleo-pipeline
`/opt/teleo-eval/diagnostics/`	`diagnostics/`	teleo-diagnostics
`/opt/teleo-eval/telegram/`	`telegram/`	(manual)
`/opt/teleo-eval/agent-state/`	`agent-state/`	(used by research-session.sh)

Quick Start

# Run tests
pip install -e ".[dev]"
pytest

# Deploy to VPS
./deploy/deploy.sh --dry-run   # preview
./deploy/deploy.sh             # deploy