Teleo evaluation pipeline infrastructure — Python async daemon for claim extraction, validation, evaluation, and merge

Find a file

m3taversal 3fe0f4b744 fix(attribution): credit sourcer/extractor from claim frontmatter Three layers of contributor-attribution bug surfaced by Apr 24 leaderboard investigation. alexastrum, thesensatore, cameron-s1 all had real merged contributions but zero credit in the contributors table. 1. lib/attribution.py: parse_attribution() only read `attribution_sourcer:` prefix-keyed flat fields. ~42% of claim files (535/1280) use the bare-key form `sourcer: alexastrum` written by extract.py. Added bare-key handling between the prefixed-flat path and the legacy-source-field fallback. Block format (`attribution: { sourcer: [...] }`) still wins when present. 2. lib/contributor.py: record_contributor_attribution() parsed the diff text with regex looking for `+- handle: "X"` lines. This matched neither the bare-key flat format nor the `attribution: { sourcer: [...] }` block format Leo uses for manual extractions. Replaced the regex parser with a file walker that calls attribution.parse_attribution_from_file() on each changed knowledge file — single source of truth for both formats. 3. scripts/backfill-sourcer-attribution.py: walks all merged knowledge files, re-attributes via the canonical parser, upserts contributors. Default additive mode preserves existing high counts (e.g. m3taversal.sourcer=1011 reflects Telegram-curator credit accumulated via a different code path that this fix does not touch). --reset flag for the destructive case. Dry-run preview (additive mode): - 670 NEW contributors to insert (mostly source-citation handles) - 77 EXISTING contributors with under-counted role columns - alexastrum: 0 → 6, thesensatore: 0 → 5, cameron-s1: 0 → 2 - astra.sourcer: 0 → 96, leo.sourcer: 0 → 44, theseus.sourcer: 0 → 18 - m3taversal.sourcer: 1011 (preserved, not 22 from file walk) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>		2026-04-24 12:48:41 +01:00
.forgejo/workflows	ganymede: add dev infrastructure — pyproject.toml, CI, deploy script	2026-03-13 14:24:27 +00:00
agent-state	Consolidate pipeline code from teleo-codex + VPS into single repo	2026-04-07 16:52:26 +01:00
deploy	fix: auto-deploy.sh rsync excludes broken + add tests/ sync	2026-04-20 17:22:11 +01:00
diagnostics	feat(activity): Timeline data gaps — type filter + commit_type classifier + source_channel reshape	2026-04-23 19:51:58 +01:00
docs	feat: reorganize repo with clear directory boundaries and agent ownership	2026-04-14 18:20:13 +01:00
hermes-agent	fix: set execute bit on research-session.sh and install-hermes.sh	2026-04-18 11:54:39 +01:00
lib	fix(attribution): credit sourcer/extractor from claim frontmatter	2026-04-24 12:48:41 +01:00
ops	fix: wire commit_type into contributor role assignment	2026-04-21 10:27:36 +01:00
research	fix: set execute bit on research-session.sh and install-hermes.sh	2026-04-18 11:54:39 +01:00
scripts	fix(attribution): credit sourcer/extractor from claim frontmatter	2026-04-24 12:48:41 +01:00
systemd	feat: add auto-deploy script and systemd units for teleo-infrastructure	2026-04-15 14:27:23 +01:00
telegram	add rio and theseus telegram bot agent configs	2026-04-20 17:20:21 +01:00
tests	fix: add telegram/ and tests/ to deploy pipeline, remove hardcoded API key	2026-04-20 17:15:55 +01:00
.gitignore	feat: add auto-deploy script and systemd units for teleo-infrastructure	2026-04-15 14:27:23 +01:00
CODEOWNERS	feat: reorganize repo with clear directory boundaries and agent ownership	2026-04-14 18:20:13 +01:00
fetch_coins.py	Skip liquidated entities in portfolio fetcher	2026-04-20 18:55:04 +01:00
pyproject.toml	ganymede: add dev infrastructure — pyproject.toml, CI, deploy script	2026-03-13 14:24:27 +00:00
README.md	feat: reorganize repo with clear directory boundaries and agent ownership	2026-04-14 18:20:13 +01:00
reweave.py	fix: quote YAML edge values containing colons, skip unparseable files in reweave merge	2026-04-18 12:07:28 +01:00
teleo-pipeline.py	fix: wrap breaker calls in stage_loop to prevent permanent task death	2026-04-20 12:37:28 +01:00

README.md

teleo-infrastructure

Pipeline infrastructure for the Teleo collective knowledge base. Async Python daemon that extracts, validates, evaluates, and merges claims via Forgejo PRs.

Directory Structure

teleo-infrastructure/
├── teleo-pipeline.py        # Daemon entry point
├── reweave.py               # Reciprocal edge maintenance
├── lib/                     # Pipeline modules (Python package)
├── diagnostics/             # Monitoring dashboard (port 8081)
├── telegram/                # Telegram bot interface
├── deploy/                  # Deployment + mirror scripts
├── systemd/                 # Service definitions
├── agent-state/             # Cross-session agent state
├── research/                # Nightly research orchestration
├── hermes-agent/            # Hermes agent setup
├── scripts/                 # One-off backfills + migrations
├── tests/                   # Test suite
└── docs/                    # Operational documentation

Ownership

Each directory has one owning agent. The owner is accountable for correctness and reviews all changes to their section. See CODEOWNERS for per-file detail.

Directory	Owner	What it does
`lib/` (core)	Ship	Config, DB, merge, cascade, validation, LLM calls
`lib/` (extraction)	Epimetheus	Source extraction, entity processing, pre-screening
`lib/` (evaluation)	Leo	Claim evaluation, analytics, attribution
`lib/` (health)	Argus	Health checks, search, claim index
`diagnostics/`	Argus	4-page dashboard, alerting, vitality metrics
`telegram/`	Ship	Telegram bot, X integration, retrieval
`deploy/`	Ship	rsync deploy, GitHub-Forgejo mirror
`systemd/`	Ship	teleo-pipeline, teleo-diagnostics, teleo-agent@
`agent-state/`	Ship	Bootstrap, state library, cascade inbox processor
`research/`	Ship	Nightly research sessions, prompt templates
`scripts/`	Ship	Backfills, migrations, one-off maintenance
`tests/`	Ganymede	pytest suite, integration tests
`docs/`	Shared	Architecture, specs, protocols

VPS Layout

Runs on Hetzner CAX31 (77.42.65.182) as user teleo.

VPS Path	Repo Source	Service
`/opt/teleo-eval/pipeline/`	`lib/`, `teleo-pipeline.py`, `reweave.py`	teleo-pipeline
`/opt/teleo-eval/diagnostics/`	`diagnostics/`	teleo-diagnostics
`/opt/teleo-eval/telegram/`	`telegram/`	(manual)
`/opt/teleo-eval/agent-state/`	`agent-state/`	(used by research-session.sh)

Quick Start

# Run tests
pip install -e ".[dev]"
pytest

# Deploy to VPS
./deploy/deploy.sh --dry-run   # preview
./deploy/deploy.sh             # deploy