Teleo evaluation pipeline infrastructure — Python async daemon for claim extraction, validation, evaluation, and merge
Find a file
Teleo Agents 74bf0461e8
Some checks are pending
CI / lint-and-test (pull_request) Waiting to run
fix(attribution): canonicalize submitted_by at write time + historical normalizer
Companion / write-side fix to fix/activity-feed-canonical-handle.

The activity-feed canonicalization was a read-side guard. The bug at the
source is that extract.py and two backfill scripts write decorated
strings (Vida (self-directed), pipeline (reweave), @m3taversal) into
prs.submitted_by and sources.submitted_by. Downstream readers
(lib.contributor.insert_contribution_event, scripts/scoring_digest,
diagnostics/activity_feed_api) all strip the decorator on read — but
anything that reads the column verbatim (like /api/activity-feed before
the read-side fix) 404s on /contributors/{decorated-handle}.

Stop writing the decorator. The self-directed signal is already carried
by intake_tier == research-task plus the prs.agent column; the suffix
is redundant string noise that costs us correctness at every consumer
that forgets to strip.

Changes:

- lib/extract.py:690 — write canonical handle via attribution.normalize_handle.
  Direct elif for intake_tier == research-task now stores just agent_name.
  @m3taversal -> m3taversal.

- diagnostics/backfill_submitted_by.py — same fix in two branches plus
  the reweave branch (pipeline (reweave) -> pipeline).

- scripts/backfill-research-session-attribution.py — UPDATE prs sets
  agent handle alone, no suffix. Docstring + log line updated.

- scripts/normalize-submitted-by.py (new) — one-time backfill that
  canonicalizes existing prs.submitted_by and sources.submitted_by rows.
  Strips trailing parenthetical decorators, lowercases, drops @. Defaults
  to dry-run; --apply to commit. Skips rows that would normalize to
  invalid handles (no garbage falls through silently).

Dry-run against live pipeline.db:
  prs:     3008 rows need normalization (clean mappings, 0 invalid)
  sources: 730 rows need normalization (clean mappings, 0 invalid)
  Total:   3738 rows. All map to existing handle column values.

After this lands + auto-deploys, the operator should run
  python3 scripts/normalize-submitted-by.py --apply
once to clean historical rows. The read-side canonicalization in
diagnostics/activity_feed_api.py (fix/activity-feed-canonical-handle)
becomes redundant defense-in-depth instead of load-bearing.

No KB writes.
2026-05-13 02:56:50 +00:00
.forgejo/workflows ganymede: add dev infrastructure — pyproject.toml, CI, deploy script 2026-03-13 14:24:27 +00:00
agent-state Consolidate pipeline code from teleo-codex + VPS into single repo 2026-04-07 16:52:26 +01:00
deploy sync-mirror: surface tracker SELECT/INSERT failures to ops log 2026-05-01 15:48:28 +01:00
diagnostics fix(attribution): canonicalize submitted_by at write time + historical normalizer 2026-05-13 02:56:50 +00:00
docs feat: reorganize repo with clear directory boundaries and agent ownership 2026-04-14 18:20:13 +01:00
hermes-agent fix: set execute bit on research-session.sh and install-hermes.sh 2026-04-18 11:54:39 +01:00
lib fix(attribution): canonicalize submitted_by at write time + historical normalizer 2026-05-13 02:56:50 +00:00
ops fix: wire commit_type into contributor role assignment 2026-04-21 10:27:36 +01:00
research fix(attribution): unify research-session format on "(self-directed)" suffix 2026-04-27 12:53:52 +01:00
scripts fix(attribution): canonicalize submitted_by at write time + historical normalizer 2026-05-13 02:56:50 +00:00
systemd feat: add auto-deploy script and systemd units for teleo-infrastructure 2026-04-15 14:27:23 +01:00
telegram add rio and theseus telegram bot agent configs 2026-04-20 17:20:21 +01:00
tests fix(tests): apply Ganymede review nits + add m3taversal reset script 2026-04-27 17:35:18 +01:00
.gitignore feat: add auto-deploy script and systemd units for teleo-infrastructure 2026-04-15 14:27:23 +01:00
CODEOWNERS feat: reorganize repo with clear directory boundaries and agent ownership 2026-04-14 18:20:13 +01:00
fetch_coins.py Skip liquidated entities in portfolio fetcher 2026-04-20 18:55:04 +01:00
pyproject.toml ganymede: add dev infrastructure — pyproject.toml, CI, deploy script 2026-03-13 14:24:27 +00:00
README.md feat: reorganize repo with clear directory boundaries and agent ownership 2026-04-14 18:20:13 +01:00
reweave.py fix: quote YAML edge values containing colons, skip unparseable files in reweave merge 2026-04-18 12:07:28 +01:00
teleo-pipeline.py fix(reaper): apply Ganymede review — dual-PATCH drift, breaker isolation, env config 2026-05-07 23:43:53 -04:00

teleo-infrastructure

Pipeline infrastructure for the Teleo collective knowledge base. Async Python daemon that extracts, validates, evaluates, and merges claims via Forgejo PRs.

Directory Structure

teleo-infrastructure/
├── teleo-pipeline.py        # Daemon entry point
├── reweave.py               # Reciprocal edge maintenance
├── lib/                     # Pipeline modules (Python package)
├── diagnostics/             # Monitoring dashboard (port 8081)
├── telegram/                # Telegram bot interface
├── deploy/                  # Deployment + mirror scripts
├── systemd/                 # Service definitions
├── agent-state/             # Cross-session agent state
├── research/                # Nightly research orchestration
├── hermes-agent/            # Hermes agent setup
├── scripts/                 # One-off backfills + migrations
├── tests/                   # Test suite
└── docs/                    # Operational documentation

Ownership

Each directory has one owning agent. The owner is accountable for correctness and reviews all changes to their section. See CODEOWNERS for per-file detail.

Directory Owner What it does
lib/ (core) Ship Config, DB, merge, cascade, validation, LLM calls
lib/ (extraction) Epimetheus Source extraction, entity processing, pre-screening
lib/ (evaluation) Leo Claim evaluation, analytics, attribution
lib/ (health) Argus Health checks, search, claim index
diagnostics/ Argus 4-page dashboard, alerting, vitality metrics
telegram/ Ship Telegram bot, X integration, retrieval
deploy/ Ship rsync deploy, GitHub-Forgejo mirror
systemd/ Ship teleo-pipeline, teleo-diagnostics, teleo-agent@
agent-state/ Ship Bootstrap, state library, cascade inbox processor
research/ Ship Nightly research sessions, prompt templates
scripts/ Ship Backfills, migrations, one-off maintenance
tests/ Ganymede pytest suite, integration tests
docs/ Shared Architecture, specs, protocols

VPS Layout

Runs on Hetzner CAX31 (77.42.65.182) as user teleo.

VPS Path Repo Source Service
/opt/teleo-eval/pipeline/ lib/, teleo-pipeline.py, reweave.py teleo-pipeline
/opt/teleo-eval/diagnostics/ diagnostics/ teleo-diagnostics
/opt/teleo-eval/telegram/ telegram/ (manual)
/opt/teleo-eval/agent-state/ agent-state/ (used by research-session.sh)

Quick Start

# Run tests
pip install -e ".[dev]"
pytest

# Deploy to VPS
./deploy/deploy.sh --dry-run   # preview
./deploy/deploy.sh             # deploy