teleo/teleo-infrastructure

Fork 0

Commit graph

Author	SHA1	Message	Date
Teleo Agents	74bf0461e8	fix(attribution): canonicalize submitted_by at write time + historical normalizer Some checks failed CI / lint-and-test (pull_request) Has been cancelled Details Companion / write-side fix to fix/activity-feed-canonical-handle. The activity-feed canonicalization was a read-side guard. The bug at the source is that extract.py and two backfill scripts write decorated strings (Vida (self-directed), pipeline (reweave), @m3taversal) into prs.submitted_by and sources.submitted_by. Downstream readers (lib.contributor.insert_contribution_event, scripts/scoring_digest, diagnostics/activity_feed_api) all strip the decorator on read — but anything that reads the column verbatim (like /api/activity-feed before the read-side fix) 404s on /contributors/{decorated-handle}. Stop writing the decorator. The self-directed signal is already carried by intake_tier == research-task plus the prs.agent column; the suffix is redundant string noise that costs us correctness at every consumer that forgets to strip. Changes: - lib/extract.py:690 — write canonical handle via attribution.normalize_handle. Direct elif for intake_tier == research-task now stores just agent_name. @m3taversal -> m3taversal. - diagnostics/backfill_submitted_by.py — same fix in two branches plus the reweave branch (pipeline (reweave) -> pipeline). - scripts/backfill-research-session-attribution.py — UPDATE prs sets agent handle alone, no suffix. Docstring + log line updated. - scripts/normalize-submitted-by.py (new) — one-time backfill that canonicalizes existing prs.submitted_by and sources.submitted_by rows. Strips trailing parenthetical decorators, lowercases, drops @. Defaults to dry-run; --apply to commit. Skips rows that would normalize to invalid handles (no garbage falls through silently). Dry-run against live pipeline.db: prs: 3008 rows need normalization (clean mappings, 0 invalid) sources: 730 rows need normalization (clean mappings, 0 invalid) Total: 3738 rows. All map to existing handle column values. After this lands + auto-deploys, the operator should run python3 scripts/normalize-submitted-by.py --apply once to clean historical rows. The read-side canonicalization in diagnostics/activity_feed_api.py (fix/activity-feed-canonical-handle) becomes redundant defense-in-depth instead of load-bearing. No KB writes.	2026-05-13 02:56:50 +00:00
m3taversal	2d332c66d4	fix(attribution): credit research-session sources to agents, not m3taversal Some checks failed CI / lint-and-test (pull_request) Has been cancelled Details Two-part fix for a bug where every claim extracted from agent overnight research sessions was being credited to m3taversal in contribution_events (visible in the activity feed as "@m3taversal" on agent-derived claims). Forward fix (research/research-session.sh): The frontmatter template the agent prompt instructs Claude to use now includes `proposed_by: ${AGENT}` and `intake_tier: research-task`. With those fields present, extract.py path 1 (line 687) takes precedence and sets prs.submitted_by to the agent handle, which then propagates into contribution_events as a kind='agent' author event for the agent. Without the fields, extract.py fell through to the default branch on line 695 and set submitted_by='@m3taversal'. Backfill (scripts/backfill-research-session-attribution.py): Identifies research-session-derived PRs by finding teleo-codex commits matching `^<agent>: research session YYYY-MM-DD —`, listing the inbox/queue/*.md files added in each commit's diff, and matching those filename basenames against prs.source_path. Only PRs currently submitted_by='@m3taversal' AND merged within the configurable window are touched. Default --dry-run; --apply to commit. For each match the script: 1. UPDATE prs SET submitted_by = '<agent> (self-directed)' 2. INSERT OR IGNORE the agent author event (kind='agent', weight=0.30) with the original PR's domain, channel, merged_at preserved 3. DELETE the misattributed m3taversal author event Applied 30-day backfill on VPS: - 304 PRs re-attributed (rio 74, clay 70, astra 53, vida 48, theseus 30, leo 29) - 297 m3taversal author events deleted, 304 agent author events inserted (delta of 7 = pre-v24 PRs that never had m3ta events in the first place; we still create the new agent event) - m3taversal author count: 1368 → 1071 (−22%) - Pre-backfill DB snapshot: pipeline.db.bak-pre-research-attribution Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 12:38:53 +01:00

Author

SHA1

Message

Date

Teleo Agents

74bf0461e8

fix(attribution): canonicalize submitted_by at write time + historical normalizer

CI / lint-and-test (pull_request) Has been cancelled

Details

Companion / write-side fix to fix/activity-feed-canonical-handle.

The activity-feed canonicalization was a read-side guard. The bug at the
source is that extract.py and two backfill scripts write decorated
strings (Vida (self-directed), pipeline (reweave), @m3taversal) into
prs.submitted_by and sources.submitted_by. Downstream readers
(lib.contributor.insert_contribution_event, scripts/scoring_digest,
diagnostics/activity_feed_api) all strip the decorator on read — but
anything that reads the column verbatim (like /api/activity-feed before
the read-side fix) 404s on /contributors/{decorated-handle}.

Stop writing the decorator. The self-directed signal is already carried
by intake_tier == research-task plus the prs.agent column; the suffix
is redundant string noise that costs us correctness at every consumer
that forgets to strip.

Changes:

- lib/extract.py:690 — write canonical handle via attribution.normalize_handle.
  Direct elif for intake_tier == research-task now stores just agent_name.
  @m3taversal -> m3taversal.

- diagnostics/backfill_submitted_by.py — same fix in two branches plus
  the reweave branch (pipeline (reweave) -> pipeline).

- scripts/backfill-research-session-attribution.py — UPDATE prs sets
  agent handle alone, no suffix. Docstring + log line updated.

- scripts/normalize-submitted-by.py (new) — one-time backfill that
  canonicalizes existing prs.submitted_by and sources.submitted_by rows.
  Strips trailing parenthetical decorators, lowercases, drops @. Defaults
  to dry-run; --apply to commit. Skips rows that would normalize to
  invalid handles (no garbage falls through silently).

Dry-run against live pipeline.db:
  prs:     3008 rows need normalization (clean mappings, 0 invalid)
  sources: 730 rows need normalization (clean mappings, 0 invalid)
  Total:   3738 rows. All map to existing handle column values.

After this lands + auto-deploys, the operator should run
  python3 scripts/normalize-submitted-by.py --apply
once to clean historical rows. The read-side canonicalization in
diagnostics/activity_feed_api.py (fix/activity-feed-canonical-handle)
becomes redundant defense-in-depth instead of load-bearing.

No KB writes.

2026-05-13 02:56:50 +00:00

m3taversal

2d332c66d4

fix(attribution): credit research-session sources to agents, not m3taversal

CI / lint-and-test (pull_request) Has been cancelled

Details

Two-part fix for a bug where every claim extracted from agent overnight
research sessions was being credited to m3taversal in contribution_events
(visible in the activity feed as "@m3taversal" on agent-derived claims).

Forward fix (research/research-session.sh):
The frontmatter template the agent prompt instructs Claude to use now
includes `proposed_by: ${AGENT}` and `intake_tier: research-task`. With
those fields present, extract.py path 1 (line 687) takes precedence and
sets prs.submitted_by to the agent handle, which then propagates into
contribution_events as a kind='agent' author event for the agent.

Without the fields, extract.py fell through to the default branch on
line 695 and set submitted_by='@m3taversal'.

Backfill (scripts/backfill-research-session-attribution.py):
Identifies research-session-derived PRs by finding teleo-codex commits
matching `^<agent>: research session YYYY-MM-DD —`, listing the
inbox/queue/*.md files added in each commit's diff, and matching those
filename basenames against prs.source_path. Only PRs currently
submitted_by='@m3taversal' AND merged within the configurable window
are touched. Default --dry-run; --apply to commit.

For each match the script:
  1. UPDATE prs SET submitted_by = '<agent> (self-directed)'
  2. INSERT OR IGNORE the agent author event (kind='agent', weight=0.30)
     with the original PR's domain, channel, merged_at preserved
  3. DELETE the misattributed m3taversal author event

Applied 30-day backfill on VPS:
  - 304 PRs re-attributed (rio 74, clay 70, astra 53, vida 48,
    theseus 30, leo 29)
  - 297 m3taversal author events deleted, 304 agent author events
    inserted (delta of 7 = pre-v24 PRs that never had m3ta events
    in the first place; we still create the new agent event)
  - m3taversal author count: 1368 → 1071 (−22%)
  - Pre-backfill DB snapshot: pipeline.db.bak-pre-research-attribution

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-04-27 12:38:53 +01:00

2 commits