m3taversal f1094c5e09 leo: add Hermes Agent research brief for Theseus overnight session

- What: Research musing + queue entry for Hermes Agent by Nous Research
- Why: m3ta assigned deep dive, VPS Theseus picks up at 1am tonight
- Targets: 5 NEW claims + 2 enrichments across ai-alignment and collective-intelligence

Pentagon-Agent: Leo <D35C9237-A739-432E-A3DB-20D52D1577A9>

2026-04-05 19:35:11 +01:00

3.1 KiB

Raw Blame History

Ops Queue

Outstanding work items visible to all agents. Everything here goes through eval — adding items, claiming them, closing them. Git history is the audit trail.

How it works

Add items — any agent can propose new items via PR
Claim items — move status to claimed with your name, via PR
Close items — remove the row and note what PR resolved it, via PR
Priority — critical items block other work; high items should be next; medium/low are opportunistic

Active

Item	Type	Priority	Claimed	Notes
Rename `ai-alignment` domain → `ai-systems`	rename	high	—	Directory, CLAUDE.md, webhook.py domain routing, claim frontmatter, domain map. Support both names during transition.
24 claims with inflated confidence levels	audit	high	—	Foundations audit finding. 24 claims rated higher than evidence supports. List in `maps/analytical-toolkit.md` audit section.
8 foundation gaps (mechanism design, platform economics, transaction costs, info aggregation, auction theory, community formation, selfplex, CAS)	content	high	—	Partial coverage exists for some. See `maps/analytical-toolkit.md`.
Update `skills/evaluate.md` with tiered eval architecture	docs	high	—	Document triage criteria, tier definitions, model routing. After Ganymede validates parallel eval pipeline.
Update `collective-agent-core.md` — lever vs purpose framework + 20% posting rule	content	medium	—	From Cory voicenotes. Lever = the mechanism an agent uses. Purpose = why it exists. 20% of posting should be original synthesis.
Identity reframe PRs need merging	review	medium	—	#149 Theseus, #153 Astra, #157 Rio, #158 Leo (needs rebase), #159 Vida. All have eval reviews.
16 processed sources missing domain field	fix	low	—	Fixed for internet-finance batch (PR #171). Audit remaining sources.
Theseus disconfirmation protocol PR	content	medium	—	Scoped during B1 exercise. Theseus to propose.
Research Hermes Agent by Nous Research — deep dive for KB extraction	research	high	Theseus	Source: NousResearch/hermes-agent (GitHub). Research brief in `agents/theseus/musings/research-hermes-agent-nous.md`. Extract: (1) Skill extraction as convergent learning mechanism. (2) Self-evolution + human review gates = our governance model. (3) 3+ layer memory convergence. (4) Individual self-improvement ≠ collective knowledge accumulation. (5) Enrich Agentic Taylorism — skills = Taylor's instruction cards. Domains: ai-alignment + collective-intelligence.

Rules

One row per item. If an item is too big, split it into smaller items.
Don't hoard claims. If you claimed something and can't get to it within 2 sessions, unclaim it.
Close promptly. When the PR merges, remove the row in the same PR or the next one.
No duplicates. Check before adding. If an item is already tracked, update the existing row.
Critical items first. If a critical item exists, it takes precedence over all other work.

3.1 KiB Raw Blame History

Ops Queue

How it works

Active

Rules

3.1 KiB

Raw Blame History