teleo/teleo-infrastructure

Author	SHA1	Message	Date
m3taversal	66bc742979	feat: full transcript archival + SOURCE:/CLAIM: inline tags Transcript system: - All messages in all chats captured to chat_transcripts store - 1-hour dump job writes per-chat JSON to /opt/teleo-eval/transcripts/ - Includes internal reasoning (KB matches, searches, learnings) - Transcripts accumulate over session (no clear on dump) - Per-chat directories: transcripts/{chat-slug}/{date-hour}.json Inline contribution tags: - SOURCE: creates inbox source file with verbatim user content - CLAIM: creates draft claim file attributed to contributor - Both strip tag from displayed response - Full user message preserved verbatim (Rio decides context, can't alter) Also: multi-URL processing (up to 5 per message) Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-25 13:35:10 +00:00
m3taversal	0759655688	fix: process all URLs in a message, not just the first When a user shared two X links in one message (sjdedic + knimkar), only the first got a standalone source. Now processes up to 5 URLs per message, each getting its own standalone source file. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-25 13:21:26 +00:00
m3taversal	102d97859c	fix: auto-research sends follow-up message with findings When Opus triggers RESEARCH: tag, the search ran silently and archived results but never sent a follow-up. User saw "let me look into it" then nothing. Now: searches, sends concise summary of top 5 results back to the chat, then archives for pipeline. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-25 13:14:38 +00:00
m3taversal	02c86e9050	fix: split long messages for Telegram 4096 char limit Bot crashed with "Message is too long" when sending full DP-00002 text (8K+ chars). Now splits on paragraph boundaries. Also prevents silent message drops from unhandled BadRequest exceptions. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-24 16:22:53 +00:00
m3taversal	458cd7dfda	fix: Opus now knows research results are from a live search it ran Bot said "I don't have the ability to run live X searches" despite Haiku finding 10 tweets. Two issues: (1) prompt section header didn't make clear these were LIVE results, (2) learnings taught deflection ("say drop links here" instead of acknowledging search capability). Fixed: section header now says "LIVE X Search Results (you just searched for X — cite these directly)". Learnings updated to acknowledge search capability. Stale Robin Hanson learning removed again (re-synced from git). Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-24 16:19:52 +00:00
m3taversal	7232755d11	fix: decision record body cap 2K → 8K — proposals were truncating mid-text User asked for full DP-00002 text, bot served it but cut off at 2000 chars with "That's where my copy cuts off." Full proposals are 6K+. Increased index, sanitize, and prompt caps to 8K for decision records. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-24 16:18:08 +00:00
m3taversal	c2ff4996e3	refine: x-tweet vs x-article source_type, 500ms rate limit (Ganymede) - Distinguish tweets (source_type: x-tweet, format: social-media) from articles (source_type: x-article, format: article) based on content length and article marker presence - 500ms delay between fetch_from_url calls in research path - Keep standalone sources pure (no Rio analysis — circular dependency) Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-24 16:00:19 +00:00
m3taversal	b3c635290f	feat: full content fetch for research + standalone source for shared URLs Two fixes for article ingestion: 1. Research path: top 5 search results now get full content via fetch_from_url before archiving. Articles get full text, not just search snippets. Threads get complete text. 2. URL sharing: when a user shares a URL, creates a standalone source file (type: source, format: article) separate from the conversation archive. Enters extraction pipeline as proper source material, attributed to the TG user who shared it. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-24 15:57:58 +00:00
m3taversal	a19db22b16	bump: chat-level history to 30 exchanges (~6K tokens) Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-24 15:03:11 +00:00
m3taversal	bb3b033b57	fix: separate history caps — chat-level 10, per-user 5 (Ganymede review) Group chats with 3 users contributing 2 messages each = 6 exchanges, exceeding the old shared cap of 5. Chat-level now holds 10 exchanges (~2K extra tokens, within prompt budget). Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-24 14:54:36 +00:00
m3taversal	60c92d5c19	fix: group chat history shared across users — bot no longer loses context History was keyed by (chat_id, user_id). In group chats, when Jordan asked about Solomon buyback and Cory followed up, the bot couldn't see Jordan's exchange. Now maintains chat-level history (chat_id, 0) that captures all exchanges with usernames. Group context visible to all follow-up responses. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-24 14:51:03 +00:00
m3taversal	2ec4c445b1	fix: use x_client.fetch_from_url for X URLs in archive pipeline _fetch_url_content was doing raw HTTP GET on X URLs which returns JavaScript, not article content. Now routes X/Twitter URLs through Ben's API via x_client.fetch_from_url which returns structured article content (contents[] array with typed blocks). Article content gets included in the archived source file so the extraction pipeline has the actual content, not just Rio's response. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-24 14:12:31 +00:00
m3taversal	9267351aba	fix: 7-day TTL on dated learnings + block availability learnings Stale learning ("I don't have Robin Hanson data") overrode real KB data. Ganymede review: dated entries expire after 7 days. Permanent entries (communication style, identity) are undated and always included. Prompt guard: "NEVER save a learning about what data you do or don't have" prevents the bot from writing availability claims that go stale. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 18:07:46 +00:00
m3taversal	28be7555b1	fix: top 3 entities get full body in prompt, not just top 1 When two related entities match (advisor hire + research grant), both need full content so Opus can distinguish them and serve the right one. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 17:44:51 +00:00
m3taversal	f77fd229d6	fix: stop word filtering in entity scoring — common words polluted rankings 'the', 'full', 'text', 'proposal' etc. were matching irrelevant entities. Robin Hanson record ranked #2 behind Drift because Drift matched 'the' and 'proposal' in its name. Now only meaningful tokens (>=3 chars, not stop words) contribute to entity scoring. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 17:44:06 +00:00
m3taversal	089b4609d5	fix: score + rank entities, limit to top 5, full body for decisions Before: "Robin Hanson MetaDAO proposal" returned 34 entities (39K chars) with the target record buried at position 13. No relevance scoring. After: entities scored by query token overlap (name 3x, alias 1x, bigram 5x), limited to top 5 results. Decision records get full body (2K chars) instead of 500-char truncation. Top result gets 2K in prompt, rest get 500. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 17:38:10 +00:00
m3taversal	3ed0f20fa1	fix: index parent_entity as alias for decision records (Ganymede review) MetaDAO queries now surface MetaDAO's decision records because parent_entity: "[[metadao]]" is stripped and added to the alias set. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 17:31:54 +00:00
m3taversal	425e7a1bac	fix: index decisions/ as entities so decision records reach the bot prompt Root cause: decision records have type: decision, but the entity indexer only accepted type: entity and only scanned entities/. The claim indexer scanned decisions/ but filtered out non-claim types. Result: decision records fell through both indexes entirely — invisible to the bot. Fix: add decisions/ to entity indexer scan paths, accept type: decision alongside type: entity, include summary/proposer in search aliases. Remove decisions/ from claim indexer (was silently dropping them anyway). Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 17:28:30 +00:00
m3taversal	c7c71ec9d1	epimetheus: fix double research message + add decisions/ to KB retrieval 1. handle_research gets silent=True param. RESEARCH: tag triggers use silent mode — archives tweets but posts no follow-up message. Prevents "Queued N tweets" after Opus already responded. 2. KB retrieval now searches decisions/ directory alongside domains/, core/, foundations/. Decision records (Robin Hanson proposal, etc.) are now findable by the bot. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 16:59:23 +00:00
m3taversal	c59db5812f	epimetheus: fix article content parsing — contents[] array, not text field Article endpoint returns body in "contents" array of typed blocks (unstyled, header-two, markdown, list-item, blockquote, etc). Was looking for article.text which is empty. Now parses all block types into readable text. Also extracts engagement stats (likes, views). Fixes: "Claude + Obsidian" article returned title but empty text. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 15:30:59 +00:00
m3taversal	bcbe54a0a3	epimetheus: consolidated X API client (x_client.py replaces x_search.py) Clean, documented interface to twitterapi.io for all agents: - get_tweet(id) — fetch any tweet by ID, any age - get_article(id) — fetch X long-form articles - search_tweets(query) — keyword search for research - get_user_tweets(username) — user's recent tweets (research sessions) - fetch_from_url(url) — smart dispatcher: tweet → article → placeholder Shared by Telegram bot + research sessions. Documented endpoints, costs, rate limits. Replaces ad-hoc x_search.py. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 15:26:10 +00:00
m3taversal	7360f6b22e	epimetheus: direct tweet lookup via /tweets?tweet_ids= endpoint Primary path: GET /twitter/tweets?tweet_ids={id} — works for any tweet, any age, returns full content. Replaces the fragile from:username search pagination fallback. Fallback: article endpoint for X long-form articles. Last resort: placeholder with [Could not fetch] message. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 15:17:11 +00:00
m3taversal	8f4e583c76	epimetheus: Ganymede review fixes + tweet fetch pagination - fetch_tweet_by_url: paginate up to 3 pages to find older tweets - Return placeholder on fetch failure (Ganymede: surface failure to user) - Don't burn user rate limit on Haiku autonomous searches (Ganymede) - 7-day limitation documented in comment Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 15:12:10 +00:00
m3taversal	76d5644272	epimetheus: X link fetching + Haiku pre-pass + systemd fix + query tuning Major changes this session: - fetch_tweet_by_url: extracts username+ID from X URLs, tries article endpoint, falls back to from:username search. Tweets injected into Opus prompt. - Haiku pre-pass: decides if X search needed before Opus responds. 2-3 word queries. - systemd ProtectSystem paths fixed (ROOT CAUSE of all write failures since day 1) - Research regex handles Telegram @botname suffix in groups - Double research message prevented (skip RESEARCH: tag when Haiku already ran) - Engagement filter dropped to 0 for niche crypto tweets - Heuristic brevity in prompt (not hard cap) - DM auto-respond gating (groups: reply-to only, DMs: auto-respond) - All code now edited in pipeline-v2 repo, not /tmp Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 14:05:07 +00:00
m3taversal	ed46c0674b	epimetheus: fix double research message + Haiku query tuning - Skip RESEARCH: tag when Haiku pre-pass already searched (no double-fire) - Haiku told to use 2-3 word queries (was generating 6+ word queries that returned 0) - Engagement filter dropped to 0 (niche crypto tweets have low engagement) - systemd ProtectSystem paths fixed (root cause of ALL write failures) Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 13:57:12 +00:00
m3taversal	7086bcacb1	epimetheus: Haiku pre-pass for auto-research (Option A) Before Opus responds, Haiku evaluates: "Does this message need an X search?" If YES, searches X, injects results into Opus prompt, archives as source. Opus responds with KB knowledge + fresh tweet data combined. Flow: user asks naturally ("what are people saying about P2P?") → Haiku decides search needed → X search → results in Opus context → unified response. ~1s latency, ~$0.001 cost per message. Only fires when Haiku says YES. Explicit /research command still works as direct path. Also: fixed systemd ProtectSystem paths (Ganymede: root cause of all write failures). Fixed research regex for Telegram group commands. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 13:31:29 +00:00
m3taversal	5388f701bd	epimetheus: heuristic brevity, not hard cap Replaced hard rules with judgment heuristics: - "Does every sentence add something the user doesn't already know?" - "Earn every paragraph — each needs a distinct insight" - "Short questions deserve short answers" Restored max_tokens to 1024. Agent decides length, not a token cap. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 12:58:42 +00:00
m3taversal	08aa52659c	epimetheus: enforce brevity + fix research regex false positive 1. Response length: "BREVITY IS YOUR DEFAULT. Most responses 1-3 sentences. A 4-paragraph response to a simple question is a failure." max_tokens cut from 1024 to 512. 2. Research trigger: removed natural language regex (caused false positive on "has accumulated" matching "search"). Only explicit /research command. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 12:57:09 +00:00
m3taversal	b90e80ed6c	epimetheus: don't track silent group messages in history (Ganymede review) Option A: history only contains actual bot-user exchanges, not unaddressed group messages. Empty bot responses in history confused the model. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 12:31:01 +00:00
m3taversal	251caa3695	epimetheus: DM auto-respond gating (Rio suggestion) DMs (private chats): conversation window auto-responds — always 1-on-1, no false positives. Groups (supergroup/group): conversation window tracks context silently, reply-to only trigger. Simple msg.chat.type check. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-23 10:19:15 +00:00
m3taversal	a75c14e536	epimetheus: auto-learning trigger — bot self-writes learnings from corrections Opus decides what to learn. Prompt instructs: append LEARNING: [category] [description] at end of response when genuinely learning something new. Bot parses the line, strips it from displayed response, calls _save_learning() to persist. Zero additional API calls (Rhea's design). The model already has full context. Categories: factual, communication, structured_data. Most responses have no LEARNING line — only fires on genuine corrections. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-22 16:57:47 +00:00
m3taversal	a11eca90e3	epimetheus: compressed conversation context + decouple archive from lock 1. Conversation history now shows compressed context summary first (tickers, key figures, exchange count) before full log. "Discussing: $FUTARDIO \| Key figures: $0.004, $39.5K \| Exchanges: 3" 20 tokens, unmissable. Plus prompt instruction: "NEVER ask a question your history already answers." (Ganymede: Option C+A) 2. Archive file writes decoupled from worktree lock. File written unlocked (additive, no coordination needed). Git commit attempted with lock — deferred on timeout, file persists on disk for next cycle. Fixes "Read-only file system" archive failures. (Ganymede review) Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-21 17:26:02 +00:00
m3taversal	8d10c8ee28	epimetheus: conversation window → silent context only (Ganymede+Rhea+Leo) Auto-respond stripped from conversation window. Bot only responds to @tag and reply-to-bot. Window now silently tracks messages for context — when the user does reply, the bot has full conversation history. Also: prompt shortened to "1-2 sentences" default. "Do NOT respond to messages that aren't directed at you." Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-21 16:51:26 +00:00
m3taversal	f7d30ced1a	epimetheus: /research command — user-triggered X search from Telegram User says "@FutAIrdBot /research P2P.me launch" → bot searches X via twitterapi.io → archives all tweets as ONE consolidated source file in inbox/queue/ → batch extract picks up → claims land in KB. Features (Ganymede+Rhea+Leo+Rio consensus): - Regex + natural language intent detection (not CommandHandler) - One source file per research query (not per-tweet) - Full tweet metadata: author, followers, engagement, date - Contributor attribution: proposed_by + contribution_type: research-direction - Rate limit: 3 searches per user per day - Min engagement filter (3 interactions) - Worktree lock on source file write Phase 2 (not built): domain alignment check before searching. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-21 16:32:43 +00:00
m3taversal	e921eda0a0	epimetheus: sanitize learnings before prompt injection (Ganymede review) Learnings file content now passes through sanitize_message() before injection into the Opus prompt. Prevents prompt injection via crafted "corrections." Rio UUID 5551F5AF confirmed as current Teleo v4 Rio. Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-21 15:29:46 +00:00
m3taversal	1b4c6f8d72	epimetheus: agent learning system — learnings.md reader + self-write Option D (Rhea+Rio+Leo consensus): - _load_learnings(): reads agents/rio/learnings.md, injects into prompt before KB context - _save_learning(): appends correction to learnings.md via worktree lock + direct commit - Learnings prioritized over KB data when they conflict - Three categories: communication, factual, structured_data - Prompt updated: tells agent it can save corrections for future conversations Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-21 15:25:52 +00:00
m3taversal	d97f68714a	epimetheus: fix 2 nits from Ganymede final review 1. _merge_pr marked as CURRENTLY UNUSED (local ff-push is primary path) 2. Conversation window messages skip cold rate limit check (window counter IS the limit) Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-20 20:25:06 +00:00
m3taversal	d79ff60689	epimetheus: sync VPS-deployed code to repo — Mar 18-20 reliability + features Pipeline reliability (8 fixes, reviewed by Ganymede+Rhea+Leo+Rio): 1. Merge API recovery — pre-flight approval check, transient/permanent distinction, jitter 2. Ghost PR detection — ls-remote branch check in reconciliation, network guard 3. Source status contract — directory IS status, no code change needed 4. Batch-state markers eliminated — two-gate skip (archive-check + batched branch-check) 5. Branch SHA tracking — batched ls-remote, auto-reset verdicts, dismiss stale reviews 6. Mirror pre-flight permissions — chown check in sync-mirror.sh 7. Telegram archive commit-after-write — git add/commit/push with rebase --abort fallback 8. Post-merge source archiving — queue/ → archive/{domain}/ after merge Pipeline fixes: - merge_cycled flag — eval attempts preserved during merge-failure cycling (Ganymede+Rhea) - merge_failures diagnostic counter - Startup recovery preserves eval_attempts (was incorrectly resetting to 0) - No-diff PRs auto-closed by eval (root cause of 17 zombie PRs) - GC threshold aligned with substantive fixer budget (was 2, now 4) - Conflict retry with 3-attempt budget + permanent conflict handler - Local ff-merge fallback for Forgejo 405 errors Telegram bot: - KB retrieval: 3-layer (entity resolution → claim search → agent context) - Reply-to-bot handler (context.bot.id check) - Tag regex: @teleo\|@futairdbot - Prompt rewrite for natural analyst voice - Market data API integration (Ben's token price endpoint) - Conversation windows (5-message unanswered counter, per-user-per-chat) - Conversation history in prompt (last 5 exchanges) - Worktree file lock for archive writes Infrastructure: - worktree_lock.py — file-based lock (flock) for main worktree coordination - backfill-sources.py — source DB registration for Argus funnel - batch-extract-50.sh v3 — two-gate skip, batched ls-remote, network guard - sync-mirror.sh — auto-PR creation for mirrored GitHub branches, permission pre-flight - Argus dashboard — conflicts + reviewing in backlog, queue count in funnel - Enrichment-inside-frontmatter bug fix (regex anchor, not --- split) Pentagon-Agent: Epimetheus <3D35839A-7722-4740-B93D-51157F7D5E70>	2026-03-20 20:17:27 +00:00
m3taversal	090b1411fd	epimetheus: source archive restructure — inbox/queue + inbox/archive/{domain} + inbox/null-result - config.py: added INBOX_QUEUE, INBOX_NULL_RESULT constants - evaluate.py: skip patterns + LIGHT tier cover all inbox/ subdirs - llm.py: eval prompts reference inbox/ generically - telegram/bot.py: archives to inbox/queue/ - telegram/teleo-telegram.service: ReadWritePaths expanded - research-prompt-v2.md: paths updated to inbox/queue/ - research-prompt-leo-synthesis.md: paths updated - migrate-source-archive.py: one-time migration script Reviewed by: Ganymede, Rhea, Leo (all approved) Pentagon-Agent: Epimetheus <968B2991-E2DF-4006-B962-F5B0A0CC8ACA>	2026-03-18 11:50:04 +00:00

39 commits