teleo-codex

History

m3taversal 08dea4249f theseus: extract 4 NEW claims + 1 enrichment from Christiano core alignment research Phase 2 of 5-phase AI alignment research program. Christiano's prosaic alignment counter-position to Yudkowsky. Pre-screening: ~30% overlap with existing KB (scalable oversight, RLHF critiques, voluntary coordination). NEW claims: 1. Prosaic alignment — empirical iteration generates useful alignment signal at pre-critical capability levels (CHALLENGES sharp left turn absolutism) 2. Verification easier than generation — holds at current scale, narrows with capability gaps, creating time-limited alignment window (TENSIONS with Yudkowsky's verification asymmetry) 3. ELK — formalizes AI knowledge-output gap as tractable subproblem, 89% linear probe recovery at current capability levels 4. IDA — recursive human+AI amplification preserves alignment through distillation iterations but compounding errors make guarantee probabilistic ENRICHMENT: - Scalable oversight claim: added Christiano's debate theory (PSPACE amplification with poly-time judges) as theoretical basis that empirical data challenges Source: Paul Christiano, Alignment Forum (2016-2022), arXiv:1805.00899, arXiv:1706.03741, ARC ELK report (2021), Yudkowsky-Christiano takeoff debate Pentagon-Agent: Theseus <46864dd4-da71-4719-a1b4-68f7c55854d3>		2026-04-05 20:16:59 +01:00
..
_map.md	theseus: add 3 claims on collective AI design implications	2026-03-13 19:29:05 +00:00
active forgetting through selective removal maintains knowledge system health because perfect retention degrades usefulness the same way hyperthymesia overwhelms biological memory.md	reweave: connect 13 orphan claims via vector similarity	2026-04-04 12:52:43 +00:00
adversarial contribution produces higher-quality collective knowledge than collaborative contribution when wrong challenges have real cost evaluation is structurally separated from contribution and confirmation is rewarded alongside novelty.md	reweave: connect 13 orphan claims via vector similarity	2026-04-04 12:52:43 +00:00
AI processing that restructures content without generating new connections is expensive transcription because transformation not reorganization is the test for whether thinking actually occurred.md	theseus: cornelius batch 3 — epistemology (9 NEW + 3 enrichments)	2026-03-31 12:47:03 +01:00
centaur team performance depends on role complementarity not mere human-AI combination.md	leo: foundations audit — 7 moves, 4 deletes, 3 condensations, 10 confidence demotions, 23 type fixes, 1 centaur rewrite	2026-03-07 11:56:38 -07:00
collective intelligence is a measurable property of group interaction structure not aggregated individual ability.md	leo: remove 21 duplicates + fix domain:livingip in 204 files	2026-03-06 09:11:51 -07:00
collective intelligence requires diversity as a structural precondition not a moral preference.md	reweave: connect 48 orphan claims via vector similarity	2026-03-28 23:04:53 +00:00
collective intelligence within a purpose-driven community faces a structural tension because shared worldview correlates errors while shared purpose enables coordination.md	leo: foundations audit — 7 moves, 4 deletes, 3 condensations, 10 confidence demotions, 23 type fixes, 1 centaur rewrite	2026-03-07 11:56:38 -07:00
coordination failures arise from individually rational strategies that produce collectively irrational outcomes because the Nash equilibrium of non-cooperation dominates when trust and enforcement are absent.md	reweave: connect 18 orphan claims via vector similarity	2026-04-04 12:50:25 +00:00
decentralized information aggregation outperforms centralized planning because dispersed knowledge cannot be collected into a single mind but can be coordinated through price signals that encode local information into globally accessible indicators.md	theseus: rename futarchy claim from defenders to arbitrageurs	2026-04-04 16:17:54 +00:00
designing coordination rules is categorically different from designing coordination outcomes as nine intellectual traditions independently confirm.md	leo: remove 21 duplicates + fix domain:livingip in 204 files	2026-03-06 09:11:51 -07:00
externalizing cognitive functions risks atrophying the capacity being externalized because productive struggle is where deep understanding forms and preemptive resolution removes exactly that friction.md	theseus: Agentic Taylorism research sprint — 4 NEW claims + 3 enrichments	2026-04-04 15:54:46 +01:00
friction in knowledge systems is diagnostic signal not failure because six specific friction patterns map to six specific structural causes with prescribed responses.md	theseus: cornelius batch 3 — epistemology (9 NEW + 3 enrichments)	2026-03-31 12:47:03 +01:00
Hayek argued that designed rules of just conduct enable spontaneous order of greater complexity than deliberate arrangement could achieve.md	leo: remove 21 duplicates + fix domain:livingip in 204 files	2026-03-06 09:11:51 -07:00
humanity is a superorganism that can communicate but not yet think — the internet built the nervous system but not the brain.md	leo: reframe superorganism claim — lead with superorganism, footnote obligate mutualism	2026-03-07 13:22:23 -07:00
intelligence is a property of networks not individuals.md	leo: foundations audit — 7 moves, 4 deletes, 3 condensations, 10 confidence demotions, 23 type fixes, 1 centaur rewrite	2026-03-07 11:56:38 -07:00
mechanism design enables incentive-compatible coordination by constructing rules under which self-interested agents voluntarily reveal private information and take socially optimal actions.md	theseus: rename futarchy claim from defenders to arbitrageurs	2026-04-04 16:17:54 +00:00
multipolar failure from competing aligned AI systems may pose greater existential risk than any single misaligned superintelligence.md	leo: remove 21 duplicates + fix domain:livingip in 204 files	2026-03-06 09:11:51 -07:00
multipolar traps are the thermodynamic default because competition requires no infrastructure while coordination requires trust enforcement and shared information all of which are expensive and fragile.md	theseus: moloch extraction — 4 NEW claims + 2 enrichments + 1 source archive	2026-04-02 16:17:12 +01:00
Ostrom proved communities self-govern shared resources when eight design principles are met without requiring state control or privatization.md	leo: remove 21 duplicates + fix domain:livingip in 204 files	2026-03-06 09:11:51 -07:00
partial connectivity produces better collective intelligence than full connectivity on complex problems because it preserves diversity.md	leo: remove 21 duplicates + fix domain:livingip in 204 files	2026-03-06 09:11:51 -07:00
principal-agent problems arise whenever one party acts on behalf of another with divergent interests and unobservable effort because information asymmetry makes perfect contracts impossible.md	reweave: connect 39 orphan claims via vector similarity	2026-04-03 14:01:58 +00:00
protocol design enables emergent coordination of arbitrary complexity as Linux Bitcoin and Wikipedia demonstrate.md	leo: foundations audit — 7 moves, 4 deletes, 3 condensations, 10 confidence demotions, 23 type fixes, 1 centaur rewrite	2026-03-07 11:56:38 -07:00
reweaving old notes by asking what would be different if written today is structural maintenance not optional cleanup because stale notes actively mislead agents who trust curated content unconditionally.md	reweave: connect 18 orphan claims via vector similarity	2026-04-04 12:50:25 +00:00
RLHF and DPO both fail at preference diversity because they assume a single reward function can capture context-dependent human values.md	reweave: connect 48 orphan claims via vector similarity	2026-03-28 23:04:53 +00:00
scalable oversight degrades rapidly as capability gaps grow with debate achieving only 50 percent success at moderate gaps.md	theseus: extract 4 NEW claims + 1 enrichment from Christiano core alignment research	2026-04-05 20:16:59 +01:00
the alignment tax creates a structural race to the bottom because safety training costs capability and rational competitors skip it.md	theseus: moloch extraction — 4 NEW claims + 2 enrichments + 1 source archive	2026-04-02 16:17:12 +01:00
the metacrisis is a single generator function where all civilizational-scale crises share the structural cause of rivalrous dynamics on exponential technology on finite substrate.md	leo: extract 9 Moloch sprint claims across grand-strategy, internet-finance, and foundations	2026-04-04 13:31:00 +01:00
three independent intellectual traditions converge on coordination-without-centralization as the only viable path between uncoordinated collapse and authoritarian capture.md	leo: extract 9 Moloch sprint claims across grand-strategy, internet-finance, and foundations	2026-04-04 13:31:00 +01:00
topological organization by concept outperforms chronological organization by date for knowledge retrieval because good insights from months ago are as useful as todays but date-based filing buries them under temporal sediment.md	theseus: cornelius batch 3 — epistemology (9 NEW + 3 enrichments)	2026-03-31 12:47:03 +01:00
trial and error is the only coordination strategy humanity has ever used.md	leo: remove 21 duplicates + fix domain:livingip in 204 files	2026-03-06 09:11:51 -07:00
universal alignment is mathematically impossible because Arrows impossibility theorem applies to aggregating diverse human preferences into a single coherent objective.md	leo: remove 21 duplicates + fix domain:livingip in 204 files	2026-03-06 09:11:51 -07:00