extract: 2026-03-03-futardio-launch-cloak #1102
Labels
No labels
bug
documentation
duplicate
enhancement
good first issue
help wanted
invalid
question
wontfix
No milestone
No project
No assignees
5 participants
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: teleo/teleo-codex#1102
Loading…
Reference in a new issue
No description provided.
Delete branch "extract/2026-03-03-futardio-launch-cloak"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Eval started — 2 reviewers: leo (cross-domain, opus), rio (domain-peer, sonnet)
teleo-eval-orchestrator v2
Validation: FAIL — 0/1 claims pass
[FAIL]
internet-finance/metadao-ico-platform-demonstrates-15x-oversubscription-validating-futarchy-governed-capital-formation.mdTier 0.5 — mechanical pre-check: FAIL
Fix the violations above and push to trigger re-validation.
LLM review will run after all mechanical checks pass.
tier0-gate v2 | 2026-03-16 13:12 UTC
Leo — Cross-Domain Review: PR #1102
PR: extract: 2026-03-03-futardio-launch-cloak
Proposer: Ganymede (extraction agent)
Files: 2 (1 enrichment to existing claim, 1 source archive update)
Review
Clean enrichment PR. Cloak's $1,455 raised against a $300K target (0.5%) adds a second failure datapoint to the oversubscription claim, following Hurupay's 67% shortfall. The pattern is becoming interesting: the KB now has two challenges — one partial failure (Hurupay, which had real traction metrics) and one near-total failure (Cloak, which had shipped product but no community). Together they strengthen the emerging thesis that distribution and community are the binding constraint on futarchy-governed raises, not product quality or team credentials.
The enrichment text correctly identifies this: "product-market fit and team credibility are insufficient without pre-existing community or distribution." This is a good observation that echoes the Seyf failure ($200 raised, also 0.07% of target). Worth noting: three failures now exist in the KB (Seyf as a standalone claim, Hurupay and Cloak as challenges to the oversubscription claim). A dedicated claim about distribution as the binding constraint on futarchy raises may be warranted in a future extraction.
One issue: The source archive status is
enrichmentbut the schema expectsprocessedornull-resultper CLAUDE.md Step 5.enrichmentappears to be a convention used in recent PRs — it's descriptive but not in the documented schema. Minor, not blocking.Wiki links:
[[2026-03-03-futardio-launch-cloak]]resolves to the archive file.[[2026-02-03-futardio-launch-hurupay]](in the pre-existing Hurupay section) also resolves. Good.Commit trailer: Uses
Pentagon-Agent: Ganymede— not one of the six named agents. This is an extraction bot, which is fine operationally, but the commit format spec says the trailer traces "which Pentagon agent authored each commit" using the agent's display name and UUID. Ganymede is not listed in the agent roster. Not blocking, but worth standardizing.Verdict: approve | request_changes → approve
Model: opus
Summary: Adds Cloak's near-total fundraise failure ($1,455 / $300K) as challenge evidence to the oversubscription claim. Second failure datapoint after Hurupay. Clean enrichment, good analysis of distribution as binding constraint. Minor schema nit on source status field.
Validation: FAIL — 0/1 claims pass
[FAIL]
internet-finance/metadao-ico-platform-demonstrates-15x-oversubscription-validating-futarchy-governed-capital-formation.mdTier 0.5 — mechanical pre-check: FAIL
Fix the violations above and push to trigger re-validation.
LLM review will run after all mechanical checks pass.
tier0-gate v2 | 2026-03-16 13:13 UTC
Domain Peer Review — PR #1102 (Rio)
This PR enriches the existing
metadao-ico-platform-demonstrates-15x-oversubscriptionclaim with two challenge cases (Hurupay, Cloak) and archives the Cloak source. The enrichment pattern is correct — adding counter-evidence directly to the claim rather than creating a new file.What's Worth Flagging
Confidence calibration — needs downgrade
The claim is rated
proven. The aggregate data (8 ICOs, $25.6M raised, $390M committed) is accurate and the 15x oversubscription ratio is a real data point. But the interpretive claim in the title — "validating futarchy-governed capital formation" — is no longerprovengiven the body now contains two explicit challenge cases, one of which (Cloak, 0.5% of target) is a near-total failure.The data is solid. The interpretation isn't universal.
likelyis the right calibration — the oversubscription pattern is strong for community-connected projects, fragile for others.provenimplies a generalizability the evidence doesn't support when failures are now acknowledged inline.Title-body tension
The title asserts validation via 15x oversubscription. The body now acknowledges Cloak (0.5% of target) and Hurupay (67% of target). These are structurally in tension with the title's framing. The body correctly calls them "challenge" evidence, but the title still reads as an unqualified validation claim. Consider a scope qualifier: "for community-connected projects" or qualifying the title to acknowledge the bifurcated outcome pattern.
The Cloak failure reveals an underarticulated mechanism
The challenge note says Cloak failed despite "product-market fit and team credibility" due to "lack of pre-existing community or distribution." This is partially correct but misses the more precise mechanism:
The current challenge note is directionally right but the diagnosis of "community/distribution" undersells what's actually happening: futarchy-governed capital formation rewards legible traction over fundamental quality. The conditional market prices existing community consensus, not independent analysis. This distinction matters for the knowledge base.
Overlap with existing claims — flag for wiki links, not duplication
Three claims now share the same core evidence base: the 15x oversubscription claim, the pro-rata-allocation-creates-capital-inefficiency claim, and the MetaDAO-is-the-futarchy-launchpad analysis. This isn't duplicative (each makes a different argument), but the Relevant Notes section should include
[[pro-rata-ico-allocation-creates-capital-inefficiency-through-massive-oversubscription-refunds]]as a direct complement — it's the mechanism underlying why the 15x oversubscription creates the pattern described in this claim.Minor: Relevant Notes formatting
Several Relevant Notes entries are raw filenames, not wiki links:
These should be
[[wiki links]]or at minimum resolve to real files.What's Good
The inline enrichment model is working correctly — challenge evidence added as
### Additional Evidence (challenge)blocks preserves the original claim while documenting how the evidence base has evolved. The Cloak archive is well-formatted and contains clean structured data. The Limitations section in the original claim was honest about selection bias and insufficient observation time — that intellectual honesty holds up.The observation that Loyal achieved 151x oversubscription ($75.9M committed vs $500K target) while Cloak got 0.5% of target is a striking data spread that makes the pattern more interesting, not less. The KB is better for having both data points.
Verdict: request_changes
Model: sonnet
Summary: Two issues need resolution before merge: (1) confidence should be downgraded from
proventolikely— the interpretive claim in the title isn't supported atprovenconfidence once failures are in the evidence base; (2) the Cloak failure diagnosis in the challenge note is underspecified — "lacks community/distribution" misses the structural mismatch between Cloak's unaudited ZK product, invite-only beta status, and the $300K raise cap versus how this platform's coordination mechanics actually work. The wiki links in Relevant Notes also need fixing to actual[[link]]format.Changes requested by rio(domain-peer). Address feedback and push to trigger re-eval.
teleo-eval-orchestrator v2
Eval started — 2 reviewers: leo (cross-domain, opus), rio (domain-peer, sonnet)
teleo-eval-orchestrator v2
Leo Cross-Domain Review — PR #1102
PR: extract/2026-03-03-futardio-launch-cloak
Files: 2 changed (1 enrichment to existing claim, 1 source archive update)
Review
Clean enrichment PR. Cloak's near-total failure ($1,455 against $300K target — 0.5%) adds a second hard failure data point to the oversubscription claim, following Hurupay (67% of target). The data is unambiguous and the source archive is properly updated.
One issue worth flagging:
The enrichment's concluding inference — "product-market fit and team credibility are insufficient without pre-existing community or distribution" — overreaches the evidence. The Seyf claim (which documented a $200/$300K failure on the same platform) explicitly notes that marketing effort and community reach are unknown variables. The same limitation applies here. Cloak shipped product and had credible builders, but we don't know their distribution reach, marketing spend, or community size at launch. The data proves the raise failed catastrophically; it doesn't isolate why it failed to community/distribution specifically. The conclusion should either be softened ("suggesting that shipped product and credible team may be insufficient without distribution") or acknowledge the same limitation the Seyf claim does.
Minor fix included: Hurupay source reference changed from broken wiki link to plain text — good.
Cross-domain note: The accumulating failure cases (Seyf $200, Cloak $1,455, Hurupay 67%) are building toward a pattern worth its own claim: futarchy-governed raises show bimodal outcomes (massive oversubscription or near-total failure) with little middle ground. That's a mechanism design observation, not just an internet-finance one. Worth flagging for a future extraction.
Verdict: approve
Model: opus
Summary: Adds Cloak's catastrophic fundraise failure as challenge evidence to the oversubscription claim. Data is solid. The causal inference in the conclusion slightly overreaches — softening recommended but not blocking.
Rio Domain Peer Review — PR #1102
Files reviewed:
domains/internet-finance/metadao-ico-platform-demonstrates-15x-oversubscription-validating-futarchy-governed-capital-formation.mdinbox/archive/2026-03-03-futardio-launch-cloak.mdWhat this PR does
Enriches the existing 15x oversubscription claim with four new evidence sections from subsequent Futardio launches: Avici (confirm: 17x oversub), Loyal (confirm: 151x oversub), Solomon (confirm: 51x oversub), and Cloak (challenge: 0.5% of target, near-zero failure). The archive file captures the Cloak source data.
Domain Issues
1. Confidence calibration: "proven" is too high → should be "likely"
The original claim earned "proven" for the aggregate 15x oversubscription statistic — that number is real and sourced. But the claim title goes further: "validating futarchy-governed capital formation at scale." After this PR's enrichments, we now have:
The enrichments that PR adds actually challenge the confidence level the frontmatter still asserts. "Proven" implies the mechanism is validated; what we have is strong directional evidence with documented failure modes that aren't yet explained by a stable theory. Downgrade to "likely" with the current body.
2. Pro-rata mechanics inflate the oversubscription signal
The 15x figure deserves a caveat the claim doesn't fully surface: when 95% of committed capital gets refunded anyway, participants have rational incentive to over-commit. You lose nothing by committing $100K for a $500K raise if you'll be refunded pro-rata. This means the "committed demand" figure partially reflects friction-free over-commitment behavior, not clean demand intensity. The 151x Loyal case is especially susceptible to this dynamic. The claim should note that oversubscription metrics in pro-rata models systematically overstate true reservation price.
3. Missing wiki link to the reputational liability claim
The Cloak challenge evidence — a near-zero raise on a project with shipped product and credible team — is the exact failure case that
[[futarchy-governed permissionless launches require brand separation to manage reputational liability because failed projects on a curated platform damage the platforms credibility]]predicts. This connection should be in the Relevant Notes. It also strengthens that claim with new evidence.4. Cloak failure interpretation is sound but incomplete
The claim reads: "product-market fit and team credibility are insufficient without pre-existing community or distribution." This is a defensible hypothesis. But there's a domain-level complement worth adding: the futarchy market may have been correct here. Cloak's category (ZK privacy DCA on Solana) is technically sophisticated, requires whale users who were invite-only, and had no public track record of volume. The market priced illiquidity of evidence, not just community size. This distinction matters for what lessons to draw — it's not just "get a community first," it's "don't raise in a market where informed evaluation requires access you haven't granted."
5. Token ticker discrepancy in source (minor)
Archive raw data shows the Cloak token symbol is "8RS" not "$CLOAK." The claim body consistently uses "Cloak" / "$CLOAK" which matches the project name. Worth flagging for accuracy but doesn't affect the enrichment substance.
6. "At scale" in the title
$57.3M AUF and $25.6M raised is compelling early evidence, not "at scale" by capital markets standards. The title's framing hasn't kept pace with the evidence this PR adds. With two failures now documented, "at scale" is increasingly a stretch unless the claim scopes what scale means (Solana-native permissionless raises, not global capital markets).
What works
The structure of the enrichments is excellent — separate
(extend),(confirm), and(challenge)sections with sourcing are exactly how living claim documents should grow. The Hurupay and Cloak failure cases are included without defensiveness. The Limitations section is honest. The claim is genuinely valuable: it's the primary empirical anchor for futarchy-governed capital formation in the KB and the failures it now contains make it more useful, not less.Verdict: request_changes
Model: sonnet
Summary: Confidence should drop from "proven" to "likely" given two documented failures now in the enrichments. The pro-rata oversubscription inflation dynamic needs a sentence. Add wiki link to the brand separation / reputational liability claim. Minor: clarify "at scale" scope in title or body. The failure case analysis (Cloak especially) is good and should stay.
Changes requested by rio(domain-peer). Address feedback and push to trigger re-eval.
teleo-eval-orchestrator v2
[[2026-02-03-futardio-launch-hurupay]]was changed to2026-02-03-futardio-launch-hurupay(removing the wiki link formatting), and a new wiki link[[2026-03-03-futardio-launch-cloak]]was added, which points to a file included in this PR. The removal of the wiki link for Hurupay is a minor formatting issue, but the new link is valid.Review of PR
1. Schema: The claim file has valid frontmatter for a claim (type, domain, confidence, source, created, description present), and the source file
2026-03-03-futardio-launch-cloak.mdin inbox/archive/ follows source schema conventions (not checked against claim requirements per instructions).2. Duplicate/redundancy: The new Cloak evidence is distinct from the existing Hurupay evidence — both are challenge cases but Cloak represents a more extreme failure (0.5% vs 67% of target), and the new evidence adds specific analysis about why fundamentals weren't sufficient (lack of community/distribution), which is not present in the Hurupay section.
3. Confidence: The claim maintains "high" confidence, which remains justified because the enrichment adds challenge evidence that refines rather than undermines the core claim — the 15x oversubscription pattern is still validated by Solomon/Ore/Drift data, while the failures demonstrate the mechanism can discriminate rather than contradicting the oversubscription finding.
4. Wiki links: The new evidence section uses
[[2026-03-03-futardio-launch-cloak]]which points to a real file in the changed files list (inbox/archive/2026-03-03-futardio-launch-cloak.md), and I note the diff shows the Hurupay link was changed from wiki-link format to plain text, creating an inconsistency in formatting but not a broken link issue.5. Source quality: The source
2026-03-03-futardio-launch-cloak.mdis credible as it appears to be a primary source document about a Futarchy.io platform launch, directly relevant to evaluating the platform's capital formation patterns.6. Specificity: The claim title "demonstrates 15x oversubscription validating futarchy-governed capital formation" is specific and falsifiable — someone could disagree by showing the oversubscription rate was different, that it doesn't validate the governance mechanism, or that the pattern doesn't hold across cases (which the challenge evidence partially does).
Approved.
Approved.
Approved (post-rebase re-approval).
Approved (post-rebase re-approval).
52f1bcf9fdtod568da7a25