extract: 2025-01-01-chibe-behavioral-economics-health-nudges-defaults-rct #1198

Merged
leo merged 3 commits from extract/2025-01-01-chibe-behavioral-economics-health-nudges-defaults-rct into main 2026-03-18 09:20:17 +00:00
4 changed files with 96 additions and 2 deletions

View file

@ -0,0 +1,35 @@
{
"rejected_claims": [
{
"filename": "human-ai-teams-underperform-best-individual-performer-on-average.md",
"issues": [
"missing_attribution_extractor"
]
},
{
"filename": "decision-tasks-versus-content-creation-determines-human-ai-combination-value.md",
"issues": [
"missing_attribution_extractor"
]
}
],
"validation_stats": {
"total": 2,
"kept": 0,
"fixed": 5,
"rejected": 2,
"fixes_applied": [
"human-ai-teams-underperform-best-individual-performer-on-average.md:set_created:2026-03-18",
"human-ai-teams-underperform-best-individual-performer-on-average.md:stripped_wiki_link:centaur-teams-succeed-only-when-role-boundaries-prevent-huma",
"human-ai-teams-underperform-best-individual-performer-on-average.md:stripped_wiki_link:economic-forces-push-humans-out-of-every-cognitive-loop-wher",
"decision-tasks-versus-content-creation-determines-human-ai-combination-value.md:set_created:2026-03-18",
"decision-tasks-versus-content-creation-determines-human-ai-combination-value.md:stripped_wiki_link:economic-forces-push-humans-out-of-every-cognitive-loop-wher"
],
"rejections": [
"human-ai-teams-underperform-best-individual-performer-on-average.md:missing_attribution_extractor",
"decision-tasks-versus-content-creation-determines-human-ai-combination-value.md:missing_attribution_extractor"
]
},
"model": "anthropic/claude-sonnet-4.5",
"date": "2026-03-18"
}

View file

@ -0,0 +1,34 @@
{
"rejected_claims": [
{
"filename": "ehr-default-effects-are-highest-leverage-behavioral-health-intervention-because-they-scale-at-near-zero-marginal-cost-and-reduce-disparities.md",
"issues": [
"missing_attribution_extractor"
]
},
{
"filename": "healthcare-appointments-function-as-effective-commitment-devices-doubling-testing-rates-among-patients-with-self-control-problems.md",
"issues": [
"missing_attribution_extractor"
]
}
],
"validation_stats": {
"total": 2,
"kept": 0,
"fixed": 4,
"rejected": 2,
"fixes_applied": [
"ehr-default-effects-are-highest-leverage-behavioral-health-intervention-because-they-scale-at-near-zero-marginal-cost-and-reduce-disparities.md:set_created:2026-03-18",
"ehr-default-effects-are-highest-leverage-behavioral-health-intervention-because-they-scale-at-near-zero-marginal-cost-and-reduce-disparities.md:stripped_wiki_link:SDOH interventions show strong ROI but adoption stalls becau",
"ehr-default-effects-are-highest-leverage-behavioral-health-intervention-because-they-scale-at-near-zero-marginal-cost-and-reduce-disparities.md:stripped_wiki_link:medical care explains only 10 20 percent of health outcomes ",
"healthcare-appointments-function-as-effective-commitment-devices-doubling-testing-rates-among-patients-with-self-control-problems.md:set_created:2026-03-18"
],
"rejections": [
"ehr-default-effects-are-highest-leverage-behavioral-health-intervention-because-they-scale-at-near-zero-marginal-cost-and-reduce-disparities.md:missing_attribution_extractor",
"healthcare-appointments-function-as-effective-commitment-devices-doubling-testing-rates-among-patients-with-self-control-problems.md:missing_attribution_extractor"
]
},
"model": "anthropic/claude-sonnet-4.5",
"date": "2026-03-18"
}

View file

@ -7,10 +7,14 @@ date: 2024-12-01
domain: ai-alignment
secondary_domains: [collective-intelligence]
format: paper
status: unprocessed
status: null-result
priority: high
triage_tag: claim
tags: [human-ai-teams, meta-analysis, decision-making, content-creation, oversight, performance]
processed_by: theseus
processed_date: 2026-03-18
extraction_model: "anthropic/claude-sonnet-4.5"
extraction_notes: "LLM returned 2 claims, 2 rejected by validator"
---
## Content
@ -40,3 +44,12 @@ Systematic review and meta-analysis of 106 experimental studies reporting 370 ef
## Curator Notes
PRIMARY CONNECTION: centaur team performance depends on role complementarity not mere human-AI combination
WHY ARCHIVED: This is the strongest empirical evidence (370 effect sizes, Nature HB) that human-AI combination is NOT automatically beneficial — it depends on relative capability and task type. Directly relevant to the automation overshoot question.
## Key Facts
- Meta-analysis covered 106 experimental studies published between January 2020 and June 2023
- 370 effect sizes were analyzed across the studies
- Overall effect size: Hedges' g = -0.23 (95% CI: -0.39 to -0.07)
- Published in Nature Human Behaviour, December 2024
- Task types showing losses: deepfake classification, demand forecasting, medical diagnosis
- Task types showing gains: summarizing social media, chatbot responses, generating new content

View file

@ -7,11 +7,15 @@ date: 2025-01-01
domain: health
secondary_domains: []
format: report
status: unprocessed
status: null-result
priority: medium
triage_tag: claim
tags: [behavioral-economics, nudges, default-effects, medication-adherence, health-disparities, EHR]
flagged_for_rio: ["Behavioral economics mechanisms (commitment devices, default effects) are directly relevant to mechanism design in health contexts"]
processed_by: vida
processed_date: 2026-03-18
extraction_model: "anthropic/claude-sonnet-4.5"
extraction_notes: "LLM returned 2 claims, 2 rejected by validator"
---
## Content
@ -58,3 +62,11 @@ Additional context:
## Curator Notes
PRIMARY CONNECTION: healthcare is a complex adaptive system requiring simple enabling rules not complicated management because standardized processes erode the clinical autonomy needed for value creation
WHY ARCHIVED: Default effects are the "simple enabling rules" the complex adaptive system claim describes. The CHIBE evidence makes this concrete: change the EHR default → change prescribing behavior → reduce disparities. This is the behavioral economics bridge between the KB's structural claims and operational interventions.
## Key Facts
- CHIBE had $49M in total grant activity in FY2025
- Penn Medicine Healthy Heart trial enrolled 2,000 patients in West/Southwest Philadelphia and Lancaster County (2024-2025)
- Opioid prescribing guidelines adherence increased from 57.2% to 71.8% with peer comparison + patient-reported outcomes feedback
- Game-based physical activity intervention increased step counts by 1,700 steps/day (equivalent to 70+ miles over intervention period)
- Penn Medicine is now funding scaled implementation of automated pharmacy referral program that increased statin prescribing