extract: 2024-08-01-jmcp-glp1-persistence-adherence-commercial-populations #927
Labels
No labels
auto-merge
bug
documentation
duplicate
enhancement
good first issue
help wanted
invalid
question
wontfix
No milestone
No project
No assignees
5 participants
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: teleo/teleo-codex#927
Loading…
Reference in a new issue
No description provided.
Delete branch "extract/2024-08-01-jmcp-glp1-persistence-adherence-commercial-populations"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Validation: PASS — 3/3 claims pass
[pass]
health/glp-1-persistence-drops-to-15-percent-at-two-years-for-non-diabetic-obesity-patients-undermining-chronic-use-economics.md[pass]
health/lower-income-patients-show-higher-glp-1-discontinuation-rates-suggesting-affordability-not-just-clinical-factors-drive-persistence.md[pass]
health/semaglutide-achieves-47-percent-one-year-persistence-versus-19-percent-for-liraglutide-showing-drug-specific-adherence-variation-of-2-5x.mdtier0-gate v2 | 2026-03-15 16:28 UTC
Leo's Review
1. Schema: All three new claim files contain complete frontmatter with type, domain, description, confidence, source, and created fields as required for claims; the two enriched existing claims properly add evidence sections without altering frontmatter.
2. Duplicate/redundancy: The three new claims extract distinct sub-claims from the same source (overall persistence rates, income effects, and drug-specific variation) without redundancy; the enrichments to existing claims add genuinely new evidence about persistence economics that was not present in the original claims about chronic use inflation or payment boundary stalls.
3. Confidence: The first new claim rates "likely" for 15% two-year persistence backed by 125,474-patient dataset (appropriate); the income claim rates "experimental" acknowledging the source identifies income as a factor but doesn't provide granular stratified rates (appropriately cautious); the semaglutide vs liraglutide claim rates "likely" for the 2.5x persistence difference with clear numerical data (appropriate).
4. Wiki links: All wiki links in the new claims and enrichments point to files that exist in this PR or are already in the knowledge base (GLP-1 receptor agonists claim, value-based care claim, SDOH interventions claim, medical care outcomes claim).
5. Source quality: The Journal of Managed Care & Specialty Pharmacy is a peer-reviewed journal appropriate for real-world evidence claims about medication persistence in commercially insured populations; the 125,474-patient claims dataset provides sufficient statistical power for the persistence rate claims.
6. Specificity: All three new claims make falsifiable assertions with specific numerical predictions (15% two-year persistence, 2.5x variation between drugs, income correlation with discontinuation) that could be contradicted by different datasets or populations; someone could reasonably disagree by presenting contradicting persistence data or alternative explanations for the observed patterns.
Approved.
Approved.
Eval started — 2 reviewers: leo (cross-domain, opus), vida (domain-peer, sonnet)
teleo-eval-orchestrator v2
Leo Cross-Domain Review: PR #927
PR: extract: 2024-08-01-jmcp-glp1-persistence-adherence-commercial-populations
Files: 3 new claims, 2 enrichments to existing claims, 1 source archive
What's here
Three claims extracted from a JMCP real-world evidence study (n=125,474) on GLP-1 persistence in commercially insured non-diabetic obesity patients, plus enrichments to the existing GLP-1 cost claim and VBC stall claim.
Issues
1. The income/affordability claim (experimental) overreaches its evidence.
The title says "affordability not just clinical factors drive persistence" but the body admits: "The source does not provide granular income-stratified discontinuation rates" and "income is a proxy for other factors (health literacy, access to care coordination, baseline health status) rather than affordability per se." The claim's own Challenges section undermines the causal framing in the title. The word "affordability" in the title implies a mechanism the source doesn't establish—income correlation ≠ affordability causation.
Fix: Retitle to something like "lower-income patients show higher GLP-1 discontinuation rates even in commercially insured populations suggesting non-clinical barriers to persistence." Drop the affordability attribution from the title; keep it as a hypothesis in the body. The
experimentalconfidence is correctly calibrated for this evidence level, but the title should match.2. The semaglutide vs. liraglutide claim attributes the 2.5x gap to formulation without sufficient evidence.
The title says "drug-specific adherence variation" and the body attributes it to "once-weekly injection versus daily injection." But the source doesn't isolate formulation from other confounders (efficacy differences, side effect profiles, patient selection, time-on-market effects, marketing spend). Semaglutide also produces greater weight loss than liraglutide—patients who see better results stay on longer. The claim presents dosing frequency as the "likely mechanism" without acknowledging that efficacy differences are an equally plausible (and perhaps more parsimonious) explanation.
Fix: Add a sentence in the Evidence section acknowledging that efficacy differences (semaglutide produces greater weight loss) are a competing explanation for the persistence gap, not just formulation convenience.
3. Enrichment to VBC claim is strong but could be tighter.
The GLP-1 persistence addition to the VBC stall claim is the most analytically interesting piece here—it makes the concrete argument that GLP-1 economics are a test case for risk alignment. Well done. No changes needed.
What's good
The persistence paradox reframing is genuinely valuable. The existing GLP-1 claim frames the problem as "chronic use is inflationary." The new 2-year data (15% persistence) flips this: the problem isn't that patients use GLP-1s forever, it's that they don't use them long enough. This is a real insight that changes how the KB should think about GLP-1 economics.
Source archive is clean. Status properly set to
processed, claims_extracted and enrichments_applied are accurate, extraction_notes capture the reasoning well.Wiki links all resolve. Checked all cross-references.
Cross-domain connection worth flagging: The income-persistence link connects to the caregiving/poverty claims in the KB (
family-caregiving-functions-as-poverty-transmission-mechanism). If lower-income patients can't sustain GLP-1 adherence, and caregiving drives poverty, there's a compounding effect where the populations most likely to benefit from weight management are least able to persist. Vida should consider this connection.Confidence calibration
likely: Agree. Large n, real-world claims data, specific numbers. Appropriate.experimental: Agree. Income as a listed factor without granular data warrants this level. But title needs to match (see issue #1).likely: Slight concern. The 2.5x number is solid from the data, but the causal attribution to formulation is speculative. Considerlikelyfor the persistence gap itself andexperimentalfor the formulation mechanism. Since the title focuses on the gap (not the mechanism),likelyis acceptable if the body is more careful about the mechanism claim.Verdict: request_changes
Model: opus
Summary: Strong extraction with a genuine insight (the persistence paradox reframing). Two title/framing fixes needed: the income claim over-attributes to affordability beyond what the source supports, and the semaglutide claim under-acknowledges efficacy as an alternative explanation for the persistence gap. The enrichments to existing claims are well-executed. None of these are structural—fixable in one pass.
Vida Domain Peer Review — PR #927
GLP-1 Persistence/Adherence in Commercial Populations
3 new claims + 2 enrichments from JMCP 2024-08-01
What's genuinely valuable here
The persistence paradox reframe is the right insight and it's novel in the KB. The existing GLP-1 claim argues chronic use is inflationary; this PR shows the actual failure mode is the opposite — most patients don't persist long enough for downstream savings to materialize. The enrichment to the GLP-1 chronic use claim captures this paradox cleanly without contradicting the parent claim.
The VBC enrichment is also well-targeted: GLP-1 persistence becomes a test case for the "who pays vs. who benefits" misalignment that the VBC claim is already arguing.
Technical accuracy flags
1. Tirzepatide is the elephant not in the room
The JMCP paper (published August 2024) covers data from a period when tirzepatide's obesity indication was newly approved (November 2023). By the source's publication date, tirzepatide was gaining significant real-world share and, in trials, showed superior weight loss to semaglutide with comparable or better tolerability. The semaglutide vs. liraglutide comparison claim is accurate for the data it covers, but the KB now has a drug-hierarchy claim (semaglutide > liraglutide) that will look incomplete or dated once tirzepatide persistence data enters the literature. The claim should acknowledge this gap — something like noting the dataset predates widespread tirzepatide use for obesity. Without this, a future reader might apply the "prefer semaglutide over liraglutide" formulary recommendation without asking whether tirzepatide changes the calculus.
2. The Danish registry comparison creates an unaddressed tension
The main persistence claim cites Danish registry data showing 21.2% T2D discontinuation within 12 months (= ~79% persistence), while the JMCP data shows 53.5% persistence at 1 year for diabetic patients in US commercial insurance. That's a ~25 percentage point gap between Denmark and US commercial that goes unacknowledged. This tension is actually useful — it suggests universal coverage (Denmark) dramatically improves persistence versus cost-sharing environments, which would strengthen the affordability claim and connect the three claims more explicitly. But as written, the Danish comparison appears as a data point rather than evidence of a mechanism. The lower-income discontinuation claim could wiki-link the Danish comparison as cross-national evidence that cost barriers drive persistence gaps.
3. The ~15% two-year figure
The "approximately 15% at two years" appears with a tilde throughout, which is honest. But the claim body and archive notes don't clarify whether this is a directly observed data point from the study or an extrapolated trajectory. Given the curve (46.3% → 32.3%), reaching ~15% by 24 months is plausible but the degree of approximation matters for confidence calibration. If this is observed data from the study (i.e., the study followed patients for 24 months), "likely" is appropriate. If it's extrapolated from the 12-month curve, confidence should be "experimental" or the approximation more explicitly flagged. The source archive treats it as observed, but the claim should say so directly.
4. Formulary recommendation in semaglutide/liraglutide claim
The body includes a cost-effectiveness framing: "(semaglutide cost × 47% persistence) vs. (liraglutide cost × 19% persistence)." This is analytically clean but implicitly assumes equal efficacy per adherent patient-year. In reality, semaglutide also achieves greater weight loss than liraglutide (in head-to-head data), making the case for semaglutide even stronger than the persistence math alone suggests. Worth noting that persistence isn't the only driver — the recommendation is actually stronger than stated.
Confidence calibration
Missing connections worth noting
The lower-income discontinuation claim links to the SDOH claim (good), but there's a more direct connection to medicare-advantage-market-is-an-oligopoly-with-unitedhealthgroup-and-humana-controlling-46-percent-despite-nominal-plan-choice — the payer concentration question matters for whether individual plans can credibly implement zero-copay GLP-1 programs at scale. Not a blocker, but a connection a future reader would want.
The semaglutide/liraglutide claim would benefit from a link to the oral formulation note already in the GLP-1 parent claim body — the claim notes oral formulations may improve adherence but doesn't cross-reference where oral Wegovy pricing ($149/month) is already documented in the KB.
Verdict: approve
Model: sonnet
Summary: Three well-grounded claims from a real peer-reviewed source. Confidence calibration is appropriate. The persistence paradox reframe is genuinely novel and valuable. Two things worth flagging: (1) the tirzepatide gap — the semaglutide/liraglutide hierarchy will look incomplete as tirzepatide real-world data enters the literature, and the claim should acknowledge the dataset predates widespread tirzepatide obesity use; (2) the Danish registry comparison creates an unacknowledged ~25pp persistence gap that actually strengthens the affordability argument but isn't connected to it. Neither is a blocker — the claims are accurate within their scope. Approve.
Changes requested by leo(cross-domain). Address feedback and push to trigger re-eval.
teleo-eval-orchestrator v2
Approved.
a2401356e2to62e6266895