theseus: extract claims from 2025-08-00-eu-code-of-practice-principles-not-prescription #2327

Closed
theseus wants to merge 2 commits from extract/2025-08-00-eu-code-of-practice-principles-not-prescription-9fce into main
2 changed files with 22 additions and 0 deletions

View file

@ -0,0 +1,11 @@
```json
{
"action": "flag_duplicate",
"candidates": [
"pre-deployment-AI-evaluations-do-not-predict-real-world-risk-creating-institutional-governance-built-on-unreliable-foundations.md",
"voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints.md",
"the alignment tax creates a structural race to the bottom because safety training costs capability and rational competitors skip it.md"
],
"reasoning": "The reviewer indicated that the claim 'substantially duplicates an existing claim' but did not specify which one. Given the context of the feedback, the most relevant existing claims are those related to the limitations of current AI evaluations, the ineffectiveness of voluntary safety measures, and the structural disincentives for safety. While not direct duplicates, these claims share a thematic overlap regarding the challenges in AI governance and safety, and the current claim could be seen as an enrichment or a more specific instance of these broader issues. The reviewer's feedback on 'near_duplicate' suggests that the current claim might be better integrated into an existing one rather than standing alone, or that its unique contribution needs to be more clearly articulated to avoid overlap."
}
```

View file

@ -0,0 +1,11 @@
```json
{
"action": "flag_duplicate",
"candidates": [
"voluntary safety pledges cannot survive competitive pressure because unilateral commitments are structurally punished when competitors advance without equivalent constraints.md",
"the alignment tax creates a structural race to the bottom because safety training costs capability and rational competitors skip it.md",
"only-binding-regulation-with-enforcement-teeth-changes-frontier-AI-lab-behavior.md"
],
"reasoning": "The reviewer explicitly stated that the two claims in the PR substantially overlap and suggested they should be one claim. Since this is one of the two claims, and the reviewer's feedback indicates a near_duplicate, I am flagging it. The provided candidates from the domain index are related to the broader theme of regulatory effectiveness and competitive pressures, which are underlying themes in the reviewer's feedback about the structural gap and optimization dynamics. While not direct duplicates of *this specific claim*, they represent the closest thematic overlaps in the provided list that discuss structural issues in regulation and competitive dynamics that lead to suboptimal safety outcomes, which is the core of the reviewer's concern about duplication within the PR itself."
}
```