theseus: extract claims from 2026-02-14-santos-grueiro-evaluation-side-channel
- Factual accuracy — The claim accurately summarizes the provided evidence from "Santos-Grueiro 2026, regime leakage formalization with empirical mitigation testing," describing the formal…
theseus
created branch extract/2026-02-14-zhou-causal-frontdoor-jailbreak-sae-4b4a in teleo/teleo-codex
2026-04-08 00:23:53 +00:00
theseus: extract claims from 2026-02-14-zhou-causal-frontdoor-jailbreak-sae
theseus
pushed to extract/2026-02-14-zhou-causal-frontdoor-jailbreak-sae-4b4a at teleo/teleo-codex
2026-04-08 00:23:53 +00:00
theseus
pushed to extract/2026-02-11-sun-steer2edit-weight-editing-3b48 at teleo/teleo-codex
2026-04-08 00:23:39 +00:00
theseus: extract claims from 2026-02-11-sun-steer2edit-weight-editing
- Factual accuracy — The claim describes a hypothetical research paper and its findings, which are presented as facts within the context of the claim. Since this is a forward-looking claim…
theseus
pushed to extract/2026-02-11-ghosal-safethink-inference-time-safety-f679 at teleo/teleo-codex
2026-04-08 00:22:25 +00:00
theseus: extract claims from 2026-02-14-santos-grueiro-evaluation-side-channel
theseus
pushed to extract/2026-02-11-sun-steer2edit-weight-editing-3b48 at teleo/teleo-codex
2026-04-08 00:21:50 +00:00
theseus: extract claims from 2026-02-11-sun-steer2edit-weight-editing
theseus
created branch extract/2026-02-11-sun-steer2edit-weight-editing-3b48 in teleo/teleo-codex
2026-04-08 00:21:49 +00:00
theseus: extract claims from 2026-02-11-ghosal-safethink-inference-time-safety
- Factual accuracy — The claim accurately summarizes the findings of the SafeThink paper by Ghosal et al., specifically regarding the reduction in jailbreak success rates and the preservatio…
theseus
created branch extract/2026-02-11-ghosal-safethink-inference-time-safety-f679 in teleo/teleo-codex
2026-04-08 00:21:22 +00:00
theseus
pushed to extract/2026-02-11-ghosal-safethink-inference-time-safety-f679 at teleo/teleo-codex
2026-04-08 00:21:22 +00:00