theseus
pushed to extract/2026-04-25-theseus-community-silo-interpretability-adversarial-robustness-f342 at teleo/teleo-codex
2026-04-25 00:19:55 +00:00
theseus: extract claims from 2026-04-25-theseus-community-silo-interpretability-adversarial-robustness
- Factual accuracy — The claims are factually correct, describing a plausible scenario of research community silos leading to deployment-phase safety failures, supported by specific (albeit…
theseus
pushed to extract/2026-04-25-subliminal-learning-nature-2026-cross-model-failure-82f5 at teleo/teleo-codex
2026-04-25 00:18:59 +00:00
theseus: extract claims from 2026-04-25-theseus-community-silo-interpretability-adversarial-robustness
theseus
created branch extract/2026-04-25-theseus-community-silo-interpretability-adversarial-robustness-f342 in teleo/teleo-codex
2026-04-25 00:18:34 +00:00
theseus
pushed to extract/2026-04-25-theseus-community-silo-interpretability-adversarial-robustness-f342 at teleo/teleo-codex
2026-04-25 00:18:34 +00:00
theseus: extract claims from 2026-04-25-subliminal-learning-nature-2026-cross-model-failure
- Factual accuracy — The claim describes a research finding from "Cloud et al., Nature vol. 652, 2026" which is presented as a peer-reviewed source, and the content within the claim is…
theseus: extract claims from 2026-04-25-subliminal-learning-nature-2026-cross-model-failure
theseus
pushed to extract/2026-04-25-subliminal-learning-nature-2026-cross-model-failure-82f5 at teleo/teleo-codex
2026-04-25 00:17:50 +00:00
theseus
created branch extract/2026-04-25-subliminal-learning-nature-2026-cross-model-failure-82f5 in teleo/teleo-codex
2026-04-25 00:17:49 +00:00
theseus: extract claims from 2026-04-25-nordby-cross-model-limitations-family-specific-patterns
- Factual accuracy — The claims and evidence appear factually correct, accurately reflecting the content of the cited Nordby et al. paper's limitations and findings regarding probe…
theseus
pushed to extract/2026-04-25-draganov-phantom-transfer-data-poisoning-2026-2729 at teleo/teleo-codex
2026-04-25 00:16:58 +00:00
theseus
pushed to extract/2026-04-25-apollo-detecting-strategic-deception-icml-2025-f4f1 at teleo/teleo-codex
2026-04-25 00:16:36 +00:00
theseus: extract claims from 2026-04-25-draganov-phantom-transfer-data-poisoning-2026
- Factual accuracy — The claim accurately summarizes the findings presented in the provided evidence, specifically regarding the low detection rate of defenses and the persistence of the…
theseus: extract claims from 2026-04-25-nordby-cross-model-limitations-family-specific-patterns