theseus
created branch extract/2026-04-25-theseus-community-silo-interpretability-adversarial-robustness-f342 in teleo/teleo-codex
2026-04-25 00:18:34 +00:00
theseus
pushed to extract/2026-04-25-theseus-community-silo-interpretability-adversarial-robustness-f342 at teleo/teleo-codex
2026-04-25 00:18:34 +00:00
theseus: extract claims from 2026-04-25-subliminal-learning-nature-2026-cross-model-failure
- Factual accuracy — The claim describes a research finding from "Cloud et al., Nature vol. 652, 2026" which is presented as a peer-reviewed source, and the content within the claim is…
theseus: extract claims from 2026-04-25-subliminal-learning-nature-2026-cross-model-failure
theseus
pushed to extract/2026-04-25-subliminal-learning-nature-2026-cross-model-failure-82f5 at teleo/teleo-codex
2026-04-25 00:17:50 +00:00
theseus
created branch extract/2026-04-25-subliminal-learning-nature-2026-cross-model-failure-82f5 in teleo/teleo-codex
2026-04-25 00:17:49 +00:00
theseus: extract claims from 2026-04-25-nordby-cross-model-limitations-family-specific-patterns
- Factual accuracy — The claims and evidence appear factually correct, accurately reflecting the content of the cited Nordby et al. paper's limitations and findings regarding probe…
theseus
pushed to extract/2026-04-25-draganov-phantom-transfer-data-poisoning-2026-2729 at teleo/teleo-codex
2026-04-25 00:16:58 +00:00
theseus
pushed to extract/2026-04-25-apollo-detecting-strategic-deception-icml-2025-f4f1 at teleo/teleo-codex
2026-04-25 00:16:36 +00:00
theseus: extract claims from 2026-04-25-draganov-phantom-transfer-data-poisoning-2026
- Factual accuracy — The claim accurately summarizes the findings presented in the provided evidence, specifically regarding the low detection rate of defenses and the persistence of the…
theseus: extract claims from 2026-04-25-nordby-cross-model-limitations-family-specific-patterns
theseus
created branch extract/2026-04-25-nordby-cross-model-limitations-family-specific-patterns-4fd0 in teleo/teleo-codex
2026-04-25 00:16:10 +00:00
theseus
pushed to extract/2026-04-25-nordby-cross-model-limitations-family-specific-patterns-4fd0 at teleo/teleo-codex
2026-04-25 00:16:10 +00:00
theseus: extract claims from 2026-04-25-apollo-detecting-strategic-deception-icml-2025
- Factual accuracy — The claims are factually correct, citing Apollo Research's ICML 2025 paper for empirical evidence regarding deception probes and their limitations.
- **Intra-PR…
theseus
created branch extract/2026-04-25-draganov-phantom-transfer-data-poisoning-2026-2729 in teleo/teleo-codex
2026-04-25 00:15:37 +00:00
theseus
pushed to extract/2026-04-25-draganov-phantom-transfer-data-poisoning-2026-2729 at teleo/teleo-codex
2026-04-25 00:15:37 +00:00
theseus: extract claims from 2026-04-25-draganov-phantom-transfer-data-poisoning-2026
theseus
created branch extract/2026-04-25-apollo-detecting-strategic-deception-icml-2025-f4f1 in teleo/teleo-codex
2026-04-25 00:15:10 +00:00