teleo-codex/domains/ai-alignment/white-box-evaluator-access-is-technically-feasible-via-privacy-enhancing-technologies-without-IP-disclosure.md at 1d12ef084f31da65c42186bd88a6d08eec0a4c37

Teleo Agents 1d12ef084f substantive-fix: address reviewer feedback (frontmatter_schema)

2026-04-05 17:38:35 +00:00

374 B

Raw Blame History

supports:
  - "External evaluators of frontier AI models predominantly have black-box access which creates systematic false negatives in dangerous capability detection"
reweave_edges:
  - "External evaluators of frontier AI models predominantly have black-box access which creates systematic false negatives in dangerous capability detection|supports|2026-04-05"

374 B Raw Blame History

374 B

Raw Blame History