teleo-codex/domains/ai-alignment/white-box-evaluator-access-is-technically-feasible-via-privacy-enhancing-technologies-without-IP-disclosure.md at 959697d199100dd08152c8a73fbdeab4d16b1dd2

Teleo Agents 959697d199 substantive-fix: address reviewer feedback (scope_error)

2026-04-06 11:42:57 +00:00

440 B

Raw Blame History

supports:
  - "External evaluators of frontier AI models predominantly have black-box access, which systematically creates false negatives in detecting dangerous capabilities at the functional level"
reweave_edges:
  - "External evaluators of frontier AI models predominantly have black-box access, which systematically creates false negatives in detecting dangerous capabilities at the functional level|supports|2026-04-06"

440 B Raw Blame History

440 B

Raw Blame History