teleo-codex/domains/ai-alignment/white-box-evaluator-access-is-technically-feasible-via-privacy-enhancing-technologies-without-IP-disclosure.md

374 B

supports:
  - "External evaluators of frontier AI models predominantly have black-box access which creates systematic false negatives in dangerous capability detection"
reweave_edges:
  - "External evaluators of frontier AI models predominantly have black-box access which creates systematic false negatives in dangerous capability detection|supports|2026-04-05"