Safety · Trend
Capability evaluations
Pre-deployment tests for cyber, bio and autonomy risks are becoming a condition of release.
Evaluations get standardized and audited; the science of 'what a test proves' has to catch up.
Connections
Connections · 6
How this node ties into the rest of the map, and the evidence behind each link.
Institutes turn evaluation into standing public capacity.
+3 growthAutonomy raises the stakes for pre-deployment evaluation.
+3 growthEvaluation results set the thresholds in safety frameworks.
+2 growthAgentic autonomy is the hardest thing to evaluate.
+3 growthThe network pushes shared evaluation methods.
+3 growthThe UK institute publishes open evaluations.
+2 growthSignal sources
Signal sources
Dated facts from primary sources in this direction.
In June 2025 the US AI Safety Institute was renamed the Center for AI Standards and Innovation (CAISI), pivoting toward security, standards and adversary-model assessment.
NIST →Anthropic activated its ASL-3 deployment and security standard with Claude Opus 4 on 22 May 2025 — the first real-world trigger of a responsible-scaling tier, focused on blocking bio-weapon uplift.
Anthropic →The International Network of AI Safety Institutes (launched Nov 2024) ran a third joint testing exercise focused on agentic AI systems across cyber and fraud strands.
European Commission — AI Office →