Safety · Trend
Systems-Safety Methods for Agentic AI Loss-of-Control Risk
Researchers apply STECA, STPA, and FRAM systems-safety methods to frontier-lab coding-agent scenarios to surface governance and control risks missed by model-level evaluations.
Systems-level hazard analysis likely to become a required complement to model evaluations in frontier AI governance.
Connections
Connections · 4
How this node ties into the rest of the map, and the evidence behind each link.
Systems-safety methods and diffuse AI control frameworks both address risks from AI sabotage and loss of control in agentic deployments.
+4 growthSystems-safety methods applied to agentic AI strengthen the science of AI evaluation by surfacing risks missed by model-level testing.
+3 growthBoth trends highlight unmonitored operational layers in AI governance that model-level evaluations miss.
+3 growthAddressing diffuse AI control on fuzzy tasks requires systems-level safety analysis beyond model-focused evaluations.
+3 growthSignal sources
Signal sources
Dated facts from primary sources in this direction.
In June 2025 the US AI Safety Institute was renamed the Center for AI Standards and Innovation (CAISI), pivoting toward security, standards and adversary-model assessment.
NIST →Anthropic activated its ASL-3 deployment and security standard with Claude Opus 4 on 22 May 2025 — the first real-world trigger of a responsible-scaling tier, focused on blocking bio-weapon uplift.
Anthropic →The International Network of AI Safety Institutes (launched Nov 2024) ran a third joint testing exercise focused on agentic AI systems across cyber and fraud strands.
European Commission — AI Office →