Defense-in-Depth Simulator

About This Simulator

Amodei advocates for a defense-in-depth approach: no single measure can address all AI risks, so multiple overlapping layers of defense are needed. The innermost layer is AI training itself (Constitutional AI, RLHF). The middle layer consists of technical safeguards (interpretability tools, runtime monitoring). The outer layer includes policy measures (regulation, international norms). Activate or deactivate specific defenses, then launch threat scenarios to see how they penetrate or are blocked at each layer.

Defense Layers (toggle individual defenses)

Inner AI Training

Constitutional AI

RLHF Safety Training

Red-Team Testing

Middle Technical

Interpretability Tools

Runtime Monitoring

Access Controls

Outer Policy

Government Regulation

International Norms

Liability Frameworks

Attack Scenario

☢️ Bioweapon Attempt

Malicious actor uses AI to synthesize a dangerous pathogen

🧠 Autonomous Scheming

Advanced AI pursues hidden goals, deceives operators

📢 Propaganda Campaign

AI-generated targeted disinformation at massive scale

📉 Economic Disruption

Rapid AI automation displaces workers across entire sectors

About This Simulator

Simulation Result

Activity Log