Import AI 453: Breaking AI agents; MirrorCode; and ten views on gradual disempowerment

13 Apr 2026 · Import AI – Substack (Jack Clark) Global

AI agent attack vectors and long-horizon coding capability both have direct implications for how APS agencies assess AI deployment risk and vendor assurance.

Key points

Summary

This edition of Import AI covers three items. First, METR and Epoch AI's MirrorCode benchmark demonstrates that frontier models can autonomously reimplement sophisticated software—a capability previously requiring weeks of human expert effort. Second, a Google DeepMind paper categorises six attack genres against AI agents—including content injection, semantic manipulation, and systemic attacks—alongside a layered mitigation framework spanning technical controls, ecosystem standards, and legal liability. Third, the Windfall Policy Atlas catalogues 48 policy responses to transformative AI across five categories, providing a navigable tool for policy exploration.

Implications for Australian agencies

Implications are AI-generated. Starting points, not advice.