CAISI Signs Agreements Regarding Frontier AI National Security Testing With Google DeepMind, Microsoft and xAI

5 May 2026 · NIST Information Technology RSS US

The US is scaling mandatory-style pre-deployment frontier AI evaluation infrastructure—a benchmark Australian AISI and DISR policy teams will likely be asked to consider.

Key points

CAISI formalises pre-deployment and post-deployment evaluation agreements with Google DeepMind, Microsoft, and xAI.
Evaluations include models with reduced safeguards, conducted in classified environments via an interagency taskforce.
This US model of government-led frontier AI safety testing may inform expectations placed on Australia's AISI.

Summary

NIST's Center for AI Standards and Innovation (CAISI) has signed expanded agreements with Google DeepMind, Microsoft, and xAI to conduct pre-deployment evaluations, post-deployment assessments, and targeted research on frontier AI capabilities and national security risks. Evaluations can occur in classified environments and include models with reduced or removed safeguards, supported by the interagency TRAINS Taskforce. CAISI now serves as the US government's primary industry contact for commercial AI testing, having completed over 40 evaluations to date, including on unreleased models.

Implications for Australian agencies

Monitor Australia's AISI and DISR policy teams may want to monitor CAISI's published outputs from these evaluations for early signal on frontier AI capability developments and national security risks.
Consider Agencies involved in AI safety and governance could assess how CAISI's model of formalised pre-deployment access agreements compares to current Australian AISI arrangements and whether similar structures are being considered domestically.

Implications are AI-generated. Starting points, not advice.