CAISI Evaluation of DeepSeek AI Models Finds Shortcomings and Risks

30 Sep 2025 · NIST – AI News (topic 2753736) US

Concrete security and bias findings from a credible government evaluator give Australian agencies evidence-based grounds to assess or restrict DeepSeek use.

Key points

Summary

NIST's Center for AI Standards and Innovation (CAISI) has published an evaluation of three DeepSeek models (R1, R1-0528, V3.1) against US frontier models from OpenAI and Anthropic. Key findings show DeepSeek trails on performance benchmarks, costs more per equivalent task, is far more vulnerable to agent hijacking and jailbreaking attacks, and propagates CCP-aligned narratives at four times the rate of US reference models. The evaluation was directed under President Trump's AI Action Plan and frames findings explicitly in national security terms. Despite these risks, DeepSeek model downloads have increased nearly 1,000% since January 2025, indicating widespread adoption in the global AI ecosystem.

Implications for Australian agencies

Implications are AI-generated. Starting points, not advice.