Introducing v0.5 of the AI Safety Benchmark from MLCommons

MIT AI Risk Repository – Blog(Global) 25 Dec 2025 52

Openly available AI safety benchmarking tools give APS agencies a practical starting point for evaluating chat AI systems against defined hazard categories.

Key points

MLCommons AI Safety Benchmark v0.5 defines 13 hazard categories for evaluating chat-based AI system safety.
Practical testing tools including ModelBench are openly available, making this usable for agency-level AI evaluation.
V0.5 has been superseded by V1.0 (AILuminate, Feb 2025); this spotlight is retrospective context, not a new release.

Implications for Australian agencies

Consider Agencies evaluating or procuring chat-based AI systems could consider referencing the MLCommons AILuminate benchmark (V1.0) as an external safety evaluation standard.
Monitor AI governance teams may want to monitor the MLCommons AI Safety Working Group's benchmark evolution, particularly as it expands to image and multimodal AI systems.

Implications are AI-generated. Starting points, not advice — see methodology for how they're framed.

View original source

Appeared in: Weekly digest, 22 December 2025