The Ethics of Advanced AI Assistants

3 Feb 2026 · MIT AI Risk Repository – Blog Global

A structured risk taxonomy for AI assistants provides APS governance teams with a reference framework for scoping AI assistant deployments and evaluations.

Key points

Google DeepMind researchers systematically map ethical and societal risks of advanced AI assistants across three domains.
The paper identifies an 'evaluation gap' where current assessments focus on models rather than broader sociotechnical systems.
The framework is one of 24 catalogued in the MIT AI Risk Repository, useful as reference material rather than operational guidance.

Summary

This MIT AI Risk Repository spotlight summarises a 2024 Google DeepMind paper mapping ethical and societal risks of advanced AI assistants - defined as agents that plan and execute actions via natural language interfaces. The framework covers three domains: value alignment and misuse, human-assistant interaction risks (including dependency, manipulation, and privacy), and societal-scale impacts such as misinformation, inequality, and job displacement. A notable finding is the 'evaluation gap': existing assessment approaches focus on model-level performance rather than the broader sociotechnical system. The paper recommends evaluations that account for human-AI interaction, multi-agent behaviour, and societal effects.

Implications for Australian agencies

Consider Agencies assessing AI assistant deployments could use this taxonomy to structure risk identification across value alignment, user interaction, and societal impact dimensions.
Monitor Policy teams developing AI assistant evaluation criteria may want to monitor how the 'evaluation gap' concept influences emerging assessment standards and frameworks internationally.

Implications are AI-generated. Starting points, not advice.