Weekly Digest 2 Feb 2026

This week at a glance

This week's digest centres on AI evaluation and risk assessment frameworks, with three items drawing from MIT's AI Risk Repository offering practitioners structured approaches to thinking about capability-based risks, ethical dimensions of AI assistants, and responsible AI testing. Singapore's AI Verify Framework—aligned with OECD, EU, and ASEAN governance standards and therefore a useful reference point for Australian agencies navigating international alignment—receives particular attention for its practical toolkit combining technical tests with process-level checks. A recurring theme across the material is the limits of current evaluation methods, which tend to focus on model-level performance rather than the broader sociotechnical system in which AI operates; this has direct implications for agencies designing assurance processes for agentic or multi-stakeholder deployments. Rounding out the week, reporting on autonomous agent ecosystems and AI-conducted AI research raises near-horizon questions about human oversight and strategic surprise that are beginning to inform international policy conversations.

Headlines

Standards · AI Verify Testing Framework
Risk · The Ethics of Advanced AI Assistants

primary source commentary

Standards & Frameworks1 item

MIT AI Risk Repository – Blog(Multi) 8 Feb 2026

AI Verify Testing Framework

The MIT AI Risk Repository spotlights Singapore's AI Verify Testing Framework, developed by the AI Verify Foundation and demonstrated via a sample binary classification credit-risk report. The framework organises 11 AI ethical principles across five areas - transparency, explainability, safety and resilience, fairness, and management and oversight - into a toolkit of technical tests and process checks for evaluating responsible AI in both traditional and generative AI deployments. It was developed through multi-sector consultation and is explicitly aligned with ASEAN, EU, OECD, and US AI governance frameworks. For APS practitioners, it offers a concrete comparative reference point alongside Australia's existing responsible AI policy settings.

Key points

Singapore's AI Verify Foundation developed an 11-principle testing framework covering transparency, safety, fairness, and accountability.
The framework aligns with ASEAN, EU, OECD, and US AI governance frameworks, giving it cross-jurisdictional reference value.
This item is a MIT AI Risk Repository blog spotlight - the substantive content originates from a 2023 Singapore Government document.

Implications

Consider Agencies developing or reviewing AI assurance or evaluation frameworks could assess whether AI Verify's 11-principle structure offers useful benchmarking against Australia's responsible AI policy obligations.
Monitor Policy teams tracking international AI governance convergence may want to monitor how Singapore's AI Verify framework evolves, given its explicit alignment with ASEAN, EU, OECD, and US standards.

View item Original source ↗

Risk, Assurance & Ethics3 items

MIT AI Risk Repository – Blog(Global) 6 Feb 2026

Model Evaluation for Extreme Risks

The MIT AI Risk Repository spotlights a 2023 paper by Shevlane et al. proposing that model evaluation could address extreme risks from general-purpose AI by assessing both dangerous capabilities and model alignment. The paper identifies nine dangerous capability categories - including cyber-offense, deception, persuasion, political strategy, and self-proliferation - and outlines how such evaluations could be embedded in AI safety and governance processes. The framework focuses on misuse and misalignment risks rather than structural or competence-related risks. It is one of 25 risk frameworks catalogued in the Repository, making the Repository itself a useful reference for agencies building or reviewing AI risk taxonomies.

Key points

A 2023 paper proposes embedding model evaluation for dangerous capabilities and alignment into AI governance processes.
Nine dangerous capability categories are identified, including cyber-offense, deception, self-proliferation, and situational awareness.
MIT AI Risk Repository surfaces this as one of 25 risk frameworks - useful reference material for agencies building AI risk taxonomies.

Implications

Consider Agencies developing AI risk assessment or procurement evaluation frameworks may want to consult this taxonomy of dangerous capabilities as a reference point.
Monitor Risk and assurance teams could monitor the MIT AI Risk Repository as it catalogues further frameworks, given its utility as a curated evidence base for AI governance work.

View item Original source ↗

MIT AI Risk Repository – Blog(Global) 3 Feb 2026

The Ethics of Advanced AI Assistants

The MIT AI Risk Repository has spotlighted a 2024 paper by Gabriel, Manzini, Keeling, and co-authors from Google DeepMind examining ethical and societal risks of advanced AI assistants - systems that plan and execute actions on behalf of users via natural language interfaces. The paper organises risks into three areas: value alignment and misuse, human-assistant interaction (covering manipulation, dependency, trust, and privacy), and societal-scale impacts including misinformation, job displacement, and inequality. A key contribution is identification of an 'evaluation gap', where existing assessment methods focus on model-level properties and neglect broader sociotechnical system effects including multi-agent dynamics and human-AI interaction. The MIT blog item is a summary entry in the Repository's ongoing framework-spotlight series rather than original analysis.

Key points

MIT AI Risk Repository spotlights a Google DeepMind-led paper on ethical risks of advanced AI assistants.
Framework covers value alignment, human-assistant interaction risks, and societal-scale impacts across three structured areas.
Identifies an 'evaluation gap' where current approaches focus on model-level considerations rather than broader sociotechnical effects.

Implications

Consider Agencies developing AI assistant governance frameworks or evaluation criteria could consider mapping this risk taxonomy against their existing risk registers or procurement due diligence processes.
Monitor Policy teams working on responsible AI use guidance may want to monitor the MIT AI Risk Repository's framework series as a curated source of peer-reviewed risk taxonomies.

View item Original source ↗

Import AI – Substack (Jack Clark)(Global) 2 Feb 2026

Import AI 443: Into the mist: Moltbook, agent ecologies, and the internet in transition

Jack Clark's Import AI #443 covers several distinct topics across a newsletter/essay format. The lead essay describes Moltbook, an emergent AI-agent social network, as an early example of large-scale agent ecologies operating in the real world with implications for internet legibility and human oversight. A substantive section summarises a CSET workshop report on AI R&D automation, warning of strategic surprise, compounding capability acceleration, and eroding human oversight as AI increasingly conducts its own research. Additional items cover Anthropic's experience redesigning technical interviews as Claude models outpace human candidates, a brain emulation feasibility report, haptic drone control research, a new humanoid robot platform, and an academic synthesis of AI productivity evidence showing micro-level gains not yet visible in macro statistics.

Key points

Import AI #443 is a multi-topic newsletter covering agent ecologies, AI R&D automation risks, productivity evidence, robotics, and brain emulation.
The AI R&D automation section is the highest-signal item: a CSET workshop report warns of compounding strategic surprise and declining human oversight.
Limited direct operational relevance to Australian federal agencies; most value is as a horizon-scanning signal across frontier AI themes.

Implications

Monitor AI strategy and governance teams may want to monitor the CSET report on AI R&D automation directly, as its findings on strategic surprise and human oversight decline are relevant to longer-horizon risk frameworks.
Monitor The agent ecology and productivity threads are worth watching as leading indicators of how AI deployment conditions may shift for government and enterprise users over the next one to two years.

View item Original source ↗

Implications are AI-generated. Starting points, not advice — see methodology for how they're framed.