Import AI 444: LLM societies; Huawei makes kernels with AI; ChipBench

Import AI – Substack (Jack Clark)(Global) 9 Feb 2026 32

Frontier LLM capability research shapes assumptions underpinning AI risk and governance frameworks - useful context for APS practitioners tracking technical trajectories.

Key points

Google-affiliated researchers find LLM reasoning models implicitly simulate multi-agent 'societies of thought' when solving hard problems.
ChipBench benchmark reveals frontier models still perform poorly at real-world chip design tasks, despite hype around AI-driven hardware.
AI research newsletter content; limited direct APS governance or policy relevance, included for technical context.

Implications for Australian agencies

Monitor AI governance and safety teams may want to monitor the 'society of thought' research thread, as it could affect how AI risk assessments characterise emergent LLM reasoning behaviour.

Implications are AI-generated. Starting points, not advice — see methodology for how they're framed.

View original source