Import AI 444: LLM societies; Huawei makes kernels with AI; ChipBench

9 Feb 2026 · Import AI – Substack (Jack Clark) Global

Emerging research on how reasoning models actually work internally may eventually inform AI assurance and explainability approaches in government contexts.

Key points

Summary

This edition of Import AI covers two research developments. First, a Google-affiliated study finds that advanced LLM reasoning models appear to simulate multiple internal perspectives or personas when solving hard problems - a phenomenon the authors call 'societies of thought' - observed in DeepSeek-R1 and QwQ-32B. Second, researchers from UC San Diego and Columbia introduce ChipBench, a more demanding benchmark for AI-assisted chip design in Verilog, finding that no current frontier model performs well on realistic industrial tasks. Both items are primarily of interest to AI researchers and technical practitioners rather than APS governance or policy staff.

Implications for Australian agencies

Implications are AI-generated. Starting points, not advice.