Import AI 448: AI R&D; Bytedance's CUDA-writing agent; on-device satellite AI

9 Mar 2026 · Import AI – Substack (Jack Clark) Global

A credible measurement framework for AI R&D automation gives APS policy teams a concrete reference for what governments should demand from AI developers as oversight mechanisms.

Key points

GovAI and Oxford researchers propose 14 measurable metrics for tracking AI R&D automation progress toward recursive self-improvement.
The framework explicitly calls on governments to develop confidential reporting systems to monitor AI R&D automation data from companies.
AI capability timelines are accelerating faster than expert forecasters predicted, with agent task horizons already exceeding earlier end-of-2026 estimates.

Summary

This edition of Import AI covers three substantive topics. Most APS-relevant is a GovAI/Oxford paper proposing 14 metrics for measuring AI R&D Automation (AIRDA) - the degree to which AI is building AI - as a precursor to governing recursive self-improvement. The paper explicitly assigns roles to governments (confidential reporting systems), companies (internal tracking), and third parties (public estimation tools). The newsletter also covers a noted AI forecaster revising her capability timelines upward significantly, and an Indian edge-computing traffic surveillance deployment using commodity GPU hardware and open-source vision models.

Implications for Australian agencies

Monitor Policy teams working on AI governance frameworks may want to monitor the GovAI/Oxford AIRDA paper as a potential input to future reporting or transparency obligations for AI developers operating in Australia.
Consider Agencies tracking AI safety and frontier risk could consider whether any of the 14 proposed metrics map usefully onto Australia's existing AI incident or assurance reporting discussions.

Implications are AI-generated. Starting points, not advice.