Inference Scaling and AI Governance

The shift from scaling up the compute used to pre-train AI systems (pre-training compute) to scaling up the amount used to run them (inference compute) may have profound effects on AI governance. The nature of these effects depends crucially on whether this new inference compute will primarily be used to improve model performance during external deployment or as part of a more complex training programme within the lab. Rapid scaling of inference-at-deployment would somewhat lower the importance of open-weight models (and of securing the weights of closed models), reduce the impact of the first human-level models, change the business model for frontier AI, reduce the need for power-intensive data centres, and potentially undermine AI governance measures that rely on training-compute thresholds. Rapid scaling of inference-during-training would have more ambiguous effects that range from a revitalisation of pre-training scaling to a form of recursive self-improvement via iterated distillation and amplification.

‍

Read paper

Inference Scaling and AI Governance

Theme

Date

author

s

Share

Research Summary

Footnotes

Further reading

Related publications

Technical AI Governance

Incident Analysis for AI Agents

August 2025

Research Paper

Carson Ezell, Xavier Roberts-Gaal, Alan Chan

Technical AI Governance

Dual-Use AI Capabilities and the Risk of Bioterrorism: Converting Capability Evaluations to Risk Assessments

September 2025

Research Paper

Luca Righetti

Technical AI Governance

Trends in Frontier AI Model Count: A Forecast to 2028

April 2025

Research Paper

Iyngkarran Kumar, Sam Manning

Technical AI Governance