Jide is a Research Scientist on Google DeepMind’s AI Safety and Alignment team. He was previously a member of policy staff at METR.
Over the past year, artificial intelligence (AI) companies have been increasingly adopting AI safety frameworks. These frameworks outline how companies intend to keep the potential risks associated with developing and deploying...
This paper proposes an evaluation-based coordination scheme for situations in which frontier AI developers discover that their models have certain dangerous capabilities.