Aligned with whom? Direct and social goals for AI systems

In this Brookings working paper, Korinek and Balwit discuss the AI alignment problem – how to ensure that AI systems pursue the goals that we want them to pursue. This article distinguishes two types of alignment problems depending on whose goals we consider, and analyzes the different solutions necessitated by each. The direct alignment problem considers whether an AI system accomplishes the goals of the entity operating it. In contrast, the social alignment problem considers the effects of an AI system on larger groups or on society more broadly. In particular, it also considers whether the system imposes externalities on others. Whereas solutions to the direct alignment problem typically center around more robust implementation, social alignment problems typically arise because of conflicts between individual and group-level goals, elevating the importance of AI governance to mediate such conflicts. Addressing the social alignment problem requires both enforcing existing norms on their developers and operators and designing new norms that apply directly to AI systems.

Read paper

Aligned with whom? Direct and social goals for AI systems

Theme

Date

author

s

Share

Research Summary

Footnotes

Further reading

Related publications

Policy Advice and Opinion

Tort Law and Frontier AI Governance

May 2024

Lawfare Article

Matthew van der Merwe, Ketan Ramakrishnan, Markus Anderljung

Policy Advice and Opinion

Response to the RFI Related to NIST's Assignments Under the Executive Order Concerning AI

February 2024

Jonas Schuett, Leonie Koessler, Markus Anderljung

Policy Advice and Opinion

Towards Publicly Accountable Frontier LLMs

November 2023

Research Paper

Markus Anderljung, Everett Thornton Smith, Joe O'Brien, Lisa Soder, Benjamin Bucknall, Emma Bluemke, et al.

Policy Advice and Opinion

Tort Law and Frontier AI Governance

May 2024

Lawfare Article

Matthew van der Merwe, Ketan Ramakrishnan, Markus Anderljung

Policy Advice and Opinion

Response to the RFI Related to NIST's Assignments Under the Executive Order Concerning AI

February 2024

Jonas Schuett, Leonie Koessler, Markus Anderljung

Policy Advice and Opinion

Towards Publicly Accountable Frontier LLMs

November 2023

Research Paper

Markus Anderljung, Everett Thornton Smith, Joe O'Brien, Lisa Soder, Benjamin Bucknall, Emma Bluemke, et al.

Policy Advice and Opinion

Preparing the Workforce for an Uncertain AI Future

November 2023

Testimony

Anton Korinek