Normativity and AI Alignment

Building AI that understands and operates within human normative systems

2026

2026 arXiv

Legal Alignment for Safe and Ethical AI

arXiv:2601.04175
With Noam Kolt, Nicholas Caputo, Jack Boeglin, Cullen O'Keefe, and others

Develops the concept of "legal alignment"—training AI systems to understand and operate within legal frameworks as a path to broader normative alignment.

2026 Contemporary Debates in the Ethics of Artificial Intelligence

Can AI Be Governed? Only if We Build Normatively Competent AI

Contemporary Debates in the Ethics of Artificial Intelligence
With eds. Sven Nyholm,  Atoosa Kasirzadeh, John Zerilli Wiley-Blackwell

Argues that effective AI governance requires AI systems capable of understanding and reasoning about human normative systems—not just following explicit rules—to truly participate in the complex equilibrium of human values and norms.

2025

2025 Phil Trans B

Metanorms Generate Stable Yet Adaptable Normative Social Order in a Politically Decentralized Society

Philosophical Transactions of the Royal Society B
With Sarah Mathew, Danson Mwangi, Samir Reynolds

Based on vignette experiments with 369 Turkana participants in Kenya, demonstrates how metanorms—rules that govern the process by which norms are interpreted, changed and enforced—enable societies to balance normative stability and adaptability through their dispute resolution institutions.

2024

2023

2022

2019

2019 AIES

Incomplete Contracting and AI Alignment

AAAI/ACM Conference on AI, Ethics, and Society
With Dylan Hadfield-Menell

Reframes the AI alignment problem through the lens of incomplete contract theory, showing how legal and economic insights about managing incomplete specifications apply to aligning AI with human values.

Cooperative AI

How to make AI agents that interact, cooperate, and coordinate

2025

2025 Nature Human Behaviour

The Impact of Advanced AI Systems on Democracy

Nature Human Behaviour
With Christopher Summerfield, Lisa Argyle, and others

Assesses the potential impacts—both positive and negative—of advanced AI systems on democratic institutions, processes, and participation.

2025 arXiv

Multi-Agent Risks from Advanced AI

arXiv:2502.14143
With Lewis Hammond, Alan Chan, and others

Analyzes risks that emerge specifically from interactions among multiple advanced AI systems, including coordination failures, conflicts, and emergent behaviors.

2025 NBER

An Economy of AI Agents

Economics of Transformative AI, NBER
With Andrew Koh

Examines how economic principles apply to a world where AI agents transact, cooperate, and compete, and what governance structures such an economy requires.

2025 arXiv

Infrastructure for AI Agents

arXiv:2501.10114
With Alan Chan, Kevin Wei, and others

Proposes the technical and institutional infrastructure needed to support safe and beneficial deployment of autonomous AI agents at scale.

2021

AI Governance

Regulatory frameworks and institutions for advanced AI

2025

2025 Jurimetrics

Regulatory Markets: The Future of AI Governance

Jurimetrics: The Journal of Law, Science, and Technology, Winter 2026
With Jack Clark

Proposes regulatory markets—a governance mechanism where governments require AI companies to purchase regulatory services from government-licensed private regulators—to overcome limitations of both command-and-control regulation and industry self-regulation.

2024

2024 Science

Regulating Advanced Artificial Agents

Science, Vol. 384
With Michael K. Cohen, Noam Kolt, Yoshua Bengio, Stuart Russell

Addresses the challenge of governing AI agents that can act autonomously in the world, proposing regulatory approaches tailored to agentic AI systems.

2024 AIES

Responsible Reporting for Frontier AI Development

AAAI/ACM Conference on AI, Ethics, and Society
With Noam Kolt, Markus Anderljung, Joslyn Barnhart, and others

Proposes standards and practices for how frontier AI developers should report capabilities, risks, and safety measures to regulators and the public.

2024 arXiv

Computing Power and the Governance of Artificial Intelligence

arXiv:2402.08797
With Girish Sastry, Lennart Heim, Markus Anderljung, Miles Brundage, and others

Analyzes how compute resources can serve as a lever for AI governance, examining tracking, allocation, and control mechanisms for computational infrastructure.

2024 arXiv

AI Model Registries: A Foundational Tool for AI Governance

arXiv:2410.09645
With Elliot McKernon, Gwyn Glasser, Deric Cheng

Proposes national registries for frontier AI models to enhance governance, drawing parallels to analogous industries while balancing safety oversight with innovation support.

2023

2023

2023 arXiv

International Institutions for Advanced AI

arXiv:2307.04699
With Lewis Ho, Robert Trager, Yoshua Bengio, Miles Brundage, and others

Explores the design of international governance institutions needed to manage risks from advanced AI, drawing on lessons from nuclear nonproliferation and other domains.

2022

2020

2019