The Alignment Project Research Agenda

Benjamin Hilton

1 Aug 2025 10:29 UTC

The Alignment Project is a global fund of over £15 million, dedicated to accelerating progress in AI control and alignment research. It is backed by an international coalition of governments, industry, venture capital and philanthropic funders.

This sequence sets out the research areas we are excited to fund – we hope this list of research ideas presents a novel contribution to the alignment field. We have deliberately focused on areas that we think the AI safety community currently underrates.

Apply now to join researchers worldwide in advancing AI safety.

For those with experience scaling and running ambitious projects, apply to our Strategy & Operations role here.

The Alignment Project by UK AISI

Mojmir, Benjamin Hilton, Jacob Pfau, Geoffrey Irving, Joseph Bloom, Tomek Korbak, David Africa and Edmund Lau

1 Aug 2025 9:52 UTC

28 points

0 comments2 min readLW link

(alignmentproject.aisi.gov.uk)

Research Areas in Information Theory and Cryptography (The Alignment Project by UK AISI)

Simon Marshall1 Aug 2025 10:25 UTC

6 points

0 comments3 min readLW link

(alignmentproject.aisi.gov.uk)

Research Areas in Computational Complexity Theory (The Alignment Project by UK AISI)

Simon Marshall1 Aug 2025 10:25 UTC

5 points

0 comments10 min readLW link

(alignmentproject.aisi.gov.uk)

Research Areas in Economic Theory and Game Theory (The Alignment Project by UK AISI)

Cecilia Wood1 Aug 2025 10:25 UTC

4 points

0 comments6 min readLW link

(alignmentproject.aisi.gov.uk)

Research Areas in Probabilistic Methods (The Alignment Project by UK AISI)

Jacob Pfau and Benjamin Hilton

1 Aug 2025 10:26 UTC

3 points

0 comments4 min readLW link

(alignmentproject.aisi.gov.uk)

Research Areas in Learning Theory (The Alignment Project by UK AISI)

David Africa and Edmund Lau

1 Aug 2025 10:26 UTC

15 points

0 comments24 min readLW link

(alignmentproject.aisi.gov.uk)

Research Areas in Evaluation and Guarantees in Reinforcement Learning (The Alignment Project by UK AISI)

Jacob Pfau and Benjamin Hilton

1 Aug 2025 9:53 UTC

14 points

0 comments11 min readLW link

(alignmentproject.aisi.gov.uk)

Research Areas in Cognitive Science (The Alignment Project by UK AISI)

Geoffrey Irving1 Aug 2025 10:26 UTC

12 points

0 comments6 min readLW link

(alignmentproject.aisi.gov.uk)

Research Areas in Interpretability (The Alignment Project by UK AISI)

Joseph Bloom1 Aug 2025 10:26 UTC

14 points

0 comments5 min readLW link

(alignmentproject.aisi.gov.uk)

Research Areas in Benchmark Design and Evaluation (The Alignment Project by UK AISI)

Jacob Pfau and Benjamin Hilton

1 Aug 2025 10:26 UTC

10 points

0 comments9 min readLW link

(alignmentproject.aisi.gov.uk)

Research Areas in Methods for Post-training and Elicitation (The Alignment Project by UK AISI)

Jacob Pfau and Benjamin Hilton

1 Aug 2025 10:27 UTC

12 points

0 comments6 min readLW link

(alignmentproject.aisi.gov.uk)

Research Areas in AI Control (The Alignment Project by UK AISI)

Julian Stastny, Tomek Korbak, Mojmir, Buck and Alan Cooney

1 Aug 2025 10:27 UTC

25 points

0 comments18 min readLW link

(alignmentproject.aisi.gov.uk)