Leon Lang

Karma: 2,365

I am the Head of Curriculum Development at Iliad, responsible for the content of the Iliad Intensive. Previously, I did a PhD at the University of Amsterdam working on AI Safety and Alignment, and specifically safety risks of Reinforcement Learning from Human Feedback (RLHF). I also worked on abstract multivariate information theory and equivariant deep learning. https://langleon.github.io/

Announcing: Iliad’s Fall 2026 Programs

David Udell, Alexander Gietelink Oldenziel and Leon Lang

30 May 2026 4:37 UTC

64 points

7 comments1 min readLW link

The Iliad Intensive Course Materials

Leon Lang, David Udell and Alexander Gietelink Oldenziel

11 May 2026 18:55 UTC

153 points

4 comments13 min readLW link

(docs.google.com)

What is the Iliad Intensive?

Leon Lang, Alexander Gietelink Oldenziel and David Udell

15 Apr 2026 18:49 UTC

95 points

15 comments2 min readLW link

A Technical Introduction to Solomonoff Induction without K-Complexity

Leon Lang26 Nov 2025 21:36 UTC

76 points

22 comments25 min readLW link

The Coding Theorem — A Link between Complexity and Probability

Leon Lang10 Aug 2025 15:34 UTC

36 points

8 comments9 min readLW link

X explains Z% of the variance in Y

Leon Lang20 Jun 2025 12:17 UTC

160 points

36 comments9 min readLW link

How to work through the ARENA program on your own

Leon Lang3 Jun 2025 17:38 UTC

38 points

5 comments6 min readLW link

[Paper Blogpost] When Your AIs Deceive You: Challenges with Partial Observability in RLHF

Leon Lang22 Oct 2024 13:57 UTC

51 points

2 comments18 min readLW link

(arxiv.org)

We Should Prepare for a Larger Representation of Academia in AI Safety

Leon Lang13 Aug 2023 18:03 UTC

90 points

14 comments5 min readLW link

Andrew Ng wants to have a conversation about extinction risk from AI

Leon Lang5 Jun 2023 22:29 UTC

31 points

2 comments1 min readLW link

(twitter.com)

Evaluating Language Model Behaviours for Shutdown Avoidance in Textual Scenarios

Simon Lermen, Teun van der Weij and Leon Lang

16 May 2023 10:53 UTC

26 points

0 comments13 min readLW link

[Appendix] Natural Abstractions: Key Claims, Theorems, and Critiques

LawrenceC, Erik Jenner and Leon Lang

16 Mar 2023 16:38 UTC

48 points

0 comments13 min readLW link

Natural Abstractions: Key Claims, Theorems, and Critiques

LawrenceC, Leon Lang and Erik Jenner

16 Mar 2023 16:37 UTC

251 points

26 comments45 min readLW link 3 reviews

Andrew Huberman on How to Optimize Sleep

Leon Lang2 Feb 2023 20:17 UTC

37 points

6 comments6 min readLW link

Experiment Idea: RL Agents Evading Learned Shutdownability

Leon Lang16 Jan 2023 22:46 UTC

31 points

7 comments17 min readLW link

(docs.google.com)

Disentangling Shard Theory into Atomic Claims

Leon Lang13 Jan 2023 4:23 UTC

86 points

6 comments18 min readLW link

Citability of Lesswrong and the Alignment Forum

Leon Lang8 Jan 2023 22:12 UTC

51 points

2 comments1 min readLW link

A Short Dialogue on the Meaning of Reward Functions

Leon Lang, Quintin Pope and peligrietzer

19 Nov 2022 21:04 UTC

45 points

0 comments3 min readLW link

Leon Lang’s Shortform

Leon Lang2 Oct 2022 10:05 UTC

2 points

95 comments1 min readLW link

Distribution Shifts and The Importance of AI Safety

Leon Lang29 Sep 2022 22:38 UTC

17 points

2 comments9 min readLW link