All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 345 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Making progress bars for Alignment

Kabir Kumar3 Jan 2025 21:25 UTC

2 points

0 comments1 min readLW link

(lu.ma)

The Intelligence Curse

lukedrago3 Jan 2025 19:07 UTC

155 points

27 comments18 min readLW link

(lukedrago.substack.com)

Introducing Squiggle AI

ozziegooen3 Jan 2025 17:53 UTC

92 points

15 comments8 min readLW link

Human study on AI spear phishing campaigns

Simon Lermen, Fred Heiding and Andrew Kao

3 Jan 2025 15:11 UTC

81 points

8 comments5 min readLW link

Mearsheimer’s Double Standard: Realism for Russia, Idealism for Israel

Ghdz3 Jan 2025 13:52 UTC

−15 points

2 comments4 min readLW link

The subset parity learning problem: much more than you wanted to know

Dmitry Vaintrob3 Jan 2025 9:13 UTC

107 points

19 comments11 min readLW link

Building AI safety benchmark environments on themes of universal human values

Roland Pihlakas and Three Laws

3 Jan 2025 4:24 UTC

18 points

3 comments12 min readLW link

(docs.google.com)

Emotional Superrationality

nullproxy2 Jan 2025 22:54 UTC

−6 points

4 comments11 min readLW link

Playing with Otamatones

jefftk2 Jan 2025 19:50 UTC

12 points

0 comments1 min readLW link

(www.jefftk.com)

7. Iterate the Game: Racing Where?

Allison Duettmann2 Jan 2025 19:06 UTC

11 points

0 comments9 min readLW link

6. Increase Intelligence: Welcome AI Players

Allison Duettmann2 Jan 2025 19:06 UTC

6 points

1 comment19 min readLW link

5. Uphold Voluntarism: Digital Defense

Allison Duettmann2 Jan 2025 19:05 UTC

3 points

0 comments18 min readLW link

4. Uphold Voluntarism: Physical Defense

Allison Duettmann2 Jan 2025 19:04 UTC

6 points

2 comments23 min readLW link

3. Improve Cooperation: Better Technologies

Allison Duettmann2 Jan 2025 19:03 UTC

5 points

2 comments23 min readLW link

2. Skim the Manual: Intelligent Voluntary Cooperation

Allison Duettmann2 Jan 2025 19:02 UTC

13 points

3 comments18 min readLW link

1. Meet the Players: Value Diversity

Allison Duettmann2 Jan 2025 19:00 UTC

32 points

2 comments10 min readLW link

Preface

Allison Duettmann2 Jan 2025 18:59 UTC

31 points

2 comments7 min readLW link

The AI Agent Revolution: Beyond the Hype of 2025

DimaG2 Jan 2025 18:55 UTC

−7 points

1 comment28 min readLW link

On False Dichotomies

nullproxy2 Jan 2025 18:54 UTC

−3 points

0 comments5 min readLW link

Preference Inversion

Benquo2 Jan 2025 18:15 UTC

54 points

48 comments4 min readLW link

(benjaminrosshoffman.com)

Alignment Is Not All You Need

Adam Jones2 Jan 2025 17:50 UTC

45 points

10 comments6 min readLW link

(adamjones.me)

What’s the short timeline plan?

Marius Hobbhahn2 Jan 2025 14:59 UTC

373 points

51 comments23 min readLW link

AI #97: 4

Zvi2 Jan 2025 14:10 UTC

45 points

4 comments40 min readLW link

(thezvi.wordpress.com)

[Question] Can private companies test LVTs?

Yair Halberstadt2 Jan 2025 11:08 UTC

7 points

0 comments1 min readLW link

Grammars, subgrammars, and combinatorics of generalization in transformers

Dmitry Vaintrob2 Jan 2025 9:37 UTC

36 points

0 comments17 min readLW link

[Question] 2025 Alignment Predictions

anaguma2 Jan 2025 5:37 UTC

3 points

3 comments1 min readLW link

Grading my 2024 AI predictions

Nikola Jurkovic2 Jan 2025 5:01 UTC

19 points

1 comment3 min readLW link

Practicing Bayesian Epistemology with “Two Boys” Probability Puzzles

Liron2 Jan 2025 4:42 UTC

43 points

14 comments6 min readLW link

Implications of Moral Realism on AI Safety

Myles H2 Jan 2025 2:58 UTC

7 points

1 comment3 min readLW link

Read The Sequences As If They Were Written Today

Peter Berggren2 Jan 2025 2:51 UTC

65 points

7 comments4 min readLW link

A Collection of Empirical Frames about Language Models

Daniel Tan2 Jan 2025 2:49 UTC

27 points

0 comments3 min readLW link

My January alignment theory Nanowrimo

Dmitry Vaintrob2 Jan 2025 0:07 UTC

53 points

2 comments2 min readLW link

Intranasal mRNA Vaccines?

J Bostock1 Jan 2025 23:46 UTC

26 points

2 comments3 min readLW link

Example of GPU-accelerated scientific computing with PyTorch

Tahp1 Jan 2025 23:01 UTC

6 points

0 comments6 min readLW link

(passwordpaper.com)

Economic Post-ASI Transition

Joel Burget1 Jan 2025 22:37 UTC

19 points

11 comments1 min readLW link

2024 in AI predictions

jessicata1 Jan 2025 20:29 UTC

127 points

3 comments8 min readLW link

Approaches to Group Singing

jefftk1 Jan 2025 12:50 UTC

12 points

1 comment3 min readLW link

(www.jefftk.com)

Alienable (not Inalienable) Right to Buy

FlorianH1 Jan 2025 12:19 UTC

9 points

7 comments4 min readLW link

AGI is what generates evolutionarily fit and novel information

onur1 Jan 2025 9:22 UTC

1 point

0 comments6 min readLW link

(solmaz.io)

The OODA Loop—Observe, Orient, Decide, Act

Davis_Kingsley1 Jan 2025 8:00 UTC

55 points

2 comments11 min readLW link

Comment on “Death and the Gorgon”

Zack_M_Davis1 Jan 2025 5:47 UTC

124 points

35 comments8 min readLW link

Fireplace and Candle Smoke

jefftk1 Jan 2025 1:50 UTC

37 points

4 comments1 min readLW link

(www.jefftk.com)

Riffing on Machines of Loving Grace

an1lam1 Jan 2025 1:06 UTC

9 points

0 comments1 min readLW link

(an1lam.substack.com)

new chinese stealth aircraft

bhauth1 Jan 2025 0:19 UTC

58 points

3 comments6 min readLW link

(bhauth.com)

The Roots of Progress 2024 in review

jasoncrawford1 Jan 2025 0:02 UTC

27 points

0 comments11 min readLW link

(newsletter.rootsofprogress.org)

Genesis

PeterMcCluskey31 Dec 2024 22:01 UTC

18 points

0 comments2 min readLW link

(bayesianinvestor.com)

Favorite colors of some LLMs.

Canaletto31 Dec 2024 21:22 UTC

10 points

3 comments7 min readLW link

My AGI safety research—2024 review, ’25 plans

Steven Byrnes31 Dec 2024 21:05 UTC

111 points

5 comments8 min readLW link 1 review

How Business Solved (?) the Human Alignment Problem

Gianluca Calcagni31 Dec 2024 20:39 UTC

−2 points

1 comment8 min readLW link

Turing-Test-Passing AI implies Aligned AI

Roko31 Dec 2024 19:59 UTC

−9 points

31 comments5 min readLW link