All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 20242025

All Jan Feb Mar Apr May JunJulAug Sep Oct

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 3031

Beyond Hangriness: A Deeper Framework for Emotional Clarity

jaredclucas30 Jul 2025 23:59 UTC

−7 points

0 comments5 min readLW link

LLMs Are Already Misaligned: Simple Experiments Prove It

Mackam30 Jul 2025 23:48 UTC

12 points

10 comments7 min readLW link

Replicators—Pandora’s dangerous children

James Stephen Brown30 Jul 2025 22:39 UTC

19 points

2 comments3 min readLW link

Exploration hacking: can reasoning models subvert RL?

Damon Falck, Joschka Braun and Eyon Jang

30 Jul 2025 22:02 UTC

16 points

4 comments9 min readLW link

Optimizing The Final Output Can Obfuscate CoT (Research Note)

lukemarks, jacob_drori, cloud and TurnTrout

30 Jul 2025 21:26 UTC

196 points

22 comments6 min readLW link

A Timing Problem for Instrumental Convergence

rhys southan30 Jul 2025 19:15 UTC

2 points

44 comments1 min readLW link

(link.springer.com)

Childhood and Education: College Admissions

Zvi30 Jul 2025 17:40 UTC

51 points

11 comments18 min readLW link

(thezvi.wordpress.com)

Apply to SPAR Fall 2025—80+ projects!

agucova30 Jul 2025 17:34 UTC

19 points

0 comments1 min readLW link

Dimensions of logical time as economic strategies

tayzzyronth30 Jul 2025 16:56 UTC

10 points

2 comments7 min readLW link

On Wireheading

Dave92F130 Jul 2025 16:26 UTC

9 points

4 comments3 min readLW link

Uncertain Updates: July 2025

Gordon Seidoh Worley30 Jul 2025 14:50 UTC

8 points

0 comments2 min readLW link

(uncertainupdates.substack.com)

Will AGI Emerge Through Self-Generated Reward Loops?

Moksh Nirvaan30 Jul 2025 13:17 UTC

5 points

0 comments1 min readLW link

Sex Determination as a Bottleneck to Species Development

Morpheus30 Jul 2025 8:27 UTC

20 points

5 comments1 min readLW link

[Question] When will the Fooming Shoggoths songs from LessOnline 2025 come out?

Brendan Long30 Jul 2025 4:04 UTC

15 points

1 comment1 min readLW link

My Empathy Is Rarely Kind

johnswentworth30 Jul 2025 3:49 UTC

73 points

230 comments4 min readLW link

Pitfalls of Building UDT Agents

Cole Wyeth30 Jul 2025 3:27 UTC

26 points

5 comments7 min readLW link

China proposes new global AI cooperation organisation

Matrice Jacobine30 Jul 2025 2:50 UTC

84 points

8 comments1 min readLW link

(www.reuters.com)

Neel Nanda MATS Applications Open (Due Aug 29)

Neel Nanda30 Jul 2025 0:55 UTC

22 points

0 comments7 min readLW link

(tinyurl.com)

Better than logarithmic returns to reasoning?

Oliver Sourbut30 Jul 2025 0:50 UTC

14 points

5 comments2 min readLW link

They’re a simulation and you must love anyway

Andrew Huang30 Jul 2025 0:01 UTC

9 points

0 comments17 min readLW link

Critique of “The Case for Strong Longtermism”

Zeren29 Jul 2025 23:58 UTC

1 point

0 comments2 min readLW link

Jagged Vs. Continuous intelligence

Mohsen29 Jul 2025 23:57 UTC

0 points

0 comments1 min readLW link

[Question] Are two potentially simple techniques an example of Mencken’s law?

StanislavKrym29 Jul 2025 23:37 UTC

4 points

4 comments2 min readLW link

The many paths to permanent disempowerment even with shutdownable AIs (MATS project summary for feedback)

GideonF29 Jul 2025 23:20 UTC

55 points

6 comments9 min readLW link

Against racing to AGI: Cooperation, deterrence, and catastrophic risks

Max_He-Ho29 Jul 2025 22:23 UTC

4 points

0 comments1 min readLW link

(philpapers.org)

Very Light Hardshell Suitcases

jefftk29 Jul 2025 20:10 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

Misalignments and RL failure modes in the early stage of superintelligence

shu yang29 Jul 2025 18:23 UTC

13 points

0 comments13 min readLW link

Low P(x-risk) as the Bailey for Low P(doom)

Vladimir_Nesov29 Jul 2025 18:01 UTC

48 points

29 comments2 min readLW link

Building Black-box Scheming Monitors

james__p, richbc, Simon Storf and Marius Hobbhahn

29 Jul 2025 17:41 UTC

39 points

18 comments11 min readLW link

Deliberative Credit Assignment (DCA): Making Faithful Reasoning Profitable

Florian_Dietz29 Jul 2025 16:23 UTC

9 points

0 comments17 min readLW link

Want to work in US emerging technology policy? Horizon fellowship applications are live

PolicyTakes29 Jul 2025 16:15 UTC

12 points

0 comments1 min readLW link

(horizonpublicservice.org)

Spilling the Tea

Zvi29 Jul 2025 14:20 UTC

34 points

8 comments12 min readLW link

(thezvi.wordpress.com)

How one logical fallacy killed God, corrupted Science and now fuels the AI race

Jáchym Fibír29 Jul 2025 13:50 UTC

−39 points

10 comments7 min readLW link

(www.phiand.ai)

About 30% of Humanity’s Last Exam chemistry/biology answers are likely wrong

bohaska29 Jul 2025 11:59 UTC

208 points

10 comments4 min readLW link

(www.futurehouse.org)

People Are Less Happy Than They Seem

Jakub Halmeš29 Jul 2025 6:03 UTC

19 points

6 comments1 min readLW link

(unpredictabletokens.substack.com)

I wrote a song parody

CronoDAS29 Jul 2025 6:00 UTC

41 points

3 comments1 min readLW link

Teaching kids to swim

Steven Byrnes29 Jul 2025 3:10 UTC

55 points

12 comments3 min readLW link

Recursions on LessOnline 2025

Error29 Jul 2025 0:09 UTC

53 points

10 comments19 min readLW link

A Precocious Baby’s Guide to Anthropics

Linch28 Jul 2025 22:20 UTC

23 points

0 comments2 min readLW link

(linch.substack.com)

Simplex Progress Report—July 2025

Adam Shai, Paul Riechers, hrbigelow, Eric Alt and mntss

28 Jul 2025 21:58 UTC

107 points

2 comments15 min readLW link

Someone should fund an AGI Blockbuster

pinto28 Jul 2025 21:14 UTC

5 points

11 comments4 min readLW link

On the Evolvability of Biological Immortality

simon460328 Jul 2025 21:13 UTC

20 points

0 comments35 min readLW link

Is Interpretability for Control or for Science?

James Enouen28 Jul 2025 21:12 UTC

3 points

0 comments3 min readLW link

Procrastination Drill

silentbob28 Jul 2025 20:54 UTC

62 points

8 comments2 min readLW link

Transcript: OpenAI’s Chief Economist and COO interviewed about AI’s economic impacts

sjadler28 Jul 2025 20:40 UTC

14 points

0 comments49 min readLW link

The necessity of security for play, and play for seeing reality

jimmy28 Jul 2025 19:27 UTC

18 points

0 comments19 min readLW link

The AGI Awakeness valley of doom and three pathways to slowing

GideonF28 Jul 2025 18:44 UTC

19 points

0 comments16 min readLW link

(open.substack.com)

[Question] What are non-obvious class markers?

Max Niederman28 Jul 2025 15:35 UTC

14 points

14 comments1 min readLW link

How to Update If Pre-Training is Dead

Noah Birnbaum28 Jul 2025 14:47 UTC

23 points

4 comments1 min readLW link

AI Companion Piece

Zvi28 Jul 2025 12:30 UTC

38 points

5 comments13 min readLW link

(thezvi.wordpress.com)