All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Call on AI Companies: Publish Your Whistleblowing Policies

karl31 Jul 2025 22:04 UTC

20 points

3 comments7 min readLW link

Do Not Render Your Counterfactuals

AlphaAndOmega31 Jul 2025 21:35 UTC

112 points

19 comments5 min readLW link

(open.substack.com)

Emergence Is Beautiful—beauty and meaning in an entropic universe

James Stephen Brown31 Jul 2025 19:00 UTC

8 points

0 comments5 min readLW link

Sharpening the Shears: 8 Lessons from Garden Leave

Jordan Rubin31 Jul 2025 18:57 UTC

8 points

0 comments4 min readLW link

(jordanmrubin.substack.com)

AISN #60: The AI Action Plan

Corin Katzke and Dan H

31 Jul 2025 18:20 UTC

6 points

0 comments4 min readLW link

(newsletter.safe.ai)

Approximating Human Preferences Using a Multi-Judge Learned System

JoseFaustino, eitan sprejer, Fernando Avalos and Augusto Bernardi

31 Jul 2025 18:01 UTC

19 points

0 comments13 min readLW link

Follow-up to “My Empathy Is Rarely Kind”

johnswentworth31 Jul 2025 17:21 UTC

81 points

42 comments2 min readLW link

Book Review: The MANIAC

Annapurna31 Jul 2025 15:18 UTC

15 points

6 comments2 min readLW link

(jorgevelez.substack.com)

Red-Thing-Ism

J Bostock31 Jul 2025 14:09 UTC

103 points

9 comments3 min readLW link

AI #127: Continued Claude Code Complications

Zvi31 Jul 2025 13:40 UTC

32 points

4 comments43 min readLW link

(thezvi.wordpress.com)

I am worried about near-term non-LLM AI developments

testingthewaters31 Jul 2025 13:15 UTC

260 points

60 comments5 min readLW link

What do we do about the Inevitable?

CSDD31 Jul 2025 10:22 UTC

−7 points

0 comments4 min readLW link

[Question] Several questions about Zen koans

Said Achmiz31 Jul 2025 6:35 UTC

24 points

21 comments3 min readLW link

Beyond Hangriness: A Deeper Framework for Emotional Clarity

jaredclucas30 Jul 2025 23:59 UTC

−7 points

0 comments5 min readLW link

LLMs Are Already Misaligned: Simple Experiments Prove It

Mackam30 Jul 2025 23:48 UTC

12 points

10 comments7 min readLW link

Replicators—Pandora’s dangerous children

James Stephen Brown30 Jul 2025 22:39 UTC

20 points

2 comments3 min readLW link

Exploration hacking: can reasoning models subvert RL?

Damon Falck, Joschka Braun and Eyon Jang

30 Jul 2025 22:02 UTC

25 points

4 comments9 min readLW link

[Research Note] Optimizing The Final Output Can Obfuscate CoT

lukemarks, jacob_drori, cloud and TurnTrout

30 Jul 2025 21:26 UTC

203 points

23 comments6 min readLW link

A Timing Problem for Instrumental Convergence

rhys southan30 Jul 2025 19:15 UTC

3 points

45 comments1 min readLW link

(link.springer.com)

Childhood and Education: College Admissions

Zvi30 Jul 2025 17:40 UTC

54 points

11 comments18 min readLW link

(thezvi.wordpress.com)

Apply to SPAR Fall 2025—80+ projects!

agucova30 Jul 2025 17:34 UTC

19 points

0 comments1 min readLW link

Dimensions of logical time as economic strategies

tayzzyronth30 Jul 2025 16:56 UTC

10 points

2 comments7 min readLW link

On Wireheading

Dave92F130 Jul 2025 16:26 UTC

10 points

4 comments3 min readLW link

Uncertain Updates: July 2025

Gordon Seidoh Worley30 Jul 2025 14:50 UTC

8 points

0 comments2 min readLW link

(uncertainupdates.substack.com)

Will AGI Emerge Through Self-Generated Reward Loops?

Moksh Nirvaan30 Jul 2025 13:17 UTC

5 points

0 comments1 min readLW link

Sex Determination as a Bottleneck to Species Development

Morpheus30 Jul 2025 8:27 UTC

20 points

5 comments1 min readLW link

[Question] When will the Fooming Shoggoths songs from LessOnline 2025 come out?

Brendan Long30 Jul 2025 4:04 UTC

15 points

1 comment1 min readLW link

My Empathy Is Rarely Kind

johnswentworth30 Jul 2025 3:49 UTC

73 points

233 comments4 min readLW link

Pitfalls of Building UDT Agents

Cole Wyeth30 Jul 2025 3:27 UTC

27 points

5 comments7 min readLW link

China proposes new global AI cooperation organisation

Matrice Jacobine30 Jul 2025 2:50 UTC

85 points

8 comments1 min readLW link

(www.reuters.com)

Neel Nanda MATS Applications Open (Due Aug 29)

Neel Nanda30 Jul 2025 0:55 UTC

23 points

0 comments7 min readLW link

(tinyurl.com)

Better than logarithmic returns to reasoning?

Oliver Sourbut30 Jul 2025 0:50 UTC

14 points

5 comments3 min readLW link

(www.oliversourbut.net)

They’re a simulation and you must love anyway

Andrew Huang30 Jul 2025 0:01 UTC

9 points

0 comments17 min readLW link

Critique of “The Case for Strong Longtermism”

Zeren29 Jul 2025 23:58 UTC

1 point

0 comments2 min readLW link

Jagged Vs. Continuous intelligence

Mohsen29 Jul 2025 23:57 UTC

0 points

0 comments1 min readLW link

[Question] Are two potentially simple techniques an example of Mencken’s law?

StanislavKrym29 Jul 2025 23:37 UTC

4 points

4 comments2 min readLW link

The many paths to permanent disempowerment even with shutdownable AIs (MATS project summary for feedback)

GideonF29 Jul 2025 23:20 UTC

64 points

8 comments9 min readLW link

Against racing to AGI: Cooperation, deterrence, and catastrophic risks

Max_He-Ho29 Jul 2025 22:23 UTC

4 points

0 comments1 min readLW link

(philpapers.org)

Very Light Hardshell Suitcases

jefftk29 Jul 2025 20:10 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

Misalignments and RL failure modes in the early stage of superintelligence

shu yang29 Jul 2025 18:23 UTC

13 points

0 comments13 min readLW link

Low P(x-risk) as the Bailey for Low P(doom)

Vladimir_Nesov29 Jul 2025 18:01 UTC

50 points

29 comments2 min readLW link

Building Black-box Scheming Monitors

CorrigibleAgent, richbc, Simon Storf and Marius Hobbhahn

29 Jul 2025 17:41 UTC

46 points

18 comments11 min readLW link

Deliberative Credit Assignment (DCA): Making Faithful Reasoning Profitable

Florian_Dietz29 Jul 2025 16:23 UTC

9 points

0 comments17 min readLW link

Want to work in US emerging technology policy? Horizon fellowship applications are live

PolicyTakes29 Jul 2025 16:15 UTC

12 points

0 comments1 min readLW link

(horizonpublicservice.org)

Spilling the Tea

Zvi29 Jul 2025 14:20 UTC

34 points

8 comments12 min readLW link

(thezvi.wordpress.com)

How one logical fallacy killed God, corrupted Science and now fuels the AI race

Jáchym Fibír29 Jul 2025 13:50 UTC

−39 points

10 comments7 min readLW link

(www.phiand.ai)

About 30% of Humanity’s Last Exam chemistry/biology answers are likely wrong

bohaska29 Jul 2025 11:59 UTC

212 points

11 comments4 min readLW link

(www.futurehouse.org)

People Are Less Happy Than They Seem

Jakub Halmeš29 Jul 2025 6:03 UTC

20 points

6 comments1 min readLW link

(unpredictabletokens.substack.com)

I wrote a song parody

CronoDAS29 Jul 2025 6:00 UTC

44 points

4 comments1 min readLW link

Teaching kids to swim

Steven Byrnes29 Jul 2025 3:10 UTC

56 points

13 comments3 min readLW link