All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 212223 24 25 26 27 28 29 30 31

Job Listing (closed): CBAI Operations Associates

Maite Abadia-Manthei21 Jul 2025 22:53 UTC

1 point

0 comments1 min readLW link

(www.cbai.ai)

If Anyone Builds It, Everyone Dies: Call for Translators (for Supplementary Materials)

yams21 Jul 2025 22:37 UTC

112 points

12 comments1 min readLW link

Why Reality Has A Well-Known Math Bias

Linch21 Jul 2025 22:13 UTC

42 points

20 comments1 min readLW link

(linch.substack.com)

Questions about animal welfare markets

Austin Chen21 Jul 2025 21:54 UTC

9 points

0 comments5 min readLW link

Directly Try Solving Alignment for 5 weeks

Kabir Kumar21 Jul 2025 21:51 UTC

86 points

4 comments6 min readLW link

(beta.ai-plans.com)

Navigating Respect: How to bid boldly, and when to humble yourself preemptively

jimmy21 Jul 2025 20:30 UTC

14 points

2 comments12 min readLW link

Grizzly Man screening, tacos, carlsmith discussion

Quinn21 Jul 2025 19:48 UTC

6 points

0 comments1 min readLW link

[Question] Refining Generalized Hangriness: Emotional Processing as Thinking Tech

M. Key 21 Jul 2025 18:49 UTC

10 points

1 comment7 min readLW link

Detecting High-Stakes Interactions with Activation Probes

Arrrlex, williambankes, Urja Pawar, Phil Blandfort, David Scott Krueger and Dmitrii Krasheninnikov

21 Jul 2025 18:21 UTC

50 points

0 comments4 min readLW link

GDM also claims IMO gold medal

Yair Halberstadt21 Jul 2025 17:18 UTC

61 points

3 comments1 min readLW link

(deepmind.google)

Visualizing AI Alignment Failures as Topological Navigation Errors in Conceptual Space

CC4CI21 Jul 2025 16:54 UTC

1 point

0 comments1 min readLW link

LLM Daydreaming (gwern.net)

Noosphere8921 Jul 2025 16:50 UTC

18 points

2 comments10 min readLW link

(gwern.net)

[Question] Moral realism—basic Q

Dagon21 Jul 2025 16:20 UTC

8 points

12 comments1 min readLW link

HRT in Menopause: A candidate for a case study of epistemology in epidemiology, statistics & medicine

foodforthought21 Jul 2025 16:18 UTC

40 points

2 comments4 min readLW link

Using Older AI Models as a Form of Boycott

Jacob121 Jul 2025 12:18 UTC

6 points

2 comments1 min readLW link

Substack for Best Posts

jefftk21 Jul 2025 12:10 UTC

11 points

1 comment2 min readLW link

(www.jefftk.com)

Monthly Roundup #32: July 2025

Zvi21 Jul 2025 12:00 UTC

41 points

10 comments37 min readLW link

(thezvi.wordpress.com)

Reasons to vote in non-deterministic elections

B Jacobs21 Jul 2025 11:09 UTC

8 points

1 comment8 min readLW link

(bobjacobs.substack.com)

Creative writing with LLMs, part 1: Prompting for fiction

Kaj_Sotala21 Jul 2025 8:47 UTC

39 points

10 comments20 min readLW link

Just Make a New Rule!

Zack_M_Davis21 Jul 2025 5:54 UTC

9 points

25 comments4 min readLW link

[Fiction] Our Trial

Nina Panickssery21 Jul 2025 3:56 UTC

73 points

1 comment3 min readLW link

(ninapanickssery.substack.com)

My First Month with Math Academy: An Experience Report from a Middle School Dropout.

L.M.Sherlock21 Jul 2025 3:18 UTC

5 points

0 comments29 min readLW link

(lmsherlock.substack.com)

AI Safety course intro blog

Boaz Barak21 Jul 2025 2:35 UTC

18 points

0 comments1 min readLW link

(windowsontheory.org)

An Outsider’s Roadmap into AI Safety Research (2025)

Luis M. Montoya21 Jul 2025 2:03 UTC

9 points

4 comments10 min readLW link

[Question] Help me learn more about AI

Mark Tranter21 Jul 2025 1:49 UTC

1 point

0 comments1 min readLW link

Unbounded Embedded Agency: AEDT w.r.t. rOSI

Cole Wyeth20 Jul 2025 23:46 UTC

36 points

0 comments16 min readLW link

AI-Oriented Investments

PeterMcCluskey20 Jul 2025 21:31 UTC

30 points

0 comments1 min readLW link

(bayesianinvestor.com)

On The Shoulders of Substrates—how one phenomenon lays the foundation for the next

James Stephen Brown20 Jul 2025 21:11 UTC

14 points

1 comment3 min readLW link

(nonzerosum.games)

Life of Posts?

jmh20 Jul 2025 21:04 UTC

10 points

3 comments1 min readLW link

LLMs Can’t See Pixels or Characters

Brendan Long20 Jul 2025 20:00 UTC

100 points

44 comments4 min readLW link

(www.brendanlong.com)

Do “adult developmental stages” theories have any pre-theoretic motivation?

Said Achmiz20 Jul 2025 14:37 UTC

35 points

19 comments3 min readLW link

Parallel Parking and possibly Instrumental Convergence

CstineSublime20 Jul 2025 10:37 UTC

2 points

10 comments3 min readLW link

Plato’s Trolley

dr_s20 Jul 2025 10:07 UTC

37 points

11 comments7 min readLW link

Shallow Water is Dangerous Too

jefftk20 Jul 2025 2:30 UTC

236 points

24 comments2 min readLW link

(www.jefftk.com)

Your AI Safety org could get EU funding up to €9.08M. Here’s how (+ free personalized support) Update: Webinar 18/8 Link Below

SamuelK20 Jul 2025 1:30 UTC

68 points

4 comments3 min readLW link

Make More Grayspaces

Duncan Sabien (Inactive)19 Jul 2025 22:22 UTC

314 points

65 comments13 min readLW link

Cheating at Bets with the Even Odds Algorithm

omark19 Jul 2025 22:06 UTC

12 points

3 comments6 min readLW link

Can We Trust the Judge? A novel method of Modelling Human Bias and Systematic Error in Debate-Based Scalable Oversight

Andreea Zaman19 Jul 2025 21:44 UTC

1 point

0 comments7 min readLW link

Peeling Back The Remoteness of Sources

adamShimi19 Jul 2025 17:41 UTC

16 points

1 comment13 min readLW link

(formethods.substack.com)

Sequential Coherence: A Bottleneck in Automation

eeeee, xavi_ferres and felixgaston

19 Jul 2025 15:27 UTC

26 points

2 comments11 min readLW link

How Misaligned AI Personas Lead to Human Extinction – Step by Step

Writer19 Jul 2025 13:59 UTC

14 points

0 comments7 min readLW link

(youtu.be)

L0 is not a neutral hyperparameter

chanind and Adrià Garriga-alonso

19 Jul 2025 13:51 UTC

24 points

3 comments5 min readLW link

From Messy Shelves to Master Librarians: Toy-Model Exploration of Block-Diagonal Geometry in LM Activations

Yuxiao19 Jul 2025 12:26 UTC

6 points

1 comment4 min readLW link

OpenAI Claims IMO Gold Medal

Mikhail Samin19 Jul 2025 9:58 UTC

77 points

74 comments1 min readLW link

(x.com)

On the deep (uncurable?) vulnerability of MCPs

awu19 Jul 2025 2:50 UTC

5 points

6 comments1 min readLW link

(www.generalanalysis.com)

[Question] Best way to ask laypeople for conditional probabilities in a Bayes net?

Zack Friedman19 Jul 2025 2:45 UTC

11 points

1 comment1 min readLW link

[Question] Get sued or kill someone: The trolly problems of Psychological practice.

Brad Dunn18 Jul 2025 23:35 UTC

12 points

2 comments3 min readLW link

resume limiting

bhauth18 Jul 2025 23:31 UTC

18 points

13 comments2 min readLW link

(www.bhauth.com)

[Linkpost] How Am I Getting Along with AI?

Gunnar_Zarncke18 Jul 2025 22:26 UTC

11 points

0 comments1 min readLW link

(jessiefischbein.substack.com)

Agents lag behind AI 2027′s schedule

OhadA18 Jul 2025 21:49 UTC

25 points

7 comments4 min readLW link