All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 232425 26 27 28 29 30 31

Analysis of Variational Sparse Autoencoders

Zach Baker23 Aug 2025 23:58 UTC

12 points

0 comments10 min readLW link

Thoughts About how RLHF and Related “Prosaic” Approaches Could be Used to Create Robustly Aligned AIs.

williawa23 Aug 2025 21:05 UTC

10 points

14 comments4 min readLW link

On the Function of Faith in A Probably-Simulated Universe

testingthewaters23 Aug 2025 20:28 UTC

−8 points

12 comments7 min readLW link

(aclevername.substack.com)

The Data Scaling Hypothesis

harsimony23 Aug 2025 18:18 UTC

5 points

0 comments1 min readLW link

How a Non-Dual Language Could Redefine AI Safety

Marcio Díaz23 Aug 2025 16:40 UTC

1 point

6 comments3 min readLW link

The Great Game: Game Theory for Collective Intelligence

Rome Viharo23 Aug 2025 15:04 UTC

−2 points

0 comments2 min readLW link

The Startup Jungle

Logan Kieller23 Aug 2025 14:59 UTC

7 points

0 comments8 min readLW link

(agenticconjectures.substack.com)

The most common mistakes people make starting EA orgs

KatWoods23 Aug 2025 14:18 UTC

2 points

0 comments4 min readLW link

Futility Illusions

silentbob23 Aug 2025 10:54 UTC

31 points

10 comments5 min readLW link

Legal Personhood—Corporate Ownership & Formation

Stephen Martin23 Aug 2025 5:45 UTC

4 points

0 comments3 min readLW link

AI 2027 Response Followup

SE Gyges23 Aug 2025 4:41 UTC

9 points

3 comments9 min readLW link

(www.lesswrong.com)

Pasta Cooking Time

jefftk23 Aug 2025 3:00 UTC

22 points

1 comment1 min readLW link

(www.jefftk.com)

Pet Ownership

incident-recipient23 Aug 2025 1:54 UTC

11 points

0 comments3 min readLW link

Reflections on writing 15 daily blog posts

CstineSublime23 Aug 2025 1:50 UTC

12 points

0 comments4 min readLW link

How Econ 101 makes us blinder on trade, morals, jobs with AI – and on marginal costs

FlorianH23 Aug 2025 0:59 UTC

21 points

5 comments8 min readLW link

(nearlyfar.org)

Memory Decoding Journal Club: Behavioral time scale synaptic plasticity underlies CA1 place fields

Devin Ward23 Aug 2025 0:53 UTC

1 point

0 comments1 min readLW link

Yudkowsky on “Don’t use p(doom)”

Raemon22 Aug 2025 23:44 UTC

100 points

40 comments4 min readLW link

Banning Said Achmiz (and broader thoughts on moderation)

habryka22 Aug 2025 23:02 UTC

253 points

399 comments30 min readLW link

(∃ Stochastic Natural Latent) Implies (∃ Deterministic Natural Latent)

johnswentworth and David Lorell

22 Aug 2025 21:46 UTC

126 points

10 comments9 min readLW link

One more reason for AI capable of independent moral reasoning: alignment itself and cause prioritisation

Michele Campolo22 Aug 2025 15:53 UTC

−3 points

0 comments3 min readLW link

The Buddhism & AI Initiative

Chris Scammell22 Aug 2025 15:50 UTC

29 points

2 comments2 min readLW link

DeepSeek v3.1 Is Not Having a Moment

Zvi22 Aug 2025 15:50 UTC

41 points

2 comments3 min readLW link

(thezvi.wordpress.com)

Doing good… best?

Michele Campolo22 Aug 2025 15:48 UTC

−1 points

6 comments2 min readLW link

With enough knowledge, any conscious agent acts morally

Michele Campolo22 Aug 2025 15:44 UTC

−2 points

9 comments36 min readLW link

CEO of Microsoft AI’s “Seemingly Conscious AI” Post

Stephen Martin22 Aug 2025 13:58 UTC

64 points

8 comments8 min readLW link

An Introduction to Credal Sets and Infra-Bayes Learnability

Brittany Gelb22 Aug 2025 13:03 UTC

33 points

2 comments13 min readLW link

Legal Personhood—Contracts (Part 2)

Stephen Martin22 Aug 2025 4:53 UTC

5 points

0 comments2 min readLW link

When Money Becomes Power

Gabriel Alfour22 Aug 2025 4:14 UTC

69 points

15 comments7 min readLW link

(cognition.cafe)

Proof Section to an Introduction to Credal Sets and Infra-Bayes Learnability

Brittany Gelb21 Aug 2025 23:11 UTC

13 points

0 comments10 min readLW link

Resampling Conserves Redundancy (Approximately)

johnswentworth and David Lorell

21 Aug 2025 22:43 UTC

66 points

22 comments6 min readLW link

The anti-fragile culture

lincolnquirk21 Aug 2025 21:41 UTC

30 points

1 comment10 min readLW link

A Conservative Vision For AI Alignment

Davidmanheim and Ram Rachum

21 Aug 2025 18:14 UTC

25 points

35 comments12 min readLW link

Emergent morality in AI weakens the Orthogonality Thesis

dawnstrata21 Aug 2025 17:57 UTC

−1 points

3 comments11 min readLW link

Four ways learning Econ makes people dumber re: future AI

Steven Byrnes21 Aug 2025 17:52 UTC

359 points

52 comments6 min readLW link

(x.com)

Memory Decoding Journal Club: Behavioral time scale synaptic plasticity underlies CA1 place fields

Devin Ward21 Aug 2025 16:13 UTC

1 point

0 comments1 min readLW link

Could one country outgrow the rest of the world?

Tom Davidson21 Aug 2025 15:32 UTC

45 points

23 comments17 min readLW link

(newsletter.forethought.org)

What is “Meaningness”

Gordon Seidoh Worley and SpectrumDT

21 Aug 2025 14:57 UTC

11 points

0 comments15 min readLW link

AI #130: Talking Past The Sale

Zvi21 Aug 2025 13:50 UTC

38 points

4 comments60 min readLW link

(thezvi.wordpress.com)

Critiques of FDT Often Stem From Confusion About Newcomblike Problems

Heighn21 Aug 2025 13:19 UTC

7 points

19 comments5 min readLW link

Legal Personhood—Contracts (Part 1)

Stephen Martin21 Aug 2025 5:23 UTC

10 points

0 comments7 min readLW link

Being honest with AIs

Lukas Finnveden21 Aug 2025 3:57 UTC

76 points

6 comments17 min readLW link

(blog.redwoodresearch.org)

ACX Fall Meetup 2025 @ Klang Valley, Malaysia

Yi-Yang21 Aug 2025 3:34 UTC

2 points

0 comments1 min readLW link

French Non-Profit Law: Associations are as cool as American churches

Lucie Philippon20 Aug 2025 22:02 UTC

43 points

6 comments3 min readLW link

AI Safety Comms Retreat

Vishakha20 Aug 2025 20:54 UTC

4 points

0 comments1 min readLW link

The trouble with “enlightenment”

Gordon Seidoh Worley20 Aug 2025 19:00 UTC

15 points

4 comments4 min readLW link

(uncertainupdates.substack.com)

An epistemic advantage of working as a moderate

Buck20 Aug 2025 17:47 UTC

215 points

95 comments4 min readLW link

My AGI timeline updates from GPT-5 (and 2025 so far)

ryan_greenblatt20 Aug 2025 16:11 UTC

169 points

14 comments4 min readLW link

come work on dangerous capability mitigations at Anthropic

Dave Orr20 Aug 2025 15:11 UTC

33 points

9 comments1 min readLW link

AI Companion Conditions

Zvi20 Aug 2025 15:00 UTC

58 points

2 comments10 min readLW link

(thezvi.wordpress.com)

[Question] What to do with pre-order if I live in Russia?

EniScien20 Aug 2025 13:39 UTC

10 points

1 comment2 min readLW link