All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 20242025

All Jan FebMarApr May Jun

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 3031

Alignment First, Intelligence Later

Chris LakinMar 30, 2025, 10:26 PM

18 points

5 comments3 min readLW link

[Question] Why do many people who care about AI Safety not clearly endorse PauseAI?

humnrdbleMar 30, 2025, 6:06 PM

45 points

42 comments2 min readLW link

Enumerating objects a model “knows” using entity-detection features.

Alex GibsonMar 30, 2025, 4:58 PM

6 points

2 comments6 min readLW link

Bonn ACX Meetup Spring 2025

Fernand0Mar 30, 2025, 3:12 PM

2 points

1 comment1 min readLW link

What does aligning AI to an ideology mean for true alignment?

StanislavKrymMar 30, 2025, 3:12 PM

1 point

0 comments8 min readLW link

How to enjoy fail attempts without self-deception (technique)

YanLyutnevMar 30, 2025, 1:49 PM

9 points

0 comments9 min readLW link

Memory Persistence within Conversation Threads with Multimodal LLMS

sjay8Mar 30, 2025, 7:16 AM

4 points

0 comments1 min readLW link

How I talk to those above me

Maxwell PetersonMar 30, 2025, 6:54 AM

102 points

16 comments8 min readLW link

How do SAE Circuits Fail? A Case Study Using a Starts-with-‘E’ Letter Detection Task

adsingh-64Mar 30, 2025, 12:47 AM

1 point

0 comments3 min readLW link

Climbing the Hill of Experiments

nomagicpillMar 29, 2025, 8:37 PM

4 points

0 comments6 min readLW link

(nomagicpill.github.io)

[Question] Does the AI control agenda broadly rely on no FOOM being possible?

Noosphere89Mar 29, 2025, 7:38 PM

22 points

3 comments1 min readLW link

Exercising Rationality

EggsMar 29, 2025, 7:08 PM

4 points

0 comments4 min readLW link

Yeshua’s Basilisk

Alex BeymanMar 29, 2025, 6:11 PM

8 points

1 comment4 min readLW link

AI Needs Us? Information Theory and Humans as data

tomdekanMar 29, 2025, 3:51 PM

0 points

6 comments4 min readLW link

Auto Shutdown Script

jefftkMar 29, 2025, 1:10 PM

16 points

5 comments1 min readLW link

(www.jefftk.com)

Proposal for a Post-Labor Societal Structure to Mitigate ASI Risks: The ‘Game Culture Civilization’ (GCC) Model

Beyond SingularityMar 29, 2025, 11:31 AM

2 points

0 comments4 min readLW link

Tormenting Gemini 2.5 with the [[[]]][][[]] Puzzle

CzynskiMar 29, 2025, 2:51 AM

48 points

36 comments3 min readLW link

Singularity Survival Guide: A Bayesian Guide for Navigating the Pre-Singularity Period

mbrooksMar 28, 2025, 11:21 PM

6 points

4 comments2 min readLW link

Softmax, Emmett Shear’s new AI startup focused on “Organic Alignment”

Chris LakinMar 28, 2025, 9:23 PM

59 points

1 comment1 min readLW link

(www.corememory.com)

The Pando Problem: Rethinking AI Individuality

Jan_KulveitMar 28, 2025, 9:03 PM

128 points

14 comments13 min readLW link

Selection Pressures on LM Personas

Raymond DouglasMar 28, 2025, 8:33 PM

30 points

0 comments3 min readLW link

AXRP Episode 40 - Jason Gross on Compact Proofs and Interpretability

DanielFilanMar 28, 2025, 6:40 PM

23 points

0 comments89 min readLW link

[Question] Share AI Safety Ideas: Both Crazy and Not. №2

ankMar 28, 2025, 5:22 PM

2 points

10 comments1 min readLW link

AI x Bio Workshop

Allison DuettmannMar 28, 2025, 5:21 PM

16 points

0 comments1 min readLW link

[Question] How many times faster can the AGI advance the science than humans do?

StanislavKrymMar 28, 2025, 3:16 PM

0 points

0 comments1 min readLW link

Gemini 2.5 is the New SoTA

ZviMar 28, 2025, 2:20 PM

52 points

1 comment12 min readLW link

(thezvi.wordpress.com)

Will the Need to Retrain AI Models from Scratch Block a Software Intelligence Explosion?

Tom DavidsonMar 28, 2025, 2:12 PM

10 points

0 comments3 min readLW link

How We Might All Die in A Year

Greg CMar 28, 2025, 1:22 PM

5 points

13 comments21 min readLW link

(x.com)

The vision of Bill Thurston

TsviBTMar 28, 2025, 11:45 AM

50 points

34 comments4 min readLW link

What Uniparental Disomy Tells Us About Improper Imprinting in Humans

MorpheusMar 28, 2025, 11:24 AM

32 points

1 comment6 min readLW link

(www.tassiloneubauer.com)

Explaining British Naval Dominance During the Age of Sail

Arjun PanicksseryMar 28, 2025, 5:47 AM

199 points

17 comments4 min readLW link

(arjunpanickssery.substack.com)

Will the AGIs be able to run the civilisation?

StanislavKrymMar 28, 2025, 4:50 AM

−4 points

2 comments3 min readLW link

[Question] Is AGI actually that likely to take off given the world energy consumption?

StanislavKrymMar 27, 2025, 11:13 PM

2 points

2 comments1 min readLW link

[Linkpost] The value of initiating a pursuit in temporal decision-making

Gunnar_ZarnckeMar 27, 2025, 9:47 PM

13 points

0 comments2 min readLW link

Alignment through atomic agents

micseydelMar 27, 2025, 6:43 PM

−1 points

0 comments1 min readLW link

Machines of Stolen Grace

Riley TavassoliMar 27, 2025, 6:15 PM

2 points

0 comments5 min readLW link

An argument for asexuality

filthy_hedonistMar 27, 2025, 6:08 PM

−2 points

10 comments1 min readLW link

On the plausibility of a “messy” rogue AI committing human-like evil

Jacob GriffithMar 27, 2025, 6:06 PM

6 points

0 comments7 min readLW link

AI Moral Alignment: The Most Important Goal of Our Generation

Ronen BarMar 27, 2025, 6:04 PM

3 points

0 comments8 min readLW link

(forum.effectivealtruism.org)

Tracing the Thoughts of a Large Language Model

Adam JermynMar 27, 2025, 5:20 PM

304 points

24 comments10 min readLW link

(www.anthropic.com)

Computational Superposition in a Toy Model of the U-AND Problem

Adam NewgasMar 27, 2025, 4:56 PM

18 points

2 comments11 min readLW link

Mistral Large 2 (123B) seems to exhibit alignment faking

Marc Carauleanu, Diogo de Lucena, Gunnar_Zarncke, Cameron Berg, Judd Rosenblatt, Mike Vaiana and AE Studio

Mar 27, 2025, 3:39 PM

80 points

4 comments13 min readLW link

AIS Netherlands is looking for a Founding Executive Director (EOI form)

gergogaspar, Jelle Donders, Natalia Matuszczyk and ENAIS

Mar 27, 2025, 3:30 PM

15 points

0 comments4 min readLW link

AI #109: Google Fails Marketing Forever

ZviMar 27, 2025, 2:50 PM

42 points

12 comments35 min readLW link

(thezvi.wordpress.com)

What life will be like for humans if aligned ASI is created

james oofouMar 27, 2025, 10:06 AM

3 points

6 comments2 min readLW link

What is scaffolding?

Vishakha and Algon

Mar 27, 2025, 9:06 AM

10 points

0 comments2 min readLW link

(aisafety.info)

Workflow vs interface vs implementation

SniffnoyMar 27, 2025, 7:38 AM

12 points

0 comments1 min readLW link

Quick thoughts on the difficulty of widely conveying a non-stereotyped position

SniffnoyMar 27, 2025, 7:30 AM

12 points

0 comments5 min readLW link

Doing principle-of-charity better

SniffnoyMar 27, 2025, 5:19 AM

22 points

1 comment3 min readLW link

X as phenomenon vs as policy, Goodhart, and the AB problem

SniffnoyMar 27, 2025, 4:32 AM

13 points

0 comments2 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer