All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 262728 29 30 31

Corrigibility’s Desirability is Timing-Sensitive

RobertMDec 26, 2024, 10:24 PM

29 points

4 comments3 min readLW link

PCR retrospective

bhauthDec 26, 2024, 9:20 PM

24 points

0 comments8 min readLW link

(bhauth.com)

AI #96: o3 But Not Yet For Thee

ZviDec 26, 2024, 8:30 PM

58 points

8 comments36 min readLW link

(thezvi.wordpress.com)

Super human AI is a very low hanging fruit!

HznDec 26, 2024, 7:00 PM

−4 points

0 comments7 min readLW link

The Field of AI Alignment: A Postmortem, and What To Do About It

johnswentworthDec 26, 2024, 6:48 PM

302 points

160 comments8 min readLW link

ReSolsticed vol I: “We’re Not Going Quietly”

RaemonDec 26, 2024, 5:52 PM

61 points

4 comments19 min readLW link

[Question] Are Sparse Autoencoders a good idea for AI control?

Gerard BoxoDec 26, 2024, 5:34 PM

3 points

4 comments1 min readLW link

A Three-Layer Model of LLM Psychology

Jan_KulveitDec 26, 2024, 4:49 PM

218 points

13 comments8 min readLW link

Human, All Too Human—Superintelligence requires learning things we can’t teach

Ben TurtelDec 26, 2024, 4:26 PM

−13 points

4 comments1 min readLW link

(bturtel.substack.com)

[Question] Why don’t we currently have AI agents?

ChristianKlDec 26, 2024, 3:26 PM

8 points

10 comments1 min readLW link

[Question] What would be the IQ and other benchmarks of o3 that uses $1 million worth of compute resources to answer one question?

avturchinDec 26, 2024, 11:08 AM

16 points

2 comments1 min readLW link

The Economics & Practicality of Starting Mars Colonization

Zero ContradictionsDec 26, 2024, 10:56 AM

2 points

1 comment1 min readLW link

(zerocontradictions.net)

Terminal goal vs Intelligence

Donatas LučiūnasDec 26, 2024, 8:10 AM

−12 points

24 comments1 min readLW link

Streamlining my voice note process

Vlad SitaloDec 26, 2024, 6:04 AM

6 points

1 comment7 min readLW link

(vlad.roam.garden)

Whistleblowing Twitter Bot

MckievDec 26, 2024, 4:09 AM

19 points

5 comments2 min readLW link

Open Thread Winter 2024/2025

habrykaDec 25, 2024, 9:02 PM

23 points

59 comments1 min readLW link

Exploring Cooperation: The Path to Utopia

DavidmanheimDec 25, 2024, 6:31 PM

11 points

0 comments LW link

(exploringcooperation.substack.com)

Living with Rats in College

lsusrDec 25, 2024, 10:44 AM

28 points

0 comments1 min readLW link

[Question] What Have Been Your Most Valuable Casual Conversations At Conferences?

johnswentworthDec 25, 2024, 5:49 AM

54 points

21 comments1 min readLW link

The Opening Salvo: 1. An Ontological Consciousness Metric: Resistance to Behavioral Modification as a Measure of Recursive Awareness

PeterpiperDec 25, 2024, 2:29 AM

−3 points

0 comments5 min readLW link

The Deep Lore of LightHaven, with Oliver Habryka (TBC episode 228)

Eneasz and habryka

Dec 24, 2024, 10:45 PM

45 points

4 comments91 min readLW link

(thebayesianconspiracy.substack.com)

Acknowledging Background Information with P(Q|I)

JenniferRMDec 24, 2024, 6:50 PM

29 points

8 comments14 min readLW link

Game Theory and Behavioral Economics in The Stock Market

Jaiveer SinghDec 24, 2024, 6:15 PM

1 point

0 comments3 min readLW link

[Question] What are the main arguments against AGI?

Edy NastaseDec 24, 2024, 3:49 PM

1 point

6 comments1 min readLW link

[Question] Recommendations on communities that discuss AI applications in society

AnnapurnaDec 24, 2024, 1:37 PM

7 points

2 comments1 min readLW link

AIs Will Increasingly Fake Alignment

ZviDec 24, 2024, 1:00 PM

89 points

0 comments52 min readLW link

(thezvi.wordpress.com)

Apply to the 2025 PIBBSS Summer Research Fellowship

DusanDNesic and Lucas Teixeira

Dec 24, 2024, 10:25 AM

15 points

0 comments2 min readLW link

Human-AI Complementarity: A Goal for Amplified Oversight

rishubjain and Sophie Bridgers

Dec 24, 2024, 9:57 AM

27 points

4 comments1 min readLW link

(deepmindsafetyresearch.medium.com)

Preliminary Thoughts on Flirting Theory

Alice BlairDec 24, 2024, 7:37 AM

14 points

6 comments3 min readLW link

[Question] Why is neuron count of human brain relevant to AI timelines?

samuelshadrachDec 24, 2024, 5:15 AM

6 points

7 comments1 min readLW link

How Much to Give is a Pragmatic Question

jefftkDec 24, 2024, 4:20 AM

12 points

1 comment2 min readLW link

(www.jefftk.com)

Do you need a better map of your myriad of maps to the territory?

CstineSublimeDec 24, 2024, 2:00 AM

11 points

2 comments5 min readLW link

Panology

JenniferRMDec 23, 2024, 9:40 PM

17 points

10 comments5 min readLW link

Aristotle, Aquinas, and the Evolution of Teleology: From Purpose to Meaning.

Spiritus DeiDec 23, 2024, 7:37 PM

−9 points

0 comments6 min readLW link

People aren’t properly calibrated on FrontierMath

cakubiloDec 23, 2024, 7:35 PM

31 points

4 comments3 min readLW link

Near- and medium-term AI Control Safety Cases

Martín SotoDec 23, 2024, 5:37 PM

9 points

0 comments6 min readLW link

[Rationality Malaysia] 2024 year-end meetup!

Doris LiewDec 23, 2024, 4:02 PM

1 point

0 comments1 min readLW link

Printable book of some rationalist creative writing (from Scott A. & Eliezer)

CounterBlunderDec 23, 2024, 3:44 PM

10 points

0 comments1 min readLW link

Monthly Roundup #25: December 2024

ZviDec 23, 2024, 2:20 PM

18 points

3 comments26 min readLW link

(thezvi.wordpress.com)

Exploring the petertodd / Leilan duality in GPT-2 and GPT-J

mwatkinsDec 23, 2024, 1:17 PM

12 points

1 comment17 min readLW link

[Question] What are the strongest arguments for very short timelines?

Kaj_SotalaDec 23, 2024, 9:38 AM

101 points

79 comments1 min readLW link

Reduce AI Self-Allegiance by saying “he” instead of “I”

Knight LeeDec 23, 2024, 9:32 AM

10 points

4 comments2 min readLW link

Funding Case: AI Safety Camp 11

Remmelt, Robert Kralisch and Linda Linsefors

Dec 23, 2024, 8:51 AM

60 points

4 comments6 min readLW link

(manifund.org)

What is compute governance?

Vishakha and Algon

Dec 23, 2024, 6:32 AM

6 points

0 comments2 min readLW link

(aisafety.info)

Stop Making Sense

JenniferRMDec 23, 2024, 5:16 AM

16 points

0 comments3 min readLW link

Hire (or Become) a Thinking Assistant

RaemonDec 23, 2024, 3:58 AM

138 points

49 comments8 min readLW link

Non-Obvious Benefits of Insurance

jefftkDec 23, 2024, 3:40 AM

21 points

5 comments2 min readLW link

(www.jefftk.com)

Vision of a positive Singularity

RussellThorDec 23, 2024, 2:19 AM

4 points

0 comments4 min readLW link

Ideologies are slow and necessary, for now

Gabriel AlfourDec 23, 2024, 1:57 AM

15 points

1 comment1 min readLW link

(cognition.cafe)

[Question] Has Anthropic checked if Claude fakes alignment for intended values too?

MaloewDec 23, 2024, 12:43 AM

4 points

1 comment1 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer