All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 789 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

$300 for the best sci-fi prompt: the results

RomanSJan 3, 2024, 7:10 PM

16 points

19 comments7 min readLW link

Agent membranes/boundaries and formalizing “safety”

ChipmonkJan 3, 2024, 5:55 PM

26 points

46 comments3 min readLW link

Safety First: safety before full alignment. The deontic sufficiency hypothesis.

ChipmonkJan 3, 2024, 5:55 PM

48 points

3 comments3 min readLW link

Practically A Book Review: Appendix to “Nonlinear’s Evidence: Debunking False and Misleading Claims” (ThingOfThings)

tailcalledJan 3, 2024, 5:07 PM

111 points

25 comments2 min readLW link

(thingofthings.substack.com)

Trivial Mathematics as a Path Forward

ACrackedPotJan 3, 2024, 4:41 PM

−4 points

2 comments2 min readLW link

Copyright Confrontation #1

ZviJan 3, 2024, 3:50 PM

34 points

7 comments18 min readLW link

(thezvi.wordpress.com)

[Question] Theoretically, could we balance the budget painlessly?

Logan ZoellnerJan 3, 2024, 2:46 PM

4 points

12 comments1 min readLW link

Johannes’ Biography

Johannes C. MayerJan 3, 2024, 1:27 PM

24 points

0 comments10 min readLW link

What Helped Me—Kale, Blood, CPAP, X-tiamine, Methylphenidate

Johannes C. MayerJan 3, 2024, 1:22 PM

35 points

12 comments2 min readLW link

[Question] Does LessWrong make a difference when it comes to AI alignment?

PhilosophicalSoulJan 3, 2024, 12:21 PM

18 points

13 comments1 min readLW link

[Question] Terminology: <something>-ware for ML?

Oliver SourbutJan 3, 2024, 11:42 AM

17 points

27 comments1 min readLW link

Trading off Lives

jefftkJan 3, 2024, 3:40 AM

53 points

12 comments2 min readLW link

(www.jefftk.com)

MonoPoly Restricted Trust

ymeskhoutJan 2, 2024, 11:02 PM

42 points

37 comments9 min readLW link

Agent membranes and causal distance

ChipmonkJan 2, 2024, 10:43 PM

20 points

3 comments3 min readLW link

Focusing on Mal-Alignment

John FisherJan 2, 2024, 7:51 PM

1 point

0 comments1 min readLW link

Gentleness and the artificial Other

Joe CarlsmithJan 2, 2024, 6:21 PM

313 points

33 comments11 min readLW link

Otherness and control in the age of AGI

Joe CarlsmithJan 2, 2024, 6:15 PM

43 points

0 comments7 min readLW link

Apologizing is a Core Rationalist Skill

johnswentworthJan 2, 2024, 5:47 PM

156 points

42 comments5 min readLW link

Cortés, AI Risk, and the Dynamics of Competing Conquerors

James_MillerJan 2, 2024, 4:37 PM

14 points

2 comments3 min readLW link

OpenAI’s Preparedness Framework: Praise & Recommendations

Orpheus16Jan 2, 2024, 4:20 PM

66 points

1 comment7 min readLW link

Dating Roundup #2: If At First You Don’t Succeed

ZviJan 2, 2024, 4:00 PM

54 points

29 comments47 min readLW link

(thezvi.wordpress.com)

Looking for Reading Recommendations: Content Moderation, Power & Censorship

Joerg WeissJan 2, 2024, 11:37 AM

2 points

7 comments1 min readLW link

AI Is Not Software

DavidmanheimJan 2, 2024, 7:58 AM

58 points

29 comments5 min readLW link

Are Metaculus AI Timelines Inconsistent?

Chris_LeongJan 2, 2024, 6:47 AM

17 points

7 comments2 min readLW link

Boston Solstice 2023 Retrospective

jefftkJan 2, 2024, 3:10 AM

33 points

0 comments6 min readLW link

(www.jefftk.com)

Steering Llama-2 with contrastive activation additions

Nina Panickssery, Wuschel Schulz, NickGabs, Meg, evhub and TurnTrout

Jan 2, 2024, 12:47 AM

125 points

29 comments8 min readLW link

(arxiv.org)

Twin Cities ACX Meetup—January 2024

Timothy M.Jan 1, 2024, 9:13 PM

1 point

2 comments1 min readLW link

San Francisco ACX Meetup “First Saturday”

guenaelJan 1, 2024, 8:58 PM

1 point

1 comment1 min readLW link

Mech Interp Challenge: January—Deciphering the Caesar Cipher Model

CallumMcDougallJan 1, 2024, 6:03 PM

17 points

0 comments3 min readLW link

Aldix and the Book of Life

villeJan 1, 2024, 5:23 PM

1 point

0 comments4 min readLW link

(medium.com)

Metaculus Hosts ACX 2024 Prediction Contest

ChristianWilliamsJan 1, 2024, 4:38 PM

4 points

0 comments LW link

(www.metaculus.com)

The Act Itself: Exceptionless Moral Norms

SebastianG Jan 1, 2024, 4:06 PM

5 points

11 comments6 min readLW link

Deception Chess

Chris LandJan 1, 2024, 3:40 PM

7 points

2 comments4 min readLW link

Stop talking about p(doom)

Isaac KingJan 1, 2024, 10:57 AM

42 points

22 comments3 min readLW link

[Question] What should a non-genius do in the face of rapid progress in GAI to ensure a decent life?

kalerJan 1, 2024, 8:22 AM

11 points

16 comments1 min readLW link

A hermeneutic net for agency

TsviBTJan 1, 2024, 8:06 AM

58 points

4 comments30 min readLW link

2023 in AI predictions

jessicataJan 1, 2024, 5:23 AM

107 points

35 comments5 min readLW link

Rhythm Stage Setup Components

jefftkJan 1, 2024, 3:10 AM

10 points

4 comments2 min readLW link

(www.jefftk.com)

Bayesian updating in real life is mostly about understanding your hypotheses

Max HJan 1, 2024, 12:10 AM

68 points

4 comments11 min readLW link

Dark Art: Inception

Abu IbrahimDec 31, 2023, 9:09 PM

11 points

0 comments3 min readLW link

A case for AI alignment being difficult

jessicataDec 31, 2023, 7:55 PM

106 points

59 comments15 min readLW link 1 review

(unstableontology.com)

The Roots of Progress 2023 in review

jasoncrawfordDec 31, 2023, 6:16 PM

22 points

0 comments11 min readLW link

(rootsofprogress.org)

Extended Navel-Gazing On My 2023 Donations

jennDec 31, 2023, 6:10 PM

8 points

0 comments LW link

(jenn.site)

aisafety.info, the Table of Content

Charbel-RaphaëlDec 31, 2023, 1:57 PM

23 points

1 comment11 min readLW link

AIOS

samhealyDec 31, 2023, 1:23 PM

−3 points

5 comments6 min readLW link

AI Alignment Metastrategy

Vanessa KosoyDec 31, 2023, 12:06 PM

124 points

13 comments7 min readLW link

[Question] Does the hardness of AI alignment undermine FOOM?

TruePathDec 31, 2023, 11:05 AM

8 points

14 comments1 min readLW link

Speed of Failing

nano_brascaDec 31, 2023, 10:39 AM

8 points

0 comments2 min readLW link

[Question] Estimating Returns to Intelligence vs Numbers, Strength and Looks

TruePathDec 31, 2023, 10:03 AM

3 points

6 comments1 min readLW link

Planning to build a cryptographic box with perfect secrecy

Lysandre TerrisseDec 31, 2023, 9:31 AM

40 points

6 comments11 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer