All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 293031

Takeaways from calibration training

Olli JärviniemiJan 29, 2023, 7:09 PM

45 points

2 comments3 min readLW link 1 review

Structure, creativity, and novelty

TsviBTJan 29, 2023, 2:30 PM

19 points

4 comments7 min readLW link

What is the ground reality of countries taking steps to recalibrate AI development towards Alignment first?

NebuchJan 29, 2023, 1:26 PM

8 points

6 comments3 min readLW link

Compendium of problems with RLHF

Charbel-RaphaëlJan 29, 2023, 11:40 AM

120 points

16 comments10 min readLW link

My biggest takeaway from Redwood Research REMIX

Alok SinghJan 29, 2023, 11:00 AM

0 points

0 comments1 min readLW link

(alok.github.io)

EA novel published on Amazon

Timothy UnderwoodJan 29, 2023, 8:33 AM

17 points

0 comments LW link

Reverse RSS Stats

jefftkJan 29, 2023, 3:40 AM

12 points

2 comments1 min readLW link

(www.jefftk.com)

Why and How to Graduate Early [U.S.]

TegoJan 29, 2023, 1:28 AM

53 points

9 comments8 min readLW link 1 review

Stop-gradients lead to fixed point predictions

Johannes Treutlein, Caspar Oesterheld, Rubi J. Hudson and Emery Cooper

Jan 28, 2023, 10:47 PM

37 points

2 comments24 min readLW link

Eli Dourado AMA on the Progress Forum

jasoncrawfordJan 28, 2023, 10:18 PM

19 points

0 comments1 min readLW link

(rootsofprogress.org)

LW Filter Tags (Rationality/World Modeling now promoted in Latest Posts)

Ruby and RobertM

Jan 28, 2023, 10:14 PM

60 points

4 comments3 min readLW link

No Fire in the Equations

Carlos RamirezJan 28, 2023, 9:16 PM

−16 points

4 comments3 min readLW link

Optimality is the tiger, and annoying the user is its teeth

Christopher KingJan 28, 2023, 8:20 PM

25 points

6 comments2 min readLW link

On not getting contaminated by the wrong obesity ideas

NatáliaJan 28, 2023, 8:18 PM

306 points

69 comments30 min readLW link

Advice I found helpful in 2022

Orpheus16Jan 28, 2023, 7:48 PM

36 points

5 comments2 min readLW link

The Knockdown Argument Paradox

Bryan FrancesJan 28, 2023, 7:23 PM

−12 points

6 comments8 min readLW link

Less Wrong/ACX Budapest Feb 4th Meetup

Richard Horvath and Timothy Underwood

Jan 28, 2023, 2:49 PM

2 points

0 comments1 min readLW link

Reflections on Deception & Generality in Scalable Oversight (Another OpenAI Alignment Review)

Shoshannah TekofskyJan 28, 2023, 5:26 AM

53 points

7 comments7 min readLW link

A Simple Alignment Typology

Shoshannah TekofskyJan 28, 2023, 5:26 AM

34 points

2 comments2 min readLW link

Spooky action at a distance in the loss landscape

Jesse Hoogland and Filip Sondej

Jan 28, 2023, 12:22 AM

61 points

4 comments7 min readLW link

(www.jessehoogland.com)

WaPo: “Big Tech was moving cautiously on AI. Then came ChatGPT.”

Julian BradshawJan 27, 2023, 10:54 PM

26 points

5 comments1 min readLW link

(www.washingtonpost.com)

Literature review of TAI timelines

Jsevillamol, keith_wynroe and David Atkinson

Jan 27, 2023, 8:07 PM

35 points

7 comments2 min readLW link

(epochai.org)

Scaling Laws Literature Review

Pablo VillalobosJan 27, 2023, 7:57 PM

36 points

1 comment4 min readLW link

(epochai.org)

The role of Bayesian ML in AI safety—an overview

Marius HobbhahnJan 27, 2023, 7:40 PM

31 points

6 comments10 min readLW link

Assigning Praise and Blame: Decoupling Epistemology and Decision Theory

adamShimi and Gabriel Alfour

Jan 27, 2023, 6:16 PM

59 points

5 comments3 min readLW link

[Question] How could humans dominate over a super intelligent AI?

Marco DiscendentiJan 27, 2023, 6:15 PM

−5 points

8 comments1 min readLW link

ChatGPT understands language

philosophybearJan 27, 2023, 7:14 AM

27 points

4 comments6 min readLW link

(philosophybear.substack.com)

Jar of Chocolate

jefftkJan 27, 2023, 3:40 AM

10 points

0 comments1 min readLW link

(www.jefftk.com)

Basics of Rationalist Discourse

Duncan Sabien (Inactive)Jan 27, 2023, 2:40 AM

284 points

193 comments36 min readLW link 4 reviews

The recent banality of rationality (and effective altruism)

CraigMichaelJan 27, 2023, 1:19 AM

−6 points

7 comments11 min readLW link

11 heuristics for choosing (alignment) research projects

Orpheus16 and danesherbs

Jan 27, 2023, 12:36 AM

50 points

5 comments1 min readLW link

A different observation of Vavilov Day

ElizabethJan 26, 2023, 9:50 PM

30 points

1 comment1 min readLW link

(acesounderglass.com)

All AGI Safety questions welcome (especially basic ones) [~monthly thread]

mwatkins and Robert Miles

Jan 26, 2023, 9:01 PM

39 points

81 comments2 min readLW link

Just another thought experiment

Bohdan Kudlai Jan 26, 2023, 7:29 PM

−11 points

0 comments1 min readLW link

Exquisite Oracle: A Dadaist-Inspired Literary Game for Many Friends (or 1 AI)

YitzJan 26, 2023, 6:26 PM

6 points

1 comment1 min readLW link

AI Risk Management Framework | NIST

DragonGodJan 26, 2023, 3:27 PM

36 points

4 comments2 min readLW link

(www.nist.gov)

“How to Escape from the Simulation”—Seeds of Science call for reviewers

rogersbaconJan 26, 2023, 3:11 PM

12 points

0 comments1 min readLW link

Loom: Why and How to use it

brookJan 26, 2023, 2:34 PM

2 points

5 comments LW link

Covid 1/26/23: Case Count Crash

ZviJan 26, 2023, 12:50 PM

32 points

5 comments9 min readLW link

(thezvi.wordpress.com)

[Question] How are you currently modeling COVID contagiousness?

CounterBlunderJan 26, 2023, 4:46 AM

2 points

2 comments1 min readLW link

[Question] What’s the simplest concrete unsolved problem in AI alignment?

aggJan 26, 2023, 4:15 AM

28 points

4 comments1 min readLW link

2022 Less Wrong Census/Survey: Request for Comments

ScrewtapeJan 25, 2023, 8:57 PM

5 points

29 comments1 min readLW link

Next steps after AGISF at UMich

JakubKJan 25, 2023, 8:57 PM

10 points

0 comments5 min readLW link

(docs.google.com)

AGI will have learnt utility functions

berenJan 25, 2023, 7:42 PM

38 points

4 comments13 min readLW link

[RFC] Possible ways to expand on “Discovering Latent Knowledge in Language Models Without Supervision”.

gekaklam, Walter Laurito , Kaarel and Kay Kozaronek

Jan 25, 2023, 7:03 PM

48 points

6 comments12 min readLW link

Spreading messages to help with the most important century

HoldenKarnofskyJan 25, 2023, 6:20 PM

75 points

4 comments18 min readLW link

(www.cold-takes.com)

My Model Of EA Burnout

LoganStrohlJan 25, 2023, 5:52 PM

259 points

50 comments5 min readLW link 1 review

Thoughts on the impact of RLHF research

paulfchristianoJan 25, 2023, 5:23 PM

253 points

102 comments9 min readLW link

[Question] Could AI be used to engineer a sociopolitical situation where humans can solve the problems surrounding AGI?

hollowingJan 25, 2023, 5:17 PM

1 point

6 comments1 min readLW link

Progress links and tweets, 2023-01-25

jasoncrawfordJan 25, 2023, 4:12 PM

8 points

0 comments1 min readLW link

(rootsofprogress.org)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer