All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

On value in humans, other animals, and AI

Michele CampoloJan 31, 2023, 11:33 PM

3 points

17 comments5 min readLW link

Criticism of the main framework in AI alignment

Michele CampoloJan 31, 2023, 11:01 PM

19 points

2 comments6 min readLW link

Nice Clothes are Good, Actually

Gordon Seidoh WorleyJan 31, 2023, 7:22 PM

72 points

28 comments4 min readLW link

[Linkpost] Human-narrated audio version of “Is Power-Seeking AI an Existential Risk?”

Joe CarlsmithJan 31, 2023, 7:21 PM

12 points

1 comment1 min readLW link

No Really, Attention is ALL You Need—Attention can do feedforward networks

Robert_AIZIJan 31, 2023, 6:48 PM

29 points

7 comments6 min readLW link

(aizi.substack.com)

Talk to me about your summer/career plans

Orpheus16Jan 31, 2023, 6:29 PM

31 points

3 comments2 min readLW link

Mechanistic Interpretability Quickstart Guide

Neel NandaJan 31, 2023, 4:35 PM

42 points

3 comments6 min readLW link

(www.neelnanda.io)

New Hackathon: Robustness to distribution changes and ambiguity

Charbel-RaphaëlJan 31, 2023, 12:50 PM

12 points

3 comments1 min readLW link

Squiggle: Why and how to use it

brookJan 31, 2023, 12:37 PM

3 points

0 comments LW link

Beware of Fake Alternatives

silentbobJan 31, 2023, 10:21 AM

57 points

11 comments4 min readLW link 1 review

Inner Misalignment in “Simulator” LLMs

Adam ScherlisJan 31, 2023, 8:33 AM

84 points

12 comments4 min readLW link

Why AI experts’ jobs are always decades from being automated

Allen HoskinsJan 31, 2023, 3:01 AM

0 points

1 comment5 min readLW link

(open.substack.com)

Apply to HAIST/MAIA’s AI Governance Workshop in DC (Feb 17-20)

Phosphorous, Xander Davies, CMD, Paramedic and tlevin

Jan 31, 2023, 2:06 AM

28 points

0 comments2 min readLW link

EA & LW Forum Weekly Summary (23rd − 29th Jan ’23)

Zoe WilliamsJan 31, 2023, 12:36 AM

12 points

0 comments LW link

Saying things because they sound good

Adam ZernerJan 31, 2023, 12:17 AM

23 points

6 comments2 min readLW link

South Bay Meetup

DavidFriedmanJan 30, 2023, 11:35 PM

2 points

0 comments1 min readLW link

Peter Thiel’s speech at Oxford Debating Union on technological stagnation, Nuclear weapons, COVID, Environment, Alignment, ‘anti-anti anti-anti-classical liberalism’, Bostrom, LW, etc.

M. Y. ZuoJan 30, 2023, 11:31 PM

8 points

33 comments1 min readLW link

Medical Image Registration: The obscure field where Deep Mesaoptimizers are already at the top of the benchmarks. (post + colab notebook)

HastingsJan 30, 2023, 10:46 PM

35 points

1 comment3 min readLW link

Humans Can Be Manually Strategic

ScrewtapeJan 30, 2023, 10:35 PM

13 points

0 comments3 min readLW link

Why I hate the “accident vs. misuse” AI x-risk dichotomy (quick thoughts on “structural risk”)

David Scott Krueger (formerly: capybaralet)Jan 30, 2023, 6:50 PM

34 points

41 comments2 min readLW link

2022 Unofficial LessWrong General Census

ScrewtapeJan 30, 2023, 6:36 PM

97 points

33 comments2 min readLW link

Call for submissions: “(In)human Values and Artificial Agency”, ALIFE 2023

the gears to ascensionJan 30, 2023, 5:37 PM

29 points

4 comments1 min readLW link

(humanvaluesandartificialagency.com)

What I mean by “alignment is in large part about making cognition aimable at all”

So8resJan 30, 2023, 3:22 PM

171 points

25 comments2 min readLW link

The Energy Requirements and Feasibility of Off-World Mining

clansJan 30, 2023, 3:07 PM

31 points

1 comment8 min readLW link

(locationtbd.home.blog)

Whatever their arguments, Covid vaccine sceptics will probably never convince me

contrarianbritJan 30, 2023, 1:42 PM

8 points

10 comments3 min readLW link

(thomasprosser.substack.com)

Simulacra Levels Summary

ZviJan 30, 2023, 1:40 PM

77 points

14 comments7 min readLW link

(thezvi.wordpress.com)

A Few Principles of Successful AI Design

VestoziaJan 30, 2023, 10:42 AM

1 point

0 comments8 min readLW link

Against Boltzmann mesaoptimizers

porbyJan 30, 2023, 2:55 AM

77 points

6 comments4 min readLW link

How Likely is Losing a Google Account?

jefftkJan 30, 2023, 12:20 AM

52 points

12 comments3 min readLW link

(www.jefftk.com)

Model-driven feedback could amplify alignment failures

aogJan 30, 2023, 12:00 AM

21 points

1 comment2 min readLW link

Takeaways from calibration training

Olli JärviniemiJan 29, 2023, 7:09 PM

45 points

2 comments3 min readLW link 1 review

Structure, creativity, and novelty

TsviBTJan 29, 2023, 2:30 PM

19 points

4 comments7 min readLW link

What is the ground reality of countries taking steps to recalibrate AI development towards Alignment first?

NebuchJan 29, 2023, 1:26 PM

8 points

6 comments3 min readLW link

Compendium of problems with RLHF

Charbel-RaphaëlJan 29, 2023, 11:40 AM

120 points

16 comments10 min readLW link

My biggest takeaway from Redwood Research REMIX

Alok SinghJan 29, 2023, 11:00 AM

0 points

0 comments1 min readLW link

(alok.github.io)

EA novel published on Amazon

Timothy UnderwoodJan 29, 2023, 8:33 AM

17 points

0 comments LW link

Reverse RSS Stats

jefftkJan 29, 2023, 3:40 AM

12 points

2 comments1 min readLW link

(www.jefftk.com)

Why and How to Graduate Early [U.S.]

TegoJan 29, 2023, 1:28 AM

53 points

9 comments8 min readLW link 1 review

Stop-gradients lead to fixed point predictions

Johannes Treutlein, Caspar Oesterheld, Rubi J. Hudson and Emery Cooper

Jan 28, 2023, 10:47 PM

37 points

2 comments24 min readLW link

Eli Dourado AMA on the Progress Forum

jasoncrawfordJan 28, 2023, 10:18 PM

19 points

0 comments1 min readLW link

(rootsofprogress.org)

LW Filter Tags (Rationality/World Modeling now promoted in Latest Posts)

Ruby and RobertM

Jan 28, 2023, 10:14 PM

60 points

4 comments3 min readLW link

No Fire in the Equations

Carlos RamirezJan 28, 2023, 9:16 PM

−16 points

4 comments3 min readLW link

Optimality is the tiger, and annoying the user is its teeth

Christopher KingJan 28, 2023, 8:20 PM

25 points

6 comments2 min readLW link

On not getting contaminated by the wrong obesity ideas

NatáliaJan 28, 2023, 8:18 PM

306 points

69 comments30 min readLW link

Advice I found helpful in 2022

Orpheus16Jan 28, 2023, 7:48 PM

36 points

5 comments2 min readLW link

The Knockdown Argument Paradox

Bryan FrancesJan 28, 2023, 7:23 PM

−12 points

6 comments8 min readLW link

Less Wrong/ACX Budapest Feb 4th Meetup

Richard Horvath and Timothy Underwood

Jan 28, 2023, 2:49 PM

2 points

0 comments1 min readLW link

Reflections on Deception & Generality in Scalable Oversight (Another OpenAI Alignment Review)

Shoshannah TekofskyJan 28, 2023, 5:26 AM

53 points

7 comments7 min readLW link

A Simple Alignment Typology

Shoshannah TekofskyJan 28, 2023, 5:26 AM

34 points

2 comments2 min readLW link

Spooky action at a distance in the loss landscape

Jesse Hoogland and Filip Sondej

Jan 28, 2023, 12:22 AM

61 points

4 comments7 min readLW link

(www.jessehoogland.com)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer