All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 293031

AI #35: Responsible Scaling Policies

ZviOct 26, 2023, 1:30 PM

66 points

10 comments55 min readLW link

(thezvi.wordpress.com)

RA Bounty: Looking for feedback on screenplay about AI Risk

WriterOct 26, 2023, 1:23 PM

32 points

6 comments1 min readLW link

Sensor Exposure can Compromise the Human Brain in the 2020s

trevorOct 26, 2023, 3:31 AM

17 points

6 comments10 min readLW link

Notes on “How do we become confident in the safety of a machine learning system?”

RohanSOct 26, 2023, 3:13 AM

4 points

0 comments13 min readLW link

Apply to the Constellation Visiting Researcher Program and Astra Fellowship, in Berkeley this Winter

Nate ThomasOct 26, 2023, 3:07 AM

42 points

10 comments1 min readLW link

CHAI internship applications are open (due Nov 13)

Erik JennerOct 26, 2023, 12:53 AM

34 points

0 comments3 min readLW link

Architects of Our Own Demise: We Should Stop Developing AI Carelessly

RokoOct 26, 2023, 12:36 AM

170 points

75 comments3 min readLW link

EA Infrastructure Fund: June 2023 grant recommendations

LinchOct 26, 2023, 12:35 AM

21 points

0 comments12 min readLW link

Responsible Scaling Policies Are Risk Management Done Wrong

simeon_cOct 25, 2023, 11:46 PM

123 points

35 comments22 min readLW link 1 review

(www.navigatingrisks.ai)

AI as a science, and three obstacles to alignment strategies

So8resOct 25, 2023, 9:00 PM

193 points

80 comments11 min readLW link

My hopes for alignment: Singular learning theory and whole brain emulation

Garrett BakerOct 25, 2023, 6:31 PM

61 points

5 comments12 min readLW link

[Question] Lying to chess players for alignment

ZaneOct 25, 2023, 5:47 PM

99 points

54 comments1 min readLW link

Anthropic, Google, Microsoft & OpenAI announce Executive Director of the Frontier Model Forum & over $10 million for a new AI Safety Fund

Zach Stein-PerlmanOct 25, 2023, 3:20 PM

31 points

8 comments4 min readLW link

(www.frontiermodelforum.org)

“The Economics of Time Travel”—call for reviewers (Seeds of Science)

rogersbaconOct 25, 2023, 3:13 PM

4 points

2 comments1 min readLW link

Compositional preference models for aligning LMs

Tomek KorbakOct 25, 2023, 12:17 PM

18 points

2 comments5 min readLW link

[Question] Should the US House of Representatives adopt rank choice voting for leadership positions?

jmhOct 25, 2023, 11:16 AM

16 points

6 comments1 min readLW link

Researchers believe they have found a way for artists to fight back against AI style capture

vernamcipherOct 25, 2023, 10:54 AM

3 points

1 comment1 min readLW link

(finance.yahoo.com)

Why We Disagree

zulupineappleOct 25, 2023, 10:50 AM

7 points

2 comments2 min readLW link

Beyond the Data: Why aid to poor doesn’t work

LyrongolemOct 25, 2023, 5:03 AM

2 points

31 comments12 min readLW link

Announcing Epoch’s newly expanded Parameters, Compute and Data Trends in Machine Learning database

Robi Rahman, Jaime Sevilla Molina, Tamay, Ege Erdil, Pablo Villalobos, Ben Cottier and Matthew Barnett

Oct 25, 2023, 2:55 AM

18 points

0 comments1 min readLW link

(epochai.org)

What is a Sequencing Read?

jefftkOct 25, 2023, 2:10 AM

17 points

2 comments2 min readLW link

(www.jefftk.com)

Verifiable private execution of machine learning models with Risc0?

mako yassOct 25, 2023, 12:44 AM

30 points

2 comments2 min readLW link

[Question] How to Resolve Forecasts With No Central Authority?

Nathan YoungOct 25, 2023, 12:28 AM

17 points

6 comments1 min readLW link

Thoughts on responsible scaling policies and regulation

paulfchristianoOct 24, 2023, 10:21 PM

221 points

33 comments6 min readLW link

The Screenplay Method

Yeshua GodOct 24, 2023, 5:41 PM

−15 points

0 comments25 min readLW link

Blunt Razor

fryolysisOct 24, 2023, 5:27 PM

3 points

0 comments2 min readLW link

Halloween Problem

Saint BlasphemerOct 24, 2023, 4:46 PM

−10 points

1 comment1 min readLW link

Who is Harry Potter? Some predictions.

Donald HobsonOct 24, 2023, 4:14 PM

23 points

7 comments2 min readLW link

Book Review: Going Infinite

ZviOct 24, 2023, 3:00 PM

246 points

113 comments97 min readLW link 1 review

(thezvi.wordpress.com)

[Interview w/ Quintin Pope] Evolution, values, and AI Safety

fowlertmOct 24, 2023, 1:53 PM

11 points

0 comments1 min readLW link

Lying is Cowardice, not Strategy

Connor Leahy and Gabriel Alfour

Oct 24, 2023, 1:24 PM

29 points

73 comments5 min readLW link

(cognition.cafe)

[Question] Anyone Else Using Brilliant?

SableOct 24, 2023, 12:12 PM

19 points

0 comments1 min readLW link

Announcing #AISummitTalks featuring Professor Stuart Russell and many others

otto.bartenOct 24, 2023, 10:11 AM

17 points

1 comment1 min readLW link

Linkpost: A Post Mortem on the Gino Case

LinchOct 24, 2023, 6:50 AM

89 points

7 comments2 min readLW link

(www.theorgplumber.com)

South Bay SSC Meetup, San Jose, November 5th.

David FriedmanOct 24, 2023, 4:50 AM

2 points

1 comment1 min readLW link

AI Pause Will Likely Backfire (Guest Post)

jsteinhardtOct 24, 2023, 4:30 AM

47 points

6 comments15 min readLW link

(bounded-regret.ghost.io)

Human wanting

TsviBTOct 24, 2023, 1:05 AM

53 points

1 comment10 min readLW link

Towards Understanding Sycophancy in Language Models

Ethan Perez, mrinank_sharma, Meg and Tomek Korbak

Oct 24, 2023, 12:30 AM

66 points

0 comments2 min readLW link

(arxiv.org)

Manifold Halloween Hackathon

Austin ChenOct 23, 2023, 10:47 PM

8 points

0 comments1 min readLW link

Open Source Replication & Commentary on Anthropic’s Dictionary Learning Paper

Neel NandaOct 23, 2023, 10:38 PM

93 points

12 comments9 min readLW link

The Shutdown Problem: An AI Engineering Puzzle for Decision Theorists

EJTOct 23, 2023, 9:00 PM

79 points

22 comments39 min readLW link

(philpapers.org)

AI Alignment [Incremental Progress Units] this Week (10/22/23)

Logan ZoellnerOct 23, 2023, 8:32 PM

22 points

0 comments6 min readLW link

(midwitalignment.substack.com)

z is not the cause of x

hrbigelowOct 23, 2023, 5:43 PM

6 points

2 comments9 min readLW link

Some of my predictable updates on AI

Aaron_ScherOct 23, 2023, 5:24 PM

32 points

8 comments9 min readLW link

Programmatic backdoors: DNNs can use SGD to run arbitrary stateful computation

Fabien Roger and Buck

Oct 23, 2023, 4:37 PM

107 points

3 comments8 min readLW link

Machine Unlearning Evaluations as Interpretability Benchmarks

NickyP and Nandi

Oct 23, 2023, 4:33 PM

33 points

2 comments11 min readLW link

VLM-RM: Specifying Rewards with Natural Language

ChengCheng, David Lindner and Ethan Perez

Oct 23, 2023, 2:11 PM

20 points

2 comments5 min readLW link

(far.ai)

Contra Dance Dialect Survey

jefftkOct 23, 2023, 1:40 PM

11 points

0 comments1 min readLW link

(www.jefftk.com)

[Question] Which LessWrongers are (aspiring) YouTubers?

Mati_RoyOct 23, 2023, 1:21 PM

22 points

13 comments1 min readLW link

[Question] What is an “anti-Occamian prior”?

ZaneOct 23, 2023, 2:26 AM

35 points

22 comments1 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer