All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 567 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Singular learning theory and bridging from ML to brain emulations

kave and Garrett Baker

Nov 1, 2023, 9:31 PM

26 points

16 comments29 min readLW link

My thoughts on the social response to AI risk

Matthew BarnettNov 1, 2023, 9:17 PM

157 points

37 comments10 min readLW link

Reactions to the Executive Order

ZviNov 1, 2023, 8:40 PM

77 points

4 comments29 min readLW link

(thezvi.wordpress.com)

Dario Amodei’s prepared remarks from the UK AI Safety Summit, on Anthropic’s Responsible Scaling Policy

Zac Hatfield-DoddsNov 1, 2023, 6:10 PM

83 points

1 comment4 min readLW link

(www.anthropic.com)

Book Review: Determined by Sapolsky

Kailuo WangNov 1, 2023, 5:37 PM

1 point

0 comments7 min readLW link

AI Alignment: A Comprehensive Survey

Stephen McAleerNov 1, 2023, 5:35 PM

20 points

1 comment1 min readLW link

(arxiv.org)

A list of all the deadlines in Biden’s Executive Order on AI

Valentin BaltadzhievNov 1, 2023, 5:14 PM

26 points

2 comments11 min readLW link

2023 LessWrong Community Census, Request for Comments

ScrewtapeNov 1, 2023, 4:32 PM

43 points

37 comments2 min readLW link

[Question] Snapshot of narratives and frames against regulating AI

Jan_KulveitNov 1, 2023, 4:30 PM

36 points

19 comments3 min readLW link

Commensal Institutions

SableNov 1, 2023, 4:01 PM

8 points

12 comments4 min readLW link

(affablyevil.substack.com)

ChatGPT’s Ontological Landscape

Bill BenzonNov 1, 2023, 3:12 PM

7 points

0 comments4 min readLW link

On the Executive Order

ZviNov 1, 2023, 2:20 PM

100 points

4 comments30 min readLW link

(thezvi.wordpress.com)

Chinese scientists acknowledge xrisk & call for international regulatory body [Linkpost]

Orpheus16Nov 1, 2023, 1:28 PM

44 points

4 comments1 min readLW link

(www.ft.com)

[Question] Forecasting Questions: What do you want to predict on AI?

Nathan YoungNov 1, 2023, 1:17 PM

7 points

2 comments1 min readLW link

Mission Impossible: Dead Reckoning Part 1 AI Takeaways

ZviNov 1, 2023, 12:52 PM

47 points

13 comments6 min readLW link

Robustness of Contrast-Consistent Search to Adversarial Prompting

Nandi, i, Jamie Wright, Seamus_F and hugofry

Nov 1, 2023, 12:46 PM

18 points

1 comment7 min readLW link

The Bletchley Declaration on AI Safety

Hauke HillebrandtNov 1, 2023, 11:44 AM

17 points

0 comments LW link

(www.gov.uk)

Bay Winter Solstice 2023: Song & speech auditions

tcheasdfjklNov 1, 2023, 4:17 AM

17 points

2 comments1 min readLW link

On Having No Clue

Chris_LeongNov 1, 2023, 1:36 AM

20 points

11 comments1 min readLW link

Balancing Security Mindset with Collaborative Research: A Proposal

MadHatterNov 1, 2023, 12:46 AM

9 points

3 comments4 min readLW link

Computational Approaches to Pathogen Detection

jefftkNov 1, 2023, 12:30 AM

32 points

5 comments5 min readLW link

(www.jefftk.com)

Thoughts on the AI Safety Summit company policy requests and responses

So8resOct 31, 2023, 11:54 PM

169 points

14 comments10 min readLW link

AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks

Dan HOct 31, 2023, 7:34 PM

35 points

1 comment6 min readLW link

(newsletter.safe.ai)

If AIs become self-aware, what religion will they have?

mnvrOct 31, 2023, 5:29 PM

−17 points

3 comments4 min readLW link

Self-Blinded L-Theanine RCT

niplavOct 31, 2023, 3:24 PM

53 points

12 comments3 min readLW link

AI Safety 101 - Chapter 5.2 - Unrestricted Adversarial Training

Charbel-RaphaëlOct 31, 2023, 2:34 PM

17 points

0 comments19 min readLW link

Preventing Language Models from hiding their reasoning

Fabien Roger and ryan_greenblatt

Oct 31, 2023, 2:34 PM

119 points

15 comments12 min readLW link 1 review

AI Safety 101 - Chapter 5.1 - Debate

Charbel-RaphaëlOct 31, 2023, 2:29 PM

15 points

0 comments13 min readLW link

M&A in AI

Hauke HillebrandtOct 31, 2023, 12:20 PM

2 points

0 comments LW link

Urging an International AI Treaty: An Open Letter

Olli JärviniemiOct 31, 2023, 11:26 AM

48 points

2 comments1 min readLW link

(aitreaty.org)

[Closed] Agent Foundations track in MATS

Vanessa KosoyOct 31, 2023, 8:12 AM

54 points

1 comment1 min readLW link

(www.matsprogram.org)

Intrinsic Drives and Extrinsic Misuse: Two Intertwined Risks of AI

jsteinhardtOct 31, 2023, 5:10 AM

40 points

0 comments12 min readLW link

(bounded-regret.ghost.io)

Focus on existential risk is a distraction from the real issues. A false fallacy

Nik SamoylovOct 30, 2023, 11:42 PM

−19 points

11 comments2 min readLW link

Will releasing the weights of large language models grant widespread access to pandemic agents?

jefftkOct 30, 2023, 6:22 PM

47 points

25 comments LW link

(arxiv.org)

[Linkpost] Two major announcements in AI governance today

AngélinaOct 30, 2023, 5:28 PM

1 point

1 comment1 min readLW link

(www.whitehouse.gov)

Grokking Beyond Neural Networks

Jack MillerOct 30, 2023, 5:28 PM

10 points

0 comments2 min readLW link

(arxiv.org)

Response to “Coordinated pausing: An evaluation-based coordination scheme for frontier AI developers”

Matthew WeardenOct 30, 2023, 5:27 PM

5 points

2 comments6 min readLW link

(matthewwearden.co.uk)

Jailbreak and Guard Aligned Language Models with Only Few In-Context Demonstrations

Zeming WeiOct 30, 2023, 5:22 PM

3 points

1 comment1 min readLW link

5 Reasons Why Governments/Militaries Already Want AI for Information Warfare

trevorOct 30, 2023, 4:30 PM

32 points

0 comments10 min readLW link

[Linkpost] Biden-Harris Executive Order on AI

berenOct 30, 2023, 3:20 PM

3 points

0 comments1 min readLW link

AI Alignment [progress] this Week (10/29/2023)

Logan ZoellnerOct 30, 2023, 3:02 PM

15 points

4 comments6 min readLW link

(midwitalignment.substack.com)

Improving the Welfare of AIs: A Nearcasted Proposal

ryan_greenblattOct 30, 2023, 2:51 PM

114 points

9 comments20 min readLW link 1 review

President Biden Issues Executive Order on Safe, Secure, and Trustworthy Artificial Intelligence

Tristan WilliamsOct 30, 2023, 11:15 AM

171 points

39 comments LW link

(www.whitehouse.gov)

GPT-2 XL’s capacity for coherence and ontology clustering

MiguelDevOct 30, 2023, 9:24 AM

6 points

2 comments41 min readLW link

Charbel-Raphaël and Lucius discuss interpretability

Mateusz Bagiński, Charbel-Raphaël and Lucius Bushnaq

Oct 30, 2023, 5:50 AM

111 points

7 comments21 min readLW link

Multi-Winner 3-2-1 Voting

Yoav RavidOct 30, 2023, 3:31 AM

14 points

6 comments3 min readLW link

math terminology as convolution

bhauthOct 30, 2023, 1:05 AM

34 points

1 comment4 min readLW link

(www.bhauth.com)

Grokking, memorization, and generalization — a discussion

Kaarel and Dmitry Vaintrob

Oct 29, 2023, 11:17 PM

75 points

11 comments23 min readLW link

Comp Sci in 2027 (Short story by Eliezer Yudkowsky)

sudoOct 29, 2023, 11:09 PM

203 points

24 comments10 min readLW link 1 review

(nitter.net)

Mathematically-Defined Optimization Captures A Lot of Useful Information

J BostockOct 29, 2023, 5:17 PM

19 points

0 comments2 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer