All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 234 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

[Question] What are your favorite posts, podcast episodes, and recorded talks, on AI timelines, or factors that would influence AI timelines?

nonzerosum2 Nov 2023 22:42 UTC

2 points

0 comments1 min readLW link

One Day Sooner

Screwtape2 Nov 2023 19:00 UTC

137 points

8 comments8 min readLW link 1 review

Propaganda or Science: A Look at Open Source AI and Bioterrorism Risk

1a3orn2 Nov 2023 18:20 UTC

194 points

79 comments23 min readLW link

AI #36: In the Background

Zvi2 Nov 2023 18:00 UTC

45 points

5 comments37 min readLW link

(thezvi.wordpress.com)

Doubt Certainty

RationalDino2 Nov 2023 17:43 UTC

4 points

13 comments3 min readLW link

Saying the quiet part out loud: trading off x-risk for personal immortality

disturbance2 Nov 2023 17:43 UTC

84 points

89 comments5 min readLW link

Mech Interp Challenge: November—Deciphering the Cumulative Sum Model

CallumMcDougall2 Nov 2023 17:10 UTC

18 points

2 comments2 min readLW link

Estimating effective dimensionality of MNIST models

Arjun Panickssery2 Nov 2023 14:13 UTC

41 points

3 comments1 min readLW link

Averages and sample sizes

mruwnik2 Nov 2023 9:52 UTC

15 points

2 comments8 min readLW link

ACX/LW/EA crossover meetup

RasmusHB2 Nov 2023 5:57 UTC

2 points

0 comments1 min readLW link

Upcoming Feedback Opportunity on Dual-Use Foundation Models

Chris_Leong2 Nov 2023 4:28 UTC

3 points

0 comments1 min readLW link

Public Weights?

jefftk2 Nov 2023 2:50 UTC

49 points

19 comments3 min readLW link

(www.jefftk.com)

[Question] Should people build productizations of open source AI models?

lc2 Nov 2023 1:26 UTC

23 points

0 comments1 min readLW link

Singular learning theory and bridging from ML to brain emulations

kave and Garrett Baker

1 Nov 2023 21:31 UTC

26 points

16 comments29 min readLW link

My thoughts on the social response to AI risk

Matthew Barnett1 Nov 2023 21:17 UTC

146 points

37 comments10 min readLW link

Reactions to the Executive Order

Zvi1 Nov 2023 20:40 UTC

77 points

4 comments29 min readLW link

(thezvi.wordpress.com)

Dario Amodei’s prepared remarks from the UK AI Safety Summit, on Anthropic’s Responsible Scaling Policy

Zac Hatfield-Dodds1 Nov 2023 18:10 UTC

85 points

1 comment4 min readLW link

(www.anthropic.com)

Book Review: Determined by Sapolsky

Kailuo Wang1 Nov 2023 17:37 UTC

1 point

0 comments7 min readLW link

AI Alignment: A Comprehensive Survey

Stephen McAleer1 Nov 2023 17:35 UTC

22 points

1 comment1 min readLW link

(arxiv.org)

A list of all the deadlines in Biden’s Executive Order on AI

Valentin Baltadzhiev1 Nov 2023 17:14 UTC

26 points

2 comments11 min readLW link

2023 LessWrong Community Census, Request for Comments

Screwtape1 Nov 2023 16:32 UTC

43 points

37 comments2 min readLW link

[Question] Snapshot of narratives and frames against regulating AI

Jan_Kulveit1 Nov 2023 16:30 UTC

36 points

19 comments3 min readLW link

Commensal Institutions

Sable1 Nov 2023 16:01 UTC

8 points

12 comments4 min readLW link

(affablyevil.substack.com)

ChatGPT’s Ontological Landscape

Bill Benzon1 Nov 2023 15:12 UTC

7 points

0 comments4 min readLW link

On the Executive Order

Zvi1 Nov 2023 14:20 UTC

100 points

4 comments30 min readLW link

(thezvi.wordpress.com)

Chinese scientists acknowledge xrisk & call for international regulatory body [Linkpost]

Orpheus161 Nov 2023 13:28 UTC

44 points

4 comments1 min readLW link

(www.ft.com)

[Question] Forecasting Questions: What do you want to predict on AI?

Nathan Young1 Nov 2023 13:17 UTC

7 points

2 comments1 min readLW link

Mission Impossible: Dead Reckoning Part 1 AI Takeaways

Zvi1 Nov 2023 12:52 UTC

47 points

13 comments6 min readLW link

Robustness of Contrast-Consistent Search to Adversarial Prompting

Nandi, i, Jamie Wright, Seamus_F and hugofry

1 Nov 2023 12:46 UTC

18 points

1 comment7 min readLW link

The Bletchley Declaration on AI Safety

Hauke Hillebrandt1 Nov 2023 11:44 UTC

17 points

0 comments4 min readLW link

(www.gov.uk)

Bay Winter Solstice 2023: Song & speech auditions

tcheasdfjkl1 Nov 2023 4:17 UTC

17 points

2 comments1 min readLW link

On Having No Clue

Chris_Leong1 Nov 2023 1:36 UTC

20 points

11 comments1 min readLW link

Balancing Security Mindset with Collaborative Research: A Proposal

MadHatter1 Nov 2023 0:46 UTC

9 points

3 comments4 min readLW link

Computational Approaches to Pathogen Detection

jefftk1 Nov 2023 0:30 UTC

32 points

5 comments5 min readLW link

(www.jefftk.com)

Thoughts on the AI Safety Summit company policy requests and responses

So8res31 Oct 2023 23:54 UTC

169 points

14 comments10 min readLW link

AISN #25: White House Executive Order on AI, UK AI Safety Summit, and Progress on Voluntary Evaluations of AI Risks

Dan H31 Oct 2023 19:34 UTC

35 points

1 comment6 min readLW link

(newsletter.safe.ai)

If AIs become self-aware, what religion will they have?

mnvr31 Oct 2023 17:29 UTC

−17 points

3 comments4 min readLW link

Self-Blinded L-Theanine RCT

niplav31 Oct 2023 15:24 UTC

53 points

12 comments3 min readLW link

AI Safety 101 - Chapter 5.2 - Unrestricted Adversarial Training

Charbel-Raphaël31 Oct 2023 14:34 UTC

17 points

0 comments19 min readLW link

Preventing Language Models from hiding their reasoning

Fabien Roger and ryan_greenblatt

31 Oct 2023 14:34 UTC

121 points

15 comments12 min readLW link 1 review

AI Safety 101 - Chapter 5.1 - Debate

Charbel-Raphaël31 Oct 2023 14:29 UTC

15 points

0 comments13 min readLW link

M&A in AI

Hauke Hillebrandt31 Oct 2023 12:20 UTC

2 points

0 comments6 min readLW link

Urging an International AI Treaty: An Open Letter

Olli Järviniemi31 Oct 2023 11:26 UTC

48 points

2 comments1 min readLW link

(aitreaty.org)

[Closed] Agent Foundations track in MATS

Vanessa Kosoy31 Oct 2023 8:12 UTC

54 points

1 comment1 min readLW link

(www.matsprogram.org)

Intrinsic Drives and Extrinsic Misuse: Two Intertwined Risks of AI

jsteinhardt31 Oct 2023 5:10 UTC

40 points

0 comments12 min readLW link

(bounded-regret.ghost.io)

Focus on existential risk is a distraction from the real issues. A false fallacy

Nik Samoylov30 Oct 2023 23:42 UTC

−19 points

11 comments2 min readLW link

Will releasing the weights of large language models grant widespread access to pandemic agents?

jefftk30 Oct 2023 18:22 UTC

47 points

25 comments1 min readLW link

(arxiv.org)

[Linkpost] Two major announcements in AI governance today

Angélina30 Oct 2023 17:28 UTC

1 point

1 comment1 min readLW link

(www.whitehouse.gov)

Grokking Beyond Neural Networks

Jack Miller30 Oct 2023 17:28 UTC

10 points

0 comments2 min readLW link

(arxiv.org)

Response to “Coordinated pausing: An evaluation-based coordination scheme for frontier AI developers”

Matthew Wearden30 Oct 2023 17:27 UTC

5 points

2 comments6 min readLW link

(matthewwearden.co.uk)