All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Leveling Up Or Leveling Off? Understanding The Science Behind Skill Plateaus

lynettebyeJun 16, 2023, 12:18 AM

45 points

9 comments18 min readLW link

If you are too stressed, walk away from the front lines

Neil Jun 12, 2023, 2:26 PM

44 points

14 comments5 min readLW link

How tall is the Shard, really?

philhJun 23, 2023, 8:10 AM

44 points

10 comments9 min readLW link

(reasonableapproximation.net)

A summary of current work in AI governance

constructiveJun 17, 2023, 6:41 PM

44 points

1 comment11 min readLW link

(forum.effectivealtruism.org)

resolving some neural network mysteries

bhauthJun 19, 2023, 12:09 AM

44 points

6 comments2 min readLW link

(www.bhauth.com)

On the Apple Vision Pro

ZviJun 14, 2023, 5:50 PM

44 points

17 comments11 min readLW link

(thezvi.wordpress.com)

Anthropically Blind: the anthropic shadow is reflectively inconsistent

Christopher KingJun 29, 2023, 2:36 AM

43 points

40 comments10 min readLW link

One implementation of regulatory GPU restrictions

porbyJun 4, 2023, 8:34 PM

42 points

6 comments5 min readLW link

The (local) unit of intelligence is FLOPs

boazbarakJun 5, 2023, 6:23 PM

42 points

7 comments5 min readLW link

Unfaithful Explanations in Chain-of-Thought Prompting

Miles TurpinJun 3, 2023, 12:22 AM

42 points

8 comments7 min readLW link

Cryonics Career Survey (more jobs than you think)

Mati_RoyJun 18, 2023, 2:13 AM

41 points

1 comment2 min readLW link

Nature: “Stop talking about tomorrow’s AI doomsday when AI poses risks today”

Ben SmithJun 28, 2023, 5:59 AM

40 points

8 comments2 min readLW link

(www.nature.com)

Dreams of “Mathopedia”

Nicholas / Heather KrossJun 2, 2023, 1:30 AM

40 points

16 comments2 min readLW link

(www.thinkingmuchbetter.com)

Catastrophic Risks from AI #1: Introduction

Dan H, Mantas Mazeika and TW123

Jun 22, 2023, 5:09 PM

40 points

1 comment5 min readLW link

(arxiv.org)

AI-Plans.com—a contributable compendium

IknownothingJun 25, 2023, 2:40 PM

39 points

7 comments4 min readLW link

(ai-plans.com)

[Question] What money-pumps exist, if any, for deontologists?

Daniel KokotajloJun 28, 2023, 7:08 PM

39 points

35 comments1 min readLW link

Bengio’s FAQ on Catastrophic AI Risks

VaniverJun 29, 2023, 11:04 PM

39 points

0 comments1 min readLW link

(yoshuabengio.org)

AISC team report: Soft-optimization, Bayes and Goodhart

Simon Fischer, benjaminko, jazcarretao, DFNaiff and Jeremy Gillen

Jun 27, 2023, 6:05 AM

38 points

2 comments15 min readLW link

Metaphors for AI, and why I don’t like them

boazbarakJun 28, 2023, 10:47 PM

38 points

18 comments12 min readLW link

Catastrophic Risks from AI #2: Malicious Use

Dan H, Mantas Mazeika and TW123

Jun 22, 2023, 5:10 PM

38 points

1 comment17 min readLW link

(arxiv.org)

Correctly Calibrated Trust

habrykaJun 24, 2023, 7:48 PM

38 points

3 comments11 min readLW link

(forum.effectivealtruism.org)

Solomonoff induction still works if the universe is uncomputable, and its usefulness doesn’t require knowing Occam’s razor

Christopher KingJun 18, 2023, 1:52 AM

38 points

28 comments4 min readLW link

The Sharp Right Turn: sudden deceptive alignment as a convergent goal

avturchinJun 6, 2023, 9:59 AM

38 points

5 comments1 min readLW link

Wildfire of strategicness

TsviBTJun 5, 2023, 1:59 PM

38 points

19 comments1 min readLW link

Why I am not a longtermist (May 2022)

boazbarakJun 6, 2023, 8:36 PM

38 points

19 comments9 min readLW link

(windowsontheory.org)

Aura as a proprioceptive glitch

pchvykovJun 12, 2023, 7:30 PM

37 points

4 comments4 min readLW link

<$750k grants for General Purpose AI Assurance/Safety Research

PhosphorousJun 13, 2023, 4:45 AM

37 points

1 comment1 min readLW link

(cset.georgetown.edu)

Society Library seeking contributions for canonical AI Safety debate map

Jarred FilmerJun 6, 2023, 6:15 PM

36 points

0 comments1 min readLW link

(www.societylibrary.org)

Why libertarians are advocating for regulation on AI

RobertMJun 14, 2023, 8:59 PM

36 points

13 comments4 min readLW link

[Linkpost] Large Language Models Converge on Brain-Like Word Representations

Bogdan Ionut CirsteaJun 11, 2023, 11:20 AM

36 points

12 comments1 min readLW link

“Natural is better” is a valuable heuristic

Neil Jun 20, 2023, 10:25 PM

35 points

16 comments4 min readLW link

The Dictatorship Problem

alyssavanceJun 11, 2023, 2:45 AM

35 points

145 comments11 min readLW link

10 quick takes about AGI

Max HJun 20, 2023, 2:22 AM

35 points

17 comments7 min readLW link

Scaffolded LLMs: Less Obvious Concerns

Stephen FowlerJun 16, 2023, 10:39 AM

34 points

15 comments14 min readLW link

Anthropic | Charting a Path to AI Accountability

Gabe MJun 14, 2023, 4:43 AM

34 points

2 comments3 min readLW link

(www.anthropic.com)

Experiments in Evaluating Steering Vectors

Gytis DaujotasJun 19, 2023, 3:11 PM

34 points

4 comments4 min readLW link

Meta-conversation shouldn’t be taboo

Adam ZernerJun 5, 2023, 12:19 AM

34 points

36 comments4 min readLW link

The AGI Race Between the US and China Doesn’t Exist.

Eva_BJun 3, 2023, 12:22 AM

33 points

15 comments7 min readLW link

(evabehrens.substack.com)

Epistemic spot checking one claim in The Precipice

Isaac KingJun 27, 2023, 1:03 AM

33 points

3 comments1 min readLW link

Announcing AISafety.info’s Write-a-thon (June 16-18) and Second Distillation Fellowship (July 3-October 2)

steven0461Jun 3, 2023, 2:03 AM

33 points

1 comment2 min readLW link

Intelligence Officials Say U.S. Has Retrieved Craft of Non-Human Origin

lcJun 6, 2023, 3:54 AM

33 points

151 comments1 min readLW link

(thedebrief.org)

Multiple stages of fallacy—justifications and non-justifications for the multiple stage fallacy

AronT13 Jun 2023 17:37 UTC

33 points

2 comments5 min readLW link

(coordinationishard.substack.com)

Transformative AGI by 2043 is <1% likely

Ted Sanders6 Jun 2023 17:36 UTC

33 points

117 comments5 min readLW link

(arxiv.org)

On the Cost of Thriving Index

Zvi26 Jun 2023 15:30 UTC

33 points

6 comments9 min readLW link

(thezvi.wordpress.com)

Never Fight The Last War

ChristianKl20 Jun 2023 12:35 UTC

32 points

4 comments1 min readLW link

“LLMs Don’t Have a Coherent Model of the World”—What it Means, Why it Matters

Davidmanheim1 Jun 2023 7:46 UTC

32 points

2 comments7 min readLW link

Andrew Ng wants to have a conversation about extinction risk from AI

Leon Lang5 Jun 2023 22:29 UTC

32 points

2 comments1 min readLW link

(twitter.com)

UK PM: $125M for AI safety

Hauke Hillebrandt12 Jun 2023 12:33 UTC

31 points

11 comments1 min readLW link

(twitter.com)

Park Toys

jefftk23 Jun 2023 16:00 UTC

31 points

5 comments1 min readLW link

(www.jefftk.com)

Philosophical Cyborg (Part 1)

ukc10014, Roman Leventov and NicholasKees

14 Jun 2023 16:20 UTC

31 points

4 comments13 min readLW link