All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 202122 23 24 25 26 27 28 29 30 31

Muddling Along Is More Likely Than Dystopia

Jeffrey Heninger20 Oct 2023 21:25 UTC

92 points

10 comments8 min readLW link

What’s Hard About The Shutdown Problem

johnswentworth20 Oct 2023 21:13 UTC

102 points

33 comments4 min readLW link

Holly Elmore and Rob Miles dialogue on AI Safety Advocacy

Bird Concept, Robert Miles and Holly_Elmore

20 Oct 2023 21:04 UTC

163 points

30 comments27 min readLW link

TOMORROW: the largest AI Safety protest ever!

Holly_Elmore20 Oct 2023 18:15 UTC

105 points

26 comments2 min readLW link

The Overkill Conspiracy Hypothesis

ymeskhout20 Oct 2023 16:51 UTC

27 points

9 comments7 min readLW link

I Would Have Solved Alignment, But I Was Worried That Would Advance Timelines

307th20 Oct 2023 16:37 UTC

126 points

33 comments9 min readLW link

Internal Target Information for AI Oversight

Paul Colognese20 Oct 2023 14:53 UTC

15 points

0 comments5 min readLW link

On the proper date for solstice celebrations

jchan20 Oct 2023 13:55 UTC

16 points

0 comments4 min readLW link

Are (at least some) Large Language Models Holographic Memory Stores?

Bill Benzon20 Oct 2023 13:07 UTC

11 points

4 comments6 min readLW link

Mechanistic interpretability of LLM analogy-making

Sergii20 Oct 2023 12:53 UTC

2 points

0 comments4 min readLW link

(grgv.xyz)

How To Socialize With Psycho(logist)s

Sable20 Oct 2023 11:33 UTC

38 points

11 comments3 min readLW link

(affablyevil.substack.com)

Revealing Intentionality In Language Models Through AdaVAE Guided Sampling

jdp20 Oct 2023 7:32 UTC

119 points

15 comments22 min readLW link

Features and Adversaries in MemoryDT

Joseph Bloom and Jay Bailey

20 Oct 2023 7:32 UTC

31 points

6 comments25 min readLW link

AI Safety Hub Serbia Soft Launch

DusanDNesic20 Oct 2023 7:11 UTC

64 points

1 comment3 min readLW link

(forum.effectivealtruism.org)

Announcing new round of “Key Phenomena in AI Risk” Reading Group

DusanDNesic and Nora_Ammann

20 Oct 2023 7:11 UTC

15 points

2 comments1 min readLW link

Unpacking the dynamics of AGI conflict that suggest the necessity of a premptive pivotal act

Eli Tyre20 Oct 2023 6:48 UTC

63 points

2 comments8 min readLW link

Genocide isn’t Decolonization

Rob Ennals20 Oct 2023 4:14 UTC

33 points

20 comments5 min readLW link

(messyprogress.substack.com)

Trying to understand John Wentworth’s research agenda

johnswentworth, habryka and David Lorell

20 Oct 2023 0:05 UTC

100 points

13 comments12 min readLW link

Boost your productivity, happiness and health with this one weird trick

ajc58619 Oct 2023 23:30 UTC

9 points

9 comments1 min readLW link

A Good Explanation of Differential Gears

Johannes C. Mayer19 Oct 2023 23:07 UTC

48 points

4 comments1 min readLW link

(youtu.be)

Evening Wiki(pedia) Workout

mcint19 Oct 2023 21:29 UTC

1 point

1 comment1 min readLW link

New roles on my team: come build Open Phil’s technical AI safety program with me!

Ajeya Cotra19 Oct 2023 16:47 UTC

83 points

6 comments4 min readLW link

[Question] Infinite tower of meta-probability

bilibili19 Oct 2023 16:44 UTC

6 points

5 comments3 min readLW link

A NotKillEveryoneIsm Argument for Accelerating Deep Learning Research

Logan Zoellner19 Oct 2023 16:28 UTC

−6 points

6 comments5 min readLW link

(midwitalignment.substack.com)

Knowledge Base 5: Business model

iwis19 Oct 2023 16:06 UTC

−4 points

2 comments1 min readLW link

AI #34: Chipping Away at Chip Exports

Zvi19 Oct 2023 15:00 UTC

36 points

19 comments59 min readLW link

(thezvi.wordpress.com)

Is Yann LeCun strawmanning AI x-risks?

Chris_Leong19 Oct 2023 11:35 UTC

26 points

4 comments1 min readLW link

[Video] Too much Empiricism kills you

Johannes C. Mayer19 Oct 2023 5:08 UTC

19 points

0 comments1 min readLW link

(youtu.be)

Are humans misaligned with evolution?

TekhneMakre and jacob_cannell

19 Oct 2023 3:14 UTC

43 points

13 comments18 min readLW link

Brains, Planes, Blimps, and Algorithms

ai dan18 Oct 2023 21:26 UTC

1 point

0 comments6 min readLW link

The (partial) fallacy of dumb superintelligence

Seth Herd18 Oct 2023 21:25 UTC

38 points

5 comments4 min readLW link

[Question] Does AI governance needs a “Federalist papers” debate?

azsantosk18 Oct 2023 21:08 UTC

40 points

4 comments1 min readLW link

Metaculus Launches Conditional Cup to Explore Linked Forecasts

ChristianWilliams18 Oct 2023 20:41 UTC

9 points

0 comments1 min readLW link

(www.metaculus.com)

AI Safety 101 : Reward Misspecification

markov18 Oct 2023 20:39 UTC

33 points

4 comments31 min readLW link

2023 East Coast Rationalist Megameetup

Screwtape18 Oct 2023 20:33 UTC

8 points

0 comments1 min readLW link

Superforecasting the premises in “Is power-seeking AI an existential risk?”

Joe Carlsmith18 Oct 2023 20:23 UTC

31 points

3 comments5 min readLW link

The Real Fanfic Is The Friends We Made Along The Way

Eneasz18 Oct 2023 19:21 UTC

92 points

1 comment27 min readLW link 1 review

(deathisbad.substack.com)

AISN #24: Kissinger Urges US-China Cooperation on AI, China’s New AI Law, US Export Controls, International Institutions, and Open Source AI

Dan H and Corin Katzke

18 Oct 2023 17:06 UTC

14 points

0 comments6 min readLW link

(newsletter.safe.ai)

Back to the Past to the Future

Prometheus18 Oct 2023 16:51 UTC

5 points

0 comments1 min readLW link

How to Eradicate Global Extreme Poverty [RA video with fundraiser!]

aggliu and Writer

18 Oct 2023 15:51 UTC

50 points

5 comments9 min readLW link

(youtu.be)

On Interpretability’s Robustness

Léo Dana18 Oct 2023 13:18 UTC

11 points

0 comments4 min readLW link

At 87, Pearl is still able to change his mind

rotatingpaguro18 Oct 2023 4:46 UTC

150 points

15 comments5 min readLW link

(Non-deceptive) Suboptimality Alignment

Sodium18 Oct 2023 2:07 UTC

5 points

1 comment9 min readLW link

magnetic cryo-FTIR

bhauth18 Oct 2023 1:59 UTC

10 points

0 comments4 min readLW link

(www.bhauth.com)

Hints about where values come from

Spiracular and TsviBT

18 Oct 2023 0:07 UTC

24 points

13 comments10 min readLW link

Labs should be explicit about why they are building AGI

peterbarnett17 Oct 2023 21:09 UTC

216 points

18 comments1 min readLW link 1 review

Eleuther releases Llemma: An Open Language Model For Mathematics

mako yass17 Oct 2023 20:03 UTC

22 points

0 comments1 min readLW link

(blog.eleuther.ai)

Investigating the learning coefficient of modular addition: hackathon project

Nina Panickssery and Dmitry Vaintrob

17 Oct 2023 19:51 UTC

97 points

5 comments12 min readLW link

Worldwork for Ethics

False Name17 Oct 2023 18:55 UTC

10 points

1 comment24 min readLW link

[Question] When building an organization, there are lots of ways to prevent financial corruption of personnel. But what are the ways to prevent corruption via social status, political power, etc.?

M. Y. Zuo17 Oct 2023 18:51 UTC

19 points

3 comments1 min readLW link