All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 202122 23 24 25 26 27 28 29 30

Vision Weekend US Edition

Allison Duettmann20 Sep 2023 21:28 UTC

4 points

0 comments1 min readLW link

Foresight Vision Weekend Europe Edition

Allison Duettmann20 Sep 2023 21:25 UTC

3 points

0 comments1 min readLW link

Notes on ChatGPT’s “memory” for strings and for events

Bill Benzon20 Sep 2023 18:12 UTC

3 points

0 comments10 min readLW link

Belief and the Truth

Sam I am20 Sep 2023 17:38 UTC

2 points

14 comments5 min readLW link

(open.substack.com)

Image Hijacks: Adversarial Images can Control Generative Models at Runtime

Scott Emmons, Luke Bailey and Euan Ong

20 Sep 2023 15:23 UTC

58 points

9 comments1 min readLW link

(arxiv.org)

Interpretability Externalities Case Study—Hungry Hungry Hippos

Magdalena Wache20 Sep 2023 14:42 UTC

64 points

22 comments2 min readLW link

An Elementary Introduction to Infra-Bayesianism

CarolusRenniusVitellius20 Sep 2023 14:29 UTC

16 points

0 comments1 min readLW link

Weekly Incidence Including Delay

jefftk20 Sep 2023 14:00 UTC

11 points

0 comments2 min readLW link

(www.jefftk.com)

[Question] The stereotype of male classical music lovers being gay

BB620 Sep 2023 13:23 UTC

13 points

6 comments1 min readLW link

Housing Roundup #6

Zvi20 Sep 2023 13:10 UTC

27 points

8 comments14 min readLW link

(thezvi.wordpress.com)

Careless talk on US-China AI competition? (and criticism of CAIS coverage)

Oliver Sourbut20 Sep 2023 12:46 UTC

18 points

3 comments10 min readLW link 3 reviews

(www.oliversourbut.net)

A New Bayesian Decision Theory

Pareto Optimal20 Sep 2023 9:36 UTC

−6 points

0 comments1 min readLW link

(paretooptimal.substack.com)

Protest against Meta’s irreversible proliferation (Sept 29, San Francisco)

Holly_Elmore19 Sep 2023 23:40 UTC

54 points

33 comments1 min readLW link

The AI Explosion Might Never Happen

snewman19 Sep 2023 23:20 UTC

22 points

31 comments9 min readLW link

Science of Deep Learning more tractably addresses the Sharp Left Turn than Agent Foundations

NickGabs19 Sep 2023 22:06 UTC

22 points

2 comments6 min readLW link

Formalizing «Boundaries» with Markov blankets

Chris Lakin19 Sep 2023 21:01 UTC

23 points

20 comments3 min readLW link

Precision of Sets of Forecasts

niplav19 Sep 2023 18:19 UTC

20 points

5 comments10 min readLW link

The Proxy Political Party

antidefault19 Sep 2023 17:47 UTC

−3 points

4 comments1 min readLW link

(antidefault.net)

The Limits of the Existence Proof Argument for General Intelligence

Amadeus Pagel19 Sep 2023 17:45 UTC

−21 points

3 comments1 min readLW link

(amadeuspagel.com)

[Question] Is there a publicly available list of examples of frontier model capabilities?

Max Kearney19 Sep 2023 17:45 UTC

1 point

0 comments1 min readLW link

Tallinn, Estonia – ACX Meetups Everywhere Autumn 2023

Andrew19 Sep 2023 16:24 UTC

1 point

0 comments1 min readLW link

Anthropic’s Responsible Scaling Policy & Long-Term Benefit Trust

Zac Hatfield-Dodds19 Sep 2023 15:09 UTC

85 points

26 comments3 min readLW link 1 review

(www.anthropic.com)

AISN #22: The Landscape of US AI Legislation - Hearings, Frameworks, Bills, and Laws

Dan H19 Sep 2023 14:44 UTC

20 points

0 comments5 min readLW link

(newsletter.safe.ai)

Compilation of Profit for Good Redteaming and Responses

Brad West 19 Sep 2023 13:34 UTC

1 point

0 comments9 min readLW link

[Link post] Michael Nielsen’s “Notes on Existential Risk from Artificial Superintelligence”

Joel Becker19 Sep 2023 13:31 UTC

67 points

12 comments6 min readLW link

(michaelnotebook.com)

[Question] Do LLMs Implement NLP Algorithms for Better Next Token Predictions?

simeon_c19 Sep 2023 12:28 UTC

5 points

1 comment1 min readLW link

On martingales

Joey Marcellino19 Sep 2023 11:39 UTC

8 points

4 comments4 min readLW link

Luck based medicine: angry eldritch sugar gods edition

Elizabeth19 Sep 2023 4:40 UTC

75 points

14 comments9 min readLW link

(acesounderglass.com)

Don’t Think About the Thing Behind the Curtain.

keltan19 Sep 2023 2:07 UTC

4 points

0 comments5 min readLW link

Panel with Israeli Prime Minister on existential risk from AI

Michaël Trazzi18 Sep 2023 23:16 UTC

22 points

2 comments1 min readLW link

(x.com)

Some reasons why I frequently prefer communicating via text

Adam Zerner18 Sep 2023 21:50 UTC

54 points

18 comments2 min readLW link

Why I Don’t Believe The Law of the Excluded Middle

Thoth Hermes18 Sep 2023 18:53 UTC

−11 points

46 comments5 min readLW link

(thothhermes.substack.com)

Forecasting for Policy (FORPOL) - Main takeaways, practical learnings & report

janklenha18 Sep 2023 17:44 UTC

2 points

0 comments4 min readLW link

The Talk: a brief explanation of sexual dimorphism

Malmesbury18 Sep 2023 16:23 UTC

553 points

79 comments16 min readLW link 3 reviews

[Question] Where might I direct promising-to-me researchers to apply for alignment jobs/grants?

abramdemski18 Sep 2023 16:20 UTC

45 points

10 comments1 min readLW link

[Review] Move First, Think Later: Sense and Nonsense in Improving Your Chess

Arjun Panickssery18 Sep 2023 15:10 UTC

36 points

2 comments6 min readLW link

(arjunpanickssery.substack.com)

Technical AI Safety Research Landscape [Slides]

Magdalena Wache18 Sep 2023 13:56 UTC

50 points

2 comments4 min readLW link

The omnizoid—Heighn FDT Debate #5

Heighn18 Sep 2023 11:54 UTC

4 points

0 comments3 min readLW link

Ask for Feelings not Tunes

jefftk18 Sep 2023 2:10 UTC

11 points

0 comments1 min readLW link

(www.jefftk.com)

Three ways interpretability could be impactful

Arthur Conmy18 Sep 2023 1:02 UTC

47 points

8 comments4 min readLW link

Show LW: Get a phone call if prediction markets predict nuclear war

Lorenzo17 Sep 2023 22:25 UTC

35 points

8 comments1 min readLW link

(recursing.github.io)

Microdooms averted by working on AI Safety

Nikola Jurkovic17 Sep 2023 21:46 UTC

34 points

3 comments3 min readLW link

(forum.effectivealtruism.org)

Eugenics Performed By A Blind, Idiot God

Bentham's Bulldog17 Sep 2023 20:37 UTC

66 points

11 comments2 min readLW link

Actually, “personal attacks after object-level arguments” is a pretty good rule of epistemic conduct

Max H17 Sep 2023 20:25 UTC

37 points

15 comments7 min readLW link

Joseph Bloom on choosing AI Alignment over bio, what many aspiring researchers get wrong, and more (interview)

Ruby and Joseph Bloom

17 Sep 2023 18:45 UTC

27 points

2 comments8 min readLW link

Catalyst books

Catnee17 Sep 2023 17:05 UTC

7 points

2 comments1 min readLW link

Telopheme, telophore, and telotect

TsviBT17 Sep 2023 16:24 UTC

46 points

7 comments8 min readLW link

How to think about slowing AI

Zach Stein-Perlman17 Sep 2023 16:00 UTC

14 points

2 comments3 min readLW link

(forum.effectivealtruism.org)

Book Review: Consciousness Explained (as the Great Catalyst)

Rafael Harth17 Sep 2023 15:30 UTC

23 points

14 comments22 min readLW link 1 review

Reflexive decision theory is an unsolved problem

Richard_Kennaway17 Sep 2023 14:15 UTC

40 points

30 comments4 min readLW link