All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

South Bay ACX/LW Meetup

IS8 May 2023 23:55 UTC

2 points

0 comments1 min readLW link

H-JEPA might be technically alignable in a modified form

Roman Leventov8 May 2023 23:04 UTC

12 points

2 comments7 min readLW link

All AGI Safety questions welcome (especially basic ones) [May 2023]

steven04618 May 2023 22:30 UTC

34 points

44 comments2 min readLW link

Predictable updating about AI risk

Joe Carlsmith8 May 2023 21:53 UTC

297 points

25 comments36 min readLW link 1 review

Annotated reply to Bengio’s “AI Scientists: Safe and Useful AI?”

Roman Leventov8 May 2023 21:26 UTC

18 points

2 comments7 min readLW link

(yoshuabengio.org)

Are healthy choices effective for improving live expectancy anymore?

Christopher King8 May 2023 21:25 UTC

4 points

4 comments1 min readLW link

LeCun’s “A Path Towards Autonomous Machine Intelligence” has an unsolved technical alignment problem

Steven Byrnes8 May 2023 19:35 UTC

144 points

38 comments15 min readLW link

Product Endorsement: Apollo Neuro

Elizabeth8 May 2023 19:00 UTC

47 points

28 comments5 min readLW link

(acesounderglass.com)

Acausal trade naturally results in the Nash bargaining solution

Christopher King8 May 2023 18:13 UTC

3 points

0 comments4 min readLW link

Inference Speed is Not Unbounded

Onid8 May 2023 16:24 UTC

35 points

32 comments16 min readLW link

[Crosspost] Unveiling the American Public Opinion on AI Moratorium and Government Intervention: The Impact of Media Exposure

otto.barten8 May 2023 14:09 UTC

7 points

0 comments6 min readLW link

(forum.effectivealtruism.org)

Thriving in the Weird Times: Preparing for the 100X Economy

Lucie Philippon and Charbel-Raphaël

8 May 2023 13:44 UTC

23 points

16 comments2 min readLW link

Housing and Transit Roundup #4

Zvi8 May 2023 13:30 UTC

25 points

0 comments11 min readLW link

(thezvi.wordpress.com)

Dance Profit Sharing

jefftk8 May 2023 13:10 UTC

11 points

3 comments2 min readLW link

(www.jefftk.com)

How “AGI” could end up being many different specialized AI’s stitched together

titotal8 May 2023 12:32 UTC

9 points

2 comments9 min readLW link

What does it take to ban a thing?

qbolec8 May 2023 11:00 UTC

66 points

18 comments5 min readLW link

Solomonoff’s solipsism

Mergimio H. Doefevmil8 May 2023 6:55 UTC

−13 points

9 comments1 min readLW link

A technical note on bilinear layers for interpretability

Lee Sharkey8 May 2023 6:06 UTC

59 points

0 comments1 min readLW link

(arxiv.org)

[Question] Is EDT correct? Does “EDT” == “logical EDT” == “logical CDT”?

Vivek Hebbar8 May 2023 2:07 UTC

13 points

2 comments1 min readLW link

LLM cognition is probably not human-like

Max H8 May 2023 1:22 UTC

27 points

15 comments7 min readLW link

[Question] If alignment problem was unsolvable, would that avoid doom?

Kinrany7 May 2023 22:13 UTC

3 points

3 comments1 min readLW link

An artificially structured argument for expecting AGI ruin

Rob Bensinger7 May 2023 21:52 UTC

91 points

26 comments19 min readLW link

Where “the Sequences” Are Wrong

Thoth Hermes7 May 2023 20:21 UTC

−15 points

5 comments14 min readLW link

(thothhermes.substack.com)

What’s wrong with being dumb?

Adam Zerner7 May 2023 18:31 UTC

14 points

17 comments2 min readLW link

Categories of Arguing Style : Why being good among rationalists isn’t enough to argue with everyone

Camille B. 7 May 2023 17:45 UTC

16 points

0 comments23 min readLW link

Self-Administered Gell-Mann Amnesia

krs7 May 2023 17:44 UTC

1 point

1 comment1 min readLW link

Understanding mesa-optimization using toy models

tilmanr, rusheb, Guillaume Corlouer, Dan Valentine, afspies, mivanitskiy and Can

7 May 2023 17:00 UTC

46 points

6 comments10 min readLW link

How to have Polygenically Screened Children

GeneSmith7 May 2023 16:01 UTC

372 points

128 comments27 min readLW link 1 review

Statistical models & the irrelevance of rare exceptions

patrissimo7 May 2023 15:59 UTC

36 points

6 comments2 min readLW link

Let’s look for coherence theorems

Valdes7 May 2023 14:45 UTC

26 points

18 comments6 min readLW link

Graphical Representations of Paul Christiano’s Doom Model

Nathan Young7 May 2023 13:03 UTC

9 points

0 comments1 min readLW link

An anthropomorphic AI dilemma

TsviBT7 May 2023 12:44 UTC

26 points

0 comments7 min readLW link

Violin Supports

jefftk7 May 2023 12:10 UTC

12 points

1 comment1 min readLW link

(www.jefftk.com)

Properties of Good Textbooks

niplav7 May 2023 8:38 UTC

50 points

11 comments1 min readLW link

Against sacrificing AI transparency for generality gains

Ape in the coat7 May 2023 6:52 UTC

4 points

0 comments2 min readLW link

TED talk by Eliezer Yudkowsky: Unleashing the Power of Artificial Intelligence

bayesed7 May 2023 5:45 UTC

49 points

36 comments1 min readLW link

(www.youtube.com)

Thinking of Convenience as an Economic Term

ozziegooen7 May 2023 1:21 UTC

6 points

0 comments12 min readLW link

(forum.effectivealtruism.org)

Corrigibility, Much more detail than anyone wants to Read

Logan Zoellner7 May 2023 1:02 UTC

27 points

3 comments7 min readLW link

Residual stream norms grow exponentially over the forward pass

StefanHex and TurnTrout

7 May 2023 0:46 UTC

79 points

24 comments9 min readLW link

On the Loebner Silver Prize (a Turing test)

hold_my_fish7 May 2023 0:39 UTC

18 points

2 comments2 min readLW link

Time and Energy Costs to Erase a Bit

DaemonicSigil6 May 2023 23:29 UTC

24 points

32 comments7 min readLW link

How much do you believe your results?

Eric Neyman6 May 2023 20:31 UTC

524 points

18 comments15 min readLW link 4 reviews

(ericneyman.wordpress.com)

Long Covid Risks: 2023 Update

Elizabeth6 May 2023 18:20 UTC

69 points

11 comments4 min readLW link

(acesounderglass.com)

Is “red” for GPT-4 the same as “red” for you?

Yusuke Hayashi6 May 2023 17:55 UTC

9 points

6 comments2 min readLW link

The Broader Fossil Fuel Community

Jeffrey Heninger6 May 2023 14:49 UTC

16 points

1 comment3 min readLW link

Estimating Norovirus Prevalence

jefftk6 May 2023 11:40 UTC

16 points

0 comments2 min readLW link

(www.jefftk.com)

Alignment as Function Fitting

A.H.6 May 2023 11:38 UTC

7 points

0 comments12 min readLW link

My preferred framings for reward misspecification and goal misgeneralisation

Yi-Yang6 May 2023 4:48 UTC

27 points

1 comment8 min readLW link

You don’t need to be a genius to be in AI safety research

Claire Short6 May 2023 2:32 UTC

15 points

1 comment6 min readLW link

Naturalist Collection

LoganStrohl6 May 2023 0:37 UTC

71 points

7 comments15 min readLW link