All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 567 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Imperial Russia was doing fine without the Soviets

Davis Kedrosky5 Jul 2022 22:24 UTC

6 points

3 comments14 min readLW link

(daviskedrosky.substack.com)

A Pattern Language For Rationality

Vaniver5 Jul 2022 19:08 UTC

75 points

14 comments15 min readLW link

How to destroy the universe with a hypercomputer

Trevor Cappallo5 Jul 2022 19:05 UTC

2 points

3 comments1 min readLW link

The curious case of Pretty Good human inner/outer alignment

PavleMiha5 Jul 2022 19:04 UTC

41 points

45 comments4 min readLW link

When is it appropriate to use statistical models and probabilities for decision making ?

Younes Kamel5 Jul 2022 12:34 UTC

10 points

7 comments4 min readLW link

(youneskamel.substack.com)

Goal Factoring

CFAR!Duncan5 Jul 2022 7:10 UTC

101 points

2 comments8 min readLW link

Assorted thoughts about abstraction

Adam Zerner5 Jul 2022 6:40 UTC

16 points

9 comments7 min readLW link

[AN #172] Sorry for the long hiatus!

Rohin Shah5 Jul 2022 6:20 UTC

54 points

0 comments3 min readLW link

(mailchi.mp)

Outline: The Rectifying of Maps

hamnox5 Jul 2022 5:14 UTC

7 points

0 comments2 min readLW link

[Question] Seeking opinions on the current and forward state of cryptocurrencies.

jmh5 Jul 2022 5:01 UTC

6 points

6 comments1 min readLW link

ITT-passing and civility are good; “charity” is bad; steelmanning is niche

Rob Bensinger5 Jul 2022 0:15 UTC

165 points

37 comments6 min readLW link 1 review

Please help us communicate AI xrisk. It could save the world.

otto.barten4 Jul 2022 21:47 UTC

4 points

7 comments2 min readLW link

Benchmark for successful concept extrapolation/avoiding goal misgeneralization

Stuart_Armstrong4 Jul 2022 20:48 UTC

83 points

12 comments4 min readLW link

Procedural Executive Function, Part 1

DaystarEld4 Jul 2022 18:51 UTC

59 points

10 comments14 min readLW link

(daystareld.com)

Anthropic’s SoLU (Softmax Linear Unit)

Joel Burget4 Jul 2022 18:38 UTC

21 points

1 comment4 min readLW link

(transformer-circuits.pub)

Book Review: The Righteous Mind

ErnestScribbler4 Jul 2022 17:45 UTC

34 points

8 comments35 min readLW link

My Most Likely Reason to Die Young is AI X-Risk

AISafetyIsNotLongtermist4 Jul 2022 17:08 UTC

61 points

24 comments4 min readLW link

(forum.effectivealtruism.org)

Is General Intelligence “Compact”?

DragonGod4 Jul 2022 13:27 UTC

27 points

6 comments22 min readLW link

Remaking EfficientZero (as best I can)

Hoagy4 Jul 2022 11:03 UTC

37 points

9 comments22 min readLW link

We Need a Consolidated List of Bad AI Alignment Solutions

Double4 Jul 2022 6:54 UTC

9 points

14 comments1 min readLW link

AI Forecasting: One Year In

jsteinhardt4 Jul 2022 5:10 UTC

132 points

12 comments6 min readLW link

(bounded-regret.ghost.io)

A compressed take on recent disagreements

kman4 Jul 2022 4:39 UTC

33 points

9 comments1 min readLW link

New US Senate Bill on X-Risk Mitigation [Linkpost]

Evan R. Murphy4 Jul 2022 1:25 UTC

35 points

12 comments1 min readLW link

(www.hsgac.senate.gov)

Monthly Shorts 6/22

Celer3 Jul 2022 23:40 UTC

5 points

2 comments5 min readLW link

(keller.substack.com)

Decision theory and dynamic inconsistency

paulfchristiano3 Jul 2022 22:20 UTC

82 points

33 comments10 min readLW link

(sideways-view.com)

Five routes of access to scientific literature

DirectedEvolution3 Jul 2022 20:53 UTC

13 points

4 comments6 min readLW link

Toni Kurz and the Insanity of Climbing Mountains

GeneSmith3 Jul 2022 20:51 UTC

293 points

73 comments11 min readLW link 2 reviews

Wonder and The Golden AI Rule

JeffreyK3 Jul 2022 18:21 UTC

0 points

4 comments6 min readLW link

Nature abhors an immutable replicator… usually

MSRayne3 Jul 2022 15:08 UTC

28 points

10 comments3 min readLW link

Post hoc justifications as Compression Algorithm

Johannes C. Mayer3 Jul 2022 5:02 UTC

8 points

0 comments1 min readLW link

SOMA—A story about Consciousness

Johannes C. Mayer3 Jul 2022 4:46 UTC

10 points

0 comments1 min readLW link

(www.youtube.com)

Sexual self-acceptance

Johannes C. Mayer3 Jul 2022 4:26 UTC

11 points

6 comments1 min readLW link

Donohue, Levitt, Roe, and Wade: T-minus 20 years to a massive crime wave?

Paul Logan3 Jul 2022 3:03 UTC

−24 points

6 comments3 min readLW link

(laulpogan.substack.com)

Can we achieve AGI Alignment by balancing multiple human objectives?

Ben Smith3 Jul 2022 2:51 UTC

11 points

1 comment4 min readLW link

Trigger-Action Planning

CFAR!Duncan3 Jul 2022 1:42 UTC

92 points

14 comments13 min readLW link 2 reviews

[Question] Which one of these two academic routes should I take to end up in AI Safety?

Martín Soto3 Jul 2022 1:05 UTC

5 points

2 comments1 min readLW link

Naive Hypotheses on AI Alignment

Shoshannah Tekofsky2 Jul 2022 19:03 UTC

98 points

29 comments5 min readLW link

Follow along with Columbia EA’s Advanced AI Safety Fellowship!

RohanS2 Jul 2022 17:45 UTC

3 points

0 comments2 min readLW link

(forum.effectivealtruism.org)

Welcome to Analogia! (Chapter 7)

Justin Bullock2 Jul 2022 17:04 UTC

5 points

0 comments11 min readLW link

[Question] What about transhumans and beyond?

AlignmentMirror2 Jul 2022 13:58 UTC

7 points

6 comments1 min readLW link

Goal-directedness: tackling complexity

Morgan_Rogers2 Jul 2022 13:51 UTC

8 points

0 comments38 min readLW link

Literature recommendations July 2022

ChristianKl2 Jul 2022 9:14 UTC

17 points

9 comments1 min readLW link

Deontological Evil

lsusr2 Jul 2022 6:57 UTC

47 points

4 comments2 min readLW link

Could an AI Alignment Sandbox be useful?

Michael Soareverix2 Jul 2022 5:06 UTC

2 points

1 comment1 min readLW link

Five views of Bayes’ Theorem

Adam Scherlis2 Jul 2022 2:25 UTC

38 points

4 comments1 min readLW link

[Linkpost] Existential Risk Analysis in Empirical Research Papers

Dan H2 Jul 2022 0:09 UTC

40 points

0 comments1 min readLW link

(arxiv.org)

Agenty AGI – How Tempting?

PeterMcCluskey1 Jul 2022 23:40 UTC

22 points

3 comments5 min readLW link

(www.bayesianinvestor.com)

AXRP Episode 16 - Preparing for Debate AI with Geoffrey Irving

DanielFilan1 Jul 2022 22:20 UTC

20 points

0 comments37 min readLW link

[Question] Examples of practical implications of Judea Pearl’s Causality work

ChristianKl1 Jul 2022 20:58 UTC

23 points

6 comments1 min readLW link

Minerva

Algon1 Jul 2022 20:06 UTC

36 points

6 comments2 min readLW link

(ai.googleblog.com)