All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Please help us communicate AI xrisk. It could save the world.

otto.barten4 Jul 2022 21:47 UTC

4 points

7 comments2 min readLW link

Benchmark for successful concept extrapolation/avoiding goal misgeneralization

Stuart_Armstrong4 Jul 2022 20:48 UTC

83 points

12 comments4 min readLW link

Procedural Executive Function, Part 1

DaystarEld4 Jul 2022 18:51 UTC

59 points

10 comments14 min readLW link

(daystareld.com)

Anthropic’s SoLU (Softmax Linear Unit)

Joel Burget4 Jul 2022 18:38 UTC

21 points

1 comment4 min readLW link

(transformer-circuits.pub)

Book Review: The Righteous Mind

ErnestScribbler4 Jul 2022 17:45 UTC

34 points

8 comments35 min readLW link

My Most Likely Reason to Die Young is AI X-Risk

AISafetyIsNotLongtermist4 Jul 2022 17:08 UTC

61 points

24 comments4 min readLW link

(forum.effectivealtruism.org)

Is General Intelligence “Compact”?

DragonGod4 Jul 2022 13:27 UTC

27 points

6 comments22 min readLW link

Remaking EfficientZero (as best I can)

Hoagy4 Jul 2022 11:03 UTC

37 points

9 comments22 min readLW link

We Need a Consolidated List of Bad AI Alignment Solutions

Double4 Jul 2022 6:54 UTC

9 points

14 comments1 min readLW link

AI Forecasting: One Year In

jsteinhardt4 Jul 2022 5:10 UTC

132 points

12 comments6 min readLW link

(bounded-regret.ghost.io)

A compressed take on recent disagreements

kman4 Jul 2022 4:39 UTC

33 points

9 comments1 min readLW link

New US Senate Bill on X-Risk Mitigation [Linkpost]

Evan R. Murphy4 Jul 2022 1:25 UTC

35 points

12 comments1 min readLW link

(www.hsgac.senate.gov)

Monthly Shorts 6/22

Celer3 Jul 2022 23:40 UTC

5 points

2 comments5 min readLW link

(keller.substack.com)

Decision theory and dynamic inconsistency

paulfchristiano3 Jul 2022 22:20 UTC

82 points

33 comments10 min readLW link

(sideways-view.com)

Five routes of access to scientific literature

DirectedEvolution3 Jul 2022 20:53 UTC

13 points

4 comments6 min readLW link

Toni Kurz and the Insanity of Climbing Mountains

GeneSmith3 Jul 2022 20:51 UTC

293 points

73 comments11 min readLW link 2 reviews

Wonder and The Golden AI Rule

JeffreyK3 Jul 2022 18:21 UTC

0 points

4 comments6 min readLW link

Nature abhors an immutable replicator… usually

MSRayne3 Jul 2022 15:08 UTC

28 points

10 comments3 min readLW link

Post hoc justifications as Compression Algorithm

Johannes C. Mayer3 Jul 2022 5:02 UTC

8 points

0 comments1 min readLW link

SOMA—A story about Consciousness

Johannes C. Mayer3 Jul 2022 4:46 UTC

10 points

0 comments1 min readLW link

(www.youtube.com)

Sexual self-acceptance

Johannes C. Mayer3 Jul 2022 4:26 UTC

11 points

6 comments1 min readLW link

Donohue, Levitt, Roe, and Wade: T-minus 20 years to a massive crime wave?

Paul Logan3 Jul 2022 3:03 UTC

−24 points

6 comments3 min readLW link

(laulpogan.substack.com)

Can we achieve AGI Alignment by balancing multiple human objectives?

Ben Smith3 Jul 2022 2:51 UTC

11 points

1 comment4 min readLW link

Trigger-Action Planning

CFAR!Duncan3 Jul 2022 1:42 UTC

92 points

14 comments13 min readLW link 2 reviews

[Question] Which one of these two academic routes should I take to end up in AI Safety?

Martín Soto3 Jul 2022 1:05 UTC

5 points

2 comments1 min readLW link

Naive Hypotheses on AI Alignment

Shoshannah Tekofsky2 Jul 2022 19:03 UTC

98 points

29 comments5 min readLW link

Follow along with Columbia EA’s Advanced AI Safety Fellowship!

RohanS2 Jul 2022 17:45 UTC

3 points

0 comments2 min readLW link

(forum.effectivealtruism.org)

Welcome to Analogia! (Chapter 7)

Justin Bullock2 Jul 2022 17:04 UTC

5 points

0 comments11 min readLW link

[Question] What about transhumans and beyond?

AlignmentMirror2 Jul 2022 13:58 UTC

7 points

6 comments1 min readLW link

Goal-directedness: tackling complexity

Morgan_Rogers2 Jul 2022 13:51 UTC

8 points

0 comments38 min readLW link

Literature recommendations July 2022

ChristianKl2 Jul 2022 9:14 UTC

17 points

9 comments1 min readLW link

Deontological Evil

lsusr2 Jul 2022 6:57 UTC

47 points

4 comments2 min readLW link

Could an AI Alignment Sandbox be useful?

Michael Soareverix2 Jul 2022 5:06 UTC

2 points

1 comment1 min readLW link

Five views of Bayes’ Theorem

Adam Scherlis2 Jul 2022 2:25 UTC

38 points

4 comments1 min readLW link

[Linkpost] Existential Risk Analysis in Empirical Research Papers

Dan H2 Jul 2022 0:09 UTC

40 points

0 comments1 min readLW link

(arxiv.org)

Agenty AGI – How Tempting?

PeterMcCluskey1 Jul 2022 23:40 UTC

22 points

3 comments5 min readLW link

(www.bayesianinvestor.com)

AXRP Episode 16 - Preparing for Debate AI with Geoffrey Irving

DanielFilan1 Jul 2022 22:20 UTC

20 points

0 comments37 min readLW link

[Question] Examples of practical implications of Judea Pearl’s Causality work

ChristianKl1 Jul 2022 20:58 UTC

23 points

6 comments1 min readLW link

Minerva

Algon1 Jul 2022 20:06 UTC

36 points

6 comments2 min readLW link

(ai.googleblog.com)

Disarming status

sano1 Jul 2022 20:00 UTC

−4 points

1 comment6 min readLW link

Paper: Forecasting world events with neural nets

Owain_Evans, Dan H and Joe Kwon

1 Jul 2022 19:40 UTC

39 points

3 comments4 min readLW link

Reframing the AI Risk

Thane Ruthenis1 Jul 2022 18:44 UTC

26 points

7 comments6 min readLW link

Who is this MSRayne person anyway?

MSRayne1 Jul 2022 17:32 UTC

36 points

30 comments11 min readLW link

Limerence Messes Up Your Rationality Real Bad, Yo

Raemon1 Jul 2022 16:53 UTC

135 points

41 comments3 min readLW link 2 reviews

[Link] On the paradox of tolerance in relation to fascism and online content moderation – Unstable Ontology

Kenny1 Jul 2022 16:43 UTC

5 points

0 comments1 min readLW link

Trends in GPU price-performance

Marius Hobbhahn and Tamay

1 Jul 2022 15:51 UTC

85 points

13 comments1 min readLW link 1 review

(epochai.org)

[Question] How to deal with non-schedulable one-off stimulus-response-pair-like situations when planning/organising projects?

mikbp1 Jul 2022 15:22 UTC

2 points

3 comments1 min readLW link

What Is The True Name of Modularity?

CallumMcDougall, Lucius Bushnaq and Avery

1 Jul 2022 14:55 UTC

39 points

10 comments12 min readLW link

Defining Optimization in a Deeper Way Part 1

J Bostock1 Jul 2022 14:03 UTC

7 points

0 comments2 min readLW link

Safetywashing

Adam Scholl1 Jul 2022 11:56 UTC

265 points

20 comments1 min readLW link 2 reviews