All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 234 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Naive Hypotheses on AI Alignment

Shoshannah Tekofsky2 Jul 2022 19:03 UTC

98 points

29 comments5 min readLW link

The Tree of Life: Stanford AI Alignment Theory of Change

Gabe M2 Jul 2022 18:36 UTC

25 points

0 comments14 min readLW link

Follow along with Columbia EA’s Advanced AI Safety Fellowship!

RohanS2 Jul 2022 17:45 UTC

3 points

0 comments2 min readLW link

(forum.effectivealtruism.org)

Welcome to Analogia! (Chapter 7)

Justin Bullock2 Jul 2022 17:04 UTC

5 points

0 comments11 min readLW link

[Question] What about transhumans and beyond?

AlignmentMirror2 Jul 2022 13:58 UTC

7 points

6 comments1 min readLW link

Goal-directedness: tackling complexity

Morgan_Rogers2 Jul 2022 13:51 UTC

8 points

0 comments38 min readLW link

Literature recommendations July 2022

ChristianKl2 Jul 2022 9:14 UTC

17 points

9 comments1 min readLW link

Deontological Evil

lsusr2 Jul 2022 6:57 UTC

46 points

4 comments2 min readLW link

Could an AI Alignment Sandbox be useful?

Michael Soareverix2 Jul 2022 5:06 UTC

2 points

1 comment1 min readLW link

Five views of Bayes’ Theorem

Adam Scherlis2 Jul 2022 2:25 UTC

38 points

4 comments1 min readLW link

[Linkpost] Existential Risk Analysis in Empirical Research Papers

Dan H2 Jul 2022 0:09 UTC

40 points

0 comments1 min readLW link

(arxiv.org)

Agenty AGI – How Tempting?

PeterMcCluskey1 Jul 2022 23:40 UTC

22 points

3 comments5 min readLW link

(www.bayesianinvestor.com)

AXRP Episode 16 - Preparing for Debate AI with Geoffrey Irving

DanielFilan1 Jul 2022 22:20 UTC

20 points

0 comments37 min readLW link

[Question] Examples of practical implications of Judea Pearl’s Causality work

ChristianKl1 Jul 2022 20:58 UTC

23 points

6 comments1 min readLW link

Minerva

Algon1 Jul 2022 20:06 UTC

36 points

6 comments2 min readLW link

(ai.googleblog.com)

Disarming status

sano1 Jul 2022 20:00 UTC

−4 points

1 comment6 min readLW link

Paper: Forecasting world events with neural nets

Owain_Evans, Dan H and Joe Kwon

1 Jul 2022 19:40 UTC

39 points

3 comments4 min readLW link

Reframing the AI Risk

Thane Ruthenis1 Jul 2022 18:44 UTC

26 points

7 comments6 min readLW link

Who is this MSRayne person anyway?

MSRayne1 Jul 2022 17:32 UTC

32 points

30 comments11 min readLW link

Limerence Messes Up Your Rationality Real Bad, Yo

Raemon1 Jul 2022 16:53 UTC

134 points

41 comments3 min readLW link 2 reviews

[Link] On the paradox of tolerance in relation to fascism and online content moderation – Unstable Ontology

Kenny1 Jul 2022 16:43 UTC

5 points

0 comments1 min readLW link

Trends in GPU price-performance

Marius Hobbhahn and Tamay

1 Jul 2022 15:51 UTC

85 points

13 comments1 min readLW link 1 review

(epochai.org)

[Question] How to deal with non-schedulable one-off stimulus-response-pair-like situations when planning/organising projects?

mikbp1 Jul 2022 15:22 UTC

2 points

3 comments1 min readLW link

What Is The True Name of Modularity?

CallumMcDougall, Lucius Bushnaq and Avery

1 Jul 2022 14:55 UTC

39 points

10 comments12 min readLW link

Defining Optimization in a Deeper Way Part 1

J Bostock1 Jul 2022 14:03 UTC

7 points

0 comments2 min readLW link

Safetywashing

Adam Scholl1 Jul 2022 11:56 UTC

264 points

20 comments1 min readLW link 2 reviews

[Question] AGI alignment with what?

AlignmentMirror1 Jul 2022 10:22 UTC

6 points

10 comments1 min readLW link

Open & Welcome Thread—July 2022

Kaj_Sotala1 Jul 2022 7:47 UTC

20 points

61 comments1 min readLW link

[Question] What is the contrast to counterfactual reasoning?

Dominic Roser1 Jul 2022 7:39 UTC

5 points

10 comments1 min readLW link

Meiosis is all you need

Metacelsus1 Jul 2022 7:39 UTC

41 points

3 comments2 min readLW link

(denovo.substack.com)

[Question] How to Navigate Evaluating Politicized Research?

Davis_Kingsley1 Jul 2022 5:59 UTC

11 points

1 comment1 min readLW link

One is (almost) normal in base π

Adam Scherlis1 Jul 2022 4:05 UTC

14 points

0 comments1 min readLW link

(adam.scherlis.com)

AI safety university groups: a promising opportunity to reduce existential risk

mic1 Jul 2022 3:59 UTC

14 points

0 comments11 min readLW link

Looking back on my alignment PhD

TurnTrout1 Jul 2022 3:19 UTC

334 points

67 comments11 min readLW link

Selection processes for subagents

Ryan Kidd30 Jun 2022 23:57 UTC

36 points

2 comments9 min readLW link

[Question] Cryonics-adjacent question

Flaglandbase30 Jun 2022 23:03 UTC

12 points

3 comments1 min readLW link

Forecasts are not enough

Ege Erdil30 Jun 2022 22:00 UTC

44 points

5 comments5 min readLW link

Murphyjitsu: an Inner Simulator algorithm

CFAR!Duncan30 Jun 2022 21:50 UTC

72 points

24 comments11 min readLW link 2 reviews

GPT-3 Catching Fish in Morse Code

Megan Kinniment30 Jun 2022 21:22 UTC

117 points

27 comments8 min readLW link

Metacognition in the Rat

Jacob Falkovich30 Jun 2022 20:53 UTC

19 points

0 comments6 min readLW link

On viewquakes

Dalton Mabery30 Jun 2022 20:08 UTC

8 points

0 comments2 min readLW link

The Track Record of Futurists Seems … Fine

HoldenKarnofsky30 Jun 2022 19:40 UTC

94 points

25 comments12 min readLW link

(www.cold-takes.com)

Quick survey on AI alignment resources

frances_lorenz30 Jun 2022 19:09 UTC

14 points

0 comments1 min readLW link

[Linkpost] Solving Quantitative Reasoning Problems with Language Models

Yitz30 Jun 2022 18:58 UTC

76 points

15 comments2 min readLW link

(storage.googleapis.com)

Failing to fix a dangerous intersection

alyssavance30 Jun 2022 18:09 UTC

110 points

17 comments2 min readLW link

Most Functions Have Undesirable Global Extrema

En Kepeig30 Jun 2022 17:10 UTC

8 points

5 comments3 min readLW link

Hedonistic Isotopes:

Trozxzr30 Jun 2022 16:49 UTC

1 point

0 comments1 min readLW link

Abadarian Trades

David Udell30 Jun 2022 16:41 UTC

19 points

22 comments2 min readLW link

Covid 6/30/22: Vaccine Update Update

Zvi30 Jun 2022 14:00 UTC

32 points

6 comments12 min readLW link

(thezvi.wordpress.com)

[Question] How should I talk about optimal but not subgame-optimal play?

JamesFaville30 Jun 2022 13:58 UTC

5 points

1 comment3 min readLW link