All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 123 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Agenty AGI – How Tempting?

PeterMcCluskey1 Jul 2022 23:40 UTC

22 points

3 comments5 min readLW link

(www.bayesianinvestor.com)

AXRP Episode 16 - Preparing for Debate AI with Geoffrey Irving

DanielFilan1 Jul 2022 22:20 UTC

20 points

0 comments37 min readLW link

[Question] Examples of practical implications of Judea Pearl’s Causality work

ChristianKl1 Jul 2022 20:58 UTC

23 points

6 comments1 min readLW link

Minerva

Algon1 Jul 2022 20:06 UTC

36 points

6 comments2 min readLW link

(ai.googleblog.com)

Disarming status

sano1 Jul 2022 20:00 UTC

−4 points

1 comment6 min readLW link

Paper: Forecasting world events with neural nets

Owain_Evans, Dan H and Joe Kwon

1 Jul 2022 19:40 UTC

39 points

3 comments4 min readLW link

Reframing the AI Risk

Thane Ruthenis1 Jul 2022 18:44 UTC

26 points

7 comments6 min readLW link

Who is this MSRayne person anyway?

MSRayne1 Jul 2022 17:32 UTC

36 points

30 comments11 min readLW link

Limerence Messes Up Your Rationality Real Bad, Yo

Raemon1 Jul 2022 16:53 UTC

136 points

41 comments3 min readLW link 2 reviews

[Link] On the paradox of tolerance in relation to fascism and online content moderation – Unstable Ontology

Kenny1 Jul 2022 16:43 UTC

5 points

0 comments1 min readLW link

Trends in GPU price-performance

Marius Hobbhahn and Tamay

1 Jul 2022 15:51 UTC

85 points

13 comments1 min readLW link 1 review

(epochai.org)

[Question] How to deal with non-schedulable one-off stimulus-response-pair-like situations when planning/organising projects?

mikbp1 Jul 2022 15:22 UTC

2 points

3 comments1 min readLW link

What Is The True Name of Modularity?

CallumMcDougall, Lucius Bushnaq and Avery

1 Jul 2022 14:55 UTC

41 points

10 comments12 min readLW link

Defining Optimization in a Deeper Way Part 1

J Bostock1 Jul 2022 14:03 UTC

7 points

0 comments2 min readLW link

Safetywashing

Adam Scholl1 Jul 2022 11:56 UTC

266 points

20 comments1 min readLW link 2 reviews

[Question] AGI alignment with what?

AlignmentMirror1 Jul 2022 10:22 UTC

6 points

10 comments1 min readLW link

Open & Welcome Thread—July 2022

Kaj_Sotala1 Jul 2022 7:47 UTC

20 points

61 comments1 min readLW link

[Question] What is the contrast to counterfactual reasoning?

Dominic Roser1 Jul 2022 7:39 UTC

5 points

10 comments1 min readLW link

Meiosis is all you need

Metacelsus1 Jul 2022 7:39 UTC

41 points

3 comments2 min readLW link

(denovo.substack.com)

[Question] How to Navigate Evaluating Politicized Research?

Davis_Kingsley1 Jul 2022 5:59 UTC

11 points

1 comment1 min readLW link

One is (almost) normal in base π

Adam Scherlis1 Jul 2022 4:05 UTC

14 points

0 comments1 min readLW link

(adam.scherlis.com)

AI safety university groups: a promising opportunity to reduce existential risk

mic1 Jul 2022 3:59 UTC

14 points

0 comments11 min readLW link

Looking back on my alignment PhD

TurnTrout1 Jul 2022 3:19 UTC

334 points

67 comments11 min readLW link

Selection processes for subagents

Ryan Kidd30 Jun 2022 23:57 UTC

37 points

2 comments9 min readLW link

[Question] Cryonics-adjacent question

Flaglandbase30 Jun 2022 23:03 UTC

12 points

3 comments1 min readLW link

Forecasts are not enough

Ege Erdil30 Jun 2022 22:00 UTC

44 points

5 comments5 min readLW link

Murphyjitsu: an Inner Simulator algorithm

CFAR!Duncan30 Jun 2022 21:50 UTC

74 points

24 comments11 min readLW link 2 reviews

GPT-3 Catching Fish in Morse Code

Megan Kinniment30 Jun 2022 21:22 UTC

117 points

27 comments8 min readLW link

Metacognition in the Rat

Jacob Falkovich30 Jun 2022 20:53 UTC

19 points

0 comments6 min readLW link

On viewquakes

Dalton Mabery30 Jun 2022 20:08 UTC

8 points

0 comments2 min readLW link

The Track Record of Futurists Seems … Fine

HoldenKarnofsky30 Jun 2022 19:40 UTC

105 points

25 comments12 min readLW link

(www.cold-takes.com)

Quick survey on AI alignment resources

frances_lorenz30 Jun 2022 19:09 UTC

14 points

0 comments1 min readLW link

[Linkpost] Solving Quantitative Reasoning Problems with Language Models

Yitz30 Jun 2022 18:58 UTC

76 points

15 comments2 min readLW link

(storage.googleapis.com)

Failing to fix a dangerous intersection

alyssavance30 Jun 2022 18:09 UTC

110 points

17 comments2 min readLW link

Most Functions Have Undesirable Global Extrema

En Kepeig30 Jun 2022 17:10 UTC

8 points

5 comments3 min readLW link

Hedonistic Isotopes:

Trozxzr30 Jun 2022 16:49 UTC

1 point

0 comments1 min readLW link

Abadarian Trades

David Udell30 Jun 2022 16:41 UTC

19 points

22 comments2 min readLW link

Covid 6/30/22: Vaccine Update Update

Zvi30 Jun 2022 14:00 UTC

32 points

6 comments12 min readLW link

(thezvi.wordpress.com)

[Question] How should I talk about optimal but not subgame-optimal play?

JamesFaville30 Jun 2022 13:58 UTC

5 points

1 comment3 min readLW link

Formal Philosophy and Alignment Possible Projects

Daniel Herrmann30 Jun 2022 10:42 UTC

34 points

5 comments8 min readLW link

Bangalore LW/ACX Meetup in person

Aditya30 Jun 2022 7:21 UTC

5 points

2 comments1 min readLW link

Cultivating And Destroying Agency

hath30 Jun 2022 3:59 UTC

120 points

11 comments9 min readLW link

$500 bounty for alignment contest ideas

Orpheus1630 Jun 2022 1:56 UTC

29 points

5 comments2 min readLW link

any good rationalist guides to nutrition / healthy eating?

Ben A30 Jun 2022 0:50 UTC

7 points

15 comments1 min readLW link

A summary of every Replacing Guilt post

Orpheus1630 Jun 2022 0:46 UTC

35 points

3 comments10 min readLW link

(forum.effectivealtruism.org)

Gradient hacking: definitions and examples

Richard_Ngo29 Jun 2022 21:35 UTC

44 points

2 comments5 min readLW link

Progress links and tweets, 2022-06-29

jasoncrawford29 Jun 2022 21:33 UTC

9 points

0 comments1 min readLW link

(rootsofprogress.org)

[Question] Correcting human error vs doing exactly what you’re told—is there literature on this in context of general system design?

Jan Czechowski29 Jun 2022 21:30 UTC

6 points

0 comments1 min readLW link

Latent Adversarial Training

Adam Jermyn29 Jun 2022 20:04 UTC

58 points

13 comments5 min readLW link

Game Review: This Merchant Life

Zvi29 Jun 2022 18:30 UTC

20 points

0 comments13 min readLW link

(thezvi.wordpress.com)