All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 202020212022 2023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 252627 28 29 30

Latacora might be of interest to some AI Safety organizations

NunoSempere25 Nov 2021 23:57 UTC

14 points

10 comments1 min readLW link

(www.latacora.com)

Christiano, Cotra, and Yudkowsky on AI progress

Eliezer Yudkowsky and Ajeya Cotra

25 Nov 2021 16:45 UTC

119 points

95 comments66 min readLW link

Covid 11/25: Another Thanksgiving

Zvi25 Nov 2021 13:40 UTC

73 points

9 comments21 min readLW link

(thezvi.wordpress.com)

Coordinating the Unequal Treaties

lsusr25 Nov 2021 10:47 UTC

34 points

4 comments2 min readLW link

First Strike and Second Strike

lsusr25 Nov 2021 9:23 UTC

28 points

5 comments1 min readLW link

You are way more fallible than you think

Shmi25 Nov 2021 5:52 UTC

4 points

14 comments2 min readLW link

[Linkpost] Danger of motivatiogenesis in interdisciplinary work

particlemania25 Nov 2021 0:13 UTC

9 points

0 comments1 min readLW link

Meetup for The Roots of Progress in San Diego, Dec 1

jasoncrawford24 Nov 2021 22:50 UTC

7 points

0 comments1 min readLW link

(rootsofprogress.org)

Base Rates and Reference Classes

jsteinhardt24 Nov 2021 22:30 UTC

20 points

7 comments5 min readLW link

(bounded-regret.ghost.io)

Why do you need the story?

George3d624 Nov 2021 20:26 UTC

52 points

11 comments5 min readLW link

(cerebralab.com)

[AN #169]: Collaborating with humans without human data

Rohin Shah24 Nov 2021 18:30 UTC

33 points

0 comments8 min readLW link

(mailchi.mp)

Paxlovid Remains Illegal: 11/24 Update

Zvi24 Nov 2021 13:40 UTC

54 points

21 comments7 min readLW link

(thezvi.wordpress.com)

HIRING: Inform and shape a new project on AI safety at Partnership on AI

Madhulika Srikumar24 Nov 2021 8:27 UTC

6 points

0 comments1 min readLW link

[Question] How much Bayesian evidence from rapid antigen and PCR tests?

mingyuan24 Nov 2021 6:54 UTC

8 points

4 comments1 min readLW link

French long COVID study: Belief vs Infection

Bucky23 Nov 2021 23:14 UTC

40 points

11 comments5 min readLW link

[Question] Cornell Meetup

Lionel Levine23 Nov 2021 21:28 UTC

6 points

4 comments1 min readLW link

AI Tracker: monitoring current and near-future risks from superscale models

Edouard Harris and Jeremie Harris

23 Nov 2021 19:16 UTC

67 points

13 comments3 min readLW link

(aitracker.org)

Laplace’s rule of succession

Ege Erdil23 Nov 2021 15:48 UTC

56 points

2 comments7 min readLW link

AI Safety Needs Great Engineers

Andy Jones23 Nov 2021 15:40 UTC

91 points

43 comments4 min readLW link

Slightly advanced decision theory 102: Four reasons not to be a (naive) utility maximizer

Jan23 Nov 2021 11:02 UTC

10 points

1 comment15 min readLW link

(universalprior.substack.com)

Use Tools For What They’re For

DirectedEvolution23 Nov 2021 8:26 UTC

28 points

14 comments8 min readLW link

[linkpost] Acquisition of Chess Knowledge in AlphaZero

Quintin Pope23 Nov 2021 7:55 UTC

8 points

1 comment1 min readLW link

[linkpost] Why Going to the Doctor Sucks (WaitButWhy)

mike_hawke23 Nov 2021 3:02 UTC

5 points

11 comments1 min readLW link

(waitbutwhy.com)

Integrating Three Models of (Human) Cognition

jbkjr23 Nov 2021 1:06 UTC

40 points

4 comments32 min readLW link

Potential Alignment mental tool: Keeping track of the types

Donald Hobson22 Nov 2021 20:05 UTC

29 points

1 comment2 min readLW link

Yudkowsky and Christiano discuss “Takeoff Speeds”

Eliezer Yudkowsky22 Nov 2021 19:35 UTC

210 points

176 comments60 min readLW link 1 review

Morally underdefined situations can be deadly

Stuart_Armstrong22 Nov 2021 14:48 UTC

17 points

8 comments2 min readLW link

A Bayesian Aggregation Paradox

Jsevillamol22 Nov 2021 10:39 UTC

87 points

23 comments7 min readLW link

[Question] Do factored sets elucidate anything about how to update everyday beliefs?

TekhneMakre22 Nov 2021 6:51 UTC

5 points

1 comment1 min readLW link

Even if you’re right, you’re wrong

DanielFilan22 Nov 2021 5:40 UTC

17 points

5 comments1 min readLW link

(danielfilan.com)

The Meta-Puzzle

DanielFilan22 Nov 2021 5:30 UTC

23 points

27 comments3 min readLW link

(danielfilan.com)

Some real examples of gradient hacking

Oliver Sourbut22 Nov 2021 0:11 UTC

15 points

8 comments2 min readLW link

“The Wisdom of the Lazy Teacher”

Richard_Kennaway21 Nov 2021 21:11 UTC

17 points

5 comments1 min readLW link

Vitalik: Cryptoeconomics and X-Risk Researchers Should Listen to Each Other More

Emerson Spartz21 Nov 2021 18:53 UTC

47 points

9 comments5 min readLW link

Giving Up On T-Mobile

jefftk21 Nov 2021 16:50 UTC

13 points

6 comments2 min readLW link

(www.jefftk.com)

From language to ethics by automated reasoning

Michele Campolo21 Nov 2021 15:16 UTC

4 points

4 comments6 min readLW link

Split and Commit

Duncan Sabien (Inactive)21 Nov 2021 6:27 UTC

196 points

34 comments5 min readLW link 1 review

What’s the weirdest way to win this game?

Adam Scherlis21 Nov 2021 5:18 UTC

9 points

5 comments1 min readLW link

(adam.scherlis.com)

Eat the cute animals instead

Andrew Vlahos21 Nov 2021 1:06 UTC

−4 points

2 comments1 min readLW link

Chris Voss negotiation MasterClass: review

VipulNaik20 Nov 2021 22:39 UTC

70 points

15 comments33 min readLW link

ACX Montreal Meetup Dec 4 2021

E20 Nov 2021 17:49 UTC

8 points

0 comments1 min readLW link

The Maker of MIND

Tomás B.20 Nov 2021 16:28 UTC

151 points

19 comments11 min readLW link

South Bay ACX/LW Meetup—CHANGED LOCATION

IS20 Nov 2021 14:42 UTC

11 points

0 comments1 min readLW link

The Emperor’s New Clothes: a story of motivated stupidity

David Hugh-Jones20 Nov 2021 13:24 UTC

10 points

5 comments3 min readLW link

(wyclif.substack.com)

[Book Review] “Sorceror’s Apprentice” by Tahir Shah

lsusr20 Nov 2021 11:29 UTC

93 points

11 comments7 min readLW link

Competence/Confidence

Duncan Sabien (Inactive)20 Nov 2021 8:59 UTC

60 points

19 comments1 min readLW link

Awesome-github Post-Scarcity List

lorepieri20 Nov 2021 8:47 UTC

3 points

6 comments1 min readLW link

A Certain Formalization of Corrigibility Is VNM-Incoherent

TurnTrout20 Nov 2021 0:30 UTC

68 points

24 comments8 min readLW link

More detailed proposal for measuring alignment of current models

Beth Barnes20 Nov 2021 0:03 UTC

31 points

0 comments8 min readLW link

Ambitious Altruistic Software Engineering Efforts: Opportunities and Benefits

ozziegooen19 Nov 2021 17:55 UTC

42 points

1 comment9 min readLW link

(forum.effectivealtruism.org)