All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 201820192020 2021 2022 2023 2024 2025 2026

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 222324 25 26 27 28 29 30 31

[Question] Can Bayes theorem represent infinite confusion?

Yoav Ravid22 Mar 2019 18:02 UTC

4 points

13 comments1 min readLW link

The Game Theory of Blackmail

Linda Linsefors22 Mar 2019 17:44 UTC

25 points

17 comments4 min readLW link

New Entry at the Stanford Encyclopedia of Philosophy on the Pragmatic Theory of Truth

Iwan Danilo22 Mar 2019 17:39 UTC

−3 points

1 comment1 min readLW link

(plato.stanford.edu)

South Bay SSC Meetup

David Friedman22 Mar 2019 3:10 UTC

2 points

0 comments1 min readLW link

Retrospective on a quantitative productivity logging attempt

etirabys22 Mar 2019 2:31 UTC

25 points

5 comments3 min readLW link

Declarative Mathematics

johnswentworth21 Mar 2019 19:05 UTC

59 points

10 comments3 min readLW link

The Main Sources of AI Risk?

Daniel Kokotajlo and Wei Dai

21 Mar 2019 18:28 UTC

128 points

29 comments2 min readLW link

[Link] IDA 9/14: The Scheme

RAISE21 Mar 2019 18:28 UTC

4 points

0 comments1 min readLW link

[Question] What should we expect from GPT-3?

avturchin21 Mar 2019 14:28 UTC

22 points

2 comments1 min readLW link

[Question] Tracking accuracy of personal forecasts

CheerfulWarrior20 Mar 2019 20:39 UTC

8 points

14 comments1 min readLW link

Criticism catalyzes analytical thinking in groups

rayraegah20 Mar 2019 16:27 UTC

10 points

0 comments1 min readLW link

Games in Kocherga club: Fallacymania, Tower of Chaos, Scientific Discovery

Alexander23020 Mar 2019 13:52 UTC

3 points

0 comments1 min readLW link

Moscow LW meetup in “Nauchka” library

Alexander23020 Mar 2019 13:49 UTC

3 points

0 comments1 min readLW link

[Question] What’s wrong with these analogies for understanding Informed Oversight and IDA?

Wei Dai20 Mar 2019 9:11 UTC

35 points

3 comments1 min readLW link

Alignment Newsletter #49

Rohin Shah20 Mar 2019 4:20 UTC

23 points

1 comment11 min readLW link

(mailchi.mp)

Some thoughts after reading Artificial Intelligence: A Modern Approach

swift_spiral19 Mar 2019 23:39 UTC

38 points

4 comments2 min readLW link

Rest Days vs Recovery Days

Unreal19 Mar 2019 22:37 UTC

242 points

36 comments6 min readLW link 1 review

Partial preferences and models

Stuart_Armstrong19 Mar 2019 16:29 UTC

12 points

9 comments2 min readLW link

IRL 3/8: Mitigating degeneracy: feature matching

RAISE18 Mar 2019 20:15 UTC

6 points

0 comments1 min readLW link

(app.grasple.com)

[Question] Is there a difference between uncertainty over your utility function and uncertainty over outcomes?

Chris_Leong18 Mar 2019 18:41 UTC

14 points

12 comments1 min readLW link

Ideas for a fact checking widget

Yoav Ravid18 Mar 2019 14:25 UTC

9 points

4 comments1 min readLW link

Implications of living within a Simulation

Tater18 Mar 2019 6:22 UTC

1 point

7 comments2 min readLW link

What failure looks like

paulfchristiano17 Mar 2019 20:18 UTC

448 points

55 comments8 min readLW link 2 reviews

Cryopreservation of Valia Zeldin

avturchin17 Mar 2019 19:15 UTC

19 points

0 comments1 min readLW link

(medium.com)

Insights from Munkres’ Topology

Rafael Harth17 Mar 2019 16:52 UTC

31 points

0 comments14 min readLW link

Motivational Meeting Place

Vincent B17 Mar 2019 16:17 UTC

8 points

1 comment3 min readLW link

[Question] Ask LW: Have you read Yudkowsky’s AI to Zombie book?

CaiwitzAzaria17 Mar 2019 13:31 UTC

10 points

20 comments1 min readLW link

[Question] What societies have ever had legal or accepted blackmail?

clone of saturn17 Mar 2019 9:16 UTC

33 points

23 comments1 min readLW link

[Question] How large is the fallout area of the biggest cobalt bomb we can build?

habryka17 Mar 2019 5:50 UTC

20 points

8 comments1 min readLW link

A cognitive intervention for wrist pain

rmoehn17 Mar 2019 5:26 UTC

28 points

24 comments6 min readLW link

Has “politics is the mind-killer” been a mind-killer?

SonnieBailey17 Mar 2019 3:05 UTC

31 points

26 comments3 min readLW link

Comparison of decision theories (with a focus on logical-counterfactual decision theories)

riceissa16 Mar 2019 21:15 UTC

82 points

20 comments10 min readLW link

Terrorism and Russell’s love of excitement

CaiwitzAzaria16 Mar 2019 6:53 UTC

−9 points

0 comments1 min readLW link

Boeing 737 MAX MCAS as an agent corrigibility failure

Shmi16 Mar 2019 1:46 UTC

60 points

3 comments1 min readLW link

Humans aren’t agents—what then for value learning?

Charlie Steiner15 Mar 2019 22:01 UTC

28 points

16 comments3 min readLW link

Privacy

Zvi15 Mar 2019 20:20 UTC

79 points

78 comments6 min readLW link

(thezvi.wordpress.com)

Active Curiosity vs Open Curiosity

Unreal15 Mar 2019 16:54 UTC

76 points

24 comments3 min readLW link

IDA 5-8/14: Approval Directed Agents

RAISE14 Mar 2019 23:58 UTC

4 points

0 comments1 min readLW link

(app.grasple.com)

Nashville SSC March Meetup

Dude McDude14 Mar 2019 19:37 UTC

1 point

0 comments1 min readLW link

Risk of Mass Human Suffering / Extinction due to Climate Emergency

willfranks14 Mar 2019 18:32 UTC

4 points

3 comments1 min readLW link

Speculations on Duo Standard

Zvi14 Mar 2019 14:30 UTC

9 points

2 comments8 min readLW link

(thezvi.wordpress.com)

Combining individual preference utility functions

Stuart_Armstrong14 Mar 2019 14:14 UTC

13 points

2 comments1 min readLW link

Mysteries, identity, and preferences over non-rewards

Stuart_Armstrong14 Mar 2019 13:52 UTC

14 points

1 comment1 min readLW link

Blackmailers are privateers in the war on hypocrisy

Benquo14 Mar 2019 8:13 UTC

27 points

23 comments5 min readLW link

(benjaminrosshoffman.com)

AI Safety Prerequisites Course: Basic abstract representations of computation

RAISE13 Mar 2019 19:38 UTC

28 points

2 comments1 min readLW link

Question: MIRI Corrigbility Agenda

algon3313 Mar 2019 19:38 UTC

15 points

11 comments1 min readLW link

A theory of human values

Stuart_Armstrong13 Mar 2019 15:22 UTC

28 points

13 comments7 min readLW link

[Question] Formalising continuous info cascades? [Info-cascade series]

Ben Pace and Bird Concept

13 Mar 2019 10:55 UTC

16 points

5 comments1 min readLW link

[Question] How large is the harm from info-cascades? [Info-cascade series]

Bird Concept and Ben Pace

13 Mar 2019 10:55 UTC

22 points

2 comments1 min readLW link

[Question] How can we respond to info-cascades? [Info-cascade series]

Bird Concept and Ben Pace

13 Mar 2019 10:55 UTC

14 points

12 comments1 min readLW link