All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 201720182019 2020 2021 2022 2023 2024 2025 2026

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

How safe “safe” AI development?

Gordon Seidoh Worley28 Feb 2018 23:21 UTC

9 points

1 comment1 min readLW link

Beyond algorithmic equivalence: self-modelling

Stuart_Armstrong28 Feb 2018 16:55 UTC

10 points

3 comments1 min readLW link

Beyond algorithmic equivalence: algorithmic noise

Stuart_Armstrong28 Feb 2018 16:55 UTC

10 points

4 comments2 min readLW link

Using the universal prior for logical uncertainty (retracted)

cousin_it28 Feb 2018 13:07 UTC

15 points

13 comments2 min readLW link

2/27/08 Update – Frontpage 3.0

Raemon28 Feb 2018 6:26 UTC

15 points

21 comments1 min readLW link

TDT for Humans

alkjash28 Feb 2018 5:40 UTC

27 points

7 comments5 min readLW link

(radimentary.wordpress.com)

Set Up for Success: Insights from ‘Naïve Set Theory’

TurnTrout28 Feb 2018 2:01 UTC

32 points

40 comments3 min readLW link

Intuition should be applied at the lowest possible level

Rafael Harth27 Feb 2018 22:58 UTC

10 points

9 comments1 min readLW link

The sad state of Rationality Zürich—Effective Altruism Zürich included

roland27 Feb 2018 14:51 UTC

−8 points

50 comments3 min readLW link

The worst trolley problem in the world

CronoDAS27 Feb 2018 3:56 UTC

1 point

1 comment1 min readLW link

Categories of Sacredness

Zvi27 Feb 2018 2:00 UTC

27 points

35 comments8 min readLW link

(thezvi.wordpress.com)

More on the Linear Utility Hypothesis and the Leverage Prior

AlexMennen26 Feb 2018 23:53 UTC

16 points

4 comments9 min readLW link

Goal Factoring

alkjash26 Feb 2018 23:30 UTC

27 points

4 comments2 min readLW link

(radimentary.wordpress.com)

Inconvenience Is Qualitatively Bad

Alicorn26 Feb 2018 23:27 UTC

85 points

52 comments2 min readLW link

The Hamming Problem of Group Rationality

PDV26 Feb 2018 18:59 UTC

6 points

36 comments1 min readLW link

Focusing

alkjash26 Feb 2018 6:10 UTC

20 points

22 comments3 min readLW link

(radimentary.wordpress.com)

Mapping the Archipelago

alkjash26 Feb 2018 5:09 UTC

14 points

24 comments1 min readLW link

Experimental Open Threads

Chris_Leong26 Feb 2018 3:13 UTC

22 points

5 comments1 min readLW link

Walkthrough of ‘Formalizing Convergent Instrumental Goals’

TurnTrout26 Feb 2018 2:20 UTC

13 points

2 comments10 min readLW link

Will AI See Sudden Progress?

KatjaGrace26 Feb 2018 0:41 UTC

27 points

11 comments1 min readLW link 1 review

Self-regulation of safety in AI research

Gordon Seidoh Worley25 Feb 2018 23:17 UTC

12 points

6 comments2 min readLW link

The abruptness of nuclear weapons

paulfchristiano25 Feb 2018 17:40 UTC

47 points

35 comments2 min readLW link

Likelihood of discontinuous progress around the development of AGI

vedevazz25 Feb 2018 15:13 UTC

4 points

2 comments1 min readLW link

(aiimpacts.org)

Open-Source Monasticism

Nathan Rosquist25 Feb 2018 13:52 UTC

26 points

7 comments4 min readLW link

Passing Troll Bridge

Diffractor25 Feb 2018 8:21 UTC

11 points

2 comments10 min readLW link

Three Miniatures

alkjash25 Feb 2018 5:40 UTC

23 points

13 comments3 min readLW link

(radimentary.wordpress.com)

Arguments about fast takeoff

paulfchristiano25 Feb 2018 4:53 UTC

103 points

68 comments2 min readLW link 1 review

(sideways-view.com)

Meta-tations on Moderation: Towards Public Archipelago

Raemon25 Feb 2018 3:59 UTC

81 points

176 comments23 min readLW link

Lessons from the Cold War on Information Hazards: Why Internal Communication is Critical

Gentzel24 Feb 2018 23:34 UTC

47 points

10 comments4 min readLW link

What we talk about when we talk about maximising utility

Richard_Ngo24 Feb 2018 22:33 UTC

14 points

18 comments4 min readLW link

Links with underscores

ShardPhoenix24 Feb 2018 11:32 UTC

2 points

3 comments1 min readLW link

A useful level distinction

Charlie Steiner24 Feb 2018 6:39 UTC

8 points

4 comments2 min readLW link

CoZE 2

alkjash24 Feb 2018 5:40 UTC

16 points

7 comments2 min readLW link

(radimentary.wordpress.com)

On Building Theories of History

Samo Burja23 Feb 2018 23:40 UTC

30 points

20 comments5 min readLW link

Mythic Mode

Valentine23 Feb 2018 22:45 UTC

71 points

82 comments9 min readLW link

The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation

Gordon Seidoh Worley23 Feb 2018 21:42 UTC

5 points

8 comments1 min readLW link

(arxiv.org)

Two types of mathematician

drossbucket23 Feb 2018 19:26 UTC

64 points

41 comments4 min readLW link

June 2012: 0/33 Turing Award winners predict computers beating humans at go within next 10 years.

betterthanwell23 Feb 2018 11:25 UTC

18 points

13 comments2 min readLW link

Design 2

alkjash23 Feb 2018 6:20 UTC

19 points

18 comments3 min readLW link

(radimentary.wordpress.com)

AI Alignment and Phenomenal Consciousness

Gordon Seidoh Worley23 Feb 2018 1:21 UTC

9 points

0 comments6 min readLW link

(mapandterritory.org)

Explanation vs Rationalization

abramdemski22 Feb 2018 23:46 UTC

16 points

11 comments4 min readLW link

The map has gears. They don’t always turn.

abramdemski22 Feb 2018 20:16 UTC

24 points

0 comments1 min readLW link

The Intelligent Social Web

Valentine22 Feb 2018 18:55 UTC

250 points

113 comments12 min readLW link 2 reviews

The Three Stages Of Model Development

katerinjo22 Feb 2018 14:33 UTC

17 points

7 comments2 min readLW link

Pain, fear, sex, and higher order preferences

Stuart_Armstrong22 Feb 2018 11:30 UTC

5 points

3 comments1 min readLW link

TAPs 2

alkjash22 Feb 2018 5:10 UTC

25 points

6 comments3 min readLW link

(radimentary.wordpress.com)

Robustness to Scale

Scott Garrabrant21 Feb 2018 22:55 UTC

144 points

23 comments2 min readLW link 1 review

Don’t Condition on no Catastrophes

Scott Garrabrant21 Feb 2018 21:50 UTC

37 points

7 comments2 min readLW link

The Logic of Science: 2.2

mpr21 Feb 2018 17:28 UTC

9 points

3 comments1 min readLW link

(pulsarcoffee.com)

Yoda Timers 2

alkjash21 Feb 2018 7:40 UTC

29 points

27 comments3 min readLW link

(radimentary.wordpress.com)