All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 201620172018 2019 2020 2021 2022 2023 2024 2025 2026

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 789 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Bet or update: fixing the will-to-wager assumption

cousin_it7 Jun 2017 15:03 UTC

62 points

61 comments1 min readLW link

New circumstances, new values?

Stuart_Armstrong6 Jun 2017 8:20 UTC

11 points

14 comments1 min readLW link

New circumstances, new values?

Stuart_Armstrong6 Jun 2017 8:18 UTC

0 points

0 comments1 min readLW link

Becoming a Better Community

Sable6 Jun 2017 7:11 UTC

11 points

16 comments5 min readLW link

Argument From Infinity

DragonGod5 Jun 2017 21:33 UTC

0 points

19 comments3 min readLW link

Mode Collapse and the Norm One Principle

tristanm5 Jun 2017 21:30 UTC

28 points

13 comments11 min readLW link

The Simple World Hypothesis

DragonGod5 Jun 2017 19:34 UTC

4 points

15 comments8 min readLW link

Cognitive Science/Psychology As a Neglected Approach to AI Safety

Kaj_Sotala5 Jun 2017 13:55 UTC

8 points

5 comments1 min readLW link

(effective-altruism.com)

Open thread, June 5 - June 11, 2017

Elo5 Jun 2017 4:23 UTC

2 points

97 comments1 min readLW link

Birth of a Stereotype

DragonGod5 Jun 2017 3:29 UTC

0 points

13 comments6 min readLW link

A Comment on Expected Utility Theory

DragonGod5 Jun 2017 3:26 UTC

0 points

5 comments4 min readLW link

Rationality as A Value Decider

DragonGod5 Jun 2017 3:21 UTC

1 point

0 comments8 min readLW link

Book Review: Weapons of Math Destruction

Zvi4 Jun 2017 21:20 UTC

1 point

0 comments16 min readLW link

Rationalist Seder: Dayenu, Lo Dayenu

Raemon4 Jun 2017 20:55 UTC

7 points

2 comments3 min readLW link

The Personal Growth Cycle

Gordon Seidoh Worley4 Jun 2017 17:20 UTC

8 points

4 comments5 min readLW link

(mapandterritory.org)

A new, better way to read the Sequences

Said Achmiz4 Jun 2017 5:10 UTC

20 points

13 comments1 min readLW link

Rationalist Seder: A Story of War

Raemon3 Jun 2017 20:17 UTC

13 points

14 comments2 min readLW link

Cooperative Oracles: Nonexploited Bargaining

Scott Garrabrant3 Jun 2017 0:39 UTC

6 points

6 comments3 min readLW link

Cooperative Oracles: Stratified Pareto Optima and Almost Stratified Pareto Optima

Scott Garrabrant3 Jun 2017 0:38 UTC

5 points

8 comments4 min readLW link

Cooperative Oracles: Introduction

Scott Garrabrant3 Jun 2017 0:36 UTC

18 points

3 comments2 min readLW link

Entangled Equilibria and the Twin Prisoners’ Dilemma

Scott Garrabrant2 Jun 2017 22:09 UTC

5 points

2 comments3 min readLW link

An algorithm with preferences: from zero to one variable

Stuart_Armstrong2 Jun 2017 16:35 UTC

4 points

0 comments1 min readLW link

Reward/value learning for reinforcement learning

Stuart_Armstrong2 Jun 2017 16:34 UTC

0 points

2 comments2 min readLW link

The best value indifference method (so far)

Stuart_Armstrong2 Jun 2017 16:33 UTC

0 points

9 comments5 min readLW link

How to judge moral learning failure

Stuart_Armstrong2 Jun 2017 16:32 UTC

0 points

2 comments2 min readLW link

Counterfactuals on POMDP

Stuart_Armstrong2 Jun 2017 16:30 UTC

2 points

0 comments2 min readLW link

Uninfluenceable learning agents

Stuart_Armstrong2 Jun 2017 16:30 UTC

3 points

7 comments1 min readLW link

Ontology, lost purposes, and instrumental goals

Stuart_Armstrong2 Jun 2017 16:28 UTC

0 points

1 comment1 min readLW link

Corrigibility thoughts I: caring about multiple things

Stuart_Armstrong2 Jun 2017 16:27 UTC

2 points

0 comments3 min readLW link

Corrigibility thoughts II: the robot operator

Stuart_Armstrong2 Jun 2017 16:27 UTC

0 points

12 comments2 min readLW link

Corrigibility thoughts III: manipulating versus deceiving

Stuart_Armstrong2 Jun 2017 16:27 UTC

0 points

0 comments1 min readLW link

The radioactive burrito and learning from positive examples

Stuart_Armstrong2 Jun 2017 16:25 UTC

0 points

2 comments1 min readLW link

Thoughts on Quantilizers

Stuart_Armstrong2 Jun 2017 16:24 UTC

2 points

0 comments2 min readLW link

Emergency learning

Stuart_Armstrong2 Jun 2017 16:23 UTC

1 point

0 comments4 min readLW link

Humans as a truth channel

Stuart_Armstrong2 Jun 2017 16:22 UTC

1 point

0 comments2 min readLW link

All the indifference designs

Stuart_Armstrong2 Jun 2017 16:20 UTC

2 points

1 comment4 min readLW link

Indifference and compensatory rewards

Stuart_Armstrong2 Jun 2017 16:19 UTC

0 points

0 comments1 min readLW link

Counterfactually uninfluenceable agents

Stuart_Armstrong2 Jun 2017 16:17 UTC

11 points

0 comments2 min readLW link

Translation “counterfactual”

Stuart_Armstrong2 Jun 2017 16:16 UTC

0 points

0 comments2 min readLW link

Understanding the important facts

Stuart_Armstrong2 Jun 2017 16:15 UTC

0 points

0 comments1 min readLW link

Low impact versus low side effects

Stuart_Armstrong2 Jun 2017 16:14 UTC

1 point

0 comments2 min readLW link

Agents that don’t become maximisers

Stuart_Armstrong2 Jun 2017 16:13 UTC

0 points

0 comments3 min readLW link

AI safety: three human problems and one AI issue

Stuart_Armstrong2 Jun 2017 16:12 UTC

2 points

4 comments3 min readLW link

Optimisation in manipulating humans: engineered fanatics vs yes-men

Stuart_Armstrong2 Jun 2017 15:51 UTC

0 points

0 comments2 min readLW link

Divergent preferences and meta-preferences

Stuart_Armstrong2 Jun 2017 15:51 UTC

9 points

0 comments3 min readLW link

Acausal trade: double decrease

Stuart_Armstrong2 Jun 2017 15:33 UTC

10 points

3 comments2 min readLW link

Acausal trade: different utilities, different trades

Stuart_Armstrong2 Jun 2017 15:33 UTC

2 points

1 comment3 min readLW link

Acausal trade: universal utility, or selling non-existence insurance too late

Stuart_Armstrong2 Jun 2017 15:33 UTC

1 point

1 comment3 min readLW link

Acausal trade: trade barriers

Stuart_Armstrong2 Jun 2017 15:32 UTC

0 points

1 comment2 min readLW link

Futarchy, Xrisks, and near misses

Stuart_Armstrong2 Jun 2017 8:02 UTC

1 point

0 comments1 min readLW link