Bet or up­date: fix­ing the will-to-wa­ger assumption

cousin_it7 Jun 2017 15:03 UTC
62 points
61 comments1 min readLW link

New cir­cum­stances, new val­ues?

Stuart_Armstrong6 Jun 2017 8:20 UTC
11 points
14 comments1 min readLW link

New cir­cum­stances, new val­ues?

Stuart_Armstrong6 Jun 2017 8:18 UTC
0 points
0 comments1 min readLW link

Be­com­ing a Bet­ter Community

Sable6 Jun 2017 7:11 UTC
11 points
16 comments5 min readLW link

Ar­gu­ment From Infinity

DragonGod5 Jun 2017 21:33 UTC
0 points
19 comments3 min readLW link

Mode Col­lapse and the Norm One Principle

tristanm5 Jun 2017 21:30 UTC
28 points
13 comments11 min readLW link

The Sim­ple World Hypothesis

DragonGod5 Jun 2017 19:34 UTC
4 points
15 comments8 min readLW link

Cog­ni­tive Science/​Psy­chol­ogy As a Ne­glected Ap­proach to AI Safety

Kaj_Sotala5 Jun 2017 13:55 UTC
8 points
5 comments1 min readLW link
(effective-altruism.com)

Open thread, June 5 - June 11, 2017

Elo5 Jun 2017 4:23 UTC
2 points
97 comments1 min readLW link

Birth of a Stereotype

DragonGod5 Jun 2017 3:29 UTC
0 points
13 comments6 min readLW link

A Com­ment on Ex­pected Utility Theory

DragonGod5 Jun 2017 3:26 UTC
0 points
5 comments4 min readLW link

Ra­tion­al­ity as A Value Decider

DragonGod5 Jun 2017 3:21 UTC
1 point
0 comments8 min readLW link

Book Re­view: Weapons of Math Destruction

Zvi4 Jun 2017 21:20 UTC
1 point
0 comments16 min readLW link

Ra­tion­al­ist Seder: Dayenu, Lo Dayenu

Raemon4 Jun 2017 20:55 UTC
7 points
2 comments3 min readLW link

The Per­sonal Growth Cycle

Gordon Seidoh Worley4 Jun 2017 17:20 UTC
8 points
4 comments5 min readLW link
(mapandterritory.org)

A new, bet­ter way to read the Sequences

Said Achmiz4 Jun 2017 5:10 UTC
19 points
13 comments1 min readLW link

Ra­tion­al­ist Seder: A Story of War

Raemon3 Jun 2017 20:17 UTC
12 points
14 comments2 min readLW link

Co­op­er­a­tive Or­a­cles: Non­ex­ploited Bargaining

Scott Garrabrant3 Jun 2017 0:39 UTC
6 points
6 comments3 min readLW link

Co­op­er­a­tive Or­a­cles: Strat­ified Pareto Op­tima and Al­most Strat­ified Pareto Optima

Scott Garrabrant3 Jun 2017 0:38 UTC
5 points
8 comments4 min readLW link

Co­op­er­a­tive Or­a­cles: Introduction

Scott Garrabrant3 Jun 2017 0:36 UTC
12 points
3 comments2 min readLW link

En­tan­gled Equil­ibria and the Twin Pri­son­ers’ Dilemma

Scott Garrabrant2 Jun 2017 22:09 UTC
5 points
2 comments3 min readLW link

An al­gorithm with prefer­ences: from zero to one variable

Stuart_Armstrong2 Jun 2017 16:35 UTC
4 points
0 comments1 min readLW link

Re­ward/​value learn­ing for re­in­force­ment learning

Stuart_Armstrong2 Jun 2017 16:34 UTC
0 points
2 comments2 min readLW link

The best value in­differ­ence method (so far)

Stuart_Armstrong2 Jun 2017 16:33 UTC
0 points
9 comments5 min readLW link

How to judge moral learn­ing failure

Stuart_Armstrong2 Jun 2017 16:32 UTC
0 points
2 comments2 min readLW link

Coun­ter­fac­tu­als on POMDP

Stuart_Armstrong2 Jun 2017 16:30 UTC
2 points
0 comments2 min readLW link

Un­in­fluence­able learn­ing agents

Stuart_Armstrong2 Jun 2017 16:30 UTC
3 points
7 comments1 min readLW link

On­tol­ogy, lost pur­poses, and in­stru­men­tal goals

Stuart_Armstrong2 Jun 2017 16:28 UTC
0 points
1 comment1 min readLW link

Cor­rigi­bil­ity thoughts I: car­ing about mul­ti­ple things

Stuart_Armstrong2 Jun 2017 16:27 UTC
2 points
0 comments3 min readLW link

Cor­rigi­bil­ity thoughts II: the robot operator

Stuart_Armstrong2 Jun 2017 16:27 UTC
0 points
12 comments2 min readLW link

Cor­rigi­bil­ity thoughts III: ma­nipu­lat­ing ver­sus deceiving

Stuart_Armstrong2 Jun 2017 16:27 UTC
0 points
0 comments1 min readLW link

The ra­dioac­tive bur­rito and learn­ing from pos­i­tive examples

Stuart_Armstrong2 Jun 2017 16:25 UTC
0 points
2 comments1 min readLW link

Thoughts on Quantilizers

Stuart_Armstrong2 Jun 2017 16:24 UTC
2 points
0 comments2 min readLW link

Emer­gency learning

Stuart_Armstrong2 Jun 2017 16:23 UTC
1 point
0 comments4 min readLW link

Hu­mans as a truth channel

Stuart_Armstrong2 Jun 2017 16:22 UTC
1 point
0 comments2 min readLW link

All the in­differ­ence designs

Stuart_Armstrong2 Jun 2017 16:20 UTC
2 points
1 comment4 min readLW link

In­differ­ence and com­pen­satory rewards

Stuart_Armstrong2 Jun 2017 16:19 UTC
0 points
0 comments1 min readLW link

Coun­ter­fac­tu­ally un­in­fluence­able agents

Stuart_Armstrong2 Jun 2017 16:17 UTC
11 points
0 comments2 min readLW link

Trans­la­tion “coun­ter­fac­tual”

Stuart_Armstrong2 Jun 2017 16:16 UTC
0 points
0 comments2 min readLW link

Un­der­stand­ing the im­por­tant facts

Stuart_Armstrong2 Jun 2017 16:15 UTC
0 points
0 comments1 min readLW link

Low im­pact ver­sus low side effects

Stuart_Armstrong2 Jun 2017 16:14 UTC
1 point
0 comments2 min readLW link

Agents that don’t be­come maximisers

Stuart_Armstrong2 Jun 2017 16:13 UTC
0 points
0 comments3 min readLW link

AI safety: three hu­man prob­lems and one AI issue

Stuart_Armstrong2 Jun 2017 16:12 UTC
2 points
4 comments3 min readLW link

Op­ti­mi­sa­tion in ma­nipu­lat­ing hu­mans: en­g­ineered fa­nat­ics vs yes-men

Stuart_Armstrong2 Jun 2017 15:51 UTC
0 points
0 comments2 min readLW link

Diver­gent prefer­ences and meta-preferences

Stuart_Armstrong2 Jun 2017 15:51 UTC
9 points
0 comments3 min readLW link

Acausal trade: dou­ble decrease

Stuart_Armstrong2 Jun 2017 15:33 UTC
10 points
3 comments2 min readLW link

Acausal trade: differ­ent util­ities, differ­ent trades

Stuart_Armstrong2 Jun 2017 15:33 UTC
1 point
1 comment3 min readLW link

Acausal trade: uni­ver­sal util­ity, or sel­l­ing non-ex­is­tence in­surance too late

Stuart_Armstrong2 Jun 2017 15:33 UTC
1 point
1 comment3 min readLW link

Acausal trade: trade barriers

Stuart_Armstrong2 Jun 2017 15:32 UTC
0 points
1 comment2 min readLW link

Futarchy, Xrisks, and near misses

Stuart_Armstrong2 Jun 2017 8:02 UTC
1 point
0 comments1 min readLW link