All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 201720182019 2020 2021 2022 2023 2024 2025 2026

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 192021 22 23 24 25 26 27 28 29 30 31

Solving the AI Race Finalists

Gordon Seidoh Worley19 Jul 2018 21:04 UTC

24 points

0 comments1 min readLW link

(medium.com)

“Artificial Intelligence” (new entry at Stanford Encyclopedia of Philosophy)

fortyeridania19 Jul 2018 9:48 UTC

5 points

8 comments1 min readLW link

(plato.stanford.edu)

Discussion: Raising the Sanity Waterline

Chriswaterguy19 Jul 2018 2:12 UTC

2 points

0 comments1 min readLW link

LW Update 2018-07-18 – AlignmentForum Bug Fixes

Raemon19 Jul 2018 2:10 UTC

13 points

0 comments1 min readLW link

Generalized Kelly betting

Linda Linsefors19 Jul 2018 1:38 UTC

15 points

5 comments2 min readLW link

Mechanism Design for AI

Tobias_Baumann18 Jul 2018 16:47 UTC

5 points

3 comments1 min readLW link

(s-risks.org)

A Step-by-step Guide to Finding a (Good!) Therapist

squidious18 Jul 2018 1:50 UTC

46 points

6 comments9 min readLW link

(opalsandbonobos.blogspot.com)

Simple Metaphor About Compressed Sensing

ryan_b17 Jul 2018 15:47 UTC

6 points

0 comments1 min readLW link

Figuring out what Alice wants, part II

Stuart_Armstrong17 Jul 2018 13:59 UTC

17 points

0 comments5 min readLW link

Figuring out what Alice wants, part I

Stuart_Armstrong17 Jul 2018 13:59 UTC

15 points

8 comments3 min readLW link

How To Use Bureaucracies

Samo Burja17 Jul 2018 8:10 UTC

64 points

37 comments9 min readLW link

(medium.com)

September CFAR Workshop

CFAR Team17 Jul 2018 3:16 UTC

20 points

0 comments1 min readLW link

(AI alignment) Now is special

Andrew Quinn17 Jul 2018 1:50 UTC

2 points

0 comments1 min readLW link

Look Under the Light Post

Gordon Seidoh Worley16 Jul 2018 22:19 UTC

22 points

8 comments4 min readLW link

Alignment Newsletter #15: 07/16/18

Rohin Shah16 Jul 2018 16:10 UTC

42 points

0 comments15 min readLW link

(mailchi.mp)

Compact vs. Wide Models

Vaniver16 Jul 2018 4:09 UTC

31 points

5 comments3 min readLW link

Probabilistic decision-making as an anxiety-reduction technique

RationallyDense16 Jul 2018 3:51 UTC

8 points

4 comments1 min readLW link

Buridan’s ass in coordination games

jessicata16 Jul 2018 2:51 UTC

53 points

26 comments10 min readLW link

Research Debt

Elizabeth15 Jul 2018 19:36 UTC

28 points

2 comments1 min readLW link

(distill.pub)

An optimistic explanation of the outrage epidemic

chaosmage15 Jul 2018 14:35 UTC

18 points

5 comments3 min readLW link

Announcement: AI alignment prize round 3 winners and next round

cousin_it15 Jul 2018 7:40 UTC

93 points

7 comments1 min readLW link

Meetup Cookbook

maia14 Jul 2018 22:26 UTC

75 points

7 comments1 min readLW link

(tigrennatenn.neocities.org)

Expected Pain Parameters

Alicorn14 Jul 2018 19:30 UTC

87 points

12 comments2 min readLW link

Boltzmann Brains and Within-model vs. Between-models Probability

Charlie Steiner14 Jul 2018 9:52 UTC

15 points

12 comments3 min readLW link

[1607.08289] “Mammalian Value Systems” (as a starting point for human value system model created by IRL agent)

avturchin14 Jul 2018 9:46 UTC

9 points

9 comments1 min readLW link

(arxiv.org)

Generating vs Recognizing

lifelonglearner14 Jul 2018 5:10 UTC

15 points

3 comments4 min readLW link

LW Update 2018-7-14 – Styling Rework, CommentsItem, Performance

Raemon14 Jul 2018 1:13 UTC

30 points

0 comments1 min readLW link

Secondary Stressors and Tactile Ambition

lionhearted (Sebastian Marshall)13 Jul 2018 0:26 UTC

16 points

16 comments4 min readLW link

A Sarno-Hanson Synthesis

moridinamael12 Jul 2018 16:13 UTC

52 points

15 comments4 min readLW link

Probability is a model, frequency is an observation: Why both halfers and thirders are correct in the Sleeping Beauty problem.

Shmi12 Jul 2018 6:52 UTC

26 points

34 comments2 min readLW link

What does the stock market tell us about AI timelines?

Tobias_Baumann12 Jul 2018 6:05 UTC

6 points

5 comments1 min readLW link

(s-risks.org)

An Agent is a Worldline in Tegmark V

komponisto12 Jul 2018 5:12 UTC

24 points

12 comments2 min readLW link

Washington, D.C.: What If

RobinZ12 Jul 2018 4:30 UTC

9 points

0 comments1 min readLW link

Are pre-specified utility functions about the real world possible in principle?

mlogan11 Jul 2018 18:46 UTC

24 points

7 comments4 min readLW link

Melatonin: Much More Than You Wanted To Know

Scott Alexander11 Jul 2018 17:40 UTC

128 points

17 comments15 min readLW link

(slatestarcodex.com)

Monk Treehouse: some problems defining simulation

dranorter11 Jul 2018 7:35 UTC

6 points

1 comment5 min readLW link

Mathematical Mindset

komponisto11 Jul 2018 3:03 UTC

56 points

5 comments2 min readLW link

Decision-theoretic problems and Theories; An (Incomplete) comparative list

somervta11 Jul 2018 2:59 UTC

36 points

0 comments1 min readLW link

(docs.google.com)

Agents That Learn From Human Behavior Can’t Learn Human Values That Humans Haven’t Learned Yet

steven046111 Jul 2018 2:59 UTC

29 points

11 comments1 min readLW link

On the Role of Counterfactuals in Learning

Max Kanwal11 Jul 2018 2:45 UTC

13 points

2 comments3 min readLW link

Clarifying Consequentialists in the Solomonoff Prior

Vlad Mikulik11 Jul 2018 2:35 UTC

20 points

16 comments6 min readLW link

Complete Class: Consequentialist Foundations

abramdemski11 Jul 2018 1:57 UTC

58 points

37 comments13 min readLW link

Conditions under which misaligned subagents can (not) arise in classifiers

anon111 Jul 2018 1:52 UTC

12 points

2 comments2 min readLW link

No, I won’t go there, it feels like you’re trying to Pascal-mug me

Rupert11 Jul 2018 1:37 UTC

9 points

0 comments2 min readLW link

Conceptual problems with utility functions

Dacyn11 Jul 2018 1:29 UTC

22 points

12 comments2 min readLW link

Dependent Type Theory and Zero-Shot Reasoning

evhub11 Jul 2018 1:16 UTC

27 points

3 comments5 min readLW link

A comment on the IDA-AlphaGoZero metaphor; capabilities versus alignment

AlexMennen11 Jul 2018 1:03 UTC

40 points

1 comment1 min readLW link

Bounding Goodhart’s Law

eric_langlois11 Jul 2018 0:46 UTC

43 points

2 comments5 min readLW link

Mechanistic Transparency for Machine Learning

DanielFilan11 Jul 2018 0:34 UTC

55 points

9 comments4 min readLW link

An environment for studying counterfactuals

Nisan11 Jul 2018 0:14 UTC

15 points

6 comments3 min readLW link