All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 201720182019 2020 2021 2022 2023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 262728 29 30

Alignment Newsletter #34

Rohin Shah26 Nov 2018 23:10 UTC

24 points

0 comments10 min readLW link

(mailchi.mp)

Boltzmann Brains, Simulations and self refuting hypothesis

Donald Hobson26 Nov 2018 19:09 UTC

1 point

9 comments1 min readLW link

Quantum Mechanics, Nothing to do with Consciousness

Donald Hobson26 Nov 2018 18:59 UTC

5 points

27 comments3 min readLW link

Status model

Bucky26 Nov 2018 15:05 UTC

26 points

7 comments3 min readLW link

Humans Consulting HCH

paulfchristiano25 Nov 2018 23:18 UTC

38 points

9 comments1 min readLW link

Approval-directed bootstrapping

paulfchristiano25 Nov 2018 23:18 UTC

24 points

0 comments1 min readLW link

How rapidly are GPUs improving in price performance?

gallabytes25 Nov 2018 19:54 UTC

31 points

9 comments1 min readLW link

(mediangroup.org)

Values Weren’t Complex, Once.

Davidmanheim25 Nov 2018 9:17 UTC

36 points

13 comments2 min readLW link

A culture of exploitation?

Bae's Theorem24 Nov 2018 22:00 UTC

1 point

3 comments1 min readLW link

Fixed Point Discussion

Scott Garrabrant24 Nov 2018 20:53 UTC

46 points

3 comments4 min readLW link

Four factors that moderate the intensity of emotions

Ruby24 Nov 2018 20:40 UTC

65 points

11 comments8 min readLW link

deluks917 on Online Weirdos

Jacob Falkovich24 Nov 2018 17:03 UTC

24 points

3 comments10 min readLW link

[Montreal] Towards High-Assurance Advanced AI Systems by Richard Mallah

Mati_Roy24 Nov 2018 6:24 UTC

3 points

0 comments1 min readLW link

Upcoming: Open Questions

Raemon24 Nov 2018 1:39 UTC

41 points

7 comments2 min readLW link

A Dragon Confronts the Terasem Movement

Alephywr24 Nov 2018 1:31 UTC

−4 points

10 comments25 min readLW link

(dancefighterredux.wordpress.com)

What if people simply forecasted your future choices?

ozziegooen23 Nov 2018 10:52 UTC

16 points

6 comments6 min readLW link

Oversight of Unsafe Systems via Dynamic Safety Envelopes

Davidmanheim23 Nov 2018 8:37 UTC

10 points

2 comments2 min readLW link

On MIRI’s new research directions

Rob Bensinger22 Nov 2018 23:42 UTC

53 points

12 comments1 min readLW link

(intelligence.org)

LW Update 2018-11-22 – Abridged Comments

Raemon22 Nov 2018 22:11 UTC

11 points

16 comments1 min readLW link

Approval-directed agents

paulfchristiano22 Nov 2018 21:15 UTC

31 points

10 comments15 min readLW link

Believing others’ priors

rk22 Nov 2018 20:44 UTC

8 points

19 comments7 min readLW link

Speculative Evopsych, Ep. 1

Optimization Process22 Nov 2018 19:00 UTC

41 points

10 comments1 min readLW link

If You Want to Win, Stop Conceding

Davis_Kingsley22 Nov 2018 18:10 UTC

47 points

15 comments3 min readLW link

Review: Artifact

Zvi22 Nov 2018 15:00 UTC

21 points

3 comments13 min readLW link

(thezvi.wordpress.com)

Perspective Reasoning and the Sleeping Beauty Problem

dadadarren22 Nov 2018 11:55 UTC

6 points

10 comments2 min readLW link

The Semantic Man

namespace22 Nov 2018 8:38 UTC

19 points

4 comments1 min readLW link

(www.generalsemantics.org)

Jesus Made Me Rational (An Introduction)

Motasaurus22 Nov 2018 5:09 UTC

−14 points

56 comments3 min readLW link

Iteration Fixed Point Exercises

Scott Garrabrant and SamEisenstat

22 Nov 2018 0:35 UTC

34 points

13 comments3 min readLW link

Suggestion: New material shouldn’t be released too fast

Chris_Leong21 Nov 2018 16:39 UTC

23 points

7 comments1 min readLW link

EA Bristol Strategy Meeting

thegreatnick21 Nov 2018 10:57 UTC

1 point

0 comments1 min readLW link

Rationality Café No. 6 - The Sequences, Part 1; Section B Repeat

thegreatnick21 Nov 2018 10:54 UTC

8 points

2 comments1 min readLW link

EA Funds: Long-Term Future fund is open to applications until November 24th (this Saturday)

habryka21 Nov 2018 3:39 UTC

37 points

0 comments1 min readLW link

Incorrect hypotheses point to correct observations

Kaj_Sotala20 Nov 2018 21:10 UTC

183 points

40 comments4 min readLW link

(kajsotala.fi)

Preschool: Much Less Than You Wanted To Know

Zvi20 Nov 2018 19:30 UTC

65 points

15 comments2 min readLW link

(thezvi.wordpress.com)

New safety research agenda: scalable agent alignment via reward modeling

Vika20 Nov 2018 17:29 UTC

34 points

12 comments1 min readLW link

(medium.com)

Prosaic AI alignment

paulfchristiano20 Nov 2018 13:56 UTC

49 points

10 comments8 min readLW link

Moscow LW meetup in “Nauchka” library

Alexander23020 Nov 2018 12:19 UTC

2 points

0 comments1 min readLW link

[Insert clever intro here]

Bae's Theorem20 Nov 2018 3:26 UTC

18 points

13 comments1 min readLW link

Alignment Newsletter #33

Rohin Shah19 Nov 2018 17:20 UTC

23 points

0 comments9 min readLW link

(mailchi.mp)

Games in Kocherga club: Fallacymania, Tower of Chaos, Scientific Discovery

Alexander23019 Nov 2018 14:23 UTC

2 points

0 comments1 min readLW link

Letting Others Be Vulnerable

lifelonglearner19 Nov 2018 2:59 UTC

34 points

6 comments7 min readLW link

Clickbait might not be destroying our general Intelligence

Donald Hobson19 Nov 2018 0:13 UTC

25 points

13 comments2 min readLW link

South Bay Meetup 12/8

DavidFriedman19 Nov 2018 0:04 UTC

3 points

0 comments1 min readLW link

[Link] “They go together: Freedom, Prosperity, and Big Government”

CronoDAS18 Nov 2018 16:51 UTC

11 points

3 comments1 min readLW link

Collaboration-by-Design versus Emergent Collaboration

Davidmanheim18 Nov 2018 7:22 UTC

11 points

2 comments2 min readLW link

Diagonalization Fixed Point Exercises

Scott Garrabrant and SamEisenstat

18 Nov 2018 0:31 UTC

46 points

26 comments3 min readLW link

Ia! Ia! Extradimensional Cephalopod Nafl’fhtagn!

ExCeph17 Nov 2018 23:00 UTC

14 points

5 comments1 min readLW link

Effective Altruism, YouTube, and AI (talk by Lê Nguyên Hoang)

Paperclip Minimizer17 Nov 2018 19:21 UTC

3 points

0 comments1 min readLW link

(www.youtube.com)

An unaligned benchmark

paulfchristiano17 Nov 2018 15:51 UTC

37 points

0 comments9 min readLW link

On Rigorous Error Handling

Martin Sustrik17 Nov 2018 9:20 UTC

13 points

4 comments6 min readLW link

(250bpm.com)