All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 201820192020 2021 2022 2023 2024 2025

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Verification and Transparency

DanielFilanAug 8, 2019, 1:50 AM

35 points

6 comments2 min readLW link

(danielfilan.com)

AI Alignment Open Thread August 2019

habrykaAug 4, 2019, 10:09 PM

35 points

96 comments1 min readLW link

AI Forecasting Resolution Council (Forecasting infrastructure, part 2)

Bird Concept and Ben Goldhaber

Aug 29, 2019, 5:35 PM

35 points

2 comments3 min readLW link

[Question] What authors consistently give accurate pictures of complex topics they discuss?

seezAug 21, 2019, 12:09 AM

34 points

3 comments1 min readLW link

[Site Update] Weekly/Monthly/Yearly on All Posts

RaemonAug 2, 2019, 12:39 AM

33 points

7 comments1 min readLW link

“Can We Survive Technology” by von Neumann

Ben PaceAug 18, 2019, 6:58 PM

33 points

2 comments1 min readLW link

(geosci.uchicago.edu)

[Question] What experiments would demonstrate “upper limits of augmented working memory?”

RaemonAug 15, 2019, 10:09 PM

33 points

6 comments2 min readLW link

AI Alignment Writing Day Roundup #1

Ben PaceAug 30, 2019, 1:26 AM

32 points

12 comments1 min readLW link

Calibrating With Cards

lifelonglearnerAug 8, 2019, 6:44 AM

32 points

3 comments3 min readLW link

Distance Functions are Hard

Grue_SlinkyAug 13, 2019, 5:33 PM

31 points

19 comments6 min readLW link

Don’t Pull a Broken Chain

johnswentworthAug 28, 2019, 1:21 AM

31 points

6 comments5 min readLW link

[Question] What explanatory power does Kahneman’s System 2 possess?

Richard_NgoAug 12, 2019, 3:23 PM

31 points

2 comments1 min readLW link

When do utility functions constrain?

HoagyAug 23, 2019, 5:19 PM

30 points

8 comments7 min readLW link

Self-Supervised Learning and AGI Safety

Steven ByrnesAug 7, 2019, 2:21 PM

30 points

9 comments12 min readLW link

Help forecast study replication in this social science prediction market

rosiecamAug 7, 2019, 6:18 PM

29 points

3 comments1 min readLW link

A Survey of Early Impact Measures

Matthew BarnettAug 6, 2019, 1:22 AM

29 points

0 comments8 min readLW link

[Question] Could we solve this email mess if we all moved to paid emails?

Bird ConceptAug 11, 2019, 4:31 PM

29 points

50 comments4 min readLW link

Inspection Paradox as a Driver of Group Separation

ShmiAug 17, 2019, 9:47 PM

29 points

0 comments1 min readLW link

[Question] What are the reasons to not consider reducing AI-Xrisk the highest priority cause?

David Scott Krueger (formerly: capybaralet)Aug 20, 2019, 9:45 PM

29 points

27 comments1 min readLW link

Predicted AI alignment event/meeting calendar

rmoehnAug 14, 2019, 7:14 AM

29 points

14 comments1 min readLW link

Announcement: Writing Day Today (Thursday)

Ben PaceAug 22, 2019, 4:48 AM

29 points

5 comments1 min readLW link

GPT-2: 6-Month Follow-Up

lifelonglearnerAug 21, 2019, 5:06 AM

28 points

1 comment1 min readLW link

“Designing agent incentives to avoid reward tampering”, DeepMind

gwernAug 14, 2019, 4:57 PM

28 points

15 comments LW link

(medium.com)

[AN #62] Are adversarial examples caused by real but imperceptible features?

Rohin ShahAug 22, 2019, 5:10 PM

28 points

10 comments9 min readLW link

(mailchi.mp)

Algorithmic Similarity

LukasMAug 23, 2019, 4:39 PM

28 points

10 comments11 min readLW link

[Question] What is the state of the ego depletion field?

Eli TyreAug 9, 2019, 8:30 PM

27 points

10 comments1 min readLW link

[Question] Why are the people who could be doing safety research, but aren’t, doing something else?

Adam SchollAug 29, 2019, 8:51 AM

27 points

19 comments1 min readLW link

Raph Koster on Virtual Worlds vs Games (notes)

RaemonAug 18, 2019, 7:01 PM

26 points

8 comments2 min readLW link

Reversible changes: consider a bucket of water

Stuart_ArmstrongAug 26, 2019, 10:55 PM

25 points

18 comments2 min readLW link

Project Proposal: Considerations for trading off capabilities and safety impacts of AI research

David Scott Krueger (formerly: capybaralet)Aug 6, 2019, 10:22 PM

25 points

11 comments2 min readLW link

Inversion of theorems into definitions when generalizing

riceissaAug 4, 2019, 5:44 PM

25 points

3 comments5 min readLW link

Goodhart’s Curse and Limitations on AI Alignment

Gordon Seidoh WorleyAug 19, 2019, 7:57 AM

25 points

18 comments10 min readLW link

Why Gradients Vanish and Explode

Matthew BarnettAug 9, 2019, 2:54 AM

25 points

9 comments3 min readLW link

Which of these five AI alignment research projects ideas are no good?

rmoehnAug 8, 2019, 7:17 AM

25 points

13 comments1 min readLW link

[Question] Why do humans not have built-in neural i/o channels?

Richard_NgoAug 8, 2019, 1:09 PM

25 points

23 comments1 min readLW link

Negative “eeny meeny miny moe”

jefftkAug 20, 2019, 2:48 AM

25 points

6 comments1 min readLW link

A Primer on Matrix Calculus, Part 1: Basic review

Matthew BarnettAug 12, 2019, 11:44 PM

25 points

4 comments7 min readLW link

Emotions are not beliefs

Chris_LeongAug 7, 2019, 6:27 AM

25 points

2 comments2 min readLW link

Implications of Quantum Computing for Artificial Intelligence Alignment Research

Jsevillamol and PabloAMC

Aug 22, 2019, 10:33 AM

24 points

3 comments13 min readLW link

Understanding understanding

mthqAug 23, 2019, 6:10 PM

24 points

1 comment2 min readLW link

July 2019 gwern.net newsletter

gwernAug 1, 2019, 4:19 PM

23 points

0 comments LW link

(www.gwern.net)

[Site Update] Behind the scenes data-layer and caching improvements

habrykaAug 7, 2019, 12:49 AM

23 points

3 comments1 min readLW link

Cartographic Processes

johnswentworthAug 27, 2019, 8:02 PM

23 points

3 comments4 min readLW link

[Question] Do you do weekly or daily reviews? What are they like?

benwrAug 5, 2019, 1:23 AM

23 points

8 comments1 min readLW link

Practical consequences of impossibility of value learning

Stuart_Armstrong2 Aug 2019 23:06 UTC

23 points

13 comments3 min readLW link

A Primer on Matrix Calculus, Part 2: Jacobians and other fun

Matthew Barnett15 Aug 2019 1:13 UTC

22 points

7 comments7 min readLW link

In defense of Oracle (“Tool”) AI research

Steven Byrnes7 Aug 2019 19:14 UTC

22 points

11 comments4 min readLW link

Four Ways An Impact Measure Could Help Alignment

Matthew Barnett8 Aug 2019 0:10 UTC

21 points

1 comment9 min readLW link

[Question] Is LW making progress?

zulupineapple24 Aug 2019 0:32 UTC

21 points

11 comments1 min readLW link

Problems with AI debate

Stuart_Armstrong26 Aug 2019 19:21 UTC

21 points

3 comments5 min readLW link