All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Abstraction sacrifices causal clarity

Marv K31 Jul 2022 19:24 UTC

2 points

0 comments3 min readLW link

Time-logging programs and/or spreadsheets (2022)

mikbp31 Jul 2022 18:18 UTC

3 points

3 comments1 min readLW link

Conservatism is a rational response to epistemic uncertainty

contrarianbrit31 Jul 2022 18:04 UTC

2 points

11 comments9 min readLW link

(thomasprosser.substack.com)

South Bay ACX/LW Meetup

IS31 Jul 2022 15:30 UTC

2 points

0 comments1 min readLW link

Perverse Independence Incentives

jefftk31 Jul 2022 14:40 UTC

61 points

3 comments1 min readLW link

(www.jefftk.com)

Wolfram Research v Cook

Kenny31 Jul 2022 13:35 UTC

7 points

3 comments8 min readLW link

Wanted: Notation for credal resilience

peter_hartree31 Jul 2022 7:35 UTC

21 points

12 comments1 min readLW link

Anatomy of a Dating Document

squidious31 Jul 2022 2:40 UTC

31 points

24 comments4 min readLW link

(opalsandbonobos.blogspot.com)

chinchilla’s wild implications

nostalgebraist31 Jul 2022 1:18 UTC

425 points

129 comments10 min readLW link 1 review

AGI-level reasoner will appear sooner than an agent; what the humanity will do with this reasoner is critical

Roman Leventov30 Jul 2022 20:56 UTC

24 points

10 comments1 min readLW link

[Question] What job should I do?

Tom Paine30 Jul 2022 9:15 UTC

2 points

8 comments1 min readLW link

How transparency changed over time

ViktoriaMalyasova30 Jul 2022 4:36 UTC

21 points

0 comments6 min readLW link

Translating between Latent Spaces

JamesH, Jeremy Gillen and NickyP

30 Jul 2022 3:25 UTC

27 points

2 comments8 min readLW link

Drexler’s Nanotech Forecast

PeterMcCluskey30 Jul 2022 0:45 UTC

25 points

28 comments3 min readLW link

(www.bayesianinvestor.com)

Humans Reflecting on HRH

leogao29 Jul 2022 21:56 UTC

27 points

4 comments2 min readLW link

Comparing Four Approaches to Inner Alignment

Lucas Teixeira29 Jul 2022 21:06 UTC

38 points

1 comment9 min readLW link

Questions for a Theory of Narratives

Marv K29 Jul 2022 19:31 UTC

5 points

4 comments4 min readLW link

Focusing

CFAR!Duncan29 Jul 2022 19:15 UTC

132 points

25 comments14 min readLW link

Conjecture: Internal Infohazard Policy

Connor Leahy, Sid Black, Chris Scammell and Andrea_Miotti

29 Jul 2022 19:07 UTC

130 points

6 comments19 min readLW link

Abstracting The Hardness of Alignment: Unbounded Atomic Optimization

adamShimi29 Jul 2022 18:59 UTC

75 points

3 comments16 min readLW link

Bucket Errors

CFAR!Duncan29 Jul 2022 18:50 UTC

46 points

8 comments11 min readLW link

Distillation Contest—Results and Recap

Aris29 Jul 2022 17:40 UTC

34 points

0 comments7 min readLW link

The generalized Sierpinski-Mazurkiewicz theorem.

Donald Hobson29 Jul 2022 0:12 UTC

11 points

4 comments1 min readLW link

The Conversations We Make Space For

Severin T. Seehrich28 Jul 2022 21:37 UTC

21 points

0 comments3 min readLW link

Defining Optimization in a Deeper Way Part 4

J Bostock28 Jul 2022 17:02 UTC

7 points

0 comments5 min readLW link

Covid 7/28/22: Ruining It For Everyone

Zvi28 Jul 2022 15:10 UTC

27 points

7 comments12 min readLW link

(thezvi.wordpress.com)

Monkeypox Post #2

Zvi28 Jul 2022 13:20 UTC

36 points

3 comments6 min readLW link

(thezvi.wordpress.com)

For Better Commenting, Stop Out Loud

DirectedEvolution28 Jul 2022 1:39 UTC

18 points

30 comments1 min readLW link

Seeking beta readers who are ignorant of biology but knowledgeable about AI safety

Holly_Elmore27 Jul 2022 23:02 UTC

11 points

6 comments1 min readLW link

Principles of Privacy for Alignment Research

johnswentworth27 Jul 2022 19:53 UTC

74 points

31 comments7 min readLW link

Moral strategies at different capability levels

Richard_Ngo27 Jul 2022 18:50 UTC

131 points

15 comments5 min readLW link

(thinkingcomplete.blogspot.com)

Progress links and tweets, 2022-07-27

jasoncrawford27 Jul 2022 17:20 UTC

18 points

0 comments1 min readLW link

(rootsofprogress.org)

Quantum Advantage in Learning from Experiments

Dennis Towne27 Jul 2022 15:49 UTC

5 points

5 comments1 min readLW link

(ai.googleblog.com)

Levels of Pluralism

adamShimi27 Jul 2022 9:35 UTC

37 points

0 comments14 min readLW link

Human trials for the Marburg vaccine: funding opportunity?

americanwalrus27 Jul 2022 5:53 UTC

3 points

0 comments1 min readLW link

(www.independent.co.uk)

[Question] “Fanatical” Longtermists: Why is Pascal’s Wager wrong?

Yitz27 Jul 2022 4:16 UTC

3 points

7 comments1 min readLW link

Unifying Bargaining Notions (2/2)

Diffractor27 Jul 2022 3:40 UTC

123 points

20 comments21 min readLW link

AGI ruin scenarios are likely (and disjunctive)

So8res27 Jul 2022 3:21 UTC

179 points

38 comments6 min readLW link

Technocracy and the Space Age

jasoncrawford26 Jul 2022 23:14 UTC

25 points

5 comments2 min readLW link

(rootsofprogress.org)

«Boundaries», Part 1: a key missing concept from utility theory

Andrew_Critch26 Jul 2022 23:03 UTC

161 points

33 comments7 min readLW link

Incoherence of unbounded selfishness

emmab26 Jul 2022 22:27 UTC

−6 points

2 comments1 min readLW link

«Boundaries» Sequence (Index Post)

Andrew_Critch26 Jul 2022 19:12 UTC

25 points

1 comment1 min readLW link

Active Inference as a formalisation of instrumental convergence

Roman Leventov26 Jul 2022 17:55 UTC

12 points

2 comments3 min readLW link

(direct.mit.edu)

NeurIPS ML Safety Workshop 2022

Dan H26 Jul 2022 15:28 UTC

72 points

2 comments1 min readLW link

(neurips2022.mlsafety.org)

AI ethics vs AI alignment

Wei Dai26 Jul 2022 13:08 UTC

8 points

1 comment1 min readLW link

Utility functions and probabilities are entangled

Thomas Kwa26 Jul 2022 5:36 UTC

15 points

5 comments1 min readLW link

How Promising is Theoretical Research on Rationality? Seeking Career Advice

Aspirant22326 Jul 2022 1:08 UTC

3 points

3 comments3 min readLW link

Prediction markets meetup/coworking (hosted by Manifold Markets)

Sinclair Chen and Austin Chen

26 Jul 2022 0:14 UTC

2 points

0 comments1 min readLW link

Alignment being impossible might be better than it being really difficult

Martín Soto25 Jul 2022 23:57 UTC

13 points

2 comments2 min readLW link

[Question] How optimistic should we be about AI figuring out how to interpret itself?

oh5432125 Jul 2022 22:09 UTC

3 points

1 comment1 min readLW link