All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 272829 30 31

Seeking beta readers who are ignorant of biology but knowledgeable about AI safety

Holly_Elmore27 Jul 2022 23:02 UTC

11 points

6 comments1 min readLW link

Principles of Privacy for Alignment Research

johnswentworth27 Jul 2022 19:53 UTC

74 points

31 comments7 min readLW link

Moral strategies at different capability levels

Richard_Ngo27 Jul 2022 18:50 UTC

131 points

15 comments5 min readLW link

(thinkingcomplete.blogspot.com)

Progress links and tweets, 2022-07-27

jasoncrawford27 Jul 2022 17:20 UTC

18 points

0 comments1 min readLW link

(rootsofprogress.org)

Quantum Advantage in Learning from Experiments

Dennis Towne27 Jul 2022 15:49 UTC

5 points

5 comments1 min readLW link

(ai.googleblog.com)

Levels of Pluralism

adamShimi27 Jul 2022 9:35 UTC

37 points

0 comments14 min readLW link

Human trials for the Marburg vaccine: funding opportunity?

americanwalrus27 Jul 2022 5:53 UTC

3 points

0 comments1 min readLW link

(www.independent.co.uk)

[Question] “Fanatical” Longtermists: Why is Pascal’s Wager wrong?

Yitz27 Jul 2022 4:16 UTC

3 points

7 comments1 min readLW link

Unifying Bargaining Notions (2/2)

Diffractor27 Jul 2022 3:40 UTC

123 points

20 comments21 min readLW link

AGI ruin scenarios are likely (and disjunctive)

So8res27 Jul 2022 3:21 UTC

179 points

38 comments6 min readLW link

Technocracy and the Space Age

jasoncrawford26 Jul 2022 23:14 UTC

25 points

5 comments2 min readLW link

(rootsofprogress.org)

«Boundaries», Part 1: a key missing concept from utility theory

Andrew_Critch26 Jul 2022 23:03 UTC

161 points

33 comments7 min readLW link

Incoherence of unbounded selfishness

emmab26 Jul 2022 22:27 UTC

−6 points

2 comments1 min readLW link

«Boundaries» Sequence (Index Post)

Andrew_Critch26 Jul 2022 19:12 UTC

25 points

1 comment1 min readLW link

Active Inference as a formalisation of instrumental convergence

Roman Leventov26 Jul 2022 17:55 UTC

12 points

2 comments3 min readLW link

(direct.mit.edu)

NeurIPS ML Safety Workshop 2022

Dan H26 Jul 2022 15:28 UTC

72 points

2 comments1 min readLW link

(neurips2022.mlsafety.org)

AI ethics vs AI alignment

Wei Dai26 Jul 2022 13:08 UTC

8 points

1 comment1 min readLW link

Utility functions and probabilities are entangled

Thomas Kwa26 Jul 2022 5:36 UTC

15 points

5 comments1 min readLW link

How Promising is Theoretical Research on Rationality? Seeking Career Advice

Aspirant22326 Jul 2022 1:08 UTC

3 points

3 comments3 min readLW link

Prediction markets meetup/coworking (hosted by Manifold Markets)

Sinclair Chen and Austin Chen

26 Jul 2022 0:14 UTC

2 points

0 comments1 min readLW link

Alignment being impossible might be better than it being really difficult

Martín Soto25 Jul 2022 23:57 UTC

13 points

2 comments2 min readLW link

[Question] How optimistic should we be about AI figuring out how to interpret itself?

oh5432125 Jul 2022 22:09 UTC

3 points

1 comment1 min readLW link

Protectionism in One Country: How Industrial Policy Worked in Canada

Davis Kedrosky25 Jul 2022 22:08 UTC

5 points

0 comments16 min readLW link

(daviskedrosky.substack.com)

Mistakes as agency

pchvykov25 Jul 2022 16:17 UTC

12 points

8 comments4 min readLW link

My Bitcoin Thesis @2022 - Part 1

aysajan25 Jul 2022 15:49 UTC

8 points

6 comments13 min readLW link

The Reader’s Guide to Optimal Monetary Policy

Ege Erdil25 Jul 2022 15:10 UTC

58 points

10 comments14 min readLW link

AGI Safety Needs People With All Skillsets!

Severin T. Seehrich25 Jul 2022 13:32 UTC

28 points

0 comments2 min readLW link

[Question] Is there any evidence that handwashing does anything to prevent COVID?

mukashi25 Jul 2022 7:34 UTC

4 points

3 comments1 min readLW link

Opening Session Tips & Advice

CFAR!Duncan25 Jul 2022 3:57 UTC

100 points

3 comments14 min readLW link 1 review

How much should we worry about mesa-optimization challenges?

sudo25 Jul 2022 3:56 UTC

4 points

13 comments2 min readLW link

[Question] Does agent foundations cover all future ML systems?

Jonas Hallgren25 Jul 2022 1:17 UTC

4 points

0 comments1 min readLW link

Unifying Bargaining Notions (1/2)

Diffractor25 Jul 2022 0:28 UTC

218 points

41 comments16 min readLW link

Reward is not the optimization target

TurnTrout25 Jul 2022 0:03 UTC

387 points

128 comments10 min readLW link 3 reviews

Brainstorm of things that could force an AI team to burn their lead

So8res24 Jul 2022 23:58 UTC

136 points

8 comments13 min readLW link

Finding Skeletons on Rashomon Ridge

David Udell, Peter S. Park and NickyP

24 Jul 2022 22:31 UTC

30 points

2 comments7 min readLW link

Gathering Information you won’t use directly is often useful

Johannes C. Mayer24 Jul 2022 21:21 UTC

6 points

1 comment1 min readLW link

[Question] Impact of ” ‘Let’s think step by step’ is all you need”?

yrimon24 Jul 2022 20:59 UTC

20 points

2 comments1 min readLW link

The Most Important Century: The Animation

Writer and Matthew Barnett

24 Jul 2022 20:58 UTC

46 points

2 comments20 min readLW link

(youtu.be)

Hiring Programmers in Academia

jefftk24 Jul 2022 20:20 UTC

54 points

19 comments2 min readLW link

(www.jefftk.com)

Less Wrong Budapest July 30th Meetup

Richard Horvath24 Jul 2022 19:07 UTC

2 points

0 comments1 min readLW link

Relationship between subjective experience and intelligence?

Q Home24 Jul 2022 9:10 UTC

5 points

4 comments9 min readLW link

Double Crux

CFAR!Duncan24 Jul 2022 6:34 UTC

61 points

9 comments11 min readLW link

Example Meetup Description

Julius24 Jul 2022 5:38 UTC

6 points

0 comments2 min readLW link

Eavesdropping on Aliens: A Data Decoding Challenge

anonymousaisafety24 Jul 2022 4:35 UTC

49 points

9 comments4 min readLW link

Information theoretic model analysis may not lend much insight, but we may have been doing them wrong!

Garrett Baker24 Jul 2022 0:42 UTC

7 points

0 comments10 min readLW link

What’s next for instrumental rationality?

Andrew_Critch23 Jul 2022 22:55 UTC

63 points

7 comments1 min readLW link

Easy guide for running a local Rationality meetup

nsokolsky23 Jul 2022 22:52 UTC

13 points

1 comment6 min readLW link

Curating “The Epistemic Sequences” (list v.0.1)

Andrew_Critch23 Jul 2022 22:17 UTC

65 points

12 comments7 min readLW link

Room Opening

jefftk23 Jul 2022 21:00 UTC

8 points

3 comments4 min readLW link

(www.jefftk.com)

A Bias Against Altruism

Lone Pine23 Jul 2022 20:44 UTC

58 points

30 comments2 min readLW link