All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 222324 25 26 27 28 29 30

Feeling Old: Leaving your 20s in the 2020s

squidious22 Nov 2022 22:50 UTC

37 points

3 comments1 min readLW link

(opalsandbonobos.blogspot.com)

Brute-forcing the universe: a non-standard shot at diamond alignment

Martín Soto22 Nov 2022 22:36 UTC

9 points

2 comments20 min readLW link

Announcing AI Alignment Awards: $100k research contests about goal misgeneralization & corrigibility

Orpheus16 and Olive Branch

22 Nov 2022 22:19 UTC

74 points

20 comments4 min readLW link

ACX Zurich November Meetup

MB22 Nov 2022 21:41 UTC

1 point

0 comments1 min readLW link

Human-level Full-Press Diplomacy (some bare facts).

Cleo Nardo22 Nov 2022 20:59 UTC

50 points

7 comments3 min readLW link

[Question] How does late-2022 COVID transmissibility drop over time?

Daniel Dewey22 Nov 2022 19:54 UTC

8 points

2 comments1 min readLW link

AI will change the world, but won’t take it over by playing “3-dimensional chess”.

Boaz Barak and benedelman

22 Nov 2022 18:57 UTC

135 points

97 comments24 min readLW link

Progress links and tweets, 2022-11-22

jasoncrawford22 Nov 2022 17:39 UTC

17 points

0 comments1 min readLW link

(rootsofprogress.org)

Tyranny of the Epistemic Majority

Scott Garrabrant22 Nov 2022 17:19 UTC

206 points

14 comments9 min readLW link 1 review

A Walkthrough of In-Context Learning and Induction Heads (w/ Charles Frye) Part 1 of 2

Neel Nanda22 Nov 2022 17:12 UTC

20 points

0 comments1 min readLW link

(www.youtube.com)

Simple Improvement to College Football Overtime Rules

Zvi22 Nov 2022 17:00 UTC

10 points

0 comments1 min readLW link

(thezvi.wordpress.com)

Meta AI announces Cicero: Human-Level Diplomacy play (with dialogue)

Jacy Reese Anthis22 Nov 2022 16:50 UTC

93 points

64 comments1 min readLW link

(www.science.org)

Austin LW meetup notes: The FTX Affair

jchan22 Nov 2022 14:01 UTC

20 points

3 comments16 min readLW link

Motivated Cognition and the Multiverse of Truth

Q Home22 Nov 2022 12:51 UTC

8 points

16 comments24 min readLW link

LessWrong readers are invited to apply to the Lurkshop

Jonas V and GradientDissenter

22 Nov 2022 9:19 UTC

101 points

41 comments3 min readLW link

Gaoxing Guy

Alok Singh22 Nov 2022 1:50 UTC

3 points

1 comment1 min readLW link

(alok.github.io)

Miscellaneous First-Pass Alignment Thoughts

NickGabs21 Nov 2022 21:23 UTC

12 points

4 comments10 min readLW link

[Hebbian Natural Abstractions] Introduction

Samuel Nellessen and Jan

21 Nov 2022 20:34 UTC

34 points

3 comments4 min readLW link

(www.snellessen.com)

Utilitarianism Meets Egalitarianism

Scott Garrabrant21 Nov 2022 19:00 UTC

124 points

16 comments6 min readLW link 1 review

Interview with Matt Freeman

Evenflair21 Nov 2022 18:17 UTC

15 points

0 comments1 min readLW link

(overcast.fm)

Here’s the exit.

Valentine21 Nov 2022 18:07 UTC

163 points

187 comments10 min readLW link 5 reviews

[Question] Benefits/Risks of Scott Aaronson’s Orthodox/Reform Framing for AI Alignment

Jeremyy21 Nov 2022 17:54 UTC

2 points

1 comment1 min readLW link

(scottaaronson.blog)

[ASoT] Reflectivity in Narrow AI

Ulisse Mini21 Nov 2022 0:51 UTC

6 points

1 comment1 min readLW link

Scott Aaronson on “Reform AI Alignment”

Shmi20 Nov 2022 22:20 UTC

39 points

17 comments1 min readLW link

(scottaaronson.blog)

On Morality, Ethics, and all that Jazz

Delen Heisman20 Nov 2022 20:00 UTC

4 points

4 comments2 min readLW link

(delen.substack.com)

Limits to the Controllability of AGI

Roman_Yampolskiy, Remmelt Ellen and Karl von Wendt

20 Nov 2022 19:18 UTC

11 points

2 comments9 min readLW link

Career Scouting: Dentistry

koratkar20 Nov 2022 15:55 UTC

70 points

5 comments5 min readLW link

(careerscouting.substack.com)

Decision Theory but also Ghosts

eva_20 Nov 2022 13:24 UTC

26 points

26 comments10 min readLW link

ARC paper: Formalizing the presumption of independence

Erik Jenner20 Nov 2022 1:22 UTC

97 points

2 comments2 min readLW link

(arxiv.org)

Update to Mysteries of mode collapse: text-davinci-002 not RLHF

janus19 Nov 2022 23:51 UTC

71 points

8 comments2 min readLW link

Make the Drought Evaporate!

AnthonyRepetto19 Nov 2022 23:41 UTC

32 points

25 comments3 min readLW link

Elastic Productivity Tools

Simon Berens19 Nov 2022 21:59 UTC

76 points

8 comments2 min readLW link

(simonberens.me)

A Short Dialogue on the Meaning of Reward Functions

Leon Lang, Quintin Pope and peligrietzer

19 Nov 2022 21:04 UTC

45 points

0 comments3 min readLW link

By Default, GPTs Think In Plain Sight

Fabien Roger19 Nov 2022 19:15 UTC

90 points

36 comments9 min readLW link

Review: Bayesian Statistics the Fun Way by Will Kurt

matto19 Nov 2022 18:52 UTC

4 points

2 comments2 min readLW link

[Question] How does acausal trade work in a deterministic multiverse?

sisyphus19 Nov 2022 1:50 UTC

2 points

13 comments1 min readLW link

Choosing the right dish

Adam Zerner19 Nov 2022 1:38 UTC

38 points

7 comments8 min readLW link

Reflective Consequentialism

Adam Zerner18 Nov 2022 23:56 UTC

21 points

14 comments4 min readLW link

Value Created vs. Value Extracted

Sable18 Nov 2022 21:34 UTC

9 points

6 comments6 min readLW link

(affablyevil.substack.com)

The Disastrously Confident And Inaccurate AI

Sharat Jacob Jacob18 Nov 2022 19:06 UTC

13 points

0 comments13 min readLW link

How AI Fails Us: A non-technical view of the Alignment Problem

testingthewaters18 Nov 2022 19:02 UTC

7 points

1 comment2 min readLW link

(ethics.harvard.edu)

[Question] Is there any policy for a fair treatment of AIs whose friendliness is in doubt?

nahoj18 Nov 2022 19:01 UTC

16 points

10 comments1 min readLW link

Distillation of “How Likely Is Deceptive Alignment?”

NickGabs18 Nov 2022 16:31 UTC

24 points

4 comments10 min readLW link

Contra Chords

jefftk18 Nov 2022 16:20 UTC

12 points

1 comment7 min readLW link

(www.jefftk.com)

[Question] Updates on scaling laws for foundation models from ′ Transcending Scaling Laws with 0.1% Extra Compute’

Nick_Greig18 Nov 2022 12:46 UTC

15 points

2 comments1 min readLW link

Halifax, NS – Monthly Rationalist, EA, and ACX Meetup

Ideopunk18 Nov 2022 11:45 UTC

10 points

0 comments1 min readLW link

Introducing The Logical Foundation, an EA-Aligned Nonprofit with a Plan to End Poverty With Guaranteed Income

Michael Simm18 Nov 2022 8:13 UTC

9 points

23 comments24 min readLW link

My Deontology Says Narrow-Mindedness is Always Wrong

LVSN18 Nov 2022 6:11 UTC

6 points

2 comments1 min readLW link

AI Ethics != Ai Safety

Dentin18 Nov 2022 3:02 UTC

2 points

0 comments1 min readLW link

Don’t design agents which exploit adversarial inputs

TurnTrout and Garrett Baker

18 Nov 2022 1:48 UTC

72 points

64 comments12 min readLW link