All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 262728 29 30

Don’t align agents to evaluations of plans

TurnTrout26 Nov 2022 21:16 UTC

48 points

49 comments18 min readLW link

[Question] What videos should Rational Animations make?

Writer26 Nov 2022 20:28 UTC

30 points

24 comments1 min readLW link

The First Filter

adamShimi and Gabriel Alfour

26 Nov 2022 19:37 UTC

67 points

5 comments1 min readLW link

Respecting your Local Preferences

Scott Garrabrant26 Nov 2022 19:04 UTC

84 points

1 comment4 min readLW link

[Question] Opinions on the sleep synaptic homeostasis hypothesis?

Angela Richardson26 Nov 2022 19:01 UTC

3 points

0 comments1 min readLW link

Why square errors?

Aprillion26 Nov 2022 13:40 UTC

41 points

11 comments2 min readLW link

[Question] Assuming that at least one religion is true, what would you expect it to be?

risedive26 Nov 2022 8:34 UTC

−9 points

9 comments1 min readLW link

Three Alignment Schemas & Their Problems

Shoshannah Tekofsky26 Nov 2022 4:25 UTC

19 points

1 comment6 min readLW link

The many types of blog posts

Adam Zerner26 Nov 2022 3:57 UTC

10 points

2 comments4 min readLW link

New Frontiers in Mojibake

Adam Scherlis26 Nov 2022 2:37 UTC

60 points

7 comments6 min readLW link 1 review

(adam.scherlis.com)

Semi-conductor/AI Stock Discussion.

sapphire25 Nov 2022 23:35 UTC

28 points

25 comments1 min readLW link

NEFFA Should Allow Small Children

jefftk25 Nov 2022 23:00 UTC

10 points

2 comments2 min readLW link

(www.jefftk.com)

Podcast: Shoshannah Tekofsky on skilling up in AI safety, visiting Berkeley, and developing novel research ideas

Orpheus1625 Nov 2022 20:47 UTC

37 points

2 comments9 min readLW link

The man and the tool

pedroalvarado25 Nov 2022 19:51 UTC

−1 points

0 comments4 min readLW link

[Question] What AI newsletters or substacks about AI do you recommend?

wunan25 Nov 2022 19:29 UTC

6 points

1 comment1 min readLW link

Mechanistic anomaly detection and ELK

paulfchristiano25 Nov 2022 18:50 UTC

138 points

22 comments21 min readLW link

(ai-alignment.com)

The Least Controversial Application of Geometric Rationality

Scott Garrabrant25 Nov 2022 16:50 UTC

60 points

22 comments4 min readLW link

Planes are still decades away from displacing most bird jobs

guzey25 Nov 2022 16:49 UTC

172 points

14 comments3 min readLW link

Take part in our giant study of cognitive abilities and get a customized report of your strengths and weaknesses!

spencerg25 Nov 2022 16:28 UTC

8 points

1 comment1 min readLW link

(www.guidedtrack.com)

Guardian AI (Misaligned systems are all around us.)

Jessica Rumbelow25 Nov 2022 15:55 UTC

15 points

6 comments2 min readLW link

Intuitions by ML researchers may get progressively worse concerning likely candidates for transformative AI

Viktor Rehnberg25 Nov 2022 15:49 UTC

7 points

0 comments2 min readLW link

Refining the Sharp Left Turn threat model, part 2: applying alignment techniques

Vika, Vikrant Varma, Ramana Kumar and Rohin Shah

25 Nov 2022 14:36 UTC

39 points

9 comments6 min readLW link

(vkrakovna.wordpress.com)

[Question] Who holds all the USDT?

ChristianKl25 Nov 2022 11:58 UTC

17 points

6 comments1 min readLW link

Fair Collective Efficient Altruism

Jobst Heitzig25 Nov 2022 9:38 UTC

2 points

1 comment5 min readLW link

[Question] If humanity one day discovers that it is a form of disease that threatens to destroy the universe, should it allow itself to be shut down?

Shmi25 Nov 2022 8:27 UTC

4 points

12 comments1 min readLW link

Could a single alien message destroy us?

Writer and Matthew Barnett

25 Nov 2022 7:32 UTC

62 points

23 comments6 min readLW link

(youtu.be)

How do I start a programming career in the West?

Lao Mein25 Nov 2022 6:37 UTC

38 points

7 comments2 min readLW link

The AI Safety community has four main work groups, Strategy, Governance, Technical and Movement Building

peterslattery25 Nov 2022 3:45 UTC

1 point

0 comments6 min readLW link

Less Successful Cider Adventures

jefftk25 Nov 2022 1:50 UTC

11 points

1 comment1 min readLW link

(www.jefftk.com)

Gliders in Language Models

Alexandre Variengien25 Nov 2022 0:38 UTC

30 points

11 comments10 min readLW link

On Kelly and altruism

philh24 Nov 2022 23:40 UTC

17 points

6 comments12 min readLW link

(reasonableapproximation.net)

Open technical problem: A Quinean proof of Löb’s theorem, for an easier cartoon guide

Andrew_Critch24 Nov 2022 21:16 UTC

58 points

35 comments3 min readLW link 1 review

[Question] Historical examples of people gaining unusual cognitive abilities?

Nicholas Kross24 Nov 2022 19:01 UTC

8 points

2 comments1 min readLW link

Corrigibility Via Thought-Process Deference

Thane Ruthenis24 Nov 2022 17:06 UTC

18 points

5 comments9 min readLW link

Geometric Exploration, Arithmetic Exploitation

Scott Garrabrant24 Nov 2022 15:36 UTC

142 points

5 comments7 min readLW link

What I Learned Running Refine

adamShimi24 Nov 2022 14:49 UTC

108 points

5 comments4 min readLW link

Covid 11/24/22: Thanks for Good Health

Zvi24 Nov 2022 13:00 UTC

26 points

4 comments8 min readLW link

(thezvi.wordpress.com)

Clarifying wireheading terminology

leogao24 Nov 2022 4:53 UTC

68 points

7 comments1 min readLW link

LW Beta Feature: Side-Comments

jimrandomh24 Nov 2022 1:55 UTC

104 points

47 comments1 min readLW link

Against “Classic Style”

Cleo Nardo23 Nov 2022 22:10 UTC

71 points

31 comments4 min readLW link

South Bay ACX/LW Meetup

IS23 Nov 2022 22:05 UTC

2 points

0 comments1 min readLW link

Meme Dialects

jefftk23 Nov 2022 21:30 UTC

34 points

1 comment2 min readLW link

(www.jefftk.com)

[Question] When do you visualize (or not) while doing math?

Alex_Altair23 Nov 2022 20:15 UTC

22 points

9 comments1 min readLW link

When AI solves a game, focus on the game’s mechanics, not its theme.

Cleo Nardo23 Nov 2022 19:16 UTC

89 points

7 comments2 min readLW link

The Geometric Expectation

Scott Garrabrant23 Nov 2022 18:05 UTC

176 points

22 comments4 min readLW link

“Far Coordination”

DragonGod23 Nov 2022 17:14 UTC

6 points

17 comments9 min readLW link

Conjecture Second Hiring Round

Connor Leahy, Sid Black, Gabriel Alfour and Chris Scammell

23 Nov 2022 17:11 UTC

92 points

0 comments1 min readLW link

Conjecture: a retrospective after 8 months of work

Connor Leahy, Sid Black, Gabriel Alfour and Chris Scammell

23 Nov 2022 17:10 UTC

180 points

9 comments8 min readLW link

Against a General Factor of Doom

Jeffrey Heninger23 Nov 2022 16:50 UTC

63 points

19 comments4 min readLW link 1 review

(aiimpacts.org)

Injecting some numbers into the AGI debate—by Boaz Barak

Jsevillamol23 Nov 2022 16:10 UTC

12 points

0 comments3 min readLW link

(windowsontheory.org)