All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 202122 23 24 25 26 27 28 29 30

The national security dimension of OpenAI’s leadership struggle

Mitchell_Porter20 Nov 2023 23:57 UTC

3 points

3 comments2 min readLW link

[Question] What will you think about the Current Thing in a year?

mike_hawke20 Nov 2023 22:39 UTC

21 points

0 comments2 min readLW link

Metaculus Introduces New Forecast Scores, New Leaderboard & Medals

ChristianWilliams20 Nov 2023 20:33 UTC

15 points

2 comments3 min readLW link

(www.metaculus.com)

[Question] “Useless Box” AGI

Cago20 Nov 2023 19:07 UTC

1 point

2 comments1 min readLW link

[Question] Advice on choosing an alcohol rehab center?

Slingshot927120 Nov 2023 18:46 UTC

2 points

1 comment1 min readLW link

Agent Boundaries Aren’t Markov Blankets. [Unless they’re non-causal; see comments.]

abramdemski20 Nov 2023 18:23 UTC

83 points

11 comments2 min readLW link

Navigating emotions in an uncertain & confusing world

Orpheus1620 Nov 2023 18:16 UTC

42 points

1 comment4 min readLW link

OpenAI: Facts from a Weekend

Zvi20 Nov 2023 15:30 UTC

272 points

166 comments9 min readLW link

(thezvi.wordpress.com)

OpenAI Staff (including Sutskever) Threaten to Quit Unless Board Resigns

Seth Herd20 Nov 2023 14:20 UTC

52 points

28 comments1 min readLW link

(www.wired.com)

Ilya: The AI scientist shaping the world

David Varga20 Nov 2023 13:09 UTC

11 points

0 comments4 min readLW link

[Linkpost] OpenAI’s Interim CEO’s views on AI x-risk

Bogdan Ionut Cirstea20 Nov 2023 13:00 UTC

9 points

0 comments1 min readLW link

A Girardian interpretation of the Altman affair, it’s on my to-do list

Bill Benzon20 Nov 2023 12:21 UTC

3 points

0 comments1 min readLW link

[Question] How did you integrate voice-to-text AI into your workflow?

ChristianKl20 Nov 2023 12:01 UTC

28 points

12 comments1 min readLW link

Short film adaptation of the essay “The Simple Truth” [eng sub]

bayesyatina20 Nov 2023 11:42 UTC

18 points

4 comments1 min readLW link

For Civilization and Against Niceness

Gabriel Alfour20 Nov 2023 10:56 UTC

49 points

14 comments8 min readLW link

(cognition.cafe)

“Optimists always win!” is the biggest survivorship bias

Yunfan Ye20 Nov 2023 8:53 UTC

8 points

0 comments2 min readLW link

Sam Altman, Greg Brockman and others from OpenAI join Microsoft

Ozyrus20 Nov 2023 8:23 UTC

58 points

15 comments1 min readLW link

(twitter.com)

Emmett Shear to be interim CEO of OpenAI

Max H20 Nov 2023 5:40 UTC

21 points

5 comments1 min readLW link

(www.theverge.com)

[Question] Where can I learn about algorithmic transformation of AI prompts?

denyeverywhere20 Nov 2023 4:35 UTC

0 points

1 comment1 min readLW link

Extreme website and app blocking

tbenthompson20 Nov 2023 3:53 UTC

11 points

1 comment4 min readLW link

(tbenthompson.com)

Am I going insane or is the quality of education at top universities shockingly low?

ChrisRumanov20 Nov 2023 3:53 UTC

26 points

30 comments2 min readLW link

Residential Demolition Tooling

jefftk20 Nov 2023 3:20 UTC

16 points

1 comment3 min readLW link

(www.jefftk.com)

Aaron Silverbook on anti-cavity bacteria

DanielFilan20 Nov 2023 3:06 UTC

31 points

3 comments1 min readLW link

(youtu.be)

Cheap Model → Big Model design

Maxwell Peterson19 Nov 2023 22:50 UTC

15 points

2 comments7 min readLW link

Human-like systematic generalization through a meta-learning neural network

Burny19 Nov 2023 21:41 UTC

8 points

0 comments2 min readLW link

(twitter.com)

Alignment is Hard: An Uncomputable Alignment Problem

Alexander Bistagne19 Nov 2023 19:38 UTC

−6 points

4 comments1 min readLW link

(github.com)

New paper shows truthfulness & instruction-following don’t generalize by default

joshc19 Nov 2023 19:27 UTC

60 points

0 comments4 min readLW link

In favour of a sovereign state of Gaza

Yair Halberstadt19 Nov 2023 16:08 UTC

8 points

3 comments4 min readLW link

My Criticism of Singular Learning Theory

Joar Skalse19 Nov 2023 15:19 UTC

83 points

56 comments12 min readLW link

“Why can’t you just turn it off?”

Roko19 Nov 2023 14:46 UTC

48 points

25 comments1 min readLW link

Spaciousness In Partner Dance: A Naturalism Demo

LoganStrohl19 Nov 2023 7:00 UTC

78 points

6 comments19 min readLW link 1 review

When Will AIs Develop Long-Term Planning?

PeterMcCluskey19 Nov 2023 0:08 UTC

18 points

5 comments4 min readLW link

(bayesianinvestor.com)

Killswitch

Junio18 Nov 2023 22:53 UTC

2 points

0 comments3 min readLW link

Superalignment

Douglas_Reay18 Nov 2023 22:37 UTC

−4 points

4 comments1 min readLW link

(openai.com)

Predictable Defect-Cooperate?

quetzal_rainbow18 Nov 2023 15:38 UTC

7 points

1 comment2 min readLW link

I think I’m just confused. Once a model exists, how do you “red-team” it to see whether it’s safe. Isn’t it already dangerous?

FTPickle18 Nov 2023 14:16 UTC

21 points

13 comments1 min readLW link

AI Safety Camp 2024

Linda Linsefors18 Nov 2023 10:37 UTC

15 points

1 comment4 min readLW link

(aisafety.camp)

Post-EAG Music Party

jefftk18 Nov 2023 3:00 UTC

14 points

2 comments2 min readLW link

(www.jefftk.com)

Letter to a Sonoma County Jail Cell

MadHatter18 Nov 2023 2:24 UTC

4 points

1 comment1 min readLW link

(open.substack.com)

1. A Sense of Fairness: Deconfusing Ethics

RogerDearnaley17 Nov 2023 20:55 UTC

19 points

10 comments15 min readLW link

Sam Altman fired from OpenAI

LawrenceC17 Nov 2023 20:42 UTC

192 points

75 comments1 min readLW link

(openai.com)

On the lethality of biased human reward ratings

Eli Tyre and johnswentworth

17 Nov 2023 18:59 UTC

48 points

10 comments37 min readLW link

Coup probes: Catching catastrophes with probes trained off-policy

Fabien Roger17 Nov 2023 17:58 UTC

95 points

9 comments11 min readLW link 1 review

On Lies and Liars

Gabriel Alfour17 Nov 2023 17:13 UTC

31 points

4 comments14 min readLW link

(cognition.cafe)

Classifying representations of sparse autoencoders (SAEs)

Annah17 Nov 2023 13:54 UTC

15 points

6 comments2 min readLW link

R&D is a Huge Externality, So Why Do Markets Do So Much of it?

Maxwell Tabarrok17 Nov 2023 13:14 UTC

15 points

14 comments3 min readLW link

(maximumprogress.substack.com)

On excluding dangerous information from training

ShayBenMoshe17 Nov 2023 11:14 UTC

23 points

5 comments3 min readLW link

The dangers of reproducing while old

garymm17 Nov 2023 5:55 UTC

23 points

6 comments1 min readLW link

(www.garymm.org)

I put odds on ends with Nathan Young

KatjaGrace17 Nov 2023 5:40 UTC

8 points

0 comments1 min readLW link

(worldspiritsockpuppet.com)

Debate helps supervise human experts [Paper]

habryka17 Nov 2023 5:25 UTC

29 points

6 comments1 min readLW link

(github.com)