All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 3031

Employed bottlenecks?

yanni kyriacos30 May 2024 23:59 UTC

4 points

1 comment1 min readLW link

Duckbill Masks Better?

jefftk30 May 2024 23:40 UTC

20 points

3 comments1 min readLW link

(www.jefftk.com)

OpenAI: Helen Toner Speaks

Zvi30 May 2024 21:10 UTC

86 points

8 comments13 min readLW link

(thezvi.wordpress.com)

Non-Disparagement Canaries for OpenAI

aysja and Adam Scholl

30 May 2024 19:20 UTC

291 points

51 comments2 min readLW link

Clarifying METR’s Auditing Role

Beth Barnes30 May 2024 18:41 UTC

108 points

1 comment2 min readLW link

A civilization ran by amateurs

Olli Järviniemi30 May 2024 17:57 UTC

66 points

8 comments6 min readLW link

One week left to apply for the Roots of Progress Blog-Building Intensive

jasoncrawford30 May 2024 16:55 UTC

8 points

0 comments3 min readLW link

(rootsofprogress.org)

Getting started with AI Alignment research: how to reproduce an experiment from research paper

Alexander23030 May 2024 14:51 UTC

3 points

0 comments3 min readLW link

AI #66: Oh to Be Less Online

Zvi30 May 2024 14:20 UTC

37 points

6 comments56 min readLW link

(thezvi.wordpress.com)

The 27 papers

WitheringWeights30 May 2024 8:46 UTC

19 points

2 comments1 min readLW link

The Market Singularity: A New Perspective

azsantosk30 May 2024 7:05 UTC

1 point

0 comments15 min readLW link

Awakening

lsusr30 May 2024 7:03 UTC

130 points

80 comments9 min readLW link

Value Claims (In Particular) Are Usually Bullshit

johnswentworth30 May 2024 6:26 UTC

151 points

18 comments2 min readLW link

The Pearly Gates

lsusr30 May 2024 4:01 UTC

137 points

6 comments3 min readLW link

AXRP Episode 32 - Understanding Agency with Jan Kulveit

DanielFilan30 May 2024 3:50 UTC

20 points

0 comments53 min readLW link

US Presidential Election: Tractability, Importance, and Urgency

kuhanj29 May 2024 23:52 UTC

42 points

2 comments3 min readLW link

Thoughts on SB-1047

ryan_greenblatt29 May 2024 23:26 UTC

60 points

1 comment11 min readLW link

How I designed my own writing system, VJScript

vkethana29 May 2024 23:18 UTC

2 points

1 comment1 min readLW link

(www.vkethana.com)

AI and integrity

Nathan Young29 May 2024 20:45 UTC

10 points

0 comments2 min readLW link

(nathanpmyoung.substack.com)

MIRI 2024 Communications Strategy

Gretta Duleba29 May 2024 19:33 UTC

325 points

218 comments7 min readLW link

2024 Summer AI Safety Intro Fellowship and Socials in Boston

KevinWei29 May 2024 18:27 UTC

8 points

0 comments1 min readLW link

Apollo Research 1-year update

Marius Hobbhahn, Lee Sharkey, Lucius Bushnaq, Dan Braun, Mikita Balesni, Jérémy Scheurer, Nicholas Goldowsky-Dill, StefanHex, jake_mendel, AlexMeinke and rusheb

29 May 2024 17:44 UTC

93 points

0 comments7 min readLW link

Response to nostalgebraist: proudly waving my moral-antirealist battle flag

Steven Byrnes29 May 2024 16:48 UTC

118 points

34 comments11 min readLW link

Looking beyond Everett in multiversal views of LLMs

kromem29 May 2024 12:35 UTC

10 points

0 comments8 min readLW link

[Question] Inviting discussion of “Beat AI: A contest using philosophical concepts”

David James29 May 2024 11:55 UTC

2 points

1 comment1 min readLW link

AI companies’ commitments

Zach Stein-Perlman29 May 2024 11:00 UTC

36 points

0 comments1 min readLW link

One way violinists fail

Solenoid_Entity29 May 2024 4:08 UTC

67 points

18 comments3 min readLW link

Hardshipification

Jonathan Moregård28 May 2024 20:02 UTC

91 points

17 comments2 min readLW link

(honestliving.substack.com)

When Are Circular Definitions A Problem?

johnswentworth28 May 2024 20:00 UTC

68 points

15 comments3 min readLW link

Notes on Gracefulness

David Gross28 May 2024 18:40 UTC

20 points

2 comments25 min readLW link

[Question] What’s a better term now that “AGI” is too vague?

Seth Herd28 May 2024 18:02 UTC

15 points

9 comments2 min readLW link

Reward hacking behavior can generalize across tasks

Kei Nishimura-Gasparian, Isaac Dunn, Henry Sleight, Miles Turpin, evhub, Carson Denison and Ethan Perez

28 May 2024 16:33 UTC

86 points

5 comments21 min readLW link

Quick Advice on Writing Essays

Niko_McCarty28 May 2024 15:02 UTC

11 points

0 comments3 min readLW link

(www.nikomccarty.com)

[Linkpost] The Expressive Capacity of State Space Models: A Formal Language Perspective

Bogdan Ionut Cirstea28 May 2024 13:49 UTC

4 points

3 comments1 min readLW link

(arxiv.org)

OpenAI: Fallout

Zvi28 May 2024 13:20 UTC

204 points

25 comments36 min readLW link

(thezvi.wordpress.com)

2024 State of the AI Regulatory Landscape

Deric Cheng and Elliot Mckernon

28 May 2024 11:59 UTC

30 points

0 comments2 min readLW link

(www.convergenceanalysis.org)

Finding Backward Chaining Circuits in Transformers Trained on Tree Search

abhayesian, Jannik Brinkmann and Victor Levoso

28 May 2024 5:29 UTC

53 points

1 comment9 min readLW link

(arxiv.org)

[Question] How to get nerds fascinated about mysterious chronic illness research?

riceissa27 May 2024 22:58 UTC

95 points

50 comments2 min readLW link

Understanding Gödel’s completeness theorem

jessicata27 May 2024 18:55 UTC

40 points

0 comments15 min readLW link

(unstableontology.com)

Publicly disclosing compute expenditure daily as a safety regulation

teraflipflop27 May 2024 18:28 UTC

−4 points

0 comments2 min readLW link

Intransitive Trust

Screwtape27 May 2024 16:55 UTC

46 points

15 comments10 min readLW link

Overview of introductory resources in AI Governance

Lucie Philippon27 May 2024 16:21 UTC

19 points

0 comments6 min readLW link

I am the Golden Gate Bridge

Zvi27 May 2024 14:40 UTC

95 points

6 comments27 min readLW link

(thezvi.wordpress.com)

Maybe Anthropic’s Long-Term Benefit Trust is powerless

Zach Stein-Perlman27 May 2024 13:00 UTC

206 points

21 comments2 min readLW link

Real Life Sort by Controversial

Elo27 May 2024 12:22 UTC

5 points

19 comments20 min readLW link

Julia Tasks 101

SatvikBeri27 May 2024 11:32 UTC

1 point

0 comments4 min readLW link

Debates how to defeat aging: Aubrey de Grey vs. Peter Fedichev.

avturchin27 May 2024 10:25 UTC

18 points

0 comments1 min readLW link

Being against involuntary death and being open to change are compatible

Andy_McKenzie27 May 2024 6:37 UTC

39 points

5 comments2 min readLW link

If you’re an AI Safety movement builder consider asking your members these questions in an interview

yanni kyriacos27 May 2024 5:46 UTC

4 points

0 comments2 min readLW link

Book review: Everything Is Predictable

PeterMcCluskey27 May 2024 3:33 UTC

46 points

1 comment2 min readLW link

(bayesianinvestor.com)