All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 789 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Deep Honesty

Aletheophile7 May 2024 20:31 UTC

166 points

26 comments9 min readLW link

Let’s Design A School, Part 2.2 School as Education—The Curriculum (General)

Sable7 May 2024 19:22 UTC

25 points

3 comments12 min readLW link

(affablyevil.substack.com)

Designing for a single purpose

Itay Dreyfus7 May 2024 14:11 UTC

48 points

12 comments10 min readLW link

(productidentity.co)

Reviewing the Structure of Current AI Regulations

Deric Cheng and Elliot Mckernon

7 May 2024 12:34 UTC

29 points

0 comments13 min readLW link

reflections on smileys and how to make society’s interpretive priors more charitable

Emrik7 May 2024 11:20 UTC

17 points

0 comments1 min readLW link

Virtual Book Club on Nick Bostrom’s “Deep Utopia: Life and Meaning in a Solved World”

elte7 May 2024 9:57 UTC

5 points

0 comments1 min readLW link

Virtual Book Club on Nick Bostrom’s “Deep Utopia: Life and Meaning in a Solved World”

elte7 May 2024 9:55 UTC

1 point

0 comments1 min readLW link

[Question] What is a community that has changed their behaviour without strife?

Nathan Young7 May 2024 9:24 UTC

12 points

6 comments1 min readLW link

Mental Masturbation and the Intellectual Comfort Zone

Declan Molony7 May 2024 5:47 UTC

42 points

2 comments2 min readLW link

AXRP Episode 31 - Singular Learning Theory with Daniel Murfet

DanielFilan7 May 2024 3:50 UTC

72 points

4 comments71 min readLW link

How do open AI models affect incentive to race?

jessicata7 May 2024 0:33 UTC

60 points

13 comments3 min readLW link

(unstablerontology.substack.com)

Rapid capability gain around supergenius level seems probable even without intelligence needing to improve intelligence

Towards_Keeperhood and Davanchama

6 May 2024 17:09 UTC

48 points

17 comments4 min readLW link

Observations on Teaching for Four Weeks

ClareChiaraVincent6 May 2024 16:55 UTC

52 points

14 comments3 min readLW link

[Question] Orthogonality Thesis burden of proof

Donatas Lučiūnas6 May 2024 16:21 UTC

−18 points

4 comments1 min readLW link

GDP per capita in 2050

Hauke Hillebrandt6 May 2024 15:14 UTC

29 points

8 comments17 min readLW link

(hauke.substack.com)

an effective ai safety initiative

Logan Zoellner6 May 2024 7:53 UTC

3 points

9 comments3 min readLW link

Uncovering Deceptive Tendencies in Language Models: A Simulated Company AI Assistant

Olli Järviniemi and evhub

6 May 2024 7:07 UTC

95 points

13 comments1 min readLW link

(arxiv.org)

Biorisk is an Unhelpful Analogy for AI Risk

Davidmanheim6 May 2024 6:20 UTC

4 points

17 comments3 min readLW link

Some Problems with Ordinal Optimization Frame

Mateusz Bagiński6 May 2024 5:28 UTC

9 points

0 comments7 min readLW link

Accidental Electronic Instrument

jefftk6 May 2024 2:10 UTC

15 points

6 comments2 min readLW link

(www.jefftk.com)

PauseAI Global Protest for the Seoul AI Safety Summit

Patodesu and Holly_Elmore

5 May 2024 22:12 UTC

1 point

0 comments1 min readLW link

Explaining a Math Magic Trick

Robert_AIZI5 May 2024 19:41 UTC

103 points

10 comments5 min readLW link

[Question] Does reducing the amount of RL for a given capability level make AI safer?

Chris_Leong5 May 2024 17:04 UTC

43 points

22 comments1 min readLW link

Haymarket at Closing Time

jefftk5 May 2024 2:40 UTC

15 points

2 comments2 min readLW link

(www.jefftk.com)

introduction to cancer vaccines

bhauth5 May 2024 1:06 UTC

113 points

19 comments5 min readLW link

(www.bhauth.com)

Some Experiments I’d Like Someone To Try With An Amnestic

johnswentworth4 May 2024 22:04 UTC

48 points

33 comments3 min readLW link

Introducing AI-Powered Audiobooks of Rational Fiction Classics

Askwho4 May 2024 17:32 UTC

68 points

14 comments1 min readLW link

S-Risks: Fates Worse Than Extinction

aggliu and Writer

4 May 2024 15:30 UTC

54 points

2 comments6 min readLW link

(youtu.be)

Shannon Vallor’s “technomoral virtues”

David Gross4 May 2024 14:48 UTC

15 points

1 comment5 min readLW link

Conserved Quantities (Stat Mech Part 2)

J Bostock4 May 2024 13:40 UTC

16 points

0 comments5 min readLW link

If you are assuming Software works well you are dead

Johannes C. Mayer4 May 2024 12:54 UTC

0 points

12 comments1 min readLW link

CCS on compound sentences

Artem Karpov4 May 2024 12:23 UTC

6 points

0 comments9 min readLW link

Now THIS is forecasting: understanding Epoch’s Direct Approach

Elliot Mckernon and Zershaaneh Qureshi

4 May 2024 12:06 UTC

65 points

4 comments19 min readLW link

OHGOOD: A coordination body for compute governance

Adam Jones4 May 2024 12:03 UTC

5 points

2 comments16 min readLW link

(adamjones.me)

My hour of memoryless lucidity

Eric Neyman4 May 2024 1:40 UTC

381 points

38 comments5 min readLW link 1 review

(ericneyman.wordpress.com)

Extra Tall Crib

jefftk4 May 2024 0:00 UTC

5 points

9 comments1 min readLW link

(www.jefftk.com)

Get your tickets to Manifest 2024 by May 13th!

Saul Munn3 May 2024 23:57 UTC

18 points

0 comments1 min readLW link

Embodiment

A*3 May 2024 20:06 UTC

4 points

0 comments1 min readLW link

(Geometrically) Maximal Lottery-Lotteries Exist

Lorxus3 May 2024 19:29 UTC

13 points

11 comments26 min readLW link

[Question] Were there any ancient rationalists?

OliverHH3 May 2024 18:26 UTC

12 points

3 comments1 min readLW link

Key takeaways from our EA and alignment research surveys

Cameron Berg, Kvee, florin_pop and Trent Hodgeson

3 May 2024 18:10 UTC

114 points

10 comments21 min readLW link

“AI Safety for Fleshy Humans” an AI Safety explainer by Nicky Case

habryka3 May 2024 18:10 UTC

92 points

12 comments4 min readLW link

(aisafety.dance)

AI Clarity: An Initial Research Agenda

Justin Bullock, Corin Katzke, Zershaaneh Qureshi and David_Kristoffersson

3 May 2024 13:54 UTC

18 points

1 comment8 min readLW link

Apply to ESPR & PAIR, Rationality and AI Camps for Ages 16-21

Anna Gajdova3 May 2024 12:36 UTC

58 points

5 comments1 min readLW link

On precise out-of-context steering

Olli Järviniemi3 May 2024 9:41 UTC

9 points

6 comments3 min readLW link

LLM+Planners hybridisation for friendly AGI

installgentoo3 May 2024 8:40 UTC

7 points

2 comments1 min readLW link

Mechanistic Interpretability Workshop Happening at ICML 2024!

Neel Nanda, LawrenceC and fbarez

3 May 2024 1:18 UTC

48 points

6 comments1 min readLW link

Weekly newsletter for AI safety events and training programs

Bryce Robertson3 May 2024 0:33 UTC

29 points

0 comments1 min readLW link

CCS: Counterfactual Civilization Simulation

Morphism2 May 2024 22:54 UTC

3 points

0 comments2 min readLW link

Let’s Design A School, Part 2.1 School as Education—Structure

Sable2 May 2024 22:04 UTC

26 points

3 comments10 min readLW link

(affablyevil.substack.com)