All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 345 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Get your tickets to Manifest 2024 by May 13th!

Saul Munn3 May 2024 23:57 UTC

18 points

0 comments1 min readLW link

Embodiment

A*3 May 2024 20:06 UTC

4 points

0 comments1 min readLW link

(Geometrically) Maximal Lottery-Lotteries Exist

Lorxus3 May 2024 19:29 UTC

13 points

11 comments26 min readLW link

[Question] Were there any ancient rationalists?

OliverHayman3 May 2024 18:26 UTC

12 points

3 comments1 min readLW link

Key takeaways from our EA and alignment research surveys

Cameron Berg, Judd Rosenblatt, florin_pop and Trent Hodgeson

3 May 2024 18:10 UTC

112 points

10 comments21 min readLW link

“AI Safety for Fleshy Humans” an AI Safety explainer by Nicky Case

habryka3 May 2024 18:10 UTC

90 points

11 comments4 min readLW link

(aisafety.dance)

AI Clarity: An Initial Research Agenda

Justin Bullock, Corin Katzke, Zershaaneh Qureshi and David_Kristoffersson

3 May 2024 13:54 UTC

18 points

1 comment8 min readLW link

Apply to ESPR & PAIR, Rationality and AI Camps for Ages 16-21

Anna Gajdova3 May 2024 12:36 UTC

58 points

5 comments1 min readLW link

On precise out-of-context steering

Olli Järviniemi3 May 2024 9:41 UTC

9 points

6 comments3 min readLW link

LLM+Planners hybridisation for friendly AGI

installgentoo3 May 2024 8:40 UTC

7 points

2 comments1 min readLW link

Mechanistic Interpretability Workshop Happening at ICML 2024!

Neel Nanda, LawrenceC and Fazl

3 May 2024 1:18 UTC

48 points

6 comments1 min readLW link

Weekly newsletter for AI safety events and training programs

Bryce Robertson3 May 2024 0:33 UTC

29 points

0 comments1 min readLW link

CCS: Counterfactual Civilization Simulation

Morphism2 May 2024 22:54 UTC

3 points

0 comments2 min readLW link

Let’s Design A School, Part 2.1 School as Education—Structure

Sable2 May 2024 22:04 UTC

26 points

2 comments10 min readLW link

(affablyevil.substack.com)

Why I’m not doing PauseAI

kwiat.dev2 May 2024 22:00 UTC

−8 points

5 comments4 min readLW link

AI #61: Meta Trouble

Zvi2 May 2024 18:40 UTC

29 points

0 comments52 min readLW link

(thezvi.wordpress.com)

Why is AGI/ASI Inevitable?

DeathlessAmaranth2 May 2024 18:27 UTC

14 points

6 comments1 min readLW link

AISN #34: New Military AI Systems Plus, AI Labs Fail to Uphold Voluntary Commitments to UK AI Safety Institute, and New AI Policy Proposals in the US Senate

Corin Katzke and Dan H

2 May 2024 16:12 UTC

6 points

0 comments8 min readLW link

(newsletter.safe.ai)

Ai Salon: Trustworthy AI Futures #1

Ian Eisenberg2 May 2024 16:07 UTC

1 point

0 comments1 min readLW link

How to write Pseudocode and why you should

Johannes C. Mayer2 May 2024 15:53 UTC

8 points

5 comments3 min readLW link

AI #62: Too Soon to Tell

Zvi2 May 2024 15:40 UTC

30 points

8 comments31 min readLW link

(thezvi.wordpress.com)

Whiteboard Program Traceing: Debug a Program Before you have the Code

Johannes C. Mayer2 May 2024 15:30 UTC

3 points

0 comments1 min readLW link

[Question] Which skincare products are evidence-based?

Vanessa Kosoy2 May 2024 15:22 UTC

122 points

48 comments1 min readLW link

Q&A on Proposed SB 1047

Zvi2 May 2024 15:10 UTC

74 points

8 comments44 min readLW link

(thezvi.wordpress.com)

[Question] What are the Activities that make up your Research Process?

Johannes C. Mayer2 May 2024 15:01 UTC

4 points

0 comments1 min readLW link

[Question] How do you Select the Right Research Acitivity in the Right Moment?

Johannes C. Mayer2 May 2024 14:45 UTC

6 points

1 comment1 min readLW link

[Question] Can stealth aircraft be detected optically?

Yair Halberstadt2 May 2024 7:47 UTC

20 points

27 comments1 min readLW link

An explanation of evil in an organized world

KatjaGrace2 May 2024 5:20 UTC

27 points

9 comments2 min readLW link

(worldspiritsockpuppet.com)

Why I stopped working on AI safety

jbkjr2 May 2024 5:08 UTC

−5 points

0 comments4 min readLW link

(jbkjr.me)

[Linkpost] Silver Bulletin: For most people, politics is about fitting in

Gunnar_Zarncke1 May 2024 18:12 UTC

18 points

4 comments1 min readLW link

(www.natesilver.net)

Launching applications for AI Safety Careers Course India 2024

Axiom_Futures1 May 2024 17:55 UTC

4 points

1 comment1 min readLW link

[Question] Shane Legg’s necessary properties for every AGI Safety plan

jacquesthibs1 May 2024 17:15 UTC

58 points

12 comments1 min readLW link

KAN: Kolmogorov-Arnold Networks

Gunnar_Zarncke1 May 2024 16:50 UTC

18 points

15 comments1 min readLW link

(arxiv.org)

Manifund Q1 Retro: Learnings from impact certs

Austin Chen1 May 2024 16:48 UTC

40 points

1 comment15 min readLW link

ACX Covid Origins Post convinced readers

ErnestScribbler1 May 2024 13:06 UTC

77 points

7 comments2 min readLW link

LessWrong Community Weekend 2024, open for applications

UnplannedCauliflower and jt

1 May 2024 10:18 UTC

79 points

2 comments7 min readLW link

Take SCIFs, it’s dangerous to go alone

latterframe, Jeffrey Ladish and schroederdewitt

1 May 2024 8:02 UTC

43 points

1 comment3 min readLW link

AXRP Episode 30 - AI Security with Jeffrey Ladish

DanielFilan1 May 2024 2:50 UTC

25 points

0 comments79 min readLW link

Neuro/BCI/WBE for Safe AI Workshop

Allison Duettmann1 May 2024 0:46 UTC

3 points

0 comments1 min readLW link

AGI: Cryptography, Security & Multipolar Scenarios Workshop

Allison Duettmann1 May 2024 0:42 UTC

8 points

1 comment1 min readLW link

The formal goal is a pointer

Morphism1 May 2024 0:27 UTC

20 points

10 comments1 min readLW link

Arch-anarchy:Theory and practice

Peter lawless 30 Apr 2024 23:20 UTC

−6 points

0 comments2 min readLW link

“Open Source AI” is a lie, but it doesn’t have to be

jacobhaimes30 Apr 2024 23:10 UTC

19 points

5 comments6 min readLW link

(jacob-haimes.github.io)

Questions for labs

Zach Stein-Perlman30 Apr 2024 22:15 UTC

77 points

11 comments8 min readLW link

Reality comprehensibility: are there illogical things in reality?

DDthinker30 Apr 2024 21:30 UTC

−3 points

0 comments10 min readLW link

Mechanistically Eliciting Latent Behaviors in Language Models

Andrew Mack and TurnTrout

30 Apr 2024 18:51 UTC

215 points

43 comments45 min readLW link

[Question] What is the easiest/funnest way to build up a comprehensive understanding of AI and AI Safety?

Jordan Arel30 Apr 2024 18:41 UTC

4 points

2 comments1 min readLW link

Transcoders enable fine-grained interpretable circuit analysis for language models

Jacob Dunefsky, Philippe Chlenski and Neel Nanda

30 Apr 2024 17:58 UTC

75 points

14 comments17 min readLW link

Announcing the 2024 Roots of Progress Blog-Building Intensive

jasoncrawford30 Apr 2024 17:37 UTC

14 points

0 comments2 min readLW link

(rootsofprogress.org)

The Intentional Stance, LLMs Edition

Eleni Angelou30 Apr 2024 17:12 UTC

30 points

3 comments8 min readLW link