All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Question] Were there any ancient rationalists?

OliverHaymanMay 3, 2024, 6:26 PM

11 points

3 comments1 min readLW link

Key takeaways from our EA and alignment research surveys

Cameron Berg, Judd Rosenblatt, florin_pop and AE Studio

May 3, 2024, 6:10 PM

112 points

10 comments21 min readLW link

“AI Safety for Fleshy Humans” an AI Safety explainer by Nicky Case

habrykaMay 3, 2024, 6:10 PM

90 points

11 comments4 min readLW link

(aisafety.dance)

AI Clarity: An Initial Research Agenda

Justin Bullock, Corin Katzke, Zershaaneh Qureshi and David_Kristoffersson

May 3, 2024, 1:54 PM

18 points

1 comment8 min readLW link

Apply to ESPR & PAIR, Rationality and AI Camps for Ages 16-21

Anna GajdovaMay 3, 2024, 12:36 PM

58 points

5 comments1 min readLW link

On precise out-of-context steering

Olli JärviniemiMay 3, 2024, 9:41 AM

9 points

6 comments3 min readLW link

LLM+Planners hybridisation for friendly AGI

installgentooMay 3, 2024, 8:40 AM

7 points

2 comments1 min readLW link

Mechanistic Interpretability Workshop Happening at ICML 2024!

Neel Nanda, LawrenceC and Fazl

May 3, 2024, 1:18 AM

48 points

6 comments1 min readLW link

Weekly newsletter for AI safety events and training programs

Bryce RobertsonMay 3, 2024, 12:33 AM

29 points

0 comments1 min readLW link

CCS: Counterfactual Civilization Simulation

MorphismMay 2, 2024, 10:54 PM

3 points

0 comments2 min readLW link

Let’s Design A School, Part 2.1 School as Education—Structure

SableMay 2, 2024, 10:04 PM

26 points

2 comments10 min readLW link

(affablyevil.substack.com)

Why I’m not doing PauseAI

kwiat.devMay 2, 2024, 10:00 PM

−8 points

5 comments4 min readLW link

AI #61: Meta Trouble

ZviMay 2, 2024, 6:40 PM

29 points

0 comments52 min readLW link

(thezvi.wordpress.com)

Why is AGI/ASI Inevitable?

DeathlessAmaranthMay 2, 2024, 6:27 PM

14 points

6 comments1 min readLW link

AISN #34: New Military AI Systems Plus, AI Labs Fail to Uphold Voluntary Commitments to UK AI Safety Institute, and New AI Policy Proposals in the US Senate

Corin Katzke and Dan H

May 2, 2024, 4:12 PM

6 points

0 comments8 min readLW link

(newsletter.safe.ai)

Ai Salon: Trustworthy AI Futures #1

Ian EisenbergMay 2, 2024, 4:07 PM

1 point

0 comments1 min readLW link

How to write Pseudocode and why you should

Johannes C. MayerMay 2, 2024, 3:53 PM

8 points

5 comments3 min readLW link

AI #62: Too Soon to Tell

ZviMay 2, 2024, 3:40 PM

30 points

8 comments31 min readLW link

(thezvi.wordpress.com)

Whiteboard Program Traceing: Debug a Program Before you have the Code

Johannes C. MayerMay 2, 2024, 3:30 PM

3 points

0 comments1 min readLW link

[Question] Which skincare products are evidence-based?

Vanessa KosoyMay 2, 2024, 3:22 PM

120 points

48 comments1 min readLW link

Q&A on Proposed SB 1047

ZviMay 2, 2024, 3:10 PM

74 points

8 comments44 min readLW link

(thezvi.wordpress.com)

[Question] What are the Activities that make up your Research Process?

Johannes C. MayerMay 2, 2024, 3:01 PM

4 points

0 comments1 min readLW link

[Question] How do you Select the Right Research Acitivity in the Right Moment?

Johannes C. MayerMay 2, 2024, 2:45 PM

6 points

1 comment1 min readLW link

[Question] Can stealth aircraft be detected optically?

Yair HalberstadtMay 2, 2024, 7:47 AM

20 points

27 comments1 min readLW link

An explanation of evil in an organized world

KatjaGraceMay 2, 2024, 5:20 AM

26 points

9 comments2 min readLW link

(worldspiritsockpuppet.com)

Why I stopped working on AI safety

jbkjrMay 2, 2024, 5:08 AM

−5 points

0 comments4 min readLW link

(jbkjr.me)

[Linkpost] Silver Bulletin: For most people, politics is about fitting in

Gunnar_ZarnckeMay 1, 2024, 6:12 PM

18 points

4 comments1 min readLW link

(www.natesilver.net)

Launching applications for AI Safety Careers Course India 2024

Axiom_FuturesMay 1, 2024, 5:55 PM

4 points

1 comment1 min readLW link

[Question] Shane Legg’s necessary properties for every AGI Safety plan

jacquesthibsMay 1, 2024, 5:15 PM

58 points

12 comments1 min readLW link

KAN: Kolmogorov-Arnold Networks

Gunnar_ZarnckeMay 1, 2024, 4:50 PM

18 points

15 comments1 min readLW link

(arxiv.org)

Manifund Q1 Retro: Learnings from impact certs

Austin ChenMay 1, 2024, 4:48 PM

40 points

1 comment LW link

ACX Covid Origins Post convinced readers

ErnestScribblerMay 1, 2024, 1:06 PM

77 points

7 comments2 min readLW link

LessWrong Community Weekend 2024, open for applications

UnplannedCauliflower and jt

May 1, 2024, 10:18 AM

79 points

2 comments7 min readLW link

Take SCIFs, it’s dangerous to go alone

latterframe, Jeffrey Ladish and schroederdewitt

May 1, 2024, 8:02 AM

42 points

1 comment3 min readLW link

AXRP Episode 30 - AI Security with Jeffrey Ladish

DanielFilanMay 1, 2024, 2:50 AM

25 points

0 comments79 min readLW link

Neuro/BCI/WBE for Safe AI Workshop

Allison DuettmannMay 1, 2024, 12:46 AM

3 points

0 comments1 min readLW link

AGI: Cryptography, Security & Multipolar Scenarios Workshop

Allison DuettmannMay 1, 2024, 12:42 AM

8 points

1 comment1 min readLW link

The formal goal is a pointer

MorphismMay 1, 2024, 12:27 AM

20 points

10 comments1 min readLW link

Arch-anarchy:Theory and practice

Peter lawless Apr 30, 2024, 11:20 PM

−6 points

0 comments2 min readLW link

“Open Source AI” is a lie, but it doesn’t have to be

jacobhaimesApr 30, 2024, 11:10 PM

19 points

5 comments6 min readLW link

(jacob-haimes.github.io)

Questions for labs

Zach Stein-PerlmanApr 30, 2024, 10:15 PM

77 points

11 comments8 min readLW link

Reality comprehensibility: are there illogical things in reality?

DDthinkerApr 30, 2024, 9:30 PM

−3 points

0 comments10 min readLW link

Mechanistically Eliciting Latent Behaviors in Language Models

Andrew Mack and TurnTrout

Apr 30, 2024, 6:51 PM

210 points

43 comments45 min readLW link

[Question] What is the easiest/funnest way to build up a comprehensive understanding of AI and AI Safety?

Jordan ArelApr 30, 2024, 6:41 PM

4 points

2 comments1 min readLW link

Transcoders enable fine-grained interpretable circuit analysis for language models

Jacob Dunefsky, Philippe Chlenski and Neel Nanda

Apr 30, 2024, 5:58 PM

74 points

14 comments17 min readLW link

Announcing the 2024 Roots of Progress Blog-Building Intensive

jasoncrawfordApr 30, 2024, 5:37 PM

14 points

0 comments2 min readLW link

(rootsofprogress.org)

The Intentional Stance, LLMs Edition

Eleni AngelouApr 30, 2024, 5:12 PM

30 points

3 comments8 min readLW link

Introducing AI Lab Watch

Zach Stein-PerlmanApr 30, 2024, 5:00 PM

225 points

30 comments1 min readLW link

(ailabwatch.org)

Why I’m doing PauseAI

Joseph Miller30 Apr 2024 16:21 UTC

108 points

16 comments4 min readLW link

LLMs could be as conscious as human emulations, potentially

Canaletto30 Apr 2024 11:36 UTC

15 points

15 comments3 min readLW link