All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar AprMayJun Jul Aug Sep Oct Nov Dec

All 123 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Linkpost] Silver Bulletin: For most people, politics is about fitting in

Gunnar_Zarncke1 May 2024 18:12 UTC

18 points

4 comments1 min readLW link

(www.natesilver.net)

Launching applications for AI Safety Careers Course India 2024

Axiom_Futures1 May 2024 17:55 UTC

4 points

1 comment1 min readLW link

[Question] Shane Legg’s necessary properties for every AGI Safety plan

jacquesthibs1 May 2024 17:15 UTC

58 points

12 comments1 min readLW link

KAN: Kolmogorov-Arnold Networks

Gunnar_Zarncke1 May 2024 16:50 UTC

18 points

15 comments1 min readLW link

(arxiv.org)

Manifund Q1 Retro: Learnings from impact certs

Austin Chen1 May 2024 16:48 UTC

40 points

1 comment15 min readLW link

ACX Covid Origins Post convinced readers

ErnestScribbler1 May 2024 13:06 UTC

77 points

7 comments2 min readLW link

LessWrong Community Weekend 2024, open for applications

UnplannedCauliflower and jt

1 May 2024 10:18 UTC

79 points

2 comments7 min readLW link

Take SCIFs, it’s dangerous to go alone

latterframe, Jeffrey Ladish and schroederdewitt

1 May 2024 8:02 UTC

43 points

1 comment3 min readLW link

AXRP Episode 30 - AI Security with Jeffrey Ladish

DanielFilan1 May 2024 2:50 UTC

25 points

0 comments79 min readLW link

Neuro/BCI/WBE for Safe AI Workshop

Allison Duettmann1 May 2024 0:46 UTC

3 points

0 comments1 min readLW link

AGI: Cryptography, Security & Multipolar Scenarios Workshop

Allison Duettmann1 May 2024 0:42 UTC

8 points

1 comment1 min readLW link

The formal goal is a pointer

Morphism1 May 2024 0:27 UTC

25 points

10 comments1 min readLW link

“Open Source AI” is a lie, but it doesn’t have to be

jacobhaimes30 Apr 2024 23:10 UTC

19 points

5 comments6 min readLW link

(jacob-haimes.github.io)

Questions for labs

Zach Stein-Perlman30 Apr 2024 22:15 UTC

77 points

11 comments8 min readLW link

Reality comprehensibility: are there illogical things in reality?

DDthinker30 Apr 2024 21:30 UTC

−3 points

0 comments10 min readLW link

Mechanistically Eliciting Latent Behaviors in Language Models

Andrew Mack and TurnTrout

30 Apr 2024 18:51 UTC

225 points

44 comments45 min readLW link 1 review

[Question] What is the easiest/funnest way to build up a comprehensive understanding of AI and AI Safety?

Jordan Arel30 Apr 2024 18:41 UTC

4 points

2 comments1 min readLW link

Transcoders enable fine-grained interpretable circuit analysis for language models

Jacob Dunefsky, Philippe Chlenski and Neel Nanda

30 Apr 2024 17:58 UTC

76 points

14 comments17 min readLW link

Announcing the 2024 Roots of Progress Blog-Building Intensive

jasoncrawford30 Apr 2024 17:37 UTC

14 points

0 comments2 min readLW link

(rootsofprogress.org)

The Intentional Stance, LLMs Edition

Eleni Angelou30 Apr 2024 17:12 UTC

36 points

5 comments8 min readLW link

Introducing AI Lab Watch

Zach Stein-Perlman30 Apr 2024 17:00 UTC

226 points

31 comments1 min readLW link

(ailabwatch.org)

Why I’m doing PauseAI

Joseph Miller30 Apr 2024 16:21 UTC

113 points

16 comments4 min readLW link

LLMs could be as conscious as human emulations, potentially

Canaletto30 Apr 2024 11:36 UTC

15 points

15 comments3 min readLW link

An interesting mathematical model of how LLMs work

Bill Benzon30 Apr 2024 11:01 UTC

5 points

0 comments1 min readLW link

Towards Multimodal Interpretability: Learning Sparse Interpretable Features in Vision Transformers

hugofry29 Apr 2024 20:57 UTC

94 points

9 comments11 min readLW link

Towards a formalization of the agent structure problem

Alex_Altair29 Apr 2024 20:28 UTC

56 points

6 comments14 min readLW link

Ironing Out the Squiggles

Zack_M_Davis29 Apr 2024 16:13 UTC

170 points

37 comments11 min readLW link

Super additivity of consciousness

Arturo Macias29 Apr 2024 15:41 UTC

−2 points

13 comments2 min readLW link

AISC9 has ended and there will be an AISC10

Linda Linsefors29 Apr 2024 10:53 UTC

75 points

4 comments2 min readLW link

Open-Source AI: A Regulatory Review

Elliot Mckernon and Deric Cheng

29 Apr 2024 10:10 UTC

18 points

0 comments8 min readLW link

Big-endian is better than little-endian

Menotim29 Apr 2024 2:30 UTC

38 points

18 comments3 min readLW link

The Prop-room and Stage Cognitive Architecture

Robert Kralisch29 Apr 2024 0:48 UTC

14 points

4 comments14 min readLW link

How are Simulators and Agents related?

Robert Kralisch29 Apr 2024 0:22 UTC

6 points

0 comments7 min readLW link

Extended Embodiment

Robert Kralisch29 Apr 2024 0:18 UTC

8 points

1 comment3 min readLW link

Referential Containment

Robert Kralisch29 Apr 2024 0:16 UTC

2 points

4 comments3 min readLW link

Disentangling Competence and Intelligence

Robert Kralisch29 Apr 2024 0:12 UTC

23 points

7 comments6 min readLW link

List your AI X-Risk cruxes!

Aryeh Englander28 Apr 2024 18:26 UTC

42 points

7 comments2 min readLW link

Things I tell myself to be more agentic

DMMF28 Apr 2024 17:44 UTC

10 points

0 comments3 min readLW link

(danfrank.ca)

Estimating the Number of Players from Game Result Percentages

Daniel L28 Apr 2024 17:42 UTC

1 point

2 comments1 min readLW link

The Science Algorithm—AISC 2024 Final Presentation

Johannes C. Mayer28 Apr 2024 14:55 UTC

4 points

0 comments1 min readLW link

(www.youtube.com)

[Aspiration-based designs] Outlook: dealing with complexity

Jobst Heitzig, jossoliver, thomasfinn and Simon Dima

28 Apr 2024 13:06 UTC

13 points

3 comments2 min readLW link

[Aspiration-based designs] 3. Performance and safety criteria, and aspiration intervals

Jobst Heitzig28 Apr 2024 13:04 UTC

10 points

0 comments12 min readLW link

[Aspiration-based designs] 2. Formal framework, basic algorithm

Jobst Heitzig, Simon Dima and Simon Fischer

28 Apr 2024 13:02 UTC

18 points

2 comments16 min readLW link

[Aspiration-based designs] 1. Informal introduction

B Jacobs, Jobst Heitzig, Simon Fischer and Simon Dima

28 Apr 2024 13:00 UTC

44 points

4 comments8 min readLW link

Playing Northboro with Lily and Rick

jefftk28 Apr 2024 2:40 UTC

10 points

1 comment2 min readLW link

(www.jefftk.com)

Release of UN’s draft related to the governance of AI (a summary of the Simon Institute’s response)

Sebastian Schmidt27 Apr 2024 18:34 UTC

7 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Mercy to the Machine: Thoughts & Rights

False Name27 Apr 2024 16:36 UTC

9 points

5 comments17 min readLW link

Constructability: Plainly-coded AGIs may be feasible in the near future

Épiphanie Gédéon and Charbel-Raphaël

27 Apr 2024 16:04 UTC

91 points

15 comments13 min readLW link

So What’s Up With PUFAs Chemically?

J Bostock27 Apr 2024 13:32 UTC

57 points

25 comments6 min readLW link

Link: Let’s Think Dot by Dot: Hidden Computation in Transformer Language Models by Jacob Pfau, William Merrill & Samuel R. Bowman

Chris_Leong27 Apr 2024 13:22 UTC

12 points

0 comments1 min readLW link

(twitter.com)