All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 262728 29 30 31

“Real summer”?

duck_master26 Aug 2024 22:11 UTC

2 points

0 comments1 min readLW link

Metaculus’s ‘Minitaculus’ Experiments — Collaborate With Us

ChristianWilliams26 Aug 2024 20:44 UTC

7 points

0 comments1 min readLW link

(www.metaculus.com)

My Apartment Art Commission Process

jenn26 Aug 2024 18:36 UTC

37 points

4 comments7 min readLW link

(jenn.site)

My (current) model of what an AI governance researcher does

Johan de Kock26 Aug 2024 17:58 UTC

1 point

2 comments5 min readLW link

Would catching your AIs trying to escape convince AI developers to slow down or undeploy?

Buck26 Aug 2024 16:46 UTC

325 points

78 comments4 min readLW link 1 review

… Wait, our models of semantics should inform fluid mechanics?!?

johnswentworth and David Lorell

26 Aug 2024 16:38 UTC

64 points

18 comments4 min readLW link

Day Zero Antivirals for Future Pandemics

Niko_McCarty26 Aug 2024 15:18 UTC

22 points

2 comments10 min readLW link

(www.asimov.press)

Molecular dynamics data will be essential for the next generation of ML protein models

Abhishaike Mahajan26 Aug 2024 14:50 UTC

9 points

0 comments11 min readLW link

(www.owlposting.com)

My lukewarm take on GLP-1 agonists

George3d626 Aug 2024 12:34 UTC

16 points

0 comments1 min readLW link

(cerebralab.com)

Interview with Robert Kralisch on Simulators

WillPetillo26 Aug 2024 5:49 UTC

17 points

0 comments75 min readLW link

One person’s worth of mental energy for AI doom aversion jobs. What should I do?

Lorec26 Aug 2024 1:29 UTC

9 points

17 comments1 min readLW link

Secular interpretations of core perennialist claims

zhukeepa25 Aug 2024 23:41 UTC

84 points

33 comments14 min readLW link

Darwinian Traps and Existential Risks

KristianRonn25 Aug 2024 22:37 UTC

85 points

14 comments10 min readLW link

DIY LessWrong Jewelry

Fluffnutt25 Aug 2024 21:33 UTC

33 points

0 comments1 min readLW link

Meta: On viewing the latest LW posts

quiet_NaN25 Aug 2024 19:31 UTC

5 points

2 comments1 min readLW link

you should probably eat oatmeal sometimes

bhauth25 Aug 2024 14:50 UTC

47 points

35 comments3 min readLW link 1 review

(bhauth.com)

Referendum Mechanics in a Marketplace of Ideas

Martin Sustrik25 Aug 2024 8:30 UTC

57 points

2 comments5 min readLW link

(250bpm.substack.com)

Please stop using mediocre AI art in your posts

Raemon25 Aug 2024 0:13 UTC

119 points

25 comments2 min readLW link

AXRP Episode 35 - Peter Hase on LLM Beliefs and Easy-to-Hard Generalization

DanielFilan24 Aug 2024 22:30 UTC

21 points

0 comments74 min readLW link

The top 30 books to expand the capabilities of AI: a biased reading list

Jonathan Mugan24 Aug 2024 21:48 UTC

−6 points

0 comments16 min readLW link

The Ap Distribution

criticalpoints24 Aug 2024 21:45 UTC

22 points

8 comments3 min readLW link

(eregis.github.io)

What is it to solve the alignment problem? (Notes)

Joe Carlsmith24 Aug 2024 21:19 UTC

67 points

18 comments53 min readLW link

Examine self modification as an intuition provider for the concept of consciousness

Canaletto24 Aug 2024 20:48 UTC

−4 points

2 comments10 min readLW link

[Question] Looking to interview AI Safety researchers for a book

jeffreycaruso24 Aug 2024 19:57 UTC

14 points

0 comments1 min readLW link

Perplexity wins my AI race

Elizabeth24 Aug 2024 19:20 UTC

107 points

12 comments10 min readLW link

(acesounderglass.com)

Why should anyone boot you up?

onur24 Aug 2024 17:51 UTC

−1 points

5 comments3 min readLW link

(solmaz.io)

Understanding Hidden Computations in Chain-of-Thought Reasoning

Ram Bharadwaj24 Aug 2024 16:35 UTC

6 points

1 comment1 min readLW link

August 2024 Time Tracking

jefftk24 Aug 2024 13:50 UTC

22 points

0 comments3 min readLW link

(www.jefftk.com)

Training a Sparse Autoencoder in < 30 minutes on 16GB of VRAM using an S3 cache

Louka Ewington-Pitsos24 Aug 2024 7:39 UTC

17 points

0 comments5 min readLW link

[Question] Looking for intuitions to extend bargaining notions

ProgramCrafter24 Aug 2024 5:00 UTC

13 points

0 comments1 min readLW link

Owain Evans on Situational Awareness and Out-of-Context Reasoning in LLMs

Michaël Trazzi24 Aug 2024 4:30 UTC

56 points

0 comments5 min readLW link

[Question] Developing Positive Habits through Video Games

pzas24 Aug 2024 3:47 UTC

1 point

5 comments1 min readLW link

“Can AI Scaling Continue Through 2030?”, Epoch AI (yes)

gwern24 Aug 2024 1:40 UTC

137 points

5 comments3 min readLW link 1 review

(epochai.org)

What’s important in “AI for epistemics”?

Lukas Finnveden24 Aug 2024 1:27 UTC

50 points

2 comments28 min readLW link

(www.forethought.org)

Showing SAE Latents Are Not Atomic Using Meta-SAEs

Bart Bussmann, Michael Pearce, Patrick Leask, Joseph Bloom, Lee Sharkey and Neel Nanda

24 Aug 2024 0:56 UTC

73 points

10 comments20 min readLW link

Using ideologically-charged language to get gpt-3.5-turbo to disobey it’s system prompt: a demo

Milan W24 Aug 2024 0:13 UTC

3 points

0 comments6 min readLW link

Crafting Polysemantic Transformer Benchmarks with Known Circuits

Evan Anders and Adrià Garriga-alonso

23 Aug 2024 22:03 UTC

17 points

0 comments25 min readLW link

[Question] What is an appropriate sample size when surveying billions of data points?

Blake23 Aug 2024 21:54 UTC

1 point

2 comments1 min readLW link

Interpretability as Compression: Reconsidering SAE Explanations of Neural Activations with MDL-SAEs

Kola Ayonrinde, Michael Pearce and Lee Sharkey

23 Aug 2024 18:52 UTC

43 points

8 comments16 min readLW link

How I started believing religion might actually matter for rationality and moral philosophy

zhukeepa23 Aug 2024 17:40 UTC

127 points

47 comments7 min readLW link 3 reviews

[Question] What do you expect AI capabilities may look like in 2028?

nonzerosum23 Aug 2024 16:59 UTC

9 points

5 comments1 min readLW link

Invitation to lead a project at AI Safety Camp (Virtual Edition, 2025)

Linda Linsefors, Remmelt Ellen and Robert Kralisch

23 Aug 2024 14:18 UTC

17 points

2 comments4 min readLW link

If we solve alignment, do we die anyway?

Seth Herd23 Aug 2024 13:13 UTC

85 points

132 comments4 min readLW link

What’s going on with Per-Component Weight Updates?

4gate22 Aug 2024 21:22 UTC

1 point

0 comments6 min readLW link

Interoperable High Level Structures: Early Thoughts on Adjectives

johnswentworth and David Lorell

22 Aug 2024 21:12 UTC

55 points

2 comments7 min readLW link

Interest poll: A time-waster blocker for desktop Linux programs

nahoj22 Aug 2024 20:44 UTC

4 points

5 comments1 min readLW link

Turning 22 in the Pre-Apocalypse

testingthewaters22 Aug 2024 20:28 UTC

37 points

14 comments24 min readLW link

(utilityhotbar.github.io)

A Robust Natural Latent Over A Mixed Distribution Is Natural Over The Distributions Which Were Mixed

johnswentworth and David Lorell

22 Aug 2024 19:19 UTC

42 points

4 comments4 min readLW link

A primer on the current state of longevity research

Abhishaike Mahajan22 Aug 2024 17:14 UTC

112 points

8 comments14 min readLW link

(www.owlposting.com)

Some reasons to start a project to stop harmful AI

Remmelt22 Aug 2024 16:23 UTC

5 points

0 comments2 min readLW link