All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 161718 19 20 21 22 23 24 25 26 27 28 29 30 31

The Cognitive Bootcamp Agreement

Raemon16 Oct 2024 23:24 UTC

36 points

1 comment8 min readLW link 1 review

Bitter lessons about lucid dreaming

avturchin16 Oct 2024 21:27 UTC

84 points

63 comments2 min readLW link

Towards Quantitative AI Risk Management

Henry Papadatos and simeon_c

16 Oct 2024 19:26 UTC

28 points

1 comment6 min readLW link

Why Academia is Mostly Not Truth-Seeking

Zero Contradictions16 Oct 2024 19:14 UTC

−7 points

6 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

Launching Adjacent News

Lucas Kohorst16 Oct 2024 17:58 UTC

24 points

0 comments4 min readLW link

[Question] Interest in Leetcode, but for Rationality?

Gregory 16 Oct 2024 17:54 UTC

76 points

20 comments2 min readLW link

Request for advice: Research for Conversational Game Theory for LLMs

Rome Viharo16 Oct 2024 17:53 UTC

10 points

0 comments1 min readLW link

Why humans won’t control superhuman AIs.

Spiritus Dei16 Oct 2024 16:48 UTC

−11 points

1 comment6 min readLW link

Against empathy-by-default

Steven Byrnes16 Oct 2024 16:38 UTC

63 points

25 comments9 min readLW link

cancer rates after gene therapy

bhauth16 Oct 2024 15:32 UTC

54 points

2 comments3 min readLW link

(bhauth.com)

Monthly Roundup #23: October 2024

Zvi16 Oct 2024 13:50 UTC

39 points

13 comments50 min readLW link

(thezvi.wordpress.com)

[Question] Change My Mind: Thirders in “Sleeping Beauty” are Just Doing Epistemology Wrong

DragonGod16 Oct 2024 10:20 UTC

8 points

67 comments6 min readLW link

[Question] After uploading your consciousness...

Jinge Wang16 Oct 2024 3:52 UTC

−2 points

0 comments1 min readLW link

The ELYSIUM Proposal - Extrapolated voLitions Yielding Separate Individualized Utopias for Mankind

Roko16 Oct 2024 1:24 UTC

9 points

18 comments1 min readLW link

(transhumanaxiology.substack.com)

Bellevue Meetup

Cedar16 Oct 2024 1:07 UTC

3 points

0 comments1 min readLW link

Singular Learning Theory for Dummies

Rahul Chand15 Oct 2024 21:13 UTC

5 points

0 comments8 min readLW link

Distillation Of DeepSeek-Prover V1.5

IvanLin15 Oct 2024 18:53 UTC

4 points

1 comment3 min readLW link

Improving Model-Written Evals for AI Safety Benchmarking

Sunishchal Dev and Marius Hobbhahn

15 Oct 2024 18:25 UTC

30 points

0 comments18 min readLW link

Taking nonlogical concepts seriously

Kris Brown15 Oct 2024 18:16 UTC

7 points

5 comments18 min readLW link

(topos.site)

Rashomon—A newsbetting site

ideasthete15 Oct 2024 18:15 UTC

23 points

8 comments1 min readLW link

On the Practical Applications of Interpretability

Nick Jiang15 Oct 2024 17:18 UTC

5 points

1 comment7 min readLW link

Anthropic’s updated Responsible Scaling Policy

Zac Hatfield-Dodds15 Oct 2024 16:46 UTC

38 points

3 comments3 min readLW link

(www.anthropic.com)

[Question] When is reward ever the optimization target?

Noosphere8915 Oct 2024 15:09 UTC

37 points

17 comments1 min readLW link

An Opinionated Evals Reading List

Marius Hobbhahn and Jérémy Scheurer

15 Oct 2024 14:38 UTC

65 points

0 comments13 min readLW link

(www.apolloresearch.ai)

Anthropic rewrote its RSP

Zach Stein-Perlman15 Oct 2024 14:25 UTC

46 points

19 comments6 min readLW link

[Intuitive self-models] 5. Dissociative Identity (Multiple Personality) Disorder

Steven Byrnes15 Oct 2024 13:31 UTC

60 points

9 comments11 min readLW link

Economics Roundup #4

Zvi15 Oct 2024 13:20 UTC

19 points

4 comments25 min readLW link

(thezvi.wordpress.com)

[Question] Is School of Thought related to the Rationality Community?

Shoshannah Tekofsky15 Oct 2024 12:41 UTC

8 points

12 comments1 min readLW link

Inverse Problems In Everyday Life

silentbob15 Oct 2024 11:42 UTC

14 points

2 comments8 min readLW link

Thinking LLMs: General Instruction Following with Thought Generation

Bogdan Ionut Cirstea15 Oct 2024 9:21 UTC

7 points

0 comments1 min readLW link

(arxiv.org)

Thoughts On the Nature of Capability Elicitation via Fine-tuning

Theodore Chapman15 Oct 2024 8:39 UTC

8 points

0 comments8 min readLW link

Minimal Motivation of Natural Latents

johnswentworth and David Lorell

14 Oct 2024 22:51 UTC

47 points

14 comments3 min readLW link

How long should political (and other) terms be?

ohmurphy14 Oct 2024 21:38 UTC

5 points

0 comments1 min readLW link

(ohmurphy.substack.com)

Examples of How I Use LLMs

jefftk14 Oct 2024 17:10 UTC

31 points

2 comments2 min readLW link

(www.jefftk.com)

It’s important to know when to stop: Mechanistic Exploration of Gemma 2 List Generation

Gerard Boxo14 Oct 2024 17:04 UTC

9 points

0 comments6 min readLW link

(gboxo.github.io)

[Question] LW resources on childhood experiences?

nahir9159514 Oct 2024 17:04 UTC

10 points

7 comments1 min readLW link

Free Will, Neurotypical Dominance, and the Path to ASI and Neuralinks: Evolving Beyond Scarcity

j_passeri14 Oct 2024 16:54 UTC

−1 points

3 comments3 min readLW link

Breakthroughs, Neurodivergence, and Working Outside the System

j_passeri14 Oct 2024 16:54 UTC

2 points

3 comments2 min readLW link

The case for unlearning that removes information from LLM weights

Fabien Roger14 Oct 2024 14:08 UTC

103 points

20 comments6 min readLW link

Circuits in Superposition: Compressing many small neural networks into one

Lucius Bushnaq and jake_mendel

14 Oct 2024 13:06 UTC

131 points

9 comments13 min readLW link

Beyond Defensive Technology

edgecase6414 Oct 2024 11:34 UTC

11 points

1 comment10 min readLW link

Why Stop AI is barricading OpenAI

Remmelt14 Oct 2024 7:12 UTC

−16 points

32 comments6 min readLW link

(docs.google.com)

The Explore vs. Exploit Dilemma

nathanjzhao14 Oct 2024 6:20 UTC

1 point

0 comments1 min readLW link

(nathanzhao.cc)

AI Alignment via Slow Substrates: Early Empirical Results With StarCraft II

Lester Leong14 Oct 2024 4:05 UTC

60 points

9 comments12 min readLW link

some questionable space launch guns

bhauth13 Oct 2024 22:52 UTC

17 points

0 comments4 min readLW link

(bhauth.com)

[Question] What are your favorite books or blogs that are out of print, or whose domains have expired (especially if they also aren’t on LibGen/Wayback/etc, or on Amazon)?

Arjun Panickssery13 Oct 2024 20:21 UTC

13 points

4 comments1 min readLW link

The Hopium Wars: the AGI Entente Delusion

Max Tegmark13 Oct 2024 17:00 UTC

236 points

60 comments9 min readLW link

Parental Writing Selection Bias

jefftk13 Oct 2024 14:00 UTC

54 points

4 comments1 min readLW link 1 review

(www.jefftk.com)

Personal Philosophy

Xor13 Oct 2024 3:01 UTC

3 points

0 comments2 min readLW link

Contagious Beliefs—Simulating Political Alignment

James Stephen Brown13 Oct 2024 0:27 UTC

8 points

0 comments2 min readLW link

(nonzerosum.games)