All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Compelling Villains and Coherent Values

Cole Wyeth6 Oct 2024 19:53 UTC

42 points

4 comments4 min readLW link

To Be Born in a Bag

Niko_McCarty6 Oct 2024 17:21 UTC

19 points

1 comment16 min readLW link

(www.asimov.press)

Whimsical Thoughts on an AI Notepad: Exploring Non-Invasive Neural Integration via Viral and Stem Cell Pathways

Pug stanky6 Oct 2024 16:37 UTC

1 point

2 comments4 min readLW link

Why I’m not a Bayesian

Richard_Ngo6 Oct 2024 15:22 UTC

252 points

114 comments10 min readLW link 3 reviews

(www.mindthefuture.info)

European Progress Conference

Martin Sustrik6 Oct 2024 11:10 UTC

30 points

11 comments3 min readLW link

(250bpm.substack.com)

Open Thread Fall 2024

habryka5 Oct 2024 22:28 UTC

44 points

194 comments1 min readLW link

[Question] Seeking AI Alignment Tutor/Advisor: $100–150/hr

MrThink5 Oct 2024 21:28 UTC

28 points

3 comments2 min readLW link

Interpretability of SAE Features Representing Check in ChessGPT

Jonathan Kutasov5 Oct 2024 20:43 UTC

27 points

2 comments8 min readLW link

2024 Election Forecasting Contest

mike207315 Oct 2024 20:43 UTC

4 points

0 comments1 min readLW link

(www.mikesblog.net)

5 ways to improve CoT faithfulness

Caleb Biddulph5 Oct 2024 20:17 UTC

46 points

40 comments6 min readLW link

Consciousness As Recursive Reflections

Gunnar_Zarncke5 Oct 2024 20:00 UTC

7 points

2 comments1 min readLW link

(www.astralcodexten.com)

Musings on Text Data Wall (Oct 2024)

Vladimir_Nesov5 Oct 2024 19:00 UTC

42 points

2 comments5 min readLW link

Apply to the Cooperative AI PhD Fellowship by October 14th!

Lewis Hammond5 Oct 2024 12:41 UTC

23 points

0 comments1 min readLW link

AISafety.info: What is the “natural abstractions hypothesis”?

Algon5 Oct 2024 12:31 UTC

38 points

2 comments3 min readLW link

(aisafety.info)

ARENA4.0 Capstone: Hyperparameter tuning for MELBO + replication on Llama-3.2-1b-Instruct

25Hour and submarat

5 Oct 2024 11:30 UTC

34 points

2 comments8 min readLW link

Exploring SAE features in LLMs with definition trees and token lists

mwatkins4 Oct 2024 22:15 UTC

46 points

5 comments6 min readLW link

AXRP Episode 37 - Jaime Sevilla on Forecasting AI

DanielFilan4 Oct 2024 21:00 UTC

21 points

3 comments56 min readLW link

[Question] Seeking Solutions for Aggregating Classifier Outputs

Saeid Ghafouri4 Oct 2024 17:39 UTC

−1 points

0 comments1 min readLW link

Amoeba roles in tech

Sindhu Shivaprasad4 Oct 2024 17:25 UTC

12 points

0 comments4 min readLW link

LASR Labs Spring 2025 applications are open!

Erin Robertson, Charlie Griffin, joehardie, Justin Olive and LASR Labs

4 Oct 2024 13:44 UTC

38 points

0 comments4 min readLW link

(Maybe) A Bag of Heuristics is All There Is & A Bag of Heuristics is All You Need

Sodium3 Oct 2024 19:11 UTC

41 points

17 comments17 min readLW link

Does natural selection favor AIs over humans?

cdkg3 Oct 2024 18:47 UTC

20 points

1 comment1 min readLW link

(link.springer.com)

What Hayek Taught Us About Nature

Ground Truth Data3 Oct 2024 18:20 UTC

−1 points

6 comments2 min readLW link

Biasing VLM Response with Visual Stimuli

Jaehyuk Lim3 Oct 2024 18:04 UTC

5 points

0 comments8 min readLW link

AI #84: Better Than a Podcast

Zvi3 Oct 2024 15:00 UTC

56 points

7 comments52 min readLW link

(thezvi.wordpress.com)

[Question] If I have some money, whom should I donate it to in order to reduce expected P(doom) the most?

KvmanThinking3 Oct 2024 11:31 UTC

35 points

37 comments1 min readLW link

Shutting down all competing AI projects might not buy a lot of time due to Internal Time Pressure

ThomasCederborg3 Oct 2024 0:01 UTC

12 points

7 comments12 min readLW link

“25 Lessons from 25 Years of Marriage” by honorary rationalist Ferrett Steinmetz

CronoDAS2 Oct 2024 22:42 UTC

24 points

2 comments1 min readLW link

(theferrett.substack.com)

MIT FutureTech are hiring for a Head of Operations role

peterslattery2 Oct 2024 17:11 UTC

8 points

0 comments4 min readLW link

Can AI Quantity beat AI Quality?

Gianluca Calcagni2 Oct 2024 15:21 UTC

2 points

0 comments5 min readLW link

[Intuitive self-models] 3. The Active Self

Steven Byrnes2 Oct 2024 15:20 UTC

80 points

46 comments27 min readLW link

AI Safety University Organizing: Early Takeaways from Thirteen Groups

agucova2 Oct 2024 15:14 UTC

32 points

0 comments9 min readLW link

Three main arguments that AI will save humans and one meta-argument

avturchin2 Oct 2024 11:39 UTC

9 points

8 comments2 min readLW link

Should we abstain from voting? (In nondeterministic elections)

B Jacobs2 Oct 2024 10:07 UTC

5 points

8 comments4 min readLW link

(bobjacobs.substack.com)

AI Safety at the Frontier: Paper Highlights, September ’24

gasteigerjo2 Oct 2024 9:49 UTC

13 points

0 comments7 min readLW link

(aisafetyfrontier.substack.com)

Self-Help Corner: Loop Detection

adamShimi2 Oct 2024 8:33 UTC

88 points

6 comments2 min readLW link

(formethods.substack.com)

The murderous shortcut: a toy model of instrumental convergence

Thomas Kwa2 Oct 2024 6:48 UTC

37 points

0 comments2 min readLW link

Switching to a Yamaha P-121 Keyboard

jefftk2 Oct 2024 2:20 UTC

11 points

0 comments2 min readLW link

(www.jefftk.com)

Foresight Vision Weekend 2024

Allison Duettmann1 Oct 2024 21:59 UTC

8 points

0 comments1 min readLW link

Happy simulations

FateGrinder1 Oct 2024 21:05 UTC

−5 points

0 comments2 min readLW link

Three Subtle Examples of Data Leakage

abstractapplic1 Oct 2024 20:45 UTC

182 points

17 comments4 min readLW link 1 review

AI Safety Newsletter #42: Newsom Vetoes SB 1047 Plus, OpenAI’s o1, and AI Governance Summary

Corin Katzke, Corin Katzke, Julius, Alexa Pan, andrewz and Dan H

1 Oct 2024 20:35 UTC

8 points

0 comments6 min readLW link

(newsletter.safe.ai)

Retrieval Augmented Genesis

João Ribeiro Medeiros1 Oct 2024 20:18 UTC

6 points

0 comments29 min readLW link

Likelihood calculation with duobels

Martin Gerdes1 Oct 2024 16:21 UTC

5 points

0 comments6 min readLW link

Is Text Watermarking a lost cause?

egor.timatkov1 Oct 2024 16:20 UTC

17 points

13 comments10 min readLW link

Information dark matter

Logan Kieller1 Oct 2024 15:05 UTC

36 points

4 comments28 min readLW link

(logankieller.substack.com)

Conventional footnotes considered harmful

dkl91 Oct 2024 14:54 UTC

25 points

16 comments1 min readLW link

(dkl9.net)

Newsom Vetoes SB 1047

Zvi1 Oct 2024 12:20 UTC

85 points

6 comments32 min readLW link

(thezvi.wordpress.com)

Will AI and Humanity Go to War?

Simon Goldstein1 Oct 2024 6:35 UTC

17 points

4 comments6 min readLW link

[Question] AMA: International School Student in China

Novice1 Oct 2024 6:00 UTC

5 points

0 comments1 min readLW link