All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Care Doesn’t Scale

stavrosOct 28, 2024, 11:57 AM

27 points

1 comment1 min readLW link

(stevenscrawls.com)

Your memory eventually drives confidence in each hypothesis to 1 or 0

Crazy philosopherOct 28, 2024, 9:00 AM

3 points

6 comments1 min readLW link

Nerdtrition: simple diets via spreadsheet abuse

dkl9Oct 27, 2024, 9:45 PM

8 points

0 comments3 min readLW link

(dkl9.net)

AGI Fermi Paradox

jrincaycOct 27, 2024, 8:14 PM

0 points

2 comments2 min readLW link

Substituting Talkbox for Breath Controller

jefftkOct 27, 2024, 7:10 PM

11 points

0 comments1 min readLW link

(www.jefftk.com)

Open Source Replication of Anthropic’s Crosscoder paper for model-diffing

Connor Kissane, robertzk, Arthur Conmy and Neel Nanda

Oct 27, 2024, 6:46 PM

48 points

4 comments5 min readLW link

Hiring a writer to co-author with me (Spencer Greenberg for ClearerThinking.org)

spencergOct 27, 2024, 5:34 PM

16 points

0 comments LW link

Interview with Bill O’Rourke—Russian Corruption, Putin, Applied Ethics, and More

JohnGreerOct 27, 2024, 5:11 PM

3 points

0 comments6 min readLW link

On Shifgrethor

JustisMillsOct 27, 2024, 3:30 PM

67 points

18 comments2 min readLW link

(justismills.substack.com)

The hostile telepaths problem

ValentineOct 27, 2024, 3:26 PM

383 points

89 comments15 min readLW link

[Question] What are some good ways to form opinions on controversial subjects in the current and upcoming era?

Terence CoelhoOct 27, 2024, 2:33 PM

9 points

21 comments1 min readLW link

Video lectures on the learning-theoretic agenda

Vanessa KosoyOct 27, 2024, 12:01 PM

75 points

0 comments1 min readLW link

(www.youtube.com)

Dario Amodei’s “Machines of Loving Grace” sound incredibly dangerous, for Humans

Super AGIOct 27, 2024, 5:05 AM

8 points

1 comment1 min readLW link

Electrostatic Airships?

DaemonicSigilOct 27, 2024, 4:32 AM

64 points

13 comments3 min readLW link

(pbement.com)

A suite of Vision Sparse Autoencoders

Louka Ewington-Pitsos and RRGoyal

Oct 27, 2024, 4:05 AM

25 points

0 comments1 min readLW link

Ways to think about alignment

Abhimanyu Pallavi SudhirOct 27, 2024, 1:40 AM

6 points

0 comments4 min readLW link

[Question] Is there a CFAR handbook audio option?

FinalFormal2Oct 26, 2024, 5:08 PM

16 points

0 comments1 min readLW link

Retrieval Augmented Genesis II — Holy Texts Semantics Analysis

João Ribeiro MedeirosOct 26, 2024, 5:00 PM

−1 points

0 comments11 min readLW link

A superficially plausible promising alternate Earth without lockstep

LorecOct 26, 2024, 4:04 PM

−2 points

3 comments4 min readLW link

Galatea and the windup toy

Nicolas VillarrealOct 26, 2024, 2:52 PM

−3 points

0 comments13 min readLW link

(nicolasdvillarreal.substack.com)

Why is there Nothing rather than Something?

Logan ZoellnerOct 26, 2024, 12:37 PM

27 points

3 comments4 min readLW link

The Summoned Heroine’s Prediction Markets Keep Providing Financial Services To The Demon King!

abstractapplicOct 26, 2024, 12:34 PM

164 points

16 comments7 min readLW link

AI Safety Camp 10

Robert Kralisch, Linda Linsefors and Remmelt

Oct 26, 2024, 11:08 AM

38 points

9 comments18 min readLW link

Arithmetic Models: Better Than You Think

kqrOct 26, 2024, 9:42 AM

28 points

4 comments11 min readLW link

(entropicthoughts.com)

The Case For Bullying

Alexej GerstmaierOct 26, 2024, 4:56 AM

−50 points

8 comments1 min readLW link

(lexposedtruth.com)

Is the Power Grid Sustainable?

jefftkOct 26, 2024, 2:30 AM

36 points

38 comments2 min readLW link

(www.jefftk.com)

[Question] (i no longer endorse this post) - cryonics is a pascal’s mugging?

KvmanThinkingOct 25, 2024, 11:24 PM

−12 points

4 comments1 min readLW link

A Case for Conscious Significance rather than Free Will.

James Stephen BrownOct 25, 2024, 11:20 PM

10 points

2 comments6 min readLW link

Introducing Kairos: a new AI safety fieldbuilding organization (the new home for SPAR and FSP)

agucovaOct 25, 2024, 9:59 PM

14 points

0 comments LW link

Brief analysis of OP Technical AI Safety Funding

22tomOct 25, 2024, 7:37 PM

76 points

5 comments1 min readLW link

UK AISI: Early lessons from evaluating frontier AI systems

Zach Stein-PerlmanOct 25, 2024, 7:00 PM

26 points

0 comments2 min readLW link

(www.aisi.gov.uk)

Lab governance reading list

Zach Stein-PerlmanOct 25, 2024, 6:00 PM

20 points

3 comments1 min readLW link

Enabling New Applications with Today’s Mechanistic Interpretability Toolkit

ananya_joshiOct 25, 2024, 5:53 PM

3 points

0 comments3 min readLW link

OpenAI’s cybersecurity is probably regulated by NIS Regulations

Adam JonesOct 25, 2024, 11:06 AM

11 points

2 comments2 min readLW link

(adamjones.me)

Linkpost: Memorandum on Advancing the United States’ Leadership in Artificial Intelligence

NisanOct 25, 2024, 4:37 AM

60 points

2 comments1 min readLW link

(www.whitehouse.gov)

Making a Pedalboard

jefftkOct 25, 2024, 12:10 AM

10 points

0 comments1 min readLW link

(www.jefftk.com)

What You Can Give Instead of Advice

Karl FaulksOct 24, 2024, 11:10 PM

13 points

2 comments1 min readLW link

[Question] is it possible to comment anonymously on a post?

KvmanThinkingOct 24, 2024, 10:24 PM

2 points

2 comments1 min readLW link

Logical Proof for the Emergence and Substrate Independence of Sentience

rifeOct 24, 2024, 9:08 PM

4 points

31 comments1 min readLW link

(awakenmoon.ai)

Against Job Boards: Human Capital and the Legibility Trap

vaishnav92Oct 24, 2024, 8:50 PM

6 points

1 comment5 min readLW link

IAPS: Mapping Technical Safety Research at AI Companies

Zach Stein-PerlmanOct 24, 2024, 8:30 PM

42 points

13 comments LW link

(www.iaps.ai)

Our Digital and Biological Children

EneaszOct 24, 2024, 6:36 PM

28 points

0 comments3 min readLW link

(deathisbad.substack.com)

Reflections on the Metastrategies Workshop

gw24 Oct 2024 18:30 UTC

41 points

5 comments11 min readLW link

How Should We Measure Intelligence Models: Why Use Frequency of Elemental Information Operations

hwj2024 Oct 2024 16:54 UTC

1 point

0 comments5 min readLW link

Meta AI (FAIR) latest paper integrates system-1 and system-2 thinking into reasoning models.

happy friday24 Oct 2024 16:54 UTC

8 points

0 comments1 min readLW link

Balancing Label Quantity and Quality for Scalable Elicitation

Alex Mallen24 Oct 2024 16:49 UTC

31 points

1 comment2 min readLW link

Claude Sonnet 3.5.1 and Haiku 3.5

Zvi24 Oct 2024 14:50 UTC

51 points

9 comments16 min readLW link

(thezvi.wordpress.com)

Big tech transitions are slow (with implications for AI)

jasoncrawford24 Oct 2024 14:25 UTC

36 points

16 comments4 min readLW link

(blog.rootsofprogress.org)

Derivative AT a discontinuity

Alok Singh24 Oct 2024 2:48 UTC

9 points

5 comments10 min readLW link

how to rapidly assimilate new information

dhruvmethi24 Oct 2024 2:18 UTC

9 points

3 comments8 min readLW link