All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 151617 18 19 20 21 22 23 24 25 26 27 28 29 30

Hyperpolation

Gunnar_Zarncke15 Sep 2024 21:37 UTC

23 points

6 comments1 min readLW link

(arxiv.org)

[Question] If I wanted to spend WAY more on AI, what would I spend it on?

Logan Zoellner15 Sep 2024 21:24 UTC

53 points

16 comments1 min readLW link

Superintelligence Can’t Solve the Problem of Deciding What You’ll Do

Vladimir_Nesov15 Sep 2024 21:03 UTC

30 points

11 comments1 min readLW link

For Limited Superintelligences, Epistemic Exclusion is Harder than Robustness to Logical Exploitation

Lorec15 Sep 2024 20:49 UTC

3 points

9 comments3 min readLW link

Why I funded PIBBSS

Ryan Kidd15 Sep 2024 19:56 UTC

116 points

21 comments3 min readLW link

Thirty random thoughts about AI alignment

Lysandre Terrisse15 Sep 2024 16:24 UTC

6 points

1 comment29 min readLW link

Proveably Safe Self Driving Cars [Modulo Assumptions]

Davidmanheim15 Sep 2024 13:58 UTC

27 points

29 comments8 min readLW link

SCP Foundation—Anti memetic Division Hub

landscape_kiwi15 Sep 2024 13:40 UTC

6 points

1 comment1 min readLW link

(scp-wiki.wikidot.com)

Did Christopher Hitchens change his mind about waterboarding?

Isaac King15 Sep 2024 8:28 UTC

176 points

24 comments7 min readLW link

Not every accommodation is a Curb Cut Effect: The Handicapped Parking Effect, the Clapper Effect, and more

Michael Cohn15 Sep 2024 5:27 UTC

80 points

39 comments10 min readLW link

(perplexedguide.net)

AlignedCut: Visual Concepts Discovery on Brain-Guided Universal Feature Space

Bogdan Ionut Cirstea14 Sep 2024 23:23 UTC

17 points

1 comment1 min readLW link

(arxiv.org)

How you can help pass important AI legislation with 10 minutes of effort

ThomasW14 Sep 2024 22:10 UTC

59 points

2 comments2 min readLW link

[Question] Calibration training for ‘percentile rankings’?

david reinstein14 Sep 2024 21:51 UTC

3 points

0 comments2 min readLW link

OpenAI o1, Llama 4, and AlphaZero of LLMs

Vladimir_Nesov14 Sep 2024 21:27 UTC

83 points

25 comments1 min readLW link

Forever Leaders

paksa14 Sep 2024 20:55 UTC

6 points

9 comments1 min readLW link

Emergent Authorship: Creativity à la Communing

gswonk14 Sep 2024 19:02 UTC

1 point

0 comments3 min readLW link

Compression Moves for Prediction

adamShimi14 Sep 2024 17:51 UTC

20 points

0 comments7 min readLW link

(epistemologicalfascinations.substack.com)

Avoiding the Bog of Moral Hazard for AI

Nathan Helm-Burger13 Sep 2024 21:24 UTC

19 points

13 comments2 min readLW link

[Question] If I ask an LLM to think step by step, how big are the steps?

ryan_b13 Sep 2024 20:30 UTC

7 points

1 comment1 min readLW link

Estimating Tail Risk in Neural Networks

Mark Xu13 Sep 2024 20:00 UTC

68 points

9 comments23 min readLW link

(www.alignment.org)

If-Then Commitments for AI Risk Reduction [by Holden Karnofsky]

habryka13 Sep 2024 19:38 UTC

28 points

0 comments20 min readLW link

(carnegieendowment.org)

Can startups be impactful in AI safety?

Esben Kran and Archana Vaidheeswaran

13 Sep 2024 19:00 UTC

15 points

0 comments6 min readLW link

I just can’t agree with AI safety. Why am I wrong?

Ya Polkovnik13 Sep 2024 17:48 UTC

0 points

5 comments2 min readLW link

Keeping it (less than) real: Against ℶ₂ possible people or worlds

quiet_NaN13 Sep 2024 17:29 UTC

17 points

3 comments9 min readLW link

Why I’m bearish on mechanistic interpretability: the shards are not in the network

tailcalled13 Sep 2024 17:09 UTC

24 points

40 comments1 min readLW link

Increasing the Span of the Set of Ideas

Jeffrey Heninger13 Sep 2024 15:52 UTC

7 points

1 comment9 min readLW link

How difficult is AI Alignment?

Sammy Martin13 Sep 2024 15:47 UTC

46 points

6 comments23 min readLW link

The Great Data Integration Schlep

sarahconstantin13 Sep 2024 15:40 UTC

285 points

20 comments9 min readLW link

(sarahconstantin.substack.com)

“Real AGI”

Seth Herd13 Sep 2024 14:13 UTC

20 points

20 comments3 min readLW link

AI, centralization, and the One Ring

owencb13 Sep 2024 14:00 UTC

82 points

14 comments8 min readLW link 1 review

(strangecities.substack.com)

Evidence against Learned Search in a Chess-Playing Neural Network

p.b.13 Sep 2024 11:59 UTC

57 points

3 comments6 min readLW link

My career exploration: Tools for building confidence

lynettebye13 Sep 2024 11:37 UTC

21 points

0 comments20 min readLW link

Contra papers claiming superhuman AI forecasting

nikos, Peter Mühlbacher, Lawrence Phillips and dschwarz

12 Sep 2024 18:10 UTC

182 points

16 comments7 min readLW link

OpenAI o1

Zach Stein-Perlman12 Sep 2024 17:30 UTC

146 points

41 comments1 min readLW link

How to Give in to Threats (without incentivizing them)

Mikhail Samin12 Sep 2024 15:55 UTC

75 points

33 comments5 min readLW link

Open Problems in AIXI Agent Foundations

Cole Wyeth12 Sep 2024 15:38 UTC

42 points

2 comments10 min readLW link

On the destruction of America’s best high school

Chris_Leong12 Sep 2024 15:30 UTC

−6 points

7 comments1 min readLW link

(scottaaronson.blog)

Optimising under arbitrarily many constraint equations

dkl912 Sep 2024 14:59 UTC

6 points

0 comments3 min readLW link

(dkl9.net)

AI #81: Alpha Proteo

Zvi12 Sep 2024 13:00 UTC

59 points

3 comments35 min readLW link

(thezvi.wordpress.com)

[Question] When can I be numerate?

FinalFormal212 Sep 2024 4:05 UTC

22 points

4 comments1 min readLW link

A Nonconstructive Existence Proof of Aligned Superintelligence

Roko12 Sep 2024 3:20 UTC

0 points

80 comments1 min readLW link

(transhumanaxiology.substack.com)

Collapsing the Belief/Knowledge Distinction

Jeremias11 Sep 2024 21:24 UTC

−7 points

8 comments1 min readLW link

Programming Refusal with Conditional Activation Steering

Bruce W. Lee11 Sep 2024 20:57 UTC

41 points

0 comments11 min readLW link

(brucewlee.com)

Checking public figures on whether they “answered the question” quick analysis from Harris/Trump debate, and a proposal

david reinstein11 Sep 2024 20:25 UTC

8 points

4 comments1 min readLW link

(open.substack.com)

AI Safety Newsletter #41: The Next Generation of Compute Scale Plus, Ranking Models by Susceptibility to Jailbreaking, and Machine Ethics

Corin Katzke, Corin Katzke, Julius, andrewz and Dan H

11 Sep 2024 19:14 UTC

5 points

1 comment5 min readLW link

(newsletter.safe.ai)

Refactoring cryonics as structural brain preservation

Andy_McKenzie11 Sep 2024 18:36 UTC

108 points

14 comments3 min readLW link

[Question] Is this a Pivotal Weak Act? Creating bacteria that decompose metal

doomyeser11 Sep 2024 18:07 UTC

9 points

9 comments3 min readLW link

How to discover the nature of sentience, and ethics

Gustavo Ramires11 Sep 2024 17:22 UTC

4 points

5 comments5 min readLW link

Seeking Mechanism Designer for Research into Internalizing Catastrophic Externalities

c.trout11 Sep 2024 15:09 UTC

24 points

2 comments3 min readLW link

Could Things Be Very Different?—How Historical Inertia Might Blind Us To Optimal Solutions

James Stephen Brown11 Sep 2024 9:53 UTC

5 points

0 comments8 min readLW link

(nonzerosum.games)