All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 141516 17 18 19 20 21 22 23 24 25 26 27 28 29 30

AlignedCut: Visual Concepts Discovery on Brain-Guided Universal Feature Space

Bogdan Ionut Cirstea14 Sep 2024 23:23 UTC

17 points

1 comment1 min readLW link

(arxiv.org)

How you can help pass important AI legislation with 10 minutes of effort

ThomasW14 Sep 2024 22:10 UTC

59 points

2 comments2 min readLW link

[Question] Calibration training for ‘percentile rankings’?

david reinstein14 Sep 2024 21:51 UTC

3 points

0 comments2 min readLW link

OpenAI o1, Llama 4, and AlphaZero of LLMs

Vladimir_Nesov14 Sep 2024 21:27 UTC

83 points

25 comments1 min readLW link

Forever Leaders

Justice Howard14 Sep 2024 20:55 UTC

6 points

9 comments1 min readLW link

Emergent Authorship: Creativity à la Communing

gswonk14 Sep 2024 19:02 UTC

1 point

0 comments3 min readLW link

Compression Moves for Prediction

adamShimi14 Sep 2024 17:51 UTC

20 points

0 comments7 min readLW link

(epistemologicalfascinations.substack.com)

Pay-on-results anxiety coaching: first success

Chris Lakin14 Sep 2024 3:39 UTC

63 points

8 comments1 min readLW link

(chrislakin.blog)

Avoiding the Bog of Moral Hazard for AI

Nathan Helm-Burger13 Sep 2024 21:24 UTC

19 points

13 comments2 min readLW link

[Question] If I ask an LLM to think step by step, how big are the steps?

ryan_b13 Sep 2024 20:30 UTC

7 points

1 comment1 min readLW link

Estimating Tail Risk in Neural Networks

Mark Xu13 Sep 2024 20:00 UTC

68 points

9 comments23 min readLW link

(www.alignment.org)

If-Then Commitments for AI Risk Reduction [by Holden Karnofsky]

habryka13 Sep 2024 19:38 UTC

28 points

0 comments20 min readLW link

(carnegieendowment.org)

Can startups be impactful in AI safety?

Esben Kran and Archana Vaidheeswaran

13 Sep 2024 19:00 UTC

15 points

0 comments6 min readLW link

I just can’t agree with AI safety. Why am I wrong?

Ya Polkovnik13 Sep 2024 17:48 UTC

0 points

5 comments2 min readLW link

Keeping it (less than) real: Against ℶ₂ possible people or worlds

quiet_NaN13 Sep 2024 17:29 UTC

17 points

3 comments9 min readLW link

Why I’m bearish on mechanistic interpretability: the shards are not in the network

tailcalled13 Sep 2024 17:09 UTC

21 points

40 comments1 min readLW link

Increasing the Span of the Set of Ideas

Jeffrey Heninger13 Sep 2024 15:52 UTC

7 points

1 comment9 min readLW link

How difficult is AI Alignment?

Sammy Martin13 Sep 2024 15:47 UTC

44 points

6 comments23 min readLW link

The Great Data Integration Schlep

sarahconstantin13 Sep 2024 15:40 UTC

276 points

19 comments9 min readLW link

(sarahconstantin.substack.com)

“Real AGI”

Seth Herd13 Sep 2024 14:13 UTC

20 points

20 comments3 min readLW link

AI, centralization, and the One Ring

owencb13 Sep 2024 14:00 UTC

80 points

12 comments8 min readLW link

(strangecities.substack.com)

Evidence against Learned Search in a Chess-Playing Neural Network

p.b.13 Sep 2024 11:59 UTC

57 points

3 comments6 min readLW link

My career exploration: Tools for building confidence

lynettebye13 Sep 2024 11:37 UTC

20 points

0 comments20 min readLW link

Contra papers claiming superhuman AI forecasting

nikos, Peter Mühlbacher, Lawrence Phillips and dschwarz

12 Sep 2024 18:10 UTC

182 points

16 comments7 min readLW link

OpenAI o1

Zach Stein-Perlman12 Sep 2024 17:30 UTC

147 points

41 comments1 min readLW link

How to Give in to Threats (without incentivizing them)

Mikhail Samin12 Sep 2024 15:55 UTC

72 points

34 comments5 min readLW link

Open Problems in AIXI Agent Foundations

Cole Wyeth12 Sep 2024 15:38 UTC

42 points

2 comments10 min readLW link

On the destruction of America’s best high school

Chris_Leong12 Sep 2024 15:30 UTC

−6 points

7 comments1 min readLW link

(scottaaronson.blog)

Optimising under arbitrarily many constraint equations

dkl912 Sep 2024 14:59 UTC

6 points

0 comments3 min readLW link

(dkl9.net)

AI #81: Alpha Proteo

Zvi12 Sep 2024 13:00 UTC

59 points

3 comments35 min readLW link

(thezvi.wordpress.com)

[Question] When can I be numerate?

FinalFormal212 Sep 2024 4:05 UTC

25 points

4 comments1 min readLW link

A Nonconstructive Existence Proof of Aligned Superintelligence

Roko12 Sep 2024 3:20 UTC

0 points

80 comments1 min readLW link

(transhumanaxiology.substack.com)

Collapsing the Belief/Knowledge Distinction

Jeremias11 Sep 2024 21:24 UTC

−7 points

8 comments1 min readLW link

Programming Refusal with Conditional Activation Steering

Bruce W. Lee11 Sep 2024 20:57 UTC

41 points

0 comments11 min readLW link

(brucewlee.com)

Checking public figures on whether they “answered the question” quick analysis from Harris/Trump debate, and a proposal

david reinstein11 Sep 2024 20:25 UTC

8 points

4 comments1 min readLW link

(open.substack.com)

AI Safety Newsletter #41: The Next Generation of Compute Scale Plus, Ranking Models by Susceptibility to Jailbreaking, and Machine Ethics

Corin Katzke, Corin Katzke, Julius, andrewz and Dan H

11 Sep 2024 19:14 UTC

5 points

1 comment5 min readLW link

(newsletter.safe.ai)

Refactoring cryonics as structural brain preservation

Andy_McKenzie11 Sep 2024 18:36 UTC

101 points

14 comments3 min readLW link

[Question] Is this a Pivotal Weak Act? Creating bacteria that decompose metal

doomyeser11 Sep 2024 18:07 UTC

9 points

9 comments3 min readLW link

How to discover the nature of sentience, and ethics

Gustavo Ramires11 Sep 2024 17:22 UTC

−2 points

5 comments5 min readLW link

Seeking Mechanism Designer for Research into Internalizing Catastrophic Externalities

c.trout11 Sep 2024 15:09 UTC

24 points

2 comments3 min readLW link

Could Things Be Very Different?—How Historical Inertia Might Blind Us To Optimal Solutions

James Stephen Brown11 Sep 2024 9:53 UTC

5 points

0 comments8 min readLW link

(nonzerosum.games)

Reformative Hypocrisy, and Paying Close Enough Attention to Selectively Reward It.

Andrew_Critch11 Sep 2024 4:41 UTC

53 points

11 comments3 min readLW link

A necessary Membrane formalism feature

ThomasCederborg10 Sep 2024 21:33 UTC

20 points

6 comments11 min readLW link

Formalizing the Informal (event invite)

abramdemski10 Sep 2024 19:22 UTC

42 points

0 comments1 min readLW link

AI #80: Never Have I Ever

Zvi10 Sep 2024 17:50 UTC

46 points

20 comments39 min readLW link

(thezvi.wordpress.com)

The Best Lay Argument is not a Simple English Yud Essay

J Bostock10 Sep 2024 17:34 UTC

261 points

15 comments5 min readLW link

Economics Roundup #3

Zvi10 Sep 2024 13:50 UTC

44 points

9 comments20 min readLW link

(thezvi.wordpress.com)

Amplify is hiring! Work with us to support field-building initiatives through digital marketing

gergogaspar10 Sep 2024 8:56 UTC

0 points

1 comment4 min readLW link

What bootstraps intelligence?

invertedpassion10 Sep 2024 7:11 UTC

2 points

2 comments1 min readLW link

Physical Therapy Sucks (but have you tried hiding it in some peanut butter?)

Declan Molony10 Sep 2024 5:54 UTC

17 points

12 comments1 min readLW link