All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 20242025

All Jan Feb MarAprMay Jun

All 1 2 3 4 5 6 7 8 91011 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

[Question] How familiar is the Lesswrong community as a whole with the concept of Reward-modelling?

OxidizeApr 9, 2025, 11:33 PM

1 point

8 comments1 min readLW link

What can we learn from expert AGI forecasts?

Benjamin_ToddApr 9, 2025, 9:34 PM

5 points

0 comments5 min readLW link

(80000hours.org)

Thoughts on AI 2027

Max HarmsApr 9, 2025, 9:26 PM

222 points

61 comments21 min readLW link

(intelligence.org)

The case for AGI by 2030

Benjamin_ToddApr 9, 2025, 8:35 PM

40 points

6 comments42 min readLW link

(80000hours.org)

Anti-automation policy as a bottleneck to economic growth

mhamptonApr 9, 2025, 8:12 PM

4 points

0 comments4 min readLW link

Reasoning models don’t always say what they think

Joe Benton, Ethan Perez, Vlad Mikulik and Fabien Roger

Apr 9, 2025, 7:48 PM

28 points

4 comments1 min readLW link

(www.anthropic.com)

Reverse engineering the memory layout of GPU inference

Paul BricmanApr 9, 2025, 3:40 PM

5 points

0 comments6 min readLW link

(noemaresearch.com)

How to defeat superintelligence, the Sta-Hi way

kilgoarApr 9, 2025, 1:58 PM

−8 points

0 comments3 min readLW link

Llama Does Not Look Good 4 Anything

ZviApr 9, 2025, 1:20 PM

31 points

1 comment16 min readLW link

(thezvi.wordpress.com)

Learned pain as a leading cause of chronic pain

SoerenMindApr 9, 2025, 11:57 AM

203 points

38 comments9 min readLW link

Does the universe’s recognition of measurement provide stronger evidence for being in a simulation than universal fine-tuning?

ameliaApr 9, 2025, 8:20 AM

0 points

2 comments4 min readLW link

Taxonomy of possibility

dkl9Apr 9, 2025, 4:24 AM

13 points

1 comment5 min readLW link

(dkl9.net)

Short Timelines Don’t Devalue Long Horizon Research

Vladimir_NesovApr 9, 2025, 12:42 AM

167 points

24 comments1 min readLW link

A Platform for Falsifiable Conjectures and Public Refutation — Would This Be Useful?

PetrusNoniusApr 8, 2025, 9:09 PM

1 point

1 comment1 min readLW link

Quantifying SAE Quality with Feature Steerability Metrics

phenomanonApr 8, 2025, 8:55 PM

2 points

0 comments4 min readLW link

MATS is hiring!

Ryan Kidd and VVN

Apr 8, 2025, 8:45 PM

8 points

0 comments6 min readLW link

birds and mammals independently evolved intelligence

bhauthApr 8, 2025, 8:00 PM

73 points

23 comments1 min readLW link

(www.quantamagazine.org)

Alignment Faking Revisited: Improved Classifiers and Open Source Extensions

John Hughes, abhayesian, Akbir Khan and Fabien Roger

Apr 8, 2025, 5:32 PM

146 points

20 comments12 min readLW link

London Working Group for Short/Medium Term AI Risks

scronkfinkleApr 8, 2025, 5:32 PM

5 points

0 comments2 min readLW link

Thinking Machines

Knight LeeApr 8, 2025, 5:27 PM

3 points

0 comments6 min readLW link

Digital Error Correction and Lock-In

alamertonApr 8, 2025, 3:46 PM

1 point

0 comments5 min readLW link

(alfielamerton.substack.com)

[Question] What faithfulness metrics should general claims about CoT faithfulness be based upon?

Rauno ArikeApr 8, 2025, 3:27 PM

24 points

0 comments4 min readLW link

AI 2027: Responses

ZviApr 8, 2025, 12:50 PM

109 points

3 comments30 min readLW link

(thezvi.wordpress.com)

The first AI war will be in your computer

ViliamApr 8, 2025, 9:28 AM

43 points

10 comments3 min readLW link

Who wants to bet me $25k at 1:7 odds that there won’t be an AI market crash in the next year?

RemmeltApr 8, 2025, 8:31 AM

32 points

19 comments1 min readLW link

A Pathway to Fully Autonomous Therapists

Declan MolonyApr 8, 2025, 4:10 AM

5 points

2 comments6 min readLW link

Rethinking Friction: Equity and Motivation Across Domains

eltimbalinoApr 8, 2025, 3:58 AM

−1 points

0 comments2 min readLW link

(www.lesswrong.com)

On different discussion traditions

Eugene ShcherbininApr 7, 2025, 11:00 PM

1 point

0 comments2 min readLW link

Misinformation is the default, and information is the government telling you your tap water is safe to drink

danielechlinApr 7, 2025, 10:28 PM

10 points

2 comments9 min readLW link

Log-linear Scaling is Worth the Cost due to Gains in Long-Horizon Tasks

shash42Apr 7, 2025, 9:50 PM

16 points

2 comments1 min readLW link

Paper Highlights, March ’25

gasteigerjoApr 7, 2025, 8:17 PM

8 points

0 comments9 min readLW link

(aisafetyfrontier.substack.com)

Factory farming intelligent minds

Odd anonApr 7, 2025, 8:05 PM

2 points

5 comments20 min readLW link

What alignment-relevant abilities might Terence Tao lack?

Towards_KeeperhoodApr 7, 2025, 7:44 PM

12 points

2 comments3 min readLW link

[Question] Are there any (semi-)detailed future scenarios where we win?

Jan BetleyApr 7, 2025, 7:13 PM

15 points

3 comments1 min readLW link

Austin Chen on Winning, Risk-Taking, and FTX

ElizabethApr 7, 2025, 7:00 PM

35 points

3 comments1 min readLW link

(acesounderglass.com)

An Unbiased Evaluation of My Debate with Thane Ruthenis—Run It Yourself

funnyfrancoApr 7, 2025, 6:56 PM

−24 points

14 comments2 min readLW link

American College Admissions Doesn’t Need to Be So Competitive

Arjun PanicksseryApr 7, 2025, 5:35 PM

48 points

20 comments6 min readLW link

(arjunpanickssery.substack.com)

Coupling for Decouplers

Jacob FalkovichApr 7, 2025, 3:40 PM

15 points

3 comments8 min readLW link

Moonlight Reflected

Jacob FalkovichApr 7, 2025, 3:35 PM

11 points

0 comments9 min readLW link

Navigation by Moonlight

Jacob FalkovichApr 7, 2025, 3:32 PM

24 points

39 comments8 min readLW link

You Are Not a Thought Experiment

Jacob FalkovichApr 7, 2025, 3:27 PM

5 points

0 comments9 min readLW link

Love is Love, Science is Fake

Jacob FalkovichApr 7, 2025, 3:19 PM

17 points

2 comments10 min readLW link

Coupling for Decouplers — Intro

Jacob FalkovichApr 7, 2025, 3:12 PM

9 points

0 comments1 min readLW link

The world according to ChatGPT

Richard_Kennaway7 Apr 2025 13:44 UTC

11 points

0 comments2 min readLW link

AI 2027: Dwarkesh’s Podcast with Daniel Kokotajlo and Scott Alexander

Zvi7 Apr 2025 13:40 UTC

67 points

2 comments26 min readLW link

(thezvi.wordpress.com)

Arguing all sides with ChatGPT 4.5

Richard_Kennaway7 Apr 2025 13:10 UTC

6 points

0 comments8 min readLW link

The Same Heaven

Lukas Petersson7 Apr 2025 12:57 UTC

3 points

1 comment5 min readLW link

(lukaspetersson.com)

Breaking down the MEAT of Alignment

JasonBrown7 Apr 2025 8:47 UTC

7 points

2 comments11 min readLW link

Well-foundedness as an organizing principle of healthy minds and societies

Richard_Ngo7 Apr 2025 0:31 UTC

35 points

7 comments6 min readLW link

(www.mindthefuture.info)

Arusha Perpetual Chicken—an unlikely iterated game

James Stephen Brown6 Apr 2025 22:56 UTC

15 points

1 comment5 min readLW link

(nonzerosum.games)