All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 272829 30

My techno-optimism [By Vitalik Buterin]

habryka27 Nov 2023 23:53 UTC

107 points

17 comments2 min readLW link

(www.lesswrong.com)

[Question] Could Germany have won World War I with high probability given the benefit of hindsight?

Roko27 Nov 2023 22:52 UTC

10 points

18 comments1 min readLW link

[Question] Could World War I have been prevented given the benefit of hindsight?

Roko27 Nov 2023 22:39 UTC

16 points

8 comments1 min readLW link

AISC 2024 - Project Summaries

NickyP27 Nov 2023 22:32 UTC

48 points

3 comments18 min readLW link

“Epistemic range of motion” and LessWrong moderation

habryka and Gabriel Alfour

27 Nov 2023 21:58 UTC

65 points

3 comments12 min readLW link

Apply to the Conceptual Boundaries Workshop for AI Safety

Chris Lakin27 Nov 2023 21:04 UTC

50 points

0 comments3 min readLW link

There is no IQ for AI

Gabriel Alfour27 Nov 2023 18:21 UTC

31 points

10 comments9 min readLW link

(cognition.cafe)

Two concepts of an “episode” (Section 2.2.1 of “Scheming AIs”)

Joe Carlsmith27 Nov 2023 18:01 UTC

19 points

1 comment13 min readLW link

[Linkpost] George Mack’s Razors

trevor27 Nov 2023 17:53 UTC

38 points

8 comments3 min readLW link

(twitter.com)

On possible cross-fertilization between AI and neuroscience [Creativity]

Bill Benzon27 Nov 2023 16:50 UTC

15 points

22 comments7 min readLW link

Ethicophysics I

MadHatter27 Nov 2023 15:44 UTC

−1 points

16 comments1 min readLW link

(open.substack.com)

Sentience Institute 2023 End of Year Summary

michael_dello27 Nov 2023 12:11 UTC

11 points

0 comments5 min readLW link

(www.sentienceinstitute.org)

[Question] A Question about Corrigibility (2015)

A.H.27 Nov 2023 12:05 UTC

4 points

2 comments1 min readLW link

Appendices to the live agendas

technicalities and Stag

27 Nov 2023 11:10 UTC

16 points

4 comments1 min readLW link

Shallow review of live agendas in alignment & safety

technicalities and Stag

27 Nov 2023 11:10 UTC

351 points

73 comments29 min readLW link 1 review

Napoleon stole the Roman Inquisition archives and investigated the Galileo case

Meow P27 Nov 2023 9:41 UTC

−3 points

0 comments1 min readLW link

(www.cricetuscricetus.co.uk)

[Question] why did OpenAI employees sign

bhauth27 Nov 2023 5:21 UTC

49 points

23 comments1 min readLW link

Unknown Probabilities

transhumanist_atom_understander27 Nov 2023 2:30 UTC

28 points

1 comment4 min readLW link

Justification for Induction

Krantz27 Nov 2023 2:05 UTC

2 points

25 comments5 min readLW link

Situational awareness (Section 2.1 of “Scheming AIs”)

Joe Carlsmith26 Nov 2023 23:00 UTC

10 points

5 comments8 min readLW link

AXRP Episode 26 - AI Governance with Elizabeth Seger

DanielFilan26 Nov 2023 23:00 UTC

14 points

0 comments66 min readLW link

Solving Two-Sided Adverse Selection with Prediction Market Matchmaking

Saul Munn26 Nov 2023 20:10 UTC

16 points

7 comments4 min readLW link

(www.brasstacks.blog)

Wikipedia is not so great, and what can be done about it.

euserx26 Nov 2023 19:13 UTC

0 points

27 comments16 min readLW link

(forum.effectivealtruism.org)

[Question] Help me solve this problem: The basilisk isn’t real, but people are

canary_itm26 Nov 2023 17:44 UTC

−19 points

4 comments1 min readLW link

Twin Cities ACX Meetup—December 2023

Timothy M.26 Nov 2023 17:32 UTC

1 point

1 comment1 min readLW link

Spaced repetition for teaching two-year olds how to read (Interview)

Chris Lakin26 Nov 2023 16:52 UTC

49 points

9 comments5 min readLW link

(chrislakin.blog)

Paper out now on creatine and cognitive performance

Fabienne26 Nov 2023 10:58 UTC

63 points

2 comments1 min readLW link

Why Q*, if real, might be a game changer

Shmi26 Nov 2023 6:12 UTC

5 points

6 comments1 min readLW link

Moral Reality Check (a short story)

jessicata26 Nov 2023 5:03 UTC

155 points

45 comments21 min readLW link 1 review

(unstableontology.com)

Accounting for Foregone Pay

jefftk26 Nov 2023 3:30 UTC

11 points

0 comments2 min readLW link

(www.jefftk.com)

Corrigibility or DWIM is an attractive primary goal for AGI

Seth Herd25 Nov 2023 19:37 UTC

19 points

4 comments1 min readLW link

On “slack” in training (Section 1.5 of “Scheming AIs”)

Joe Carlsmith25 Nov 2023 17:51 UTC

1 point

0 comments5 min readLW link

Announcing New Beginner-friendly Book on AI Safety and Risk

Darren McKee25 Nov 2023 15:57 UTC

85 points

3 comments1 min readLW link

Fertility as Metascience

Maxwell Tabarrok25 Nov 2023 15:42 UTC

22 points

1 comment3 min readLW link

(maximumprogress.substack.com)

Reaction to “Empowerment is (almost) All We Need” : an open-ended alternative

Ryo 25 Nov 2023 15:35 UTC

9 points

3 comments5 min readLW link

How Microsoft’s ruthless employee evaluation system annihilated team collaboration.

positivesum25 Nov 2023 13:25 UTC

3 points

2 comments1 min readLW link

(tryingtruly.substack.com)

What are the results of more parental supervision and less outdoor play?

juliawise25 Nov 2023 12:52 UTC

235 points

31 comments5 min readLW link

A simple treacherous turn demonstration

Nikola Jurkovic25 Nov 2023 4:51 UTC

22 points

5 comments3 min readLW link

The two paragraph argument for AI risk

CronoDAS25 Nov 2023 2:01 UTC

25 points

8 comments1 min readLW link

Goodhart’s Law Example: Training Verifiers to Solve Math Word Problems

Chris_Leong25 Nov 2023 0:53 UTC

27 points

2 comments1 min readLW link

(arxiv.org)

Some thoughts on CBDC

PixelatedPenguin25 Nov 2023 0:32 UTC

−1 points

1 comment1 min readLW link

Testing for consequence-blindness in LLMs using the HI-ADS unit test.

David Scott Krueger24 Nov 2023 23:35 UTC

25 points

2 comments2 min readLW link

Epoch is hiring an ML Distributed Systems Senior Researcher

merilalama and Jaime Sevilla Molina

24 Nov 2023 22:33 UTC

2 points

0 comments4 min readLW link

(careers.rethinkpriorities.org)

Article Discussion And Free Pizza—St Paul

25Hour24 Nov 2023 21:02 UTC

1 point

0 comments1 min readLW link

Why focus on schemers in particular (Sections 1.3 and 1.4 of “Scheming AIs”)

Joe Carlsmith24 Nov 2023 19:18 UTC

8 points

0 comments22 min readLW link

Surviving and Shaping Long-Term Competitions: Lessons from Net Assessment

Gentzel and ihavenoahidea

24 Nov 2023 18:18 UTC

6 points

0 comments13 min readLW link

Ability to solve long-horizon tasks correlates with wanting things in the behaviorist sense

So8res24 Nov 2023 17:37 UTC

213 points

85 comments5 min readLW link 1 review

The Limitations of GPT-4

p.b.24 Nov 2023 15:30 UTC

27 points

12 comments4 min readLW link

Progress links digest, 2023-11-24: Bottlenecks of aging, Starship launches, and much more

jasoncrawford24 Nov 2023 15:25 UTC

40 points

1 comment14 min readLW link

(rootsofprogress.org)

[Question] What’s the evidence that LLMs will scale up efficiently beyond GPT4? i.e. couldn’t GPT5, etc., be very inefficient?

M. Y. Zuo24 Nov 2023 15:22 UTC

11 points

6 comments1 min readLW link