All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 2829

Conspiracy Theorists Aren’t Ignorant. They’re Bad At Epistemology.

Bentham's Bulldog28 Feb 2024 23:39 UTC

19 points

10 comments5 min readLW link

Discovering alignment windfalls reduces AI risk

goodgravy and stuhlmueller

28 Feb 2024 21:23 UTC

15 points

1 comment8 min readLW link

(blog.elicit.com)

my theory of the industrial revolution

bhauth28 Feb 2024 21:07 UTC

23 points

7 comments3 min readLW link

(www.bhauth.com)

Wholesomeness and Effective Altruism

owencb28 Feb 2024 20:28 UTC

42 points

3 comments10 min readLW link

timestamping through the Singularity

throwaway91811912728 Feb 2024 19:09 UTC

−2 points

4 comments8 min readLW link

Evidential Cooperation in Large Worlds: Potential Objections & FAQ

Chi Nguyen and _will_

28 Feb 2024 18:58 UTC

46 points

5 comments18 min readLW link

Timaeus’s First Four Months

Jesse Hoogland, Daniel Murfet, Stan van Wingerden and Alexander Gietelink Oldenziel

28 Feb 2024 17:01 UTC

173 points

6 comments6 min readLW link

Notes on control evaluations for safety cases

ryan_greenblatt, Buck and Fabien Roger

28 Feb 2024 16:15 UTC

49 points

0 comments32 min readLW link

Corporate Governance for Frontier AI Labs: A Research Agenda

Matthew Wearden28 Feb 2024 11:29 UTC

5 points

0 comments16 min readLW link

(matthewwearden.co.uk)

How AI Will Change Education

robotelvis28 Feb 2024 5:30 UTC

6 points

4 comments5 min readLW link

(messyprogress.substack.com)

Band Lessons?

jefftk28 Feb 2024 3:00 UTC

13 points

3 comments1 min readLW link

(www.jefftk.com)

New LessWrong review winner UI (“The LeastWrong” section and full-art post pages)

kave28 Feb 2024 2:42 UTC

107 points

65 comments1 min readLW link

Counting arguments provide no evidence for AI doom

Nora Belrose and Quintin Pope

27 Feb 2024 23:03 UTC

98 points

200 comments14 min readLW link 2 reviews

Which animals realize which types of subjective welfare?

MichaelStJules27 Feb 2024 19:31 UTC

4 points

0 comments18 min readLW link

Biosecurity and AI: Risks and Opportunities

Steve Newman27 Feb 2024 18:45 UTC

11 points

1 comment7 min readLW link

(www.safe.ai)

The Gemini Incident Continues

Zvi27 Feb 2024 16:00 UTC

45 points

6 comments48 min readLW link

(thezvi.wordpress.com)

How I internalized my achievements to better deal with negative feelings

Raymond Koopmanschap27 Feb 2024 15:10 UTC

42 points

7 comments6 min readLW link

On Frustration and Regret

silentbob27 Feb 2024 12:19 UTC

8 points

0 comments4 min readLW link

San Francisco ACX Meetup “Third Saturday”

Nate Sternberg and guenael

27 Feb 2024 7:07 UTC

7 points

0 comments1 min readLW link

Examining Language Model Performance with Reconstructed Activations using Sparse Autoencoders

Evan Anders and Joseph Bloom

27 Feb 2024 2:43 UTC

43 points

16 comments15 min readLW link

Project idea: an iterated prisoner’s dilemma competition/game

Adam Zerner26 Feb 2024 23:06 UTC

8 points

0 comments5 min readLW link

Acting Wholesomely

owencb26 Feb 2024 21:49 UTC

58 points

69 comments16 min readLW link 3 reviews

Getting rational now or later: navigating procrastination and time-inconsistent preferences for new rationalists

milo_thoughts26 Feb 2024 19:38 UTC

1 point

0 comments8 min readLW link

[Question] Whom Do You Trust?

JackOfAllTrades26 Feb 2024 19:38 UTC

1 point

0 comments1 min readLW link

Boundary Violations vs Boundary Dissolution

Chris Lakin26 Feb 2024 18:59 UTC

8 points

4 comments1 min readLW link

[Question] Can we get an AI to “do our alignment homework for us”?

Chris_Leong26 Feb 2024 7:56 UTC

55 points

33 comments1 min readLW link

How I build and run behavioral interviews

benkuhn26 Feb 2024 5:50 UTC

32 points

6 comments4 min readLW link

(www.benkuhn.net)

Hidden Cognition Detection Methods and Benchmarks

Paul Colognese26 Feb 2024 5:31 UTC

22 points

11 comments4 min readLW link

Cellular respiration as a steam engine

dkl925 Feb 2024 20:17 UTC

24 points

1 comment1 min readLW link

(dkl9.net)

[Question] Rationalism and Dependent Origination?

Baometrus25 Feb 2024 18:16 UTC

2 points

3 comments1 min readLW link

China-AI forecasts

NathanBarnard25 Feb 2024 16:49 UTC

40 points

29 comments6 min readLW link

Ideological Bayesians

Kevin Dorst25 Feb 2024 14:17 UTC

105 points

5 comments10 min readLW link

(kevindorst.substack.com)

Deconfusing In-Context Learning

Arjun Panickssery25 Feb 2024 9:48 UTC

37 points

1 comment2 min readLW link

Everett branches, inter-light cone trade and other alien matters: Appendix to “An ECL explainer”

Chi Nguyen and _will_

24 Feb 2024 23:09 UTC

17 points

0 comments11 min readLW link

Cooperating with aliens and AGIs: An ECL explainer

Chi Nguyen, _will_ and Orpheus16

24 Feb 2024 22:58 UTC

57 points

8 comments20 min readLW link

Choosing My Quest (Part 2 of “The Sense Of Physical Necessity”)

LoganStrohl24 Feb 2024 21:31 UTC

40 points

7 comments12 min readLW link

Rationality Research Report: Towards 10x OODA Looping?

Raemon24 Feb 2024 21:06 UTC

118 points

26 comments15 min readLW link

Exercise: Planmaking, Surprise Anticipation, and “Baba is You”

Raemon24 Feb 2024 20:33 UTC

71 points

31 comments6 min readLW link

In search of God.

Spiritus Dei24 Feb 2024 18:59 UTC

−19 points

3 comments7 min readLW link

Impossibility of Anthropocentric-Alignment

False Name24 Feb 2024 18:31 UTC

−8 points

2 comments39 min readLW link

The Inner Alignment Problem

Jakub Halmeš24 Feb 2024 17:55 UTC

1 point

1 comment3 min readLW link

(jakubhalmes.substack.com)

We Need Major, But Not Radical, FDA Reform

Maxwell Tabarrok24 Feb 2024 16:54 UTC

42 points

12 comments7 min readLW link

(www.maximum-progress.com)

After Overmorrow: Scattered Musings on the Immediate Post-AGI World

Yuli_Ban24 Feb 2024 15:49 UTC

−3 points

0 comments26 min readLW link

[Question] CDT vs. EDT on Deterrence

Terence Coelho24 Feb 2024 15:41 UTC

1 point

9 comments1 min readLW link

Balancing Games

jefftk24 Feb 2024 14:40 UTC

63 points

18 comments1 min readLW link

(www.jefftk.com)

How well do truth probes generalise?

mishajw24 Feb 2024 14:12 UTC

96 points

11 comments9 min readLW link

Rawls’s Veil of Ignorance Doesn’t Make Any Sense

Arjun Panickssery24 Feb 2024 13:18 UTC

9 points

9 comments1 min readLW link

[Question] Can someone explain to me what went wrong with ChatGPT?

Valentin Baltadzhiev24 Feb 2024 11:50 UTC

9 points

1 comment1 min readLW link

The Sense Of Physical Necessity: A Naturalism Demo (Introduction)

LoganStrohl24 Feb 2024 2:56 UTC

59 points

1 comment6 min readLW link

Instrumental deception and manipulation in LLMs—a case study

Olli Järviniemi24 Feb 2024 2:07 UTC

39 points

13 comments12 min readLW link