All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Linkpost] Remarks on the Convergence in Distribution of Random Neural Networks to Gaussian Processes in the Infinite Width Limit

carboniferous_umbraculum Nov 30, 2023, 2:01 PM

9 points

0 comments1 min readLW link

(drive.google.com)

[Question] Buy Nothing Day is a great idea with a terrible app— why has nobody built a killer app for crowdsourced ‘effective communism’ yet?

lillybaeumNov 30, 2023, 1:47 PM

8 points

17 comments1 min readLW link

[Question] Comprehensible Input is the only way people learn languages—is it the only way people learn?

lillybaeumNov 30, 2023, 1:31 PM

8 points

2 comments3 min readLW link

Some Intuitions for the Ethicophysics

MadHatter and mishka

Nov 30, 2023, 6:47 AM

2 points

4 comments8 min readLW link

The Alignment Agenda THEY Don’t Want You to Know About

MadHatterNov 30, 2023, 4:29 AM

−19 points

16 comments1 min readLW link

Cis fragility

[deactivated]Nov 30, 2023, 4:14 AM

−51 points

9 comments3 min readLW link

Homework Answer: Glicko Ratings for War

MadHatterNov 30, 2023, 4:08 AM

−45 points

1 comment77 min readLW link

(gist.github.com)

[Question] Feature Request for LessWrong

MadHatterNov 30, 2023, 3:19 AM

11 points

8 comments1 min readLW link

My Alignment Research Agenda (“the Ethicophysics”)

MadHatterNov 30, 2023, 2:57 AM

−13 points

0 comments1 min readLW link

[Question] Stupid Question: Why am I getting consistently downvoted?

MadHatterNov 30, 2023, 12:21 AM

31 points

138 comments1 min readLW link

Inositol Non-Results

ElizabethNov 29, 2023, 9:40 PM

20 points

2 comments1 min readLW link

(acesounderglass.com)

Losing Metaphors: Zip and Paste

jefftkNov 29, 2023, 8:31 PM

26 points

6 comments1 min readLW link

(www.jefftk.com)

Preserving our heritage: Building a movement and a knowledge ark for current and future generations

rnk8Nov 29, 2023, 7:20 PM

0 points

5 comments12 min readLW link

AGI Alignment is Absurd

Youssef MohamedNov 29, 2023, 7:11 PM

−9 points

4 comments3 min readLW link

The origins of the steam engine: An essay with interactive animated diagrams

jasoncrawfordNov 29, 2023, 6:30 PM

30 points

1 comment1 min readLW link

(rootsofprogress.org)

ChatGPT 4 solved all the gotcha problems I posed that tripped ChatGPT 3.5

VipulNaikNov 29, 2023, 6:11 PM

33 points

16 comments14 min readLW link

“Clean” vs. “messy” goal-directedness (Section 2.2.3 of “Scheming AIs”)

Joe CarlsmithNov 29, 2023, 4:32 PM

29 points

1 comment11 min readLW link

Lying Alignment Chart

Zack_M_DavisNov 29, 2023, 4:15 PM

78 points

17 comments1 min readLW link

Rethink Priorities: Seeking Expressions of Interest for Special Projects Next Year

kierangreigNov 29, 2023, 1:59 PM

4 points

0 comments5 min readLW link

[Question] Thoughts on teletransportation with copies?

titotalNov 29, 2023, 12:56 PM

15 points

13 comments1 min readLW link

Interpretability with Sparse Autoencoders (Colab exercises)

CallumMcDougallNov 29, 2023, 12:56 PM

76 points

9 comments4 min readLW link

The 101 Space You Will Always Have With You

ScrewtapeNov 29, 2023, 4:56 AM

280 points

23 comments6 min readLW link 1 review

Trust your intuition—Kahneman’s book misses the forest for the trees

mnvrNov 29, 2023, 4:37 AM

−2 points

2 comments2 min readLW link

Process Substitution Without Shell?

jefftkNov 29, 2023, 3:20 AM

19 points

18 comments2 min readLW link

(www.jefftk.com)

Deception Chess: Game #2

ZaneNov 29, 2023, 2:43 AM

29 points

17 comments2 min readLW link

Black Box Biology

GeneSmithNov 29, 2023, 2:27 AM

65 points

30 comments2 min readLW link

[Question] What would be the shelf life of nuclear weapon-secrecy if nuclear weapons had not immediately been used in combat?

Gram StoneNov 29, 2023, 12:53 AM

7 points

2 comments1 min readLW link

Scaling laws for dominant assurance contracts

jessicataNov 28, 2023, 11:11 PM

36 points

5 comments7 min readLW link

(unstableontology.com)

I’m confused about innate smell neuroanatomy

Steven ByrnesNov 28, 2023, 8:49 PM

40 points

2 comments9 min readLW link

How to Control an LLM’s Behavior (why my P(DOOM) went down)

RogerDearnaleyNov 28, 2023, 7:56 PM

65 points

30 comments11 min readLW link

[Question] Is there a word for discrimination against A.I.?

Aaron BohannonNov 28, 2023, 7:03 PM

1 point

4 comments1 min readLW link

Update #2 to “Dominant Assurance Contract Platform”: EnsureDone

moyamoNov 28, 2023, 6:02 PM

33 points

2 comments1 min readLW link

Ethicophysics II: Politics is the Mind-Savior

MadHatterNov 28, 2023, 4:27 PM

−9 points

9 comments4 min readLW link

(bittertruths.substack.com)

Neither EA nor e/acc is what we need to build the future

jasoncrawfordNov 28, 2023, 4:04 PM

7 points

22 comments3 min readLW link

(rootsofprogress.org)

Agentic Growth

Logan KiellerNov 28, 2023, 3:45 PM

1 point

0 comments3 min readLW link

(logankieller.substack.com)

AISC project: How promising is automating alignment research? (literature review)

Bogdan Ionut CirsteaNov 28, 2023, 2:47 PM

4 points

1 comment1 min readLW link

(docs.google.com)

A day in the life of a mechanistic interpretability researcher

Bill BenzonNov 28, 2023, 2:45 PM

3 points

3 comments1 min readLW link

Two sources of beyond-episode goals (Section 2.2.2 of “Scheming AIs”)

Joe CarlsmithNov 28, 2023, 1:49 PM

11 points

1 comment15 min readLW link

Self-Referential Probabilistic Logic Admits the Payor’s Lemma

Yudhister KumarNov 28, 2023, 10:27 AM

80 points

14 comments6 min readLW link

[Question] How can I use AI without increasing AI-risk?

Yoav RavidNov 28, 2023, 10:05 AM

18 points

6 comments1 min readLW link

A Reading From The Book Of Sequences

ScrewtapeNov 28, 2023, 6:45 AM

8 points

0 comments4 min readLW link

Anthropic Fall 2023 Debate Progress Update

Ansh RadhakrishnanNov 28, 2023, 5:37 AM

76 points

9 comments12 min readLW link

Apocalypse insurance, and the hardline libertarian take on AI risk

So8resNov 28, 2023, 2:09 AM

135 points

40 comments7 min readLW link 1 review

My techno-optimism [By Vitalik Buterin]

habrykaNov 27, 2023, 11:53 PM

107 points

17 comments2 min readLW link

(www.lesswrong.com)

[Question] Could Germany have won World War I with high probability given the benefit of hindsight?

RokoNov 27, 2023, 10:52 PM

10 points

18 comments1 min readLW link

[Question] Could World War I have been prevented given the benefit of hindsight?

RokoNov 27, 2023, 10:39 PM

16 points

8 comments1 min readLW link

AISC 2024 - Project Summaries

NickyPNov 27, 2023, 10:32 PM

48 points

3 comments18 min readLW link

“Epistemic range of motion” and LessWrong moderation

habryka and Gabriel Alfour

Nov 27, 2023, 9:58 PM

65 points

3 comments12 min readLW link

Apply to the Conceptual Boundaries Workshop for AI Safety

Chris LakinNov 27, 2023, 9:04 PM

50 points

0 comments3 min readLW link

There is no IQ for AI

Gabriel AlfourNov 27, 2023, 6:21 PM

30 points

10 comments9 min readLW link

(cognition.cafe)