All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 131415 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

internal in nonstandard analysis

Alok SinghJan 11, 2023, 9:58 AM

9 points

1 comment1 min readLW link

Compounding Resource X

RaemonJan 11, 2023, 3:14 AM

77 points

6 comments9 min readLW link

Running With a Backpack

jefftkJan 11, 2023, 3:00 AM

19 points

11 comments1 min readLW link

(www.jefftk.com)

A simple thought experiment showing why recessions are an unnecessary bug in our economic system

skogsnisseJan 11, 2023, 12:43 AM

1 point

1 comment1 min readLW link

We don’t trade with ants

KatjaGraceJan 10, 2023, 11:50 PM

272 points

109 comments7 min readLW link 1 review

(worldspiritsockpuppet.com)

[Question] Who are the people who are currently profiting from inflation?

skogsnisseJan 10, 2023, 9:39 PM

1 point

2 comments1 min readLW link

Is Progress Real?

rogersbaconJan 10, 2023, 5:42 PM

5 points

14 comments14 min readLW link

(www.secretorum.life)

200 COP in MI: Interpreting Reinforcement Learning

Neel NandaJan 10, 2023, 5:37 PM

25 points

1 comment10 min readLW link

AGI and the EMH: markets are not expecting aligned or unaligned AI in the next 30 years

basil.halperin, J. Zachary Mazlish and tmychow

Jan 10, 2023, 4:06 PM

119 points

44 comments26 min readLW link

The Alignment Problem from a Deep Learning Perspective (major rewrite)

SoerenMind, Richard_Ngo and LawrenceC

Jan 10, 2023, 4:06 PM

84 points

8 comments39 min readLW link

(arxiv.org)

Against using stock prices to forecast AI timelines

basil.halperin, tmychow and J. Zachary Mazlish

Jan 10, 2023, 4:03 PM

23 points

2 comments2 min readLW link

Sorting Pebbles Into Correct Heaps: The Animation

WriterJan 10, 2023, 3:58 PM

26 points

2 comments1 min readLW link

(youtu.be)

Escape Velocity from Bullshit Jobs

ZviJan 10, 2023, 2:30 PM

61 points

18 comments5 min readLW link

(thezvi.wordpress.com)

Scaling laws vs individual differences

berenJan 10, 2023, 1:22 PM

45 points

21 comments7 min readLW link

Notes on writing

RPJan 10, 2023, 4:01 AM

35 points

11 comments3 min readLW link

Idea: Learning How To Move Towards The Metagame

AlgonJan 10, 2023, 12:58 AM

10 points

3 comments1 min readLW link

Review AI Alignment posts to help figure out how to make a proper AI Alignment review

habryka and Raemon

Jan 10, 2023, 12:19 AM

85 points

31 comments2 min readLW link

Against the paradox of tolerance

pchvykovJan 10, 2023, 12:12 AM

1 point

11 comments3 min readLW link

Increased Scam Quality/Quantity (Hypothesis in need of data)?

BeeblebroxJan 9, 2023, 10:57 PM

9 points

6 comments1 min readLW link

Wentworth and Larsen on buying time

Orpheus16, Thomas Larsen and johnswentworth

Jan 9, 2023, 9:31 PM

74 points

6 comments12 min readLW link

EA & LW Forum Summaries—Holiday Edition (19th Dec − 8th Jan)

Zoe WilliamsJan 9, 2023, 9:06 PM

11 points

0 comments LW link

GWWC Should Require Public Charity Evaluations

jefftkJan 9, 2023, 8:10 PM

28 points

0 comments4 min readLW link

(www.jefftk.com)

[MLSN #7]: an example of an emergent internal optimizer

joshc and Dan H

Jan 9, 2023, 7:39 PM

28 points

0 comments6 min readLW link

Trying to isolate objectives: approaches toward high-level interpretability

JozdienJan 9, 2023, 6:33 PM

49 points

14 comments8 min readLW link

The special nature of special relativity

adamShimiJan 9, 2023, 5:30 PM

37 points

1 comment3 min readLW link

(epistemologicalvigilance.substack.com)

Pierre Menard, pixel art, and entropy

Joey MarcellinoJan 9, 2023, 4:34 PM

1 point

1 comment6 min readLW link

Forecasting extreme outcomes

AidanGothJan 9, 2023, 4:34 PM

4 points

1 comment2 min readLW link

(docs.google.com)

Evidence under Adversarial Conditions

PeterMcCluskeyJan 9, 2023, 4:21 PM

57 points

1 comment3 min readLW link

(bayesianinvestor.com)

How to Bounded Distrust

ZviJan 9, 2023, 1:10 PM

122 points

17 comments4 min readLW link 1 review

(thezvi.wordpress.com)

Reification bias

adamShimi and Gabriel Alfour

Jan 9, 2023, 12:22 PM

25 points

6 comments2 min readLW link

Big list of AI safety videos

JakubKJan 9, 2023, 6:12 AM

11 points

2 comments1 min readLW link

(docs.google.com)

Rationality Practice: Self-Deception

DarmaniJan 9, 2023, 4:07 AM

6 points

0 comments1 min readLW link

Wolf Incident Postmortem

jefftkJan 9, 2023, 3:20 AM

137 points

13 comments1 min readLW link

(www.jefftk.com)

You’re Not One “You”—How Decision Theories Are Talking Past Each Other

keith_wynroeJan 9, 2023, 1:21 AM

28 points

11 comments8 min readLW link

On Blogging and Podcasting

DanielFilanJan 9, 2023, 12:40 AM

18 points

6 comments11 min readLW link

(danielfilan.com)

ChatGPT tells stories about XP-708-DQ, Eliezer, dragons, dark sorceresses, and unaligned robots becoming aligned

Bill BenzonJan 8, 2023, 11:21 PM

6 points

2 comments18 min readLW link

Simulacra are Things

janusJan 8, 2023, 11:03 PM

63 points

7 comments2 min readLW link

[Question] GPT learning from smarter texts?

ViliamJan 8, 2023, 10:23 PM

26 points

7 comments1 min readLW link

Latent variable prediction markets mockup + designer request

tailcalledJan 8, 2023, 10:18 PM

25 points

4 comments1 min readLW link

Citability of Lesswrong and the Alignment Forum

Leon LangJan 8, 2023, 10:12 PM

48 points

2 comments1 min readLW link

I tried to learn as much Deep Learning math as I could in 24 hours

PhosphorousJan 8, 2023, 9:07 PM

31 points

2 comments7 min readLW link

[Question] What specific thing would you do with AI Alignment Research Assistant GPT?

quetzal_rainbowJan 8, 2023, 7:24 PM

47 points

9 comments1 min readLW link

[Question] Research ideas (AI Interpretability & Neurosciences) for a 2-months project

flux8 Jan 2023 15:36 UTC

3 points

1 comment1 min readLW link

200 COP in MI: Image Model Interpretability

Neel Nanda8 Jan 2023 14:53 UTC

18 points

3 comments6 min readLW link

Halifax Monthly Meetup: Moloch in the HRM

Ideopunk8 Jan 2023 14:49 UTC

10 points

0 comments1 min readLW link

Dangers of deference

TsviBT8 Jan 2023 14:36 UTC

62 points

5 comments2 min readLW link

Could evolution produce something truly aligned with its own optimization standards? What would an answer to this mean for AI alignment?

No77e8 Jan 2023 11:04 UTC

3 points

4 comments1 min readLW link

AI psychology should ground the theories of AI consciousness and inform human-AI ethical interaction design

Roman Leventov8 Jan 2023 6:37 UTC

20 points

8 comments2 min readLW link

Stop Talking to Each Other and Start Buying Things: Three Decades of Survival in the Desert of Social Media

the gears to ascension8 Jan 2023 4:45 UTC

1 point

14 comments1 min readLW link

(catvalente.substack.com)

Can Ads be GDPR Compliant?

jefftk8 Jan 2023 2:50 UTC

39 points

10 comments7 min readLW link

(www.jefftk.com)