All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 141516 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Alignment is not enough

Alan ChanJan 12, 2023, 12:33 AM

12 points

6 comments11 min readLW link

(coordination.substack.com)

How it feels to have your mind hacked by an AI

blakedJan 12, 2023, 12:33 AM

367 points

222 comments17 min readLW link

Categorical-measure-theoretic approach to optimal policies tending to seek power

jacekJan 12, 2023, 12:32 AM

31 points

3 comments6 min readLW link

Any person/mind should have the right to suicide

askofaJan 12, 2023, 12:32 AM

14 points

13 comments2 min readLW link

Have we really forsaken natural selection?

KatjaGraceJan 12, 2023, 12:10 AM

34 points

7 comments2 min readLW link

(worldspiritsockpuppet.com)

[Question] Using Finite Factored Sets for Causal Representation Learning?

David ReberJan 11, 2023, 10:06 PM

2 points

3 comments1 min readLW link

GWWC’s Handling of Conflicting Funding Bars

jefftkJan 11, 2023, 8:30 PM

19 points

0 comments3 min readLW link

(www.jefftk.com)

How to write a big cartesian product symbol in MathJax

Matthias G. MayerJan 11, 2023, 8:21 PM

8 points

1 comment1 min readLW link

What’s the deal with AI consciousness?

TW123Jan 11, 2023, 4:37 PM

6 points

13 comments9 min readLW link

(aiwatchtower.substack.com)

[Question] Any significant updates on long covid risk analysis?

Randomized, ControlledJan 11, 2023, 2:31 PM

23 points

11 comments1 min readLW link

internal in nonstandard analysis

Alok SinghJan 11, 2023, 9:58 AM

9 points

1 comment1 min readLW link

Compounding Resource X

RaemonJan 11, 2023, 3:14 AM

77 points

6 comments9 min readLW link

Running With a Backpack

jefftkJan 11, 2023, 3:00 AM

19 points

11 comments1 min readLW link

(www.jefftk.com)

A simple thought experiment showing why recessions are an unnecessary bug in our economic system

skogsnisseJan 11, 2023, 12:43 AM

1 point

1 comment1 min readLW link

We don’t trade with ants

KatjaGraceJan 10, 2023, 11:50 PM

272 points

109 comments7 min readLW link 1 review

(worldspiritsockpuppet.com)

[Question] Who are the people who are currently profiting from inflation?

skogsnisseJan 10, 2023, 9:39 PM

1 point

2 comments1 min readLW link

Is Progress Real?

rogersbaconJan 10, 2023, 5:42 PM

5 points

14 comments14 min readLW link

(www.secretorum.life)

200 COP in MI: Interpreting Reinforcement Learning

Neel NandaJan 10, 2023, 5:37 PM

25 points

1 comment10 min readLW link

AGI and the EMH: markets are not expecting aligned or unaligned AI in the next 30 years

basil.halperin, J. Zachary Mazlish and tmychow

Jan 10, 2023, 4:06 PM

119 points

44 comments26 min readLW link

The Alignment Problem from a Deep Learning Perspective (major rewrite)

SoerenMind, Richard_Ngo and LawrenceC

Jan 10, 2023, 4:06 PM

84 points

8 comments39 min readLW link

(arxiv.org)

Against using stock prices to forecast AI timelines

basil.halperin, tmychow and J. Zachary Mazlish

Jan 10, 2023, 4:03 PM

23 points

2 comments2 min readLW link

Sorting Pebbles Into Correct Heaps: The Animation

WriterJan 10, 2023, 3:58 PM

26 points

2 comments1 min readLW link

(youtu.be)

Escape Velocity from Bullshit Jobs

ZviJan 10, 2023, 2:30 PM

61 points

18 comments5 min readLW link

(thezvi.wordpress.com)

Scaling laws vs individual differences

berenJan 10, 2023, 1:22 PM

45 points

21 comments7 min readLW link

Notes on writing

RPJan 10, 2023, 4:01 AM

35 points

11 comments3 min readLW link

Idea: Learning How To Move Towards The Metagame

AlgonJan 10, 2023, 12:58 AM

10 points

3 comments1 min readLW link

Review AI Alignment posts to help figure out how to make a proper AI Alignment review

habryka and Raemon

Jan 10, 2023, 12:19 AM

85 points

31 comments2 min readLW link

Against the paradox of tolerance

pchvykovJan 10, 2023, 12:12 AM

1 point

11 comments3 min readLW link

Increased Scam Quality/Quantity (Hypothesis in need of data)?

BeeblebroxJan 9, 2023, 10:57 PM

9 points

6 comments1 min readLW link

Wentworth and Larsen on buying time

Orpheus16, Thomas Larsen and johnswentworth

Jan 9, 2023, 9:31 PM

74 points

6 comments12 min readLW link

EA & LW Forum Summaries—Holiday Edition (19th Dec − 8th Jan)

Zoe WilliamsJan 9, 2023, 9:06 PM

11 points

0 comments LW link

GWWC Should Require Public Charity Evaluations

jefftkJan 9, 2023, 8:10 PM

28 points

0 comments4 min readLW link

(www.jefftk.com)

[MLSN #7]: an example of an emergent internal optimizer

joshc and Dan H

Jan 9, 2023, 7:39 PM

28 points

0 comments6 min readLW link

Trying to isolate objectives: approaches toward high-level interpretability

JozdienJan 9, 2023, 6:33 PM

49 points

14 comments8 min readLW link

The special nature of special relativity

adamShimiJan 9, 2023, 5:30 PM

37 points

1 comment3 min readLW link

(epistemologicalvigilance.substack.com)

Pierre Menard, pixel art, and entropy

Joey MarcellinoJan 9, 2023, 4:34 PM

1 point

1 comment6 min readLW link

Forecasting extreme outcomes

AidanGothJan 9, 2023, 4:34 PM

4 points

1 comment2 min readLW link

(docs.google.com)

Evidence under Adversarial Conditions

PeterMcCluskeyJan 9, 2023, 4:21 PM

57 points

1 comment3 min readLW link

(bayesianinvestor.com)

How to Bounded Distrust

ZviJan 9, 2023, 1:10 PM

122 points

17 comments4 min readLW link 1 review

(thezvi.wordpress.com)

Reification bias

adamShimi and Gabriel Alfour

Jan 9, 2023, 12:22 PM

25 points

6 comments2 min readLW link

Big list of AI safety videos

JakubKJan 9, 2023, 6:12 AM

11 points

2 comments1 min readLW link

(docs.google.com)

Rationality Practice: Self-Deception

Darmani9 Jan 2023 4:07 UTC

6 points

0 comments1 min readLW link

Wolf Incident Postmortem

jefftk9 Jan 2023 3:20 UTC

137 points

13 comments1 min readLW link

(www.jefftk.com)

You’re Not One “You”—How Decision Theories Are Talking Past Each Other

keith_wynroe9 Jan 2023 1:21 UTC

28 points

11 comments8 min readLW link

On Blogging and Podcasting

DanielFilan9 Jan 2023 0:40 UTC

18 points

6 comments11 min readLW link

(danielfilan.com)

ChatGPT tells stories about XP-708-DQ, Eliezer, dragons, dark sorceresses, and unaligned robots becoming aligned

Bill Benzon8 Jan 2023 23:21 UTC

6 points

2 comments18 min readLW link

Simulacra are Things

janus8 Jan 2023 23:03 UTC

63 points

7 comments2 min readLW link

[Question] GPT learning from smarter texts?

Viliam8 Jan 2023 22:23 UTC

26 points

7 comments1 min readLW link

Latent variable prediction markets mockup + designer request

tailcalled8 Jan 2023 22:18 UTC

25 points

4 comments1 min readLW link

Citability of Lesswrong and the Alignment Forum

Leon Lang8 Jan 2023 22:12 UTC

48 points

2 comments1 min readLW link