All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

Are short timelines actually bad?

joshcFeb 5, 2023, 9:21 PM

61 points

7 comments3 min readLW link

Stanzas On Power Calculation

DirectedEvolutionFeb 5, 2023, 7:15 PM

9 points

0 comments1 min readLW link

A List of things I might do with a Proof Oracle

Logan ZoellnerFeb 5, 2023, 6:14 PM

−14 points

13 comments3 min readLW link

Teaching Simple Boundaries

jefftkFeb 5, 2023, 5:30 PM

23 points

0 comments2 min readLW link

(www.jefftk.com)

Control

TsviBTFeb 5, 2023, 4:16 PM

21 points

14 comments9 min readLW link

Have an idea? Come to Oxford to discuss and write (20 – 24 March)

RP, Flourish Journal and Jemima

Feb 5, 2023, 3:05 PM

20 points

0 comments1 min readLW link

H5N1 - thread for information sharing, planning, and action

MathiasKBFeb 5, 2023, 12:44 PM

31 points

8 comments LW link

Second call: CFP for Rebellion and Disobedience in AI workshop

Ram RachumFeb 5, 2023, 12:18 PM

2 points

0 comments2 min readLW link

Research Direction: Be the AGI you want to see in the world

scottviteri, sudo and Lauro Langosco

Feb 5, 2023, 7:15 AM

44 points

0 comments7 min readLW link

Sex is Good, Actually

Gordon Seidoh WorleyFeb 5, 2023, 6:33 AM

41 points

8 comments4 min readLW link

Questions about AI that bother me

Eleni AngelouFeb 5, 2023, 5:04 AM

13 points

6 comments2 min readLW link

Evaluations (of new AI Safety researchers) can be noisy

LawrenceCFeb 5, 2023, 4:15 AM

132 points

11 comments16 min readLW link 1 review

Pandemic Prediction Checklist: H5N1 (6/14)

DirectedEvolutionFeb 5, 2023, 3:26 AM

50 points

10 comments7 min readLW link

Podcast with Oli Habryka on LessWrong / Lightcone Infrastructure

DanielFilanFeb 5, 2023, 2:52 AM

88 points

20 comments1 min readLW link

(thefilancabinet.com)

Misleading Fast Charging Specs

jefftkFeb 5, 2023, 2:50 AM

9 points

3 comments1 min readLW link

(www.jefftk.com)

I hired 5 people to sit behind me and make me productive for a month

Simon BerensFeb 5, 2023, 1:19 AM

252 points

83 comments10 min readLW link

(www.simonberens.com)

Modal Fixpoint Cooperation without Löb’s Theorem

Andrew_CritchFeb 5, 2023, 12:58 AM

134 points

34 comments3 min readLW link 1 review

 Who invented knitting? The plot thickens

eukaryoteFeb 5, 2023, 12:24 AM

60 points

9 comments19 min readLW link

(eukaryotewritesblog.com)

Some miscellaneous thoughts on ChatGPT, stories, and mechanical interpretability

Bill BenzonFeb 4, 2023, 7:35 PM

2 points

0 comments3 min readLW link

O(“AGI Safety”)>O(“Stop Tyrants”)

AnthonyRepettoFeb 4, 2023, 6:38 PM

−4 points

11 comments1 min readLW link

Monthly Doom Argument Threads? Doom Argument Wiki?

LVSNFeb 4, 2023, 4:59 PM

3 points

0 comments1 min readLW link

The Future of Structured Self Improvement

EvenflairFeb 4, 2023, 4:02 PM

27 points

4 comments1 min readLW link

(guildoftherose.org)

Empathy as a natural consequence of learnt reward models

berenFeb 4, 2023, 3:35 PM

48 points

27 comments13 min readLW link

Mech Interp Project Advising Call: Memorisation in GPT-2 Small

Neel NandaFeb 4, 2023, 2:17 PM

7 points

0 comments1 min readLW link

Do IQ tests measure intelligence? - A prediction market on my future beliefs about the topic

tailcalledFeb 4, 2023, 11:19 AM

1 point

10 comments1 min readLW link

(manifold.markets)

AXRP Episode 19 - Mechanistic Interpretability with Neel Nanda

DanielFilanFeb 4, 2023, 3:00 AM

45 points

0 comments117 min readLW link

The 2/3 rule for multi-factor authentication

RomanHaukssonFeb 4, 2023, 2:57 AM

4 points

0 comments1 min readLW link

(roman.computer)

Path-Dependence in ChatGPT’s Political Outputs

lsusrFeb 4, 2023, 2:02 AM

28 points

4 comments4 min readLW link

Fucking Goddamn Basics of Rationalist Discourse

LoganStrohlFeb 4, 2023, 1:47 AM

356 points

103 comments1 min readLW link 3 reviews

Small Talk is Good, Actually

Gordon Seidoh WorleyFeb 4, 2023, 12:38 AM

52 points

9 comments3 min readLW link

Update on Book Review Dominant Assurance Contract

Arjun PanicksseryFeb 3, 2023, 11:16 PM

9 points

0 comments LW link

[Question] 2+2=π√2+n

Logan ZoellnerFeb 3, 2023, 10:27 PM

16 points

15 comments1 min readLW link

[Question] If I encounter a capabilities paper that kinda spooks me, what should I do with it?

the gears to ascensionFeb 3, 2023, 9:37 PM

28 points

8 comments1 min readLW link

[Question] What Are The Preconditions/Prerequisites for Asymptotic Analysis?

DragonGodFeb 3, 2023, 9:26 PM

8 points

2 comments1 min readLW link

[Linkpost] Google invested $300M in Anthropic in late 2022

Orpheus16Feb 3, 2023, 7:13 PM

73 points

14 comments1 min readLW link

(www.ft.com)

Many AI governance proposals have a tradeoff between usefulness and feasibility

Orpheus16 and Carson Ezell

Feb 3, 2023, 6:49 PM

22 points

2 comments2 min readLW link

Reply to Duncan Sabien on Strawmanning

Zack_M_DavisFeb 3, 2023, 5:57 PM

43 points

11 comments4 min readLW link

Semi-rare plain language words that are great to remember

LVSNFeb 3, 2023, 4:33 PM

4 points

7 comments1 min readLW link

[Question] What qualities does an AGI need to have to realize the risk of false vacuum, without hardcoding physics theories into it?

RationalSieveFeb 3, 2023, 4:00 PM

1 point

4 comments1 min readLW link

Housing and Transit Roundup #3

ZviFeb 3, 2023, 3:10 PM

21 points

6 comments16 min readLW link

(thezvi.wordpress.com)

Taboo P(doom)

NathanBarnardFeb 3, 2023, 10:37 AM

14 points

10 comments1 min readLW link

ChatGPT: Tantalizing afterthoughts in search of story trajectories [induction heads]

Bill BenzonFeb 3, 2023, 10:35 AM

4 points

0 comments20 min readLW link

Jordan Peterson: Guru/Villain

Bryan Frances3 Feb 2023 9:02 UTC

−14 points

6 comments9 min readLW link

[Question] What is the risk of asking a counterfactual oracle a question that already had its answer erased?

Chris_Leong3 Feb 2023 3:13 UTC

7 points

0 comments1 min readLW link

I don’t think MIRI “gave up”

Raemon3 Feb 2023 0:26 UTC

106 points

64 comments4 min readLW link

What fact that you know is true but most people aren’t ready to accept it?

lorepieri3 Feb 2023 0:06 UTC

47 points

211 comments1 min readLW link

[Question] Monotonous Work

Gideon Bauer2 Feb 2023 21:35 UTC

1 point

0 comments1 min readLW link

Is AI risk assessment too anthropocentric?

Craig Mattson2 Feb 2023 21:34 UTC

3 points

6 comments1 min readLW link

Halifax Monthly Meetup: Introduction to Effective Altruism

Ideopunk2 Feb 2023 21:10 UTC

10 points

0 comments1 min readLW link

Conditioning Predictive Models: Outer alignment via careful conditioning

evhub, Adam Jermyn, Johannes Treutlein, Rubi J. Hudson and kcwoolverton

2 Feb 2023 20:28 UTC

72 points

15 comments57 min readLW link