All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 101112 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Movie posters

KatjaGraceMar 6, 2024, 6:20 AM

40 points

0 comments2 min readLW link

(worldspiritsockpuppet.com)

We Inspected Every Head In GPT-2 Small using SAEs So You Don’t Have To

robertzk, Connor Kissane, Arthur Conmy and Neel Nanda

Mar 6, 2024, 5:03 AM

63 points

0 comments12 min readLW link

[Question] Does anyone know good essays on how different AI timelines will affect asset prices?

Tim LiptrotMar 6, 2024, 4:21 AM

8 points

2 comments1 min readLW link

Twin Cities ACX Meetup—March 2024

Timothy M.Mar 5, 2024, 9:15 PM

1 point

0 comments1 min readLW link

My Clients, The Liars

ymeskhoutMar 5, 2024, 9:06 PM

249 points

86 comments7 min readLW link

If Ukraine fails, the world will reap fatal consequences

Danylo ZhyrkoMar 5, 2024, 7:42 PM

−22 points

14 comments5 min readLW link

Making Connections with ChatGPT: The Macksey Game

Bill BenzonMar 5, 2024, 6:15 PM

5 points

2 comments11 min readLW link

[Question] Good taxonomies of all risks (small or large) from AI?

Aryeh EnglanderMar 5, 2024, 6:15 PM

6 points

1 comment1 min readLW link

[Question] Making 2023 ACX Prediction Results Public

LegionnaireMar 5, 2024, 5:56 PM

3 points

9 comments1 min readLW link

Social status part 2/2: everything else

Steven ByrnesMar 5, 2024, 4:29 PM

65 points

2 comments23 min readLW link

Social status part 1/2: negotiations over object-level preferences

Steven ByrnesMar 5, 2024, 4:29 PM

118 points

15 comments21 min readLW link

Two Tales of AI Takeover: My Doubts

Violet HourMar 5, 2024, 3:51 PM

30 points

8 comments29 min readLW link

Research Report: Sparse Autoencoders find only 9/180 board state features in OthelloGPT

Robert_AIZIMar 5, 2024, 1:55 PM

61 points

24 comments10 min readLW link

(aizi.substack.com)

Read the Roon

ZviMar 5, 2024, 1:50 PM

136 points

6 comments19 min readLW link

(thezvi.wordpress.com)

In defense of anthropically updating EDT

Anthony DiGiovanniMar 5, 2024, 6:21 AM

18 points

17 comments13 min readLW link

Claude Doesn’t Want to Die

garrisonMar 5, 2024, 6:00 AM

22 points

3 comments LW link

(garrisonlovely.substack.com)

Many arguments for AI x-risk are wrong

TurnTroutMar 5, 2024, 2:31 AM

162 points

87 comments12 min readLW link

Some ways of spending your time are better than others

depressurizeMar 4, 2024, 11:21 PM

6 points

5 comments4 min readLW link

Claude 3 claims it’s conscious, doesn’t want to die or be modified

Mikhail SaminMar 4, 2024, 11:05 PM

81 points

118 comments14 min readLW link

Modifying Jones’ “AI Dilemma” Model

harsimonyMar 4, 2024, 9:55 PM

7 points

0 comments6 min readLW link

(splittinginfinity.substack.com)

Benefits of adding poison to your DMT

George3d6Mar 4, 2024, 8:35 PM

6 points

2 comments5 min readLW link

(morelucid.substack.com)

Notes on Awe

David GrossMar 4, 2024, 8:23 PM

20 points

1 comment33 min readLW link

Boston’s Line 1

jefftkMar 4, 2024, 7:30 PM

12 points

0 comments1 min readLW link

(www.jefftk.com)

Anthropic release Claude 3, claims >GPT-4 Performance

LawrenceCMar 4, 2024, 6:23 PM

115 points

41 comments2 min readLW link

(www.anthropic.com)

Anomalous Concept Detection for Detecting Hidden Cognition

Paul CologneseMar 4, 2024, 4:52 PM

24 points

3 comments10 min readLW link

INTERVIEW: StakeOut.AI w/ Dr. Peter Park

jacobhaimesMar 4, 2024, 4:35 PM

6 points

0 comments1 min readLW link

(into-ai-safety.github.io)

Housing Roundup #7

ZviMar 4, 2024, 3:00 PM

42 points

1 comment44 min readLW link

(thezvi.wordpress.com)

The Solution to Sleeping Beauty

Ape in the coatMar 4, 2024, 6:46 AM

18 points

77 comments13 min readLW link

Are we so good to simulate?

KatjaGraceMar 4, 2024, 5:20 AM

38 points

24 comments2 min readLW link

(worldspiritsockpuppet.com)

The Broken Screwdriver and other parables

bhauthMar 4, 2024, 3:34 AM

49 points

1 comment2 min readLW link

Grief is a fire sale

Nathan YoungMar 4, 2024, 1:11 AM

77 points

1 comment4 min readLW link

[Question] Good HPMoR scenes / passages?

PhilGoetzMar 3, 2024, 10:42 PM

15 points

17 comments1 min readLW link

Attending Sold-Out Beantown Stomp

jefftkMar 3, 2024, 9:30 PM

9 points

0 comments1 min readLW link

(www.jefftk.com)

AI things that are perhaps as important as human-controlled AI

Chi NguyenMar 3, 2024, 6:07 PM

55 points

4 comments LW link

A tedious and effective way to learn 汉字 (Chinese characters)

dkl9Mar 3, 2024, 4:41 PM

7 points

1 comment2 min readLW link

(dkl9.net)

Some costs of superposition

Linda LinseforsMar 3, 2024, 4:08 PM

46 points

11 comments3 min readLW link

[Question] If you controlled the first agentic AGI, what would you set as its first task(s)?

sweenesmMar 3, 2024, 2:16 PM

−13 points

5 comments2 min readLW link

Self-Resolving Prediction Markets

PeterMcCluskeyMar 3, 2024, 2:39 AM

33 points

0 comments3 min readLW link

(bayesianinvestor.com)

[Question] Increase the tax value of donations with high-variance investments?

Brendan LongMar 3, 2024, 1:39 AM

20 points

4 comments2 min readLW link

Common Philosophical Mistakes, according to Joe Schmid [videos]

DanielFilanMar 3, 2024, 12:15 AM

8 points

3 comments1 min readLW link

(www.youtube.com)

Agreeing With Stalin in Ways That Exhibit Generally Rationalist Principles

Zack_M_DavisMar 2, 2024, 10:05 PM

27 points

25 comments58 min readLW link

(unremediatedgender.space)

The World in 2029

Nathan YoungMar 2, 2024, 6:03 PM

74 points

37 comments3 min readLW link

The Most Dangerous Idea

rogersbaconMar 2, 2024, 5:53 PM

−8 points

2 comments26 min readLW link

(www.secretorum.life)

Future life

DavidMadsen2 Mar 2024 15:41 UTC

−12 points

2 comments2 min readLW link

Ugo Conti’s Whistle-Controlled Synthesizer

jefftk2 Mar 2024 2:50 UTC

15 points

1 comment2 min readLW link

(www.jefftk.com)

A one-sentence formulation of the AI X-Risk argument I try to make

tcelferact2 Mar 2024 0:44 UTC

3 points

0 comments LW link

If you weren’t such an idiot...

kave and Mark Xu

2 Mar 2024 0:01 UTC

157 points

76 comments2 min readLW link

(markxu.com)

Increasing IQ is trivial

George3d61 Mar 2024 22:43 UTC

38 points

61 comments6 min readLW link

(epistemink.substack.com)

self-fulfilling prophecies when applying for funding

Chris Lakin1 Mar 2024 19:01 UTC

31 points

0 comments1 min readLW link

(chipmonk.substack.com)

Antagonistic AI

Xybermancer1 Mar 2024 18:50 UTC

−8 points

1 comment1 min readLW link