On green

Joe Carlsmith21 Mar 2024 17:38 UTC
258 points
34 comments31 min readLW link

My PhD the­sis: Al­gorith­mic Bayesian Epistemology

Eric Neyman16 Mar 2024 22:56 UTC
248 points
14 comments7 min readLW link
(arxiv.org)

Failures in Kindness

silentbob26 Mar 2024 21:30 UTC
245 points
27 comments9 min readLW link

My Clients, The Liars

ymeskhout5 Mar 2024 21:06 UTC
231 points
85 comments7 min readLW link

ChatGPT can learn in­di­rect control

Raymond D21 Mar 2024 21:11 UTC
212 points
23 comments1 min readLW link

Modern Trans­form­ers are AGI, and Hu­man-Level

abramdemski26 Mar 2024 17:46 UTC
205 points
89 comments5 min readLW link

“How could I have thought that faster?”

mesaoptimizer11 Mar 2024 10:56 UTC
200 points
30 comments2 min readLW link
(twitter.com)

My In­ter­view With Cade Metz on His Re­port­ing About Slate Star Codex

Zack_M_Davis26 Mar 2024 17:18 UTC
188 points
186 comments6 min readLW link

Daniel Kah­ne­man has died

DanielFilan27 Mar 2024 15:59 UTC
183 points
11 comments1 min readLW link
(www.washingtonpost.com)

Toward a Broader Con­cep­tion of Ad­verse Selection

Ricki Heicklen14 Mar 2024 22:40 UTC
174 points
61 comments13 min readLW link
(bayesshammai.substack.com)

‘Em­piri­cism!’ as Anti-Epistemology

Eliezer Yudkowsky14 Mar 2024 2:02 UTC
165 points
84 comments25 min readLW link

Many ar­gu­ments for AI x-risk are wrong

TurnTrout5 Mar 2024 2:31 UTC
153 points
76 comments12 min readLW link

Ver­nor Vinge, who coined the term “Tech­nolog­i­cal Sin­gu­lar­ity”, dies at 79

Kaj_Sotala21 Mar 2024 22:14 UTC
148 points
24 comments1 min readLW link
(arstechnica.com)

On Devin

Zvi18 Mar 2024 13:20 UTC
147 points
30 comments11 min readLW link
(thezvi.wordpress.com)

Some (prob­le­matic) aes­thet­ics of what con­sti­tutes good work in academia

Steven Byrnes11 Mar 2024 17:47 UTC
146 points
12 comments12 min readLW link

Us­ing axis lines for good or evil

dynomight6 Mar 2024 14:47 UTC
140 points
39 comments4 min readLW link
(dynomight.net)

The Worst Form Of Govern­ment (Ex­cept For Every­thing Else We’ve Tried)

johnswentworth17 Mar 2024 18:11 UTC
140 points
46 comments4 min readLW link

Read the Roon

Zvi5 Mar 2024 13:50 UTC
130 points
6 comments19 min readLW link
(thezvi.wordpress.com)

Com­mu­nity Notes by X

NicholasKees18 Mar 2024 17:13 UTC
123 points
15 comments7 min readLW link

If you weren’t such an idiot...

2 Mar 2024 0:01 UTC
119 points
60 comments2 min readLW link
(markxu.com)

An­thropic re­lease Claude 3, claims >GPT-4 Performance

LawrenceC4 Mar 2024 18:23 UTC
114 points
40 comments2 min readLW link
(www.anthropic.com)

The Parable Of The Fallen Pen­du­lum—Part 1

johnswentworth1 Mar 2024 0:25 UTC
111 points
32 comments2 min readLW link

So­cial sta­tus part 1/​2: ne­go­ti­a­tions over ob­ject-level preferences

Steven Byrnes5 Mar 2024 16:29 UTC
110 points
15 comments21 min readLW link

Gen­eral Thoughts on Sec­u­lar Solstice

Jeffrey Heninger23 Mar 2024 18:48 UTC
98 points
60 comments8 min readLW link

Notes from a Prompt Factory

Richard_Ngo10 Mar 2024 5:13 UTC
98 points
19 comments9 min readLW link
(www.narrativeark.xyz)

Notes on Dwarkesh Pa­tel’s Pod­cast with Demis Hassabis

Zvi1 Mar 2024 16:30 UTC
93 points
0 comments8 min readLW link
(thezvi.wordpress.com)

On attunement

Joe Carlsmith25 Mar 2024 12:47 UTC
92 points
8 comments22 min readLW link

OpenAI: The Board Expands

Zvi12 Mar 2024 14:00 UTC
92 points
1 comment30 min readLW link
(thezvi.wordpress.com)

“Deep Learn­ing” Is Func­tion Approximation

Zack_M_Davis21 Mar 2024 17:50 UTC
91 points
28 comments10 min readLW link
(zackmdavis.net)

In­tro­duc­ing METR’s Au­ton­omy Eval­u­a­tion Resources

15 Mar 2024 23:16 UTC
90 points
0 comments1 min readLW link
(metr.github.io)

New re­port: Safety Cases for AI

joshc20 Mar 2024 16:45 UTC
90 points
13 comments1 min readLW link
(twitter.com)

An­nounc­ing Neu­ron­pe­dia: Plat­form for ac­cel­er­at­ing re­search into Sparse Autoencoders

25 Mar 2024 21:17 UTC
89 points
7 comments7 min readLW link

Sim­ple ver­sus Short: Higher-or­der de­gen­er­acy and er­ror-correction

Daniel Murfet11 Mar 2024 7:52 UTC
89 points
5 comments12 min readLW link

SAE re­con­struc­tion er­rors are (em­piri­cally) pathological

wesg29 Mar 2024 16:37 UTC
88 points
15 comments8 min readLW link

Anx­iety vs. Depression

Sable17 Mar 2024 0:15 UTC
84 points
33 comments3 min readLW link
(affablyevil.substack.com)

LessOn­line (May 31—June 2, Berkeley, CA)

Ben Pace26 Mar 2024 2:34 UTC
82 points
15 comments1 min readLW link
(Less.Online)

Stage­wise Devel­op­ment in Neu­ral Networks

20 Mar 2024 19:54 UTC
81 points
1 comment11 min readLW link

Nat­u­ral La­tents: The Concepts

20 Mar 2024 18:21 UTC
80 points
16 comments19 min readLW link

[Linkpost] Prac­ti­cally-A-Book Re­view: Root­claim $100,000 Lab Leak Debate

trevor28 Mar 2024 16:03 UTC
77 points
22 comments2 min readLW link
(www.astralcodexten.com)

On Claude 3.0

Zvi6 Mar 2024 18:50 UTC
75 points
5 comments31 min readLW link
(thezvi.wordpress.com)

Vote on An­thropic Topics to Discuss

Ben Pace6 Mar 2024 19:43 UTC
75 points
55 comments1 min readLW link

The Parable Of The Fallen Pen­du­lum—Part 2

johnswentworth12 Mar 2024 21:41 UTC
74 points
8 comments4 min readLW link

Nick Bostrom’s new book, “Deep Utopia”, is out today

PeterH27 Mar 2024 11:24 UTC
73 points
5 comments1 min readLW link
(nickbostrom.com)

The World in 2029

Nathan Young2 Mar 2024 18:03 UTC
70 points
37 comments3 min readLW link

The Cog­ni­tive-The­o­retic Model of the Uni­verse: A Par­tial Sum­mary and Review

jessicata27 Mar 2024 19:59 UTC
70 points
31 comments36 min readLW link
(unstablerontology.substack.com)

Claude 3 claims it’s con­scious, doesn’t want to die or be modified

Mikhail Samin4 Mar 2024 23:05 UTC
69 points
101 comments14 min readLW link

How use­ful is “AI Con­trol” as a fram­ing on AI X-Risk?

14 Mar 2024 18:06 UTC
67 points
4 comments34 min readLW link

MATS AI Safety Strat­egy Curriculum

7 Mar 2024 19:59 UTC
66 points
2 comments16 min readLW link

Grief is a fire sale

Nathan Young4 Mar 2024 1:11 UTC
65 points
1 comment4 min readLW link

[Question] What could a policy ban­ning AGI look like?

TsviBT13 Mar 2024 14:19 UTC
65 points
21 comments3 min readLW link