Beyond Blame Minimization

physicaleconomicsMar 27, 2022, 12:03 AM
137 points
47 comments4 min readLW link

An­nounc­ing the LessWrong Cu­rated Podcast

Jun 22, 2022, 10:16 PM
137 points
27 comments1 min readLW link

LessWrong Now Has Dark Mode

jimrandomhMay 10, 2022, 1:21 AM
137 points
31 comments1 min readLW link

AI-Writ­ten Cri­tiques Help Hu­mans No­tice Flaws

paulfchristianoJun 25, 2022, 5:22 PM
137 points
5 comments3 min readLW link
(openai.com)

On Bounded Distrust

ZviFeb 3, 2022, 2:50 PM
137 points
19 comments56 min readLW link1 review
(thezvi.wordpress.com)

Car­ry­ing the Torch: A Re­sponse to Anna Sala­mon by the Guild of the Rose

moridinamaelJul 6, 2022, 2:20 PM
136 points
16 comments6 min readLW link

De­con­fus­ing Direct vs Amor­tised Optimization

berenDec 2, 2022, 11:30 AM
136 points
19 comments10 min readLW link

Brain­storm of things that could force an AI team to burn their lead

So8resJul 24, 2022, 11:58 PM
136 points
8 comments13 min readLW link

The Align­ment Com­mu­nity Is Cul­turally Broken

sudoNov 13, 2022, 6:53 PM
136 points
68 comments2 min readLW link

Ex­ter­nal­ized rea­son­ing over­sight: a re­search di­rec­tion for lan­guage model alignment

tameraAug 3, 2022, 12:03 PM
136 points
23 comments6 min readLW link

Don’t leave your finger­prints on the future

So8resOct 8, 2022, 12:35 AM
136 points
48 comments5 min readLW link

Ap­ply to the Red­wood Re­search Mechanis­tic In­ter­pretabil­ity Ex­per­i­ment (REMIX), a re­search pro­gram in Berkeley

Oct 27, 2022, 1:32 AM
135 points
14 comments12 min readLW link

Mon­i­tor­ing for de­cep­tive alignment

evhubSep 8, 2022, 11:07 PM
135 points
8 comments9 min readLW link

AI will change the world, but won’t take it over by play­ing “3-di­men­sional chess”.

Nov 22, 2022, 6:57 PM
134 points
97 comments24 min readLW link

You have a place to stay in Swe­den, should you need it.

DojanFeb 27, 2022, 1:21 AM
134 points
3 comments1 min readLW link

Nice­ness is unnatural

So8resOct 13, 2022, 1:30 AM
134 points
20 comments8 min readLW link1 review

[New Fea­ture] Sup­port for Foot­notes!

RubyJan 4, 2022, 7:35 AM
134 points
31 comments1 min readLW link

The Territory

LoganStrohlFeb 15, 2022, 6:56 PM
134 points
12 comments5 min readLW link

Third Time: a bet­ter way to work

bfinnJan 7, 2022, 9:15 PM
133 points
76 comments8 min readLW link

Sadly, FTX

ZviNov 17, 2022, 2:30 PM
133 points
18 comments47 min readLW link
(thezvi.wordpress.com)

Will Ca­pa­bil­ities Gen­er­al­ise More?

Ramana KumarJun 29, 2022, 5:12 PM
133 points
39 comments4 min readLW link

Meadow Theory

Duncan Sabien (Inactive)Mar 9, 2022, 5:13 PM
132 points
16 comments16 min readLW link1 review

Su­per­in­tel­li­gent AI is nec­es­sary for an amaz­ing fu­ture, but far from sufficient

So8resOct 31, 2022, 9:16 PM
132 points
48 comments34 min readLW link

My cur­rent thoughts on the risks from SETI

Matthew BarnettMar 15, 2022, 5:18 PM
132 points
27 comments10 min readLW link

AI Fore­cast­ing: One Year In

jsteinhardtJul 4, 2022, 5:10 AM
132 points
12 comments6 min readLW link
(bounded-regret.ghost.io)

Orexin and the quest for more wak­ing hours

ChristianKlSep 24, 2022, 7:54 PM
131 points
39 comments5 min readLW link

Con­jec­ture: In­ter­nal In­fo­haz­ard Policy

Jul 29, 2022, 7:07 PM
131 points
6 comments19 min readLW link

Warn­ing Shots Prob­a­bly Wouldn’t Change The Pic­ture Much

So8resOct 6, 2022, 5:15 AM
130 points
42 comments2 min readLW link

Con­fused why a “ca­pa­bil­ities re­search is good for al­ign­ment progress” po­si­tion isn’t dis­cussed more

Kaj_SotalaJun 2, 2022, 9:41 PM
130 points
27 comments4 min readLW link

Only Ask­ing Real Questions

jefftkApr 14, 2022, 3:50 PM
130 points
45 comments2 min readLW link
(www.jefftk.com)

Pa­tient Observation

LoganStrohlFeb 23, 2022, 7:31 PM
129 points
4 comments10 min readLW link1 review

[Closed] Job Offer­ing: Help Com­mu­ni­cate Infrabayesianism

Mar 23, 2022, 6:35 PM
129 points
22 comments1 min readLW link

In­ter­gen­er­a­tional trauma im­ped­ing co­op­er­a­tive ex­is­ten­tial safety efforts

Andrew_CritchJun 3, 2022, 8:13 AM
129 points
29 comments3 min readLW link

Ex­plain­ing the Twit­ter Pos­trat Scene

Jacob FalkovichApr 5, 2022, 10:23 PM
128 points
29 comments5 min readLW link

Limer­ence Messes Up Your Ra­tion­al­ity Real Bad, Yo

RaemonJul 1, 2022, 4:53 PM
128 points
41 comments3 min readLW link2 reviews

The case against AI alignment

andrew sauerDec 24, 2022, 6:57 AM
128 points
110 comments5 min readLW link

On in­finite ethics

Joe CarlsmithJan 31, 2022, 7:04 AM
128 points
71 comments51 min readLW link1 review

I left Rus­sia on March 8

avturchinMar 10, 2022, 8:05 PM
127 points
16 comments1 min readLW link

A Longlist of The­o­ries of Im­pact for Interpretability

Neel NandaMar 11, 2022, 2:55 PM
127 points
41 comments5 min readLW link2 reviews

“Pivotal Acts” means some­thing specific

RaemonJun 7, 2022, 9:56 PM
127 points
23 comments2 min readLW link

Long covid: prob­a­bly worth avoid­ing—some considerations

KatjaGraceJan 16, 2022, 11:46 AM
127 points
88 comments14 min readLW link
(worldspiritsockpuppet.com)

Re-Ex­am­in­ing LayerNorm

Eric WinsorDec 1, 2022, 10:20 PM
127 points
12 comments5 min readLW link

On the Di­plo­macy AI

ZviNov 28, 2022, 1:20 PM
127 points
29 comments11 min readLW link
(thezvi.wordpress.com)

Clar­ify­ing AI X-risk

Nov 1, 2022, 11:03 AM
127 points
24 comments4 min readLW link1 review

The case for be­com­ing a black-box in­ves­ti­ga­tor of lan­guage models

BuckMay 6, 2022, 2:35 PM
126 points
20 comments3 min readLW link

Geo­met­ric Ex­plo­ra­tion, Arith­metic Exploitation

Scott GarrabrantNov 24, 2022, 3:36 PM
126 points
5 comments7 min readLW link

Shared re­al­ity: a key driver of hu­man behavior

kdbscottDec 24, 2022, 7:35 PM
126 points
25 comments4 min readLW link

The Wicked Prob­lem Experience

HoldenKarnofskyMar 2, 2022, 5:50 PM
125 points
6 comments9 min readLW link1 review
(www.cold-takes.com)

Gene drives: why the wait?

MetacelsusSep 19, 2022, 11:37 PM
125 points
50 comments3 min readLW link
(denovo.substack.com)

Let’s See You Write That Cor­rigi­bil­ity Tag

Eliezer YudkowskyJun 19, 2022, 9:11 PM
125 points
70 comments1 min readLW link