$20K In Boun­ties for AI Safety Public Materials

Aug 5, 2022, 2:52 AM
71 points
9 comments6 min readLW link

Paper read­ing as a Cargo Cult

jem-mosigAug 7, 2022, 7:50 AM
70 points
10 comments5 min readLW link

Build­ing a Bugs List prompts

CFAR!DuncanAug 13, 2022, 8:00 AM
69 points
9 comments2 min readLW link

Jack Clark on the re­al­ities of AI policy

Kaj_SotalaAug 7, 2022, 8:44 AM
68 points
3 comments3 min readLW link
(threadreaderapp.com)

The Ex­pand­ing Mo­ral Cine­matic Universe

RaemonAug 28, 2022, 6:42 PM
67 points
9 comments14 min readLW link

In Defense Of Mak­ing Money

George3d6Aug 18, 2022, 2:10 PM
65 points
13 comments7 min readLW link
(www.epistem.ink)

AI art isn’t “about to shake things up”. It’s already here.

Davis_KingsleyAug 22, 2022, 11:17 AM
65 points
19 comments3 min readLW link

Vingean Agency

abramdemskiAug 24, 2022, 8:08 PM
63 points
14 comments3 min readLW link

ACX Mee­tups Every­where List

Scott AlexanderAug 26, 2022, 6:12 PM
63 points
1 comment41 min readLW link

En­cul­tured AI Pre-plan­ning, Part 1: En­abling New Benchmarks

Aug 8, 2022, 10:44 PM
63 points
2 comments6 min readLW link

Steganog­ra­phy in Chain of Thought Reasoning

A RayAug 8, 2022, 3:47 AM
62 points
13 comments6 min readLW link

Oops It’s Time To Over­throw the Or­ga­nizer Day!

ScrewtapeAug 18, 2022, 4:40 PM
62 points
5 comments4 min readLW link

Seek­ing PCK (Ped­a­gog­i­cal Con­tent Knowl­edge)

CFAR!DuncanAug 12, 2022, 4:15 AM
62 points
11 comments5 min readLW link

Seek­ing In­terns/​RAs for Mechanis­tic In­ter­pretabil­ity Projects

Neel NandaAug 15, 2022, 7:11 AM
61 points
0 comments2 min readLW link

Au­ton­omy as tak­ing re­spon­si­bil­ity for refer­ence maintenance

Ramana KumarAug 17, 2022, 12:50 PM
61 points
3 comments5 min readLW link

An In­tro­duc­tion to Cur­rent The­o­ries of Consciousness

hohenheimAug 28, 2022, 5:55 PM
60 points
43 comments49 min readLW link

OpenAI’s Align­ment Plans

dkirmaniAug 24, 2022, 7:39 PM
60 points
17 comments5 min readLW link
(openai.com)

Anti-squat­ted AI x-risk do­mains index

plexAug 12, 2022, 12:01 PM
59 points
6 comments1 min readLW link

Find­ing Goals in the World Model

Aug 22, 2022, 6:06 PM
59 points
8 comments13 min readLW link

The Prag­mas­cope Idea

johnswentworthAug 4, 2022, 9:52 PM
59 points
20 comments3 min readLW link

My thoughts on di­rect work (and join­ing LessWrong)

RobertMAug 16, 2022, 6:53 PM
58 points
4 comments6 min readLW link

How to plan for a rad­i­cally un­cer­tain fu­ture?

KerryAug 30, 2022, 2:14 AM
57 points
35 comments1 min readLW link

EA & LW Fo­rums Weekly Sum­mary (21 Aug − 27 Aug 22′)

Zoe WilliamsAug 30, 2022, 1:42 AM
57 points
4 comments12 min readLW link

How and why to turn ev­ery­thing into audio

Aug 11, 2022, 8:55 AM
57 points
20 comments5 min readLW link

Refine’s First Blog Post Day

adamShimiAug 13, 2022, 10:23 AM
55 points
3 comments1 min readLW link

[Question] How to bet against civ­i­liza­tional ad­e­quacy?

Wei Dai12 Aug 2022 23:33 UTC
54 points
20 comments1 min readLW link

All the posts I will never write

Alexander Gietelink Oldenziel14 Aug 2022 18:29 UTC
54 points
8 comments8 min readLW link

Brain-like AGI pro­ject “ain­telope”

Gunnar_Zarncke14 Aug 2022 16:33 UTC
54 points
2 comments1 min readLW link

Trans­former lan­guage mod­els are do­ing some­thing more general

Numendil3 Aug 2022 21:13 UTC
53 points
6 comments2 min readLW link

I missed the crux of the al­ign­ment prob­lem the whole time

zeshen13 Aug 2022 10:11 UTC
53 points
7 comments3 min readLW link

Us­ing GPT-3 to aug­ment hu­man intelligence

Henrik Karlsson10 Aug 2022 15:54 UTC
52 points
8 comments18 min readLW link
(escapingflatland.substack.com)

Vari­a­tional Bayesian methods

Ege Erdil25 Aug 2022 20:49 UTC
52 points
2 comments9 min readLW link

A Data limited future

Donald Hobson6 Aug 2022 14:56 UTC
52 points
25 comments2 min readLW link

Turbocharging

CFAR!Duncan2 Aug 2022 0:01 UTC
52 points
5 comments9 min readLW link

An­nounc­ing Squig­gle: Early Access

ozziegooen3 Aug 2022 19:48 UTC
51 points
7 comments7 min readLW link
(forum.effectivealtruism.org)

Gen­eral al­ign­ment properties

TurnTrout8 Aug 2022 23:40 UTC
51 points
2 comments1 min readLW link

Againstness

CFAR!Duncan2 Aug 2022 19:29 UTC
50 points
8 comments9 min readLW link

Po­laris, Five-Se­cond Ver­sions, and Thought Lengths

CFAR!Duncan1 Aug 2022 7:14 UTC
50 points
12 comments8 min readLW link

On Car Seats as Contraception

Zvi22 Aug 2022 14:10 UTC
49 points
15 comments35 min readLW link
(thezvi.wordpress.com)

Six weeks doesn’t make a habit

lynettebye6 Aug 2022 8:54 UTC
48 points
1 comment3 min readLW link

AGI Timelines Are Mostly Not Strate­gi­cally Rele­vant To Alignment

johnswentworth23 Aug 2022 20:15 UTC
48 points
34 comments1 min readLW link

The Shard The­ory Align­ment Scheme

David Udell25 Aug 2022 4:52 UTC
47 points
32 comments2 min readLW link

Gra­di­ent de­scent doesn’t se­lect for in­ner search

Ivan Vendrov13 Aug 2022 4:15 UTC
47 points
23 comments4 min readLW link

Covid 8/​18/​22: CDC Ad­mits Mistakes

Zvi18 Aug 2022 14:30 UTC
46 points
9 comments17 min readLW link
(thezvi.wordpress.com)

Pro­posal: Con­sider not us­ing dis­tance-di­rec­tion-di­men­sion words in ab­stract discussions

moridinamael9 Aug 2022 20:44 UTC
46 points
18 comments5 min readLW link

The Fal­ling Drill

Screwtape5 Aug 2022 0:08 UTC
46 points
3 comments2 min readLW link

Re­view: Amus­ing Our­selves to Death

L Rudolf L20 Aug 2022 21:13 UTC
44 points
7 comments16 min readLW link1 review
(www.strataoftheworld.com)

Vol­un­teer to host a meetup!

mingyuan21 Aug 2022 22:43 UTC
44 points
1 comment1 min readLW link

The Dumbest Pos­si­ble Gets There First

Artaxerxes13 Aug 2022 10:20 UTC
44 points
7 comments2 min readLW link

The Solomonoff prior is ma­lign. It’s not a big deal.

Charlie Steiner25 Aug 2022 8:25 UTC
43 points
9 comments7 min readLW link