Gen­er­a­tive, Epi­sodic Ob­jec­tives for Safe AI

Michael Glass5 Oct 2022 23:18 UTC
11 points
3 comments8 min readLW link

Depen­dency Tree For The Devel­op­ment Of Plate Tectonics

Elizabeth5 Oct 2022 22:40 UTC
38 points
3 comments4 min readLW link
(acesounderglass.com)

[Question] How does an­thropic rea­son­ing and illu­sion­ism/​elimini­tivism in­ter­act?

Shiroe5 Oct 2022 22:31 UTC
5 points
18 comments1 min readLW link

[Question] Find­ing Great Tutors

Ulisse Mini5 Oct 2022 22:08 UTC
27 points
5 comments1 min readLW link

Progress links and tweets, 2022-10-05

jasoncrawford5 Oct 2022 19:24 UTC
9 points
1 comment1 min readLW link
(rootsofprogress.org)

A blog post is a very long and com­plex search query to find fas­ci­nat­ing peo­ple and make them route in­ter­est­ing stuff to your inbox

Henrik Karlsson5 Oct 2022 19:07 UTC
89 points
12 comments11 min readLW link
(escapingflatland.substack.com)

Neu­ral Tan­gent Ker­nel Distillation

5 Oct 2022 18:11 UTC
76 points
20 comments8 min readLW link

Track­ing Com­pute Stocks and Flows: Case Stud­ies?

Cullen5 Oct 2022 17:57 UTC
11 points
5 comments1 min readLW link

[Linkpost] “Blueprint for an AI Bill of Rights”—Office of Science and Tech­nol­ogy Policy, USA (2022)

Fer32dwt34r3dfsz5 Oct 2022 16:42 UTC
9 points
4 comments2 min readLW link
(www.whitehouse.gov)

Paper: Dis­cov­er­ing novel al­gorithms with AlphaTen­sor [Deep­mind]

LawrenceC5 Oct 2022 16:20 UTC
82 points
18 comments1 min readLW link
(www.deepmind.com)

Reflec­tion Mechanisms as an Align­ment tar­get: A fol­low-up survey

5 Oct 2022 14:03 UTC
15 points
2 comments7 min readLW link

Char­i­ta­ble Reads of Anti-AGI-X-Risk Ar­gu­ments, Part 1

sstich5 Oct 2022 5:03 UTC
3 points
4 comments3 min readLW link

Sleep Training

jefftk5 Oct 2022 2:10 UTC
36 points
4 comments2 min readLW link
(www.jefftk.com)

Looping

Jarred Filmer5 Oct 2022 1:47 UTC
53 points
6 comments2 min readLW link

How are you deal­ing with on­tol­ogy iden­ti­fi­ca­tion?

Erik Jenner4 Oct 2022 23:28 UTC
34 points
10 comments3 min readLW link

Smoke with­out fire is scary

Adam Jermyn4 Oct 2022 21:08 UTC
51 points
22 comments4 min readLW link

Dep­re­cated: Some hu­mans are fit­ness maximizers

Shoshannah Tekofsky4 Oct 2022 19:38 UTC
6 points
22 comments6 min readLW link

Will you let your kid play foot­ball?

5hout4 Oct 2022 17:48 UTC
14 points
12 comments2 min readLW link

Quick notes on “mir­ror neu­rons”

Steven Byrnes4 Oct 2022 17:39 UTC
39 points
2 comments2 min readLW link

Fea­ture re­quest: Filter by read/​ upvoted

Nathan Young4 Oct 2022 17:17 UTC
16 points
5 comments1 min readLW link

Lay­ers Of Mind

PeteG4 Oct 2022 16:52 UTC
−8 points
4 comments2 min readLW link

[Question] Does Google still hire peo­ple via their foo­bar challenge?

Algon4 Oct 2022 15:39 UTC
11 points
7 comments1 min readLW link

Rus­sia will do a nu­clear test

sanxiyn4 Oct 2022 14:59 UTC
3 points
7 comments1 min readLW link

Paper+Sum­mary: OMNIGROK: GROKKING BEYOND ALGORITHMIC DATA

Marius Hobbhahn4 Oct 2022 7:22 UTC
46 points
11 comments1 min readLW link
(arxiv.org)

Frontline of AGI Align­ment

SD Marlow4 Oct 2022 3:47 UTC
−10 points
0 comments1 min readLW link
(robothouse.substack.com)

Ad­ver­sar­ial vs Col­lab­o­ra­tive Contexts

jefftk4 Oct 2022 2:40 UTC
31 points
4 comments2 min readLW link
(www.jefftk.com)

Hu­mans aren’t fit­ness maximizers

So8res4 Oct 2022 1:31 UTC
49 points
46 comments5 min readLW link

Self-defeat­ing con­spir­acy the­o­rists and their theories

M. Y. Zuo4 Oct 2022 0:48 UTC
5 points
12 comments3 min readLW link

No free lunch the­o­rem is irrelevant

Catnee4 Oct 2022 0:21 UTC
18 points
7 comments1 min readLW link

The Village and the River Mon­sters… Or: Less Fight­ing, More Brainstorming

ExCeph3 Oct 2022 23:01 UTC
7 points
29 comments8 min readLW link
(ginnungagapfoundation.wordpress.com)

my cur­rent out­look on AI risk mitigation

Tamsin Leake3 Oct 2022 20:06 UTC
63 points
6 comments11 min readLW link
(carado.moe)

Re­call and Re­gur­gi­ta­tion in GPT2

Megan Kinniment3 Oct 2022 19:35 UTC
43 points
1 comment26 min readLW link

Iver­mectin: Much Less Than You Needed To Know

George3d63 Oct 2022 15:02 UTC
31 points
10 comments1 min readLW link
(doyourownresearch.substack.com)

If you want to learn tech­ni­cal AI safety, here’s a list of AI safety courses, read­ing lists, and resources

KatWoods3 Oct 2022 12:43 UTC
12 points
3 comments1 min readLW link

Oc­to­ber Bu­dapest Less Wrong/​ACX meetup

Richard Horvath3 Oct 2022 10:53 UTC
2 points
0 comments1 min readLW link

A re­view of the Bio-An­chors report

jylin043 Oct 2022 10:27 UTC
45 points
4 comments1 min readLW link
(docs.google.com)

Data for IRL: What is needed to learn hu­man val­ues?

Jan Wehner3 Oct 2022 9:23 UTC
18 points
6 comments12 min readLW link

Statis­tics for ob­jects with shared identities

Q Home3 Oct 2022 9:21 UTC
2 points
7 comments4 min readLW link

Vi­su­al­iz­ing Learned Rep­re­sen­ta­tions of Rice Disease

muhia_bee3 Oct 2022 9:09 UTC
7 points
0 comments4 min readLW link
(indecisive-sand-24a.notion.site)

[Question] Is there a cul­ture over­hang?

Aleksi Liimatainen3 Oct 2022 7:26 UTC
18 points
4 comments1 min readLW link

Baby Mon­i­tor with Delay

jefftk3 Oct 2022 1:40 UTC
12 points
13 comments1 min readLW link
(www.jefftk.com)

Si­mu­lacra Levels in Literature

Miniman2 Oct 2022 21:13 UTC
1 point
0 comments4 min readLW link

What makes a prob­a­bil­ity ques­tion “well-defined”? (Part I)

Noah Topper2 Oct 2022 21:05 UTC
14 points
4 comments7 min readLW link
(naivebayes.substack.com)

Not Long Now

Alex Beyman2 Oct 2022 20:32 UTC
11 points
2 comments65 min readLW link

Against the weird­ness heuristic

Eleni Angelou2 Oct 2022 19:41 UTC
17 points
3 comments2 min readLW link

Easy fix­ing Voting

Charbel-Raphaël2 Oct 2022 17:03 UTC
12 points
2 comments1 min readLW link

Monthly Shorts 9/​22, and An Es­say in Defense of Technodeterminism

Celer2 Oct 2022 16:10 UTC
10 points
1 comment5 min readLW link
(keller.substack.com)

Open & Wel­come Thread—Oct 2022

niplav2 Oct 2022 11:04 UTC
17 points
62 comments1 min readLW link

[Question] Any fur­ther work on AI Safety Suc­cess Sto­ries?

Krieger2 Oct 2022 9:53 UTC
8 points
6 comments1 min readLW link

Re­quest for feed­back on sam­ple blurbs for the EA fan­tasy novel I wrote

Timothy Underwood2 Oct 2022 9:01 UTC
8 points
0 comments2 min readLW link