Auto-GPT: Open-sourced dis­aster?

awgApr 5, 2023, 10:46 PM
23 points
18 comments1 min readLW link
(github.com)

The Orthog­o­nal­ity Th­e­sis is Not Ob­vi­ously True

omnizoidApr 5, 2023, 9:06 PM
3 points
80 comments9 min readLW link

Willi­ams-Beuren Syn­drome: Frendly Mutations

TakkApr 5, 2023, 8:59 PM
−1 points
1 comment1 min readLW link

OpenAI: Our ap­proach to AI safety

Jacob G-WApr 5, 2023, 8:26 PM
1 point
1 comment1 min readLW link
(openai.com)

Why Are Max­i­mum En­tropy Distri­bu­tions So Ubiquitous?

johnswentworthApr 5, 2023, 8:12 PM
68 points
6 comments9 min readLW link

“On Liv­ing in an Atomic Age”, by C.S. Lewis (1948)

tjaffeeApr 5, 2023, 6:34 PM
17 points
3 comments8 min readLW link
(hebrew-streams.org)

Eliezer Yud­kowsky’s Let­ter in Time Magazine

ZviApr 5, 2023, 6:00 PM
214 points
86 comments14 min readLW link
(thezvi.wordpress.com)

Dark Ar­tifi­cial Intelligence

FrankAIApr 5, 2023, 5:37 PM
0 points
0 comments4 min readLW link

[Question] Best ar­gu­ments against in­stru­men­tal con­ver­gence?

lfrymireApr 5, 2023, 5:06 PM
5 points
7 comments1 min readLW link

Progress links and tweets, 2023-04-05

jasoncrawfordApr 5, 2023, 4:18 PM
20 points
0 comments2 min readLW link
(rootsofprogress.org)

Univer­sal­ity and Hid­den In­for­ma­tion in Con­cept Bot­tle­neck Models

HoagyApr 5, 2023, 2:00 PM
23 points
0 comments11 min readLW link

AI safety and the se­cu­rity mind­set: user in­ter­face de­sign, red-teams, for­mal verification

Allison DuettmannApr 5, 2023, 11:33 AM
35 points
0 comments8 min readLW link

ICA Simulacra

OzyrusApr 5, 2023, 6:41 AM
26 points
2 comments7 min readLW link

AGI de­ploy­ment as an act of aggression

dr_sApr 5, 2023, 6:39 AM
28 points
30 comments13 min readLW link

A Brief In­tro­duc­tion to Al­gorith­mic Com­mon In­tel­li­gence, ACI . 1

Akira PyinyaApr 5, 2023, 5:43 AM
−2 points
1 comment2 min readLW link

46% of US adults at least “some­what con­cerned” about AI ex­tinc­tion risk.

FoyleApr 5, 2023, 5:25 AM
1 point
0 comments1 min readLW link

[Question] Has any­one thought about how to pro­ceed now that AI notkil­lev­ery­oneism is be­com­ing more rele­vant/​is ap­proach­ing the Over­ton win­dow?

metachiralityApr 5, 2023, 3:06 AM
11 points
8 comments1 min readLW link

Em­pa­thy bandaid for im­me­di­ate AI catastrophe

installgentooApr 5, 2023, 2:12 AM
1 point
2 comments1 min readLW link

“Cor­rigi­bil­ity at some small length” by dath ilan

Christopher KingApr 5, 2023, 1:47 AM
32 points
3 comments9 min readLW link
(www.glowfic.com)

New sur­vey: 46% of Amer­i­cans are con­cerned about ex­tinc­tion from AI; 69% sup­port a six-month pause in AI development

Orpheus16Apr 5, 2023, 1:26 AM
46 points
9 comments1 min readLW link
(today.yougov.com)

Is AGI suici­dal­ity the golden ray of hope?

Alex KirkoApr 4, 2023, 11:29 PM
−18 points
4 comments1 min readLW link

Re­con­tex­tu­al­iz­ing the Risks of AI in More Pre­dictable Outcomes

ignorepeterApr 4, 2023, 11:28 PM
−19 points
2 comments5 min readLW link

LW Team is ad­just­ing mod­er­a­tion policy

RaemonApr 4, 2023, 8:41 PM
304 points
185 comments3 min readLW link

Ex­ces­sive AI growth-rate yields lit­tle so­cio-eco­nomic benefit.

Cleo NardoApr 4, 2023, 7:13 PM
27 points
22 comments4 min readLW link

Pe­nal­ize Model Com­plex­ity Via Self-Distillation

research_prime_spaceApr 4, 2023, 6:52 PM
15 points
7 comments1 min readLW link

The One Heresy to Rule Them All

rogersbaconApr 4, 2023, 6:23 PM
−22 points
0 comments3 min readLW link
(www.secretorum.life)

Gi­ant (In)scrutable Ma­tri­ces: (Maybe) the Best of All Pos­si­ble Worlds

1a3ornApr 4, 2023, 5:39 PM
211 points
38 comments5 min readLW link1 review

Play My Futarchy/​Pre­dic­tion Mar­ket Mafia Game

Arjun PanicksseryApr 4, 2023, 4:12 PM
21 points
2 comments1 min readLW link
(arjunpanickssery.substack.com)

[Question] Steel­man /​ Ide­olog­i­cal Tur­ing Test of Yann LeCun’s AI X-Risk ar­gu­ment?

Aryeh EnglanderApr 4, 2023, 3:53 PM
26 points
14 comments1 min readLW link

Given the Restrict Act, Don’t Ban TikTok

ZviApr 4, 2023, 2:40 PM
97 points
9 comments4 min readLW link
(thezvi.wordpress.com)

Run­ning many AI var­i­ants to find cor­rect goal generalization

avturchinApr 4, 2023, 2:16 PM
20 points
3 comments1 min readLW link

In­vo­ca­tions: The Other Ca­pa­bil­ities Over­hang?

Robert_AIZIApr 4, 2023, 1:38 PM
29 points
4 comments4 min readLW link
(aizi.substack.com)

Wanted: Men­tal Health Pro­gram Man­ager at Re­think Wel­lbe­ing

Inga G.Apr 4, 2023, 11:49 AM
7 points
0 commentsLW link

Where Free Will and Deter­minism Meet

David BravoApr 4, 2023, 10:59 AM
0 points
0 comments3 min readLW link

Strate­gies to Prevent AI Annihilation

lastchanceformankindApr 4, 2023, 8:59 AM
−2 points
0 comments4 min readLW link

ACX Meetup Madrid

Pablo VillalobosApr 4, 2023, 8:53 AM
5 points
2 comments1 min readLW link

[Question] Best Ways to Try to Get Fund­ing for Align­ment Re­search?

RGRGRGApr 4, 2023, 6:35 AM
9 points
6 comments1 min readLW link

Con­sider ap­ply­ing to a 2-week al­ign­ment pro­ject with former GitHub CEO

Bird ConceptApr 4, 2023, 6:20 AM
42 points
0 comments1 min readLW link
(twitter.com)

On how it feels gen­er­at­ing art with DALL-E

cortrinkauApr 4, 2023, 4:13 AM
5 points
0 comments3 min readLW link
(cortrinkau.bearblog.dev)

AI Sum­mer Harvest

Cleo NardoApr 4, 2023, 3:35 AM
130 points
10 comments1 min readLW link

How to re­spond to the re­cent con­dem­na­tions of the ra­tio­nal­ist community

Christopher KingApr 4, 2023, 1:42 AM
−2 points
7 comments4 min readLW link

Steer­ing systems

Max HApr 4, 2023, 12:56 AM
50 points
1 comment15 min readLW link

ChatGPT Suggests Listen­ing To Rus­sell & Yudkowsky

JenniferRMApr 4, 2023, 12:30 AM
9 points
1 comment17 min readLW link

Com­plex Sys­tems are Hard to Control

jsteinhardtApr 4, 2023, 12:00 AM
42 points
5 comments10 min readLW link
(bounded-regret.ghost.io)

Ap­ply to the Cavendish Labs Fel­low­ship (by 4/​15)

Apr 3, 2023, 11:09 PM
11 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

Twin Cities ACX Meetup—April 2023

Timothy M.Apr 3, 2023, 11:07 PM
5 points
3 comments1 min readLW link

Com­mu­ni­cat­ing effec­tively un­der Knigh­tian norms

Richard_NgoApr 3, 2023, 10:39 PM
96 points
54 comments6 min readLW link

If in­ter­pretabil­ity re­search goes well, it may get dangerous

So8resApr 3, 2023, 9:48 PM
201 points
11 comments2 min readLW link

Towards em­pa­thy in RL agents and be­yond: In­sights from cog­ni­tive sci­ence for AI Align­ment

Marc CarauleanuApr 3, 2023, 7:59 PM
15 points
6 comments1 min readLW link
(clipchamp.com)

Monthly Roundup #5: April 2023

ZviApr 3, 2023, 6:50 PM
26 points
12 comments14 min readLW link
(thezvi.wordpress.com)