Is AGI suici­dal­ity the golden ray of hope?

Alex Kirko4 Apr 2023 23:29 UTC
−18 points
4 comments1 min readLW link

Re­con­tex­tu­al­iz­ing the Risks of AI in More Pre­dictable Outcomes

ignorepeter4 Apr 2023 23:28 UTC
−19 points
2 comments5 min readLW link

LW Team is ad­just­ing mod­er­a­tion policy

Raemon4 Apr 2023 20:41 UTC
296 points
182 comments3 min readLW link

Ex­ces­sive AI growth-rate yields lit­tle so­cio-eco­nomic benefit.

Cleo Nardo4 Apr 2023 19:13 UTC
27 points
22 comments4 min readLW link

Pe­nal­ize Model Com­plex­ity Via Self-Distillation

research_prime_space4 Apr 2023 18:52 UTC
15 points
7 comments1 min readLW link

The One Heresy to Rule Them All

rogersbacon4 Apr 2023 18:23 UTC
−22 points
0 comments3 min readLW link
(www.secretorum.life)

Gi­ant (In)scrutable Ma­tri­ces: (Maybe) the Best of All Pos­si­ble Worlds

1a3orn4 Apr 2023 17:39 UTC
191 points
35 comments5 min readLW link

Play My Futarchy/​Pre­dic­tion Mar­ket Mafia Game

Arjun Panickssery4 Apr 2023 16:12 UTC
21 points
2 comments1 min readLW link
(arjunpanickssery.substack.com)

[Question] Steel­man /​ Ide­olog­i­cal Tur­ing Test of Yann LeCun’s AI X-Risk ar­gu­ment?

Aryeh Englander4 Apr 2023 15:53 UTC
26 points
14 comments1 min readLW link

Given the Restrict Act, Don’t Ban TikTok

Zvi4 Apr 2023 14:40 UTC
97 points
9 comments4 min readLW link
(thezvi.wordpress.com)

Run­ning many AI var­i­ants to find cor­rect goal generalization

avturchin4 Apr 2023 14:16 UTC
20 points
3 comments1 min readLW link

In­vo­ca­tions: The Other Ca­pa­bil­ities Over­hang?

Robert_AIZI4 Apr 2023 13:38 UTC
29 points
4 comments4 min readLW link
(aizi.substack.com)

Wanted: Men­tal Health Pro­gram Man­ager at Re­think Wel­lbe­ing

Inga G.4 Apr 2023 11:49 UTC
7 points
0 comments1 min readLW link

Where Free Will and Deter­minism Meet

David Bravo4 Apr 2023 10:59 UTC
0 points
0 comments3 min readLW link

Strate­gies to Prevent AI Annihilation

lastchanceformankind4 Apr 2023 8:59 UTC
−2 points
0 comments4 min readLW link

ACX Meetup Madrid

Pablo Villalobos4 Apr 2023 8:53 UTC
5 points
2 comments1 min readLW link

[Question] Best Ways to Try to Get Fund­ing for Align­ment Re­search?

RGRGRG4 Apr 2023 6:35 UTC
9 points
6 comments1 min readLW link

Con­sider ap­ply­ing to a 2-week al­ign­ment pro­ject with former GitHub CEO

jacobjacob4 Apr 2023 6:20 UTC
41 points
0 comments1 min readLW link
(twitter.com)

On how it feels gen­er­at­ing art with DALL-E

cortrinkau4 Apr 2023 4:13 UTC
5 points
0 comments3 min readLW link
(cortrinkau.bearblog.dev)

AI Sum­mer Harvest

Cleo Nardo4 Apr 2023 3:35 UTC
130 points
10 comments1 min readLW link

How to re­spond to the re­cent con­dem­na­tions of the ra­tio­nal­ist community

Christopher King4 Apr 2023 1:42 UTC
−2 points
7 comments4 min readLW link

Steer­ing systems

Max H4 Apr 2023 0:56 UTC
50 points
1 comment15 min readLW link

ChatGPT Suggests Listen­ing To Rus­sell & Yudkowsky

JenniferRM4 Apr 2023 0:30 UTC
7 points
1 comment17 min readLW link

Com­plex Sys­tems are Hard to Control

jsteinhardt4 Apr 2023 0:00 UTC
42 points
5 comments10 min readLW link
(bounded-regret.ghost.io)

Ap­ply to the Cavendish Labs Fel­low­ship (by 4/​15)

3 Apr 2023 23:09 UTC
11 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

Twin Cities ACX Meetup—April 2023

Timothy M.3 Apr 2023 23:07 UTC
5 points
3 comments1 min readLW link

Com­mu­ni­cat­ing effec­tively un­der Knigh­tian norms

Richard_Ngo3 Apr 2023 22:39 UTC
88 points
54 comments6 min readLW link

If in­ter­pretabil­ity re­search goes well, it may get dangerous

So8res3 Apr 2023 21:48 UTC
197 points
10 comments2 min readLW link

Towards em­pa­thy in RL agents and be­yond: In­sights from cog­ni­tive sci­ence for AI Align­ment

Marc Carauleanu3 Apr 2023 19:59 UTC
15 points
6 comments1 min readLW link
(clipchamp.com)

Monthly Roundup #5: April 2023

Zvi3 Apr 2023 18:50 UTC
26 points
12 comments14 min readLW link
(thezvi.wordpress.com)

Ex­plor­ing non-an­thro­pocen­tric as­pects of AI ex­is­ten­tial safety

mishka3 Apr 2023 18:07 UTC
8 points
0 comments3 min readLW link

[Question] GJP on AGI

Suh_Prance_Alot3 Apr 2023 17:21 UTC
2 points
0 comments1 min readLW link

Do we have a plan for the “first crit­i­cal try” prob­lem?

Christopher King3 Apr 2023 16:27 UTC
−3 points
14 comments1 min readLW link

Ex­plo­ra­tory Anal­y­sis of RLHF Trans­form­ers with TransformerLens

Curt Tigges3 Apr 2023 16:09 UTC
21 points
2 comments11 min readLW link
(blog.eleuther.ai)

AWS Has Raised Prices Before

jefftk3 Apr 2023 16:00 UTC
7 points
3 comments1 min readLW link
(www.jefftk.com)

Mati’s in­tro­duc­tion to paus­ing gi­ant AI experiments

Mati_Roy3 Apr 2023 15:56 UTC
7 points
0 comments2 min readLW link

Su­per­in­tel­li­gence will out­smart us or it isn’t superintelligence

Neil 3 Apr 2023 15:01 UTC
−4 points
4 comments1 min readLW link

AI-kills-ev­ery­one sce­nar­ios re­quire robotic in­fras­truc­ture, but not nec­es­sar­ily nanotech

avturchin3 Apr 2023 12:45 UTC
52 points
47 comments4 min readLW link

Orthog­o­nal­ity is expensive

beren3 Apr 2023 10:20 UTC
34 points
8 comments3 min readLW link

Re­peated Play of Im­perfect New­comb’s Para­dox in In­fra-Bayesian Physicalism

Sven Nilsen3 Apr 2023 10:06 UTC
2 points
0 comments2 min readLW link

Effec­tive Altru­ism Vir­tual Pro­grams Apr-May 2023

Yve Nichols-Evans3 Apr 2023 6:40 UTC
1 point
0 comments1 min readLW link

Board Game Theory

Optimization Process3 Apr 2023 6:23 UTC
8 points
0 comments3 min readLW link

Planecrash Podcast

planecrashpodcast3 Apr 2023 4:34 UTC
10 points
5 comments1 min readLW link

[Question] I’m just start­ing to grasp Shard The­ory. Is that a nor­mal feel­ing?

twkaiser3 Apr 2023 3:08 UTC
−20 points
1 comment1 min readLW link

Rules for liv­ing in a 99.9+% lizard­man world

at_the_zoo3 Apr 2023 2:39 UTC
−1 points
12 comments1 min readLW link

The Friendly Drunk Fool Align­ment Strategy

JenniferRM3 Apr 2023 1:26 UTC
23 points
18 comments11 min readLW link

Slack Group: Ra­tion­al­ist Startup Founders

Adam Zerner3 Apr 2023 0:44 UTC
31 points
0 comments3 min readLW link

Orthog­o­nal­ity is Expensive

DragonGod3 Apr 2023 0:43 UTC
21 points
3 comments1 min readLW link
(www.beren.io)

GTP4 ca­pa­ble of limited re­cur­sive im­prov­ing?

Boris Kashirin2 Apr 2023 21:38 UTC
2 points
3 comments1 min readLW link

[Question] Scared about the fu­ture of AI

eitan weiss2 Apr 2023 20:37 UTC
−1 points
0 comments1 min readLW link