Is AGI suici­dal­ity the golden ray of hope?

Alex KirkoApr 4, 2023, 11:29 PM
−18 points
4 comments1 min readLW link

Re­con­tex­tu­al­iz­ing the Risks of AI in More Pre­dictable Outcomes

ignorepeterApr 4, 2023, 11:28 PM
−19 points
2 comments5 min readLW link

LW Team is ad­just­ing mod­er­a­tion policy

RaemonApr 4, 2023, 8:41 PM
304 points
185 comments3 min readLW link

Ex­ces­sive AI growth-rate yields lit­tle so­cio-eco­nomic benefit.

Cleo NardoApr 4, 2023, 7:13 PM
27 points
22 comments4 min readLW link

Pe­nal­ize Model Com­plex­ity Via Self-Distillation

research_prime_spaceApr 4, 2023, 6:52 PM
15 points
7 comments1 min readLW link

The One Heresy to Rule Them All

rogersbaconApr 4, 2023, 6:23 PM
−22 points
0 comments3 min readLW link
(www.secretorum.life)

Gi­ant (In)scrutable Ma­tri­ces: (Maybe) the Best of All Pos­si­ble Worlds

1a3ornApr 4, 2023, 5:39 PM
211 points
38 comments5 min readLW link1 review

Play My Futarchy/​Pre­dic­tion Mar­ket Mafia Game

Arjun PanicksseryApr 4, 2023, 4:12 PM
21 points
2 comments1 min readLW link
(arjunpanickssery.substack.com)

[Question] Steel­man /​ Ide­olog­i­cal Tur­ing Test of Yann LeCun’s AI X-Risk ar­gu­ment?

Aryeh EnglanderApr 4, 2023, 3:53 PM
26 points
14 comments1 min readLW link

Given the Restrict Act, Don’t Ban TikTok

ZviApr 4, 2023, 2:40 PM
97 points
9 comments4 min readLW link
(thezvi.wordpress.com)

Run­ning many AI var­i­ants to find cor­rect goal generalization

avturchinApr 4, 2023, 2:16 PM
20 points
3 comments1 min readLW link

In­vo­ca­tions: The Other Ca­pa­bil­ities Over­hang?

Robert_AIZIApr 4, 2023, 1:38 PM
29 points
4 comments4 min readLW link
(aizi.substack.com)

Wanted: Men­tal Health Pro­gram Man­ager at Re­think Wel­lbe­ing

Inga G.Apr 4, 2023, 11:49 AM
7 points
0 commentsLW link

Where Free Will and Deter­minism Meet

David BravoApr 4, 2023, 10:59 AM
0 points
0 comments3 min readLW link

Strate­gies to Prevent AI Annihilation

lastchanceformankindApr 4, 2023, 8:59 AM
−2 points
0 comments4 min readLW link

ACX Meetup Madrid

Pablo VillalobosApr 4, 2023, 8:53 AM
5 points
2 comments1 min readLW link

[Question] Best Ways to Try to Get Fund­ing for Align­ment Re­search?

RGRGRGApr 4, 2023, 6:35 AM
9 points
6 comments1 min readLW link

Con­sider ap­ply­ing to a 2-week al­ign­ment pro­ject with former GitHub CEO

Bird ConceptApr 4, 2023, 6:20 AM
42 points
0 comments1 min readLW link
(twitter.com)

On how it feels gen­er­at­ing art with DALL-E

cortrinkauApr 4, 2023, 4:13 AM
5 points
0 comments3 min readLW link
(cortrinkau.bearblog.dev)

AI Sum­mer Harvest

Cleo NardoApr 4, 2023, 3:35 AM
130 points
10 comments1 min readLW link

How to re­spond to the re­cent con­dem­na­tions of the ra­tio­nal­ist community

Christopher KingApr 4, 2023, 1:42 AM
−2 points
7 comments4 min readLW link

Steer­ing systems

Max HApr 4, 2023, 12:56 AM
50 points
1 comment15 min readLW link

ChatGPT Suggests Listen­ing To Rus­sell & Yudkowsky

JenniferRMApr 4, 2023, 12:30 AM
9 points
1 comment17 min readLW link

Com­plex Sys­tems are Hard to Control

jsteinhardtApr 4, 2023, 12:00 AM
42 points
5 comments10 min readLW link
(bounded-regret.ghost.io)

Ap­ply to the Cavendish Labs Fel­low­ship (by 4/​15)

Apr 3, 2023, 11:09 PM
11 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

Twin Cities ACX Meetup—April 2023

Timothy M.Apr 3, 2023, 11:07 PM
5 points
3 comments1 min readLW link

Com­mu­ni­cat­ing effec­tively un­der Knigh­tian norms

Richard_NgoApr 3, 2023, 10:39 PM
96 points
54 comments6 min readLW link

If in­ter­pretabil­ity re­search goes well, it may get dangerous

So8resApr 3, 2023, 9:48 PM
201 points
11 comments2 min readLW link

Towards em­pa­thy in RL agents and be­yond: In­sights from cog­ni­tive sci­ence for AI Align­ment

Marc CarauleanuApr 3, 2023, 7:59 PM
15 points
6 comments1 min readLW link
(clipchamp.com)

Monthly Roundup #5: April 2023

ZviApr 3, 2023, 6:50 PM
26 points
12 comments14 min readLW link
(thezvi.wordpress.com)

Ex­plor­ing non-an­thro­pocen­tric as­pects of AI ex­is­ten­tial safety

mishkaApr 3, 2023, 6:07 PM
8 points
0 comments3 min readLW link

[Question] GJP on AGI

Suh_Prance_AlotApr 3, 2023, 5:21 PM
2 points
0 comments1 min readLW link

Do we have a plan for the “first crit­i­cal try” prob­lem?

Christopher KingApr 3, 2023, 4:27 PM
−3 points
14 comments1 min readLW link

Ex­plo­ra­tory Anal­y­sis of RLHF Trans­form­ers with TransformerLens

Curt TiggesApr 3, 2023, 4:09 PM
21 points
2 comments11 min readLW link
(blog.eleuther.ai)

AWS Has Raised Prices Before

jefftkApr 3, 2023, 4:00 PM
7 points
3 comments1 min readLW link
(www.jefftk.com)

Mati’s in­tro­duc­tion to paus­ing gi­ant AI experiments

Mati_RoyApr 3, 2023, 3:56 PM
7 points
0 comments2 min readLW link

Su­per­in­tel­li­gence will out­smart us or it isn’t superintelligence

Neil Apr 3, 2023, 3:01 PM
−4 points
4 comments1 min readLW link

AI-kills-ev­ery­one sce­nar­ios re­quire robotic in­fras­truc­ture, but not nec­es­sar­ily nanotech

avturchinApr 3, 2023, 12:45 PM
53 points
47 comments4 min readLW link

Orthog­o­nal­ity is expensive

berenApr 3, 2023, 10:20 AM
43 points
9 comments3 min readLW link

Re­peated Play of Im­perfect New­comb’s Para­dox in In­fra-Bayesian Physicalism

Sven NilsenApr 3, 2023, 10:06 AM
2 points
0 comments2 min readLW link

Effec­tive Altru­ism Vir­tual Pro­grams Apr-May 2023

Yve Nichols-EvansApr 3, 2023, 6:40 AM
1 point
0 comments1 min readLW link

Board Game Theory

Optimization ProcessApr 3, 2023, 6:23 AM
8 points
0 comments3 min readLW link

Planecrash Podcast

planecrashpodcastApr 3, 2023, 4:34 AM
10 points
5 comments1 min readLW link

[Question] I’m just start­ing to grasp Shard The­ory. Is that a nor­mal feel­ing?

twkaiserApr 3, 2023, 3:08 AM
−20 points
1 comment1 min readLW link

Rules for liv­ing in a 99.9+% lizard­man world

at_the_zooApr 3, 2023, 2:39 AM
−1 points
12 comments1 min readLW link

The Friendly Drunk Fool Align­ment Strategy

JenniferRMApr 3, 2023, 1:26 AM
29 points
19 comments11 min readLW link

Slack Group: Ra­tion­al­ist Startup Founders

Adam ZernerApr 3, 2023, 12:44 AM
31 points
2 comments3 min readLW link

Orthog­o­nal­ity is Expensive

DragonGodApr 3, 2023, 12:43 AM
21 points
3 comments1 min readLW link
(www.beren.io)

GTP4 ca­pa­ble of limited re­cur­sive im­prov­ing?

Boris KashirinApr 2, 2023, 9:38 PM
2 points
3 comments1 min readLW link

[Question] Scared about the fu­ture of AI

eitan weissApr 2, 2023, 8:37 PM
−1 points
0 comments1 min readLW link