Some thoughts on Ge­orge Hotz vs Eliezer Yudkowsky

TristanTrimAug 15, 2023, 11:33 PM
10 points
3 comments2 min readLW link

Un­der­stand­ing the In­for­ma­tion Flow in­side Large Lan­guage Models

Aug 15, 2023, 9:13 PM
19 points
0 comments17 min readLW link

[Question] Any re­search in “probe-tun­ing” of LLMs?

Roman LeventovAug 15, 2023, 9:01 PM
20 points
3 comments1 min readLW link

Can AI Trans­form the Elec­torate into a Ci­ti­zen’s Assem­bly

RoscoHunterAug 15, 2023, 5:52 PM
−3 points
5 comments3 min readLW link

Ten Thou­sand Years of Solitude

agpAug 15, 2023, 5:45 PM
137 points
19 comments4 min readLW link
(www.discovermagazine.com)

AISN #19: US-China Com­pe­ti­tion on AI Chips, Mea­sur­ing Lan­guage Agent Devel­op­ments, Eco­nomic Anal­y­sis of Lan­guage Model Pro­pa­ganda, and White House AI Cy­ber Challenge

Dan HAug 15, 2023, 4:10 PM
21 points
0 comments5 min readLW link
(newsletter.safe.ai)

[Question] What is the most effec­tive anti-tyranny char­ity?

lcAug 15, 2023, 3:26 PM
20 points
10 comments1 min readLW link

My check­list for pub­lish­ing a blog post

Steven ByrnesAug 15, 2023, 3:04 PM
87 points
6 comments3 min readLW link

The Dun­bar Play­book: A CRM sys­tem for your friends

Severin T. SeehrichAug 15, 2023, 8:44 AM
32 points
16 comments5 min readLW link
(amoretlicentia.substack.com)

Op­ti­cal Illu­sions are Out of Distri­bu­tion Errors

vitaliyaAug 15, 2023, 2:23 AM
30 points
8 comments2 min readLW link

A short calcu­la­tion about a Twit­ter poll

Ege ErdilAug 14, 2023, 7:48 PM
64 points
64 comments11 min readLW link

De­com­pos­ing in­de­pen­dent gen­er­al­iza­tions in neu­ral net­works via Hes­sian analysis

Aug 14, 2023, 5:04 PM
84 points
4 comments1 min readLW link

Memetic Judo #2: In­cor­po­ral Switches and Lev­ers Compendium

Max TKAug 14, 2023, 4:53 PM
19 points
6 comments17 min readLW link

Ex­is­ten­tially rele­vant thought ex­per­i­ment: To kill or not to kill, a sniper, a man and a but­ton.

AlexFromSafeTransitionAug 14, 2023, 10:53 AM
−18 points
6 comments4 min readLW link

Step­ping down as mod­er­a­tor on LW

Kaj_SotalaAug 14, 2023, 10:46 AM
82 points
1 comment1 min readLW link

An­nounc­ing Man­i­fest 2023 (Sep 22-24 in Berkeley)

Aug 14, 2023, 5:13 AM
31 points
0 comments2 min readLW link

Co­her­ence Ther­apy with LLMs—quick demo

Chris LakinAug 14, 2023, 3:34 AM
19 points
11 comments1 min readLW link

Listen For What You Don’t Hear: The Case for Contrarianism

Yashvardhan SharmaAug 14, 2023, 2:53 AM
1 point
1 comment5 min readLW link

Recipe: Hes­sian eigen­vec­tor com­pu­ta­tion for PyTorch models

Nina PanicksseryAug 14, 2023, 2:48 AM
32 points
5 comments5 min readLW link

[Question] As­sum­ing LK99 or similar: how to ac­cel­er­ate com­mer­cial­iza­tion?

ryan_bAug 13, 2023, 9:34 PM
7 points
5 comments1 min readLW link

Twin Cities ACX Meetup Septem­ber 2023

Timothy M.Aug 13, 2023, 8:10 PM
1 point
4 comments1 min readLW link

Fun­da­men­tal Uncer­tainty: Chap­ter 1 - How can we know what’s true?

Gordon Seidoh WorleyAug 13, 2023, 6:55 PM
17 points
4 comments12 min readLW link

We Should Pre­pare for a Larger Rep­re­sen­ta­tion of Academia in AI Safety

Leon LangAug 13, 2023, 6:03 PM
90 points
14 comments5 min readLW link

AGI is eas­ier than robotaxis

Daniel KokotajloAug 13, 2023, 5:00 PM
41 points
30 comments4 min readLW link

[Question] If we’re al­ive in 5 years, do you think the fund­ing situ­a­tion will be much bet­ter by then? (With large amounts of gov­ern­ment fund­ing, for ex­am­ple)

kuiraAug 13, 2023, 4:32 PM
−2 points
6 comments1 min readLW link

Ab­stract The­o­ries of Everything

PhilosophistryAug 13, 2023, 6:06 AM
−17 points
0 comments1 min readLW link

[Linkpost] Per­sonal and Psy­cholog­i­cal Di­men­sions of AI Re­searchers Con­fronting AI Catas­trophic Risks

Bogdan Ionut CirsteaAug 12, 2023, 10:02 PM
42 points
0 comments1 min readLW link

The Em­pa­thy Eng­ine: A De­con­struc­tion of the So­cietal Me­ta­mor­pho­sis through Tech­nolog­i­cal Em­pa­thy Augmentation

bigdickproblemsAug 12, 2023, 6:23 PM
−30 points
3 comments2 min readLW link

The Benev­olent Ruler’s Hand­book (Part 2): Mo­ral­ity Rules

FCCCAug 12, 2023, 2:25 PM
5 points
0 comments4 min readLW link

Learn­ing as you play: an­thropic shadow in deadly games

dr_sAug 12, 2023, 7:34 AM
37 points
28 comments35 min readLW link

Biolog­i­cal An­chors: The Trick that Might or Might Not Work

Scott AlexanderAug 12, 2023, 12:53 AM
91 points
3 comments33 min readLW link
(astralcodexten.substack.com)

Si­mu­late the CEO

robotelvisAug 12, 2023, 12:09 AM
23 points
5 comments5 min readLW link
(messyprogress.substack.com)

How to de­cide un­der low-stakes uncertainty

dkl9Aug 11, 2023, 6:07 PM
11 points
4 comments1 min readLW link
(dkl9.net)

The Pan­demic is Only Begin­ning: The Long COVID Disaster

salvatore matteraAug 11, 2023, 5:36 PM
−6 points
15 comments8 min readLW link

When dis­cussing AI risks, talk about ca­pa­bil­ities, not intelligence

VikaAug 11, 2023, 1:38 PM
124 points
7 comments3 min readLW link
(vkrakovna.wordpress.com)

What are the flaws in this AGI ar­gu­ment?

William the Kiwi Aug 11, 2023, 11:31 AM
5 points
14 comments1 min readLW link

Google Deep­Mind’s RT-2

SandXboxAug 11, 2023, 11:26 AM
9 points
1 comment1 min readLW link
(robotics-transformer2.github.io)

Linkpost: We need an­other Ex­pert Sur­vey on Progress in AI, urgently

David MearsAug 11, 2023, 8:22 AM
25 points
2 comments2 min readLW link
(open.substack.com)

What Does a Marginal Grant at LTFF Look Like? Fund­ing Pri­ori­ties and Grant­mak­ing Thresh­olds at the Long-Term Fu­ture Fund

Aug 11, 2023, 3:59 AM
64 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

[Question] Will post­ing any thread on LW guaran­tee that a LLM will in­dex all my con­tent, and if ques­tions peo­ple ask to the LLM af­ter my name will sur­face up all my LW con­tent?

Alex K. Chen (parrot)Aug 11, 2023, 1:40 AM
0 points
0 comments1 min readLW link

AI Safety Con­cepts Wri­teup: WebGPT

JustisMillsAug 11, 2023, 1:35 AM
9 points
1 comment7 min readLW link

[Question] What is sci­ence?

Adam ZernerAug 11, 2023, 12:00 AM
6 points
4 comments1 min readLW link

Three con­figurable prettyprinters

philhAug 10, 2023, 11:10 PM
9 points
0 comments22 min readLW link
(reasonableapproximation.net)

Ilya Sutskever’s thoughts on AI safety (July 2023): a tran­script with my comments

mishkaAug 10, 2023, 7:07 PM
21 points
3 comments5 min readLW link

Seek­ing In­put to AI Safety Book for non-tech­ni­cal audience

Darren McKeeAug 10, 2023, 5:58 PM
10 points
4 comments1 min readLW link

Eval­u­at­ing GPT-4 The­ory of Mind Capabilities

Aug 10, 2023, 5:57 PM
15 points
2 comments14 min readLW link

Some al­ign­ment ideas

SelonNeriasAug 10, 2023, 5:51 PM
1 point
0 comments11 min readLW link

Self Su­per­vised Learn­ing (SSL)

Varshul GuptaAug 10, 2023, 5:43 PM
5 points
1 comment2 min readLW link
(dubverseblack.substack.com)

Pre­dict­ing Virus Rel­a­tive Abun­dance in Wastewater

jefftkAug 10, 2023, 3:46 PM
33 points
2 commentsLW link
(naobservatory.org)

AI #24: Week of the Podcast

ZviAug 10, 2023, 3:00 PM
49 points
5 comments44 min readLW link
(thezvi.wordpress.com)