List of re­quests for an AI slow­down/​halt.

Cleo NardoApr 14, 2023, 11:55 PM
46 points
6 comments1 min readLW link

[linkpost] “What Are Rea­son­able AI Fears?” by Robin Han­son, 2023-04-23

Arjun PanicksseryApr 14, 2023, 11:26 PM
26 points
16 commentsLW link

“Do X be­cause de­ci­sion the­ory” ~= “Do X be­cause bayes the­o­rem”

lcApr 14, 2023, 8:57 PM
39 points
1 comment2 min readLW link

LLMs and hal­lu­ci­na­tion, like white on rice?

Bill BenzonApr 14, 2023, 7:53 PM
5 points
0 comments3 min readLW link

GPT-4 is eas­ily con­trol­led/​ex­ploited with tricky de­ci­sion the­o­retic dilem­mas.

scasperApr 14, 2023, 7:39 PM
6 points
4 comments2 min readLW link

On Car­ing about our AI Progeny

PeterMcCluskeyApr 14, 2023, 7:32 PM
22 points
5 comments1 min readLW link
(bayesianinvestor.com)

Moder­a­tion notes re: re­cent Said/​Dun­can threads

RaemonApr 14, 2023, 6:06 PM
50 points
560 comments2 min readLW link

What we’ve learned so far from our tech­nolog­i­cal temp­ta­tions project

Richard Korzekwa Apr 14, 2023, 5:46 PM
15 points
4 comments11 min readLW link
(aiimpacts.org)

[Question] How does con­scious­ness in­ter­act with ar­chi­tec­ture?

FinalFormal2Apr 14, 2023, 3:56 PM
5 points
3 comments1 min readLW link

Iqisa: A Library For Han­dling Fore­cast­ing Datasets

niplavApr 14, 2023, 3:16 PM
27 points
0 commentsLW link

What’s this prob­a­bil­ity you’re re­port­ing?

EOC and SCP
Apr 14, 2023, 3:07 PM
19 points
10 comments3 min readLW link

Nav­i­gat­ing AI Risks (NAIR) #1: Slow­ing Down AI

simeon_cApr 14, 2023, 2:35 PM
11 points
3 comments1 min readLW link
(navigatingairisks.substack.com)

[Question] What would the FLI mora­to­rium ac­tu­ally do?

ChristianKlApr 14, 2023, 1:14 PM
17 points
7 comments1 min readLW link

Re­search Re­port: In­cor­rect­ness Cascades

Robert_AIZIApr 14, 2023, 12:49 PM
19 points
0 comments10 min readLW link
(aizi.substack.com)

The self-un­al­ign­ment problem

Apr 14, 2023, 12:10 PM
155 points
24 comments10 min readLW link

AI Safety Europe Re­treat 2023 Retrospective

Magdalena WacheApr 14, 2023, 9:05 AM
43 points
0 comments2 min readLW link

[Question] What’s the differ­ence be­tween Wis­dom and Ra­tion­al­ity?

Yoav RavidApr 14, 2023, 6:22 AM
8 points
4 comments1 min readLW link

Shap­ley Value At­tri­bu­tion in Chain of Thought

leogaoApr 14, 2023, 5:56 AM
106 points
7 comments4 min readLW link

A fresh­man year dur­ing the AI midgame: my ap­proach to the next year

BuckApr 14, 2023, 12:38 AM
154 points
15 commentsLW link1 review

Against AI Un­der­stand­ing and Sen­tience: Large Lan­guage Models, Mean­ing, and the Pat­terns of Hu­man Lan­guage Use

Jonathan YanApr 13, 2023, 11:29 PM
−1 points
0 comments1 min readLW link
(philsci-archive.pitt.edu)

Fi­nan­cial Times: We must slow down the race to God-like AI

trevorApr 13, 2023, 7:55 PM
113 points
17 comments16 min readLW link
(www.ft.com)

R0 Is Not Counterfactual

jefftkApr 13, 2023, 7:50 PM
33 points
9 comments2 min readLW link
(www.jefftk.com)

Sub­scripts for Probabilities

niplavApr 13, 2023, 6:32 PM
67 points
9 comments5 min readLW link

The Virus—Short Story

Michael SoareverixApr 13, 2023, 6:18 PM
4 points
0 comments4 min readLW link

First ACX Brno Meetup

adekczApr 13, 2023, 5:42 PM
2 points
0 comments1 min readLW link

Pol­lut­ing the agen­tic commons

hamandcheeseApr 13, 2023, 5:42 PM
7 points
4 comments2 min readLW link
(www.secondbest.ca)

Cam­bridge LW Meetup: When Science Isn’t Enough

Apr 13, 2023, 5:36 PM
2 points
0 comments1 min readLW link

Even if hu­man & AI al­ign­ment are just as easy, we are screwed

Matthew_OpitzApr 13, 2023, 5:32 PM
35 points
5 comments5 min readLW link

In­tro to On­to­ge­netic Curriculum

ErisApr 13, 2023, 5:15 PM
20 points
1 comment2 min readLW link

Was Homer a stochas­tic par­rot? Mean­ing in liter­ary texts and LLMs

Bill BenzonApr 13, 2023, 4:44 PM
7 points
4 comments3 min readLW link

AI #7: Free Agency

ZviApr 13, 2023, 4:20 PM
33 points
12 comments47 min readLW link
(thezvi.wordpress.com)

Nav­i­gat­ing the Open-Source AI Land­scape: Data, Fund­ing, and Safety

Apr 13, 2023, 3:29 PM
32 points
7 comments11 min readLW link
(forum.effectivealtruism.org)

On AutoGPT

ZviApr 13, 2023, 12:30 PM
248 points
47 comments20 min readLW link
(thezvi.wordpress.com)

Iden­ti­fy­ing se­man­tic neu­rons, mechanis­tic cir­cuits & in­ter­pretabil­ity web apps

Apr 13, 2023, 11:59 AM
18 points
0 comments8 min readLW link

Try­ing Agen­tGPT, an Au­toGPT variant

Gunnar_ZarnckeApr 13, 2023, 10:13 AM
10 points
9 comments1 min readLW link

An­nounc­ing Epoch’s dash­board of key trends and figures in Ma­chine Learning

JsevillamolApr 13, 2023, 7:33 AM
35 points
7 comments1 min readLW link
(epochai.org)

[Question] What is the best source to ex­plain short AI timelines to a skep­ti­cal per­son?

trevorApr 13, 2023, 4:29 AM
12 points
12 comments1 min readLW link

“Aligned” foun­da­tion mod­els don’t im­ply al­igned systems

Max HApr 13, 2023, 4:13 AM
39 points
11 comments5 min readLW link

[Question] Us­ing ChatGPT for mem­ory re­con­soli­da­tion?

warrenjordanApr 13, 2023, 1:27 AM
3 points
2 comments1 min readLW link

In­de­pen­dence Dividends

jefftkApr 13, 2023, 1:20 AM
35 points
11 comments1 min readLW link
(www.jefftk.com)

AI x-risk, ap­prox­i­mately or­dered by embarrassment

Alex Lawsen Apr 12, 2023, 11:01 PM
151 points
7 comments19 min readLW link

AXRP Epi­sode 20 - ‘Re­form’ AI Align­ment with Scott Aaronson

DanielFilanApr 12, 2023, 9:30 PM
22 points
2 comments68 min readLW link

Ap­ply to >30 AI safety fun­ders in one ap­pli­ca­tion with the Non­lin­ear Network

12 Apr 2023 21:23 UTC
65 points
12 comments2 min readLW link

AGI goal space is big, but nar­row­ing might not be as hard as it seems.

Jacy Reese Anthis12 Apr 2023 19:03 UTC
15 points
0 comments3 min readLW link

Nat­u­ral lan­guage alignment

Jacy Reese Anthis12 Apr 2023 19:02 UTC
31 points
2 comments2 min readLW link

Repug­nant lev­els of violins

Solenoid_Entity12 Apr 2023 17:11 UTC
73 points
10 comments12 min readLW link

Progress links and tweets, 2023-04-12

jasoncrawford12 Apr 2023 16:52 UTC
8 points
2 comments1 min readLW link
(rootsofprogress.org)

A ba­sic math­e­mat­i­cal struc­ture of intelligence

Golol12 Apr 2023 16:49 UTC
4 points
6 comments4 min readLW link

[Question] Should Au­toGPT up­date us to­wards re­search­ing IDA?

Michaël Trazzi12 Apr 2023 16:41 UTC
15 points
5 comments1 min readLW link

Box­ing lessons

yakimoff12 Apr 2023 16:19 UTC
1 point
0 comments1 min readLW link