AI #55: Keep Claud­ing Along

ZviMar 14, 2024, 3:40 PM
62 points
16 comments70 min readLW link
(thezvi.wordpress.com)

To the av­er­age hu­man, con­trol­led AI is just as lethal as ‘mis­al­igned’ AI

YonatanKMar 14, 2024, 2:52 PM
6 points
20 comments5 min readLW link

Claude vs GPT

Maxwell TabarrokMar 14, 2024, 12:41 PM
12 points
2 comments2 min readLW link
(www.maximum-progress.com)

A brief re­view of China’s AI in­dus­try and regulations

Elliot MckernonMar 14, 2024, 12:19 PM
24 points
0 comments16 min readLW link

[Question] Can any LLM be rep­re­sented as an Equa­tion?

Valentin BaltadzhievMar 14, 2024, 9:51 AM
1 point
2 comments1 min readLW link

‘Em­piri­cism!’ as Anti-Epistemology

Eliezer YudkowskyMar 14, 2024, 2:02 AM
171 points
92 comments25 min readLW link

Op­por­tunis­tic Time-Management

Richard HenageMar 13, 2024, 9:38 PM
13 points
2 comments1 min readLW link

AI gov­er­nance and strat­egy: a list of re­search agen­das and work that could be done.

Mar 13, 2024, 9:23 PM
7 points
1 comment17 min readLW link

High­lights from Lex Frid­man’s in­ter­view of Yann LeCun

Joel BurgetMar 13, 2024, 8:58 PM
48 points
15 comments41 min readLW link

On the Lat­est TikTok Bill

ZviMar 13, 2024, 6:50 PM
58 points
7 comments29 min readLW link
(thezvi.wordpress.com)

[Question] Recom­mended book for a bal­anced take and les­sons learned from covid pan­demic response

Martin Hare RobertsonMar 13, 2024, 6:14 PM
4 points
0 comments1 min readLW link

ACX/​LW Seat­tle spring meetup 2024

nsokolskyMar 13, 2024, 5:24 PM
12 points
3 comments1 min readLW link

Lay­ing the Foun­da­tions for Vi­sion and Mul­ti­modal Mechanis­tic In­ter­pretabil­ity & Open Problems

Mar 13, 2024, 5:09 PM
44 points
13 comments14 min readLW link

I was raised by de­vout Mor­mons, AMA [&|] Solic­it­ing Advice

ErioirEMar 13, 2024, 4:52 PM
32 points
41 comments2 min readLW link

Re­la­tional Agency: Con­sis­tently Reach­ing Out

Jonathan MoregårdMar 13, 2024, 2:34 PM
16 points
0 comments5 min readLW link
(open.substack.com)

[Question] What could a policy ban­ning AGI look like?

TsviBTMar 13, 2024, 2:19 PM
78 points
23 comments3 min readLW link

Click­bait Soapboxing

DaystarEldMar 13, 2024, 2:09 PM
24 points
16 comments3 min readLW link
(daystareld.com)

Vir­tual AI Safety Un­con­fer­ence 2024

Mar 13, 2024, 1:54 PM
14 points
0 comments1 min readLW link

Jobs, Re­la­tion­ships, and Other Cults

Mar 13, 2024, 5:58 AM
40 points
9 comments35 min readLW link

How do you im­prove the qual­ity of your drink­ing wa­ter?

Alex K. Chen (parrot)Mar 13, 2024, 12:37 AM
11 points
2 comments1 min readLW link

The Parable Of The Fallen Pen­du­lum—Part 2

johnswentworthMar 12, 2024, 9:41 PM
78 points
8 comments4 min readLW link

Open con­sul­tancy: Let­ting un­trusted AIs choose what an­swer to ar­gue for

Fabien RogerMar 12, 2024, 8:38 PM
35 points
5 comments5 min readLW link

[Question] Is any­one work­ing on for­mally ver­ified AI toolchains?

metachiralityMar 12, 2024, 7:36 PM
17 points
4 comments1 min readLW link

Trans­former Debugger

Henk TillmanMar 12, 2024, 7:08 PM
26 points
0 comments1 min readLW link
(github.com)

Su­perfore­cast­ing the Ori­gins of the Covid-19 Pandemic

DanielFilanMar 12, 2024, 7:01 PM
64 points
0 comments1 min readLW link
(goodjudgment.substack.com)

min­i­mum vi­able action

Sindhu PrasadMar 12, 2024, 4:06 PM
1 point
0 comments3 min readLW link

Hard­ball ques­tions for the Gem­ini Con­gres­sional Hearing

Michael ThiessenMar 12, 2024, 3:27 PM
−11 points
2 comments1 min readLW link

OpenAI: The Board Expands

ZviMar 12, 2024, 2:00 PM
92 points
1 comment30 min readLW link
(thezvi.wordpress.com)

Up­date on Devel­op­ing an Ethics Calcu­la­tor to Align an AGI to

sweenesmMar 12, 2024, 12:33 PM
4 points
2 comments8 min readLW link

[Question] How do you iden­tify and coun­ter­act your bi­ases in de­ci­sion-mak­ing?

warrenjordanMar 12, 2024, 5:01 AM
2 points
1 comment1 min readLW link

How Much Have I Been Play­ing?

jefftkMar 12, 2024, 2:10 AM
9 points
0 comments1 min readLW link
(www.jefftk.com)

Bias-Aug­mented Con­sis­tency Train­ing Re­duces Bi­ased Rea­son­ing in Chain-of-Thought

Miles TurpinMar 11, 2024, 11:46 PM
16 points
0 comments1 min readLW link
(arxiv.org)

AI Safety Ac­tion Plan—A re­port com­mis­sioned by the US State Department

agucovaMar 11, 2024, 10:14 PM
22 points
1 comment1 min readLW link
(www.gladstone.ai)

A dis­cus­sion of AI risk and the cost/​benefit calcu­la­tion of stop­ping or paus­ing AI development

DuncanFowlerMar 11, 2024, 9:41 PM
1 point
0 comments1 min readLW link

Among the A.I. Doom­say­ers—The New Yorker

agucovaMar 11, 2024, 9:35 PM
12 points
1 comment1 min readLW link
(www.newyorker.com)

Be More Katja

Nathan YoungMar 11, 2024, 9:12 PM
53 points
0 comments3 min readLW link

AI In­ci­dent Re­port­ing: A Reg­u­la­tory Review

Mar 11, 2024, 9:03 PM
16 points
0 comments6 min readLW link

Re­sults from an Ad­ver­sar­ial Col­lab­o­ra­tion on AI Risk (FRI)

Mar 11, 2024, 8:00 PM
61 points
3 comments9 min readLW link
(forecastingresearch.org)

The Astro­nom­i­cal Sacri­fice Dilemma

Matthew McRedmondMar 11, 2024, 7:58 PM
15 points
3 comments4 min readLW link

Epiphe­nom­e­nal­ism leads to elimi­na­tivism about qualia

Clément LMar 11, 2024, 7:53 PM
4 points
0 comments7 min readLW link

The Best Es­say (Paul Gra­ham)

Chris_LeongMar 11, 2024, 7:25 PM
25 points
2 comments1 min readLW link
(paulgraham.com)

Open Thread Spring 2024

habrykaMar 11, 2024, 7:17 PM
22 points
160 comments1 min readLW link

New so­cial credit formalizations

KatjaGraceMar 11, 2024, 7:00 PM
23 points
3 comments2 min readLW link
(worldspiritsockpuppet.com)

How dis­agree­ments about Ev­i­den­tial Cor­re­la­tions could be settled

Martín SotoMar 11, 2024, 6:28 PM
12 points
3 comments4 min readLW link

“Ar­tifi­cial Gen­eral In­tel­li­gence”: an ex­tremely brief FAQ

Steven ByrnesMar 11, 2024, 5:49 PM
75 points
6 comments2 min readLW link

Some (prob­le­matic) aes­thet­ics of what con­sti­tutes good work in academia

Steven ByrnesMar 11, 2024, 5:47 PM
148 points
12 comments12 min readLW link

Storable Votes with a Pay as you win mechanism: a con­tri­bu­tion for in­sti­tu­tional design

Arturo MaciasMar 11, 2024, 3:58 PM
17 points
19 comments2 min readLW link

Tend to your clar­ity, not your confusion

Severin T. SeehrichMar 11, 2024, 3:09 PM
23 points
1 comment6 min readLW link

[Question] What do we know about the AI knowl­edge and views, es­pe­cially about ex­is­ten­tial risk, of the new OpenAI board mem­bers?

ZviMar 11, 2024, 2:55 PM
60 points
2 comments2 min readLW link

“How could I have thought that faster?”

mesaoptimizerMar 11, 2024, 10:56 AM
237 points
32 comments2 min readLW link
(twitter.com)