China’s po­si­tion on au­tonomous weapons

bhauth23 Aug 2023 22:20 UTC
17 points
2 comments1 min readLW link
(academic.oup.com)

Diet Ex­per­i­ment Pr­ereg­is­tra­tion: Long-term wa­ter fast­ing + seed oil re­moval

lc23 Aug 2023 22:08 UTC
56 points
17 comments1 min readLW link

The Low-Hang­ing Fruit Prior and sloped valleys in the loss landscape

23 Aug 2023 21:12 UTC
79 points
1 comment13 min readLW link

Govern­ing, Fast and Slow

Carson23 Aug 2023 20:01 UTC
3 points
0 comments3 min readLW link

A prob­lem with the most re­cently pub­lished ver­sion of CEV

ThomasCederborg23 Aug 2023 18:05 UTC
9 points
5 comments8 min readLW link

[Question] Which paths to pow­er­ful AI should be boosted?

Zach Stein-Perlman23 Aug 2023 16:00 UTC
1 point
0 comments1 min readLW link

A The­ory of Laughter

Steven Byrnes23 Aug 2023 15:05 UTC
101 points
13 comments22 min readLW link

Why Is No One Try­ing To Align Profit In­cen­tives With Align­ment Re­search?

Prometheus23 Aug 2023 13:16 UTC
51 points
11 comments4 min readLW link

Ex­plor­ing the Re­spon­si­ble Path to AI Re­search in the Philippines

MiguelDev23 Aug 2023 8:44 UTC
6 points
0 comments6 min readLW link

[Question] Do agents with (mu­tu­ally known) iden­ti­cal util­ity func­tions but ir­rec­on­cilable knowl­edge some­times fight?

mako yass23 Aug 2023 8:13 UTC
14 points
13 comments1 min readLW link

South Bay ACX/​SSC Fall Mee­tups Everywhere

allisona23 Aug 2023 3:00 UTC
3 points
0 comments1 min readLW link

Separate the truth from your wishes

Jacob G-W23 Aug 2023 0:52 UTC
6 points
3 comments1 min readLW link
(jacobgw.com)

Im­pli­ca­tions of ev­i­den­tial co­op­er­a­tion in large worlds

Lukas Finnveden23 Aug 2023 0:43 UTC
39 points
4 comments17 min readLW link
(lukasfinnveden.substack.com)

South Bay Ca­sual Group Walk

allisona22 Aug 2023 22:43 UTC
7 points
2 comments1 min readLW link

Walk while you talk: don’t balk at “no chalk”

dkl922 Aug 2023 21:27 UTC
41 points
9 comments2 min readLW link
(dkl9.net)

State of Gen­er­ally Available Self-Driving

jefftk22 Aug 2023 18:50 UTC
66 points
6 comments2 min readLW link
(www.jefftk.com)

Seth Ex­plains Consciousness

Jacob Falkovich22 Aug 2023 18:06 UTC
38 points
125 comments14 min readLW link
(putanumonit.com)

ChatGPT challenges the case for hu­man irrationality

Kevin Dorst22 Aug 2023 12:46 UTC
4 points
10 comments7 min readLW link
(kevindorst.substack.com)

[Question] Does one have rea­son to be­lieve the simu­la­tion hy­poth­e­sis is prob­a­bly true?

kuira22 Aug 2023 8:34 UTC
1 point
20 comments1 min readLW link

The Joan of Arc Challenge For Ob­jec­tive List The­ory

omnizoid22 Aug 2023 8:01 UTC
−2 points
4 comments10 min readLW link

The Lop­sided Lives Ar­gu­ment For He­donism About Well-being

omnizoid22 Aug 2023 7:59 UTC
−2 points
8 comments22 min readLW link

Causal­ity and a Cost Se­man­tics for Neu­ral Networks

scottviteri21 Aug 2023 21:02 UTC
22 points
1 comment9 min readLW link

Ideas for im­prov­ing epistemics in AI safety outreach

mic21 Aug 2023 19:55 UTC
64 points
6 comments3 min readLW link

Rice’s The­o­rem says that AIs can’t de­ter­mine much from study­ing AI source code

Michael Weiss-Malik21 Aug 2023 19:05 UTC
−11 points
4 comments1 min readLW link

Large Lan­guage Models will be Great for Censorship

Ethan Edwards21 Aug 2023 19:03 UTC
183 points
14 comments8 min readLW link
(ethanedwards.substack.com)

“Throw­ing Ex­cep­tions” Is A Strange Pro­gram­ming Pattern

Thoth Hermes21 Aug 2023 18:50 UTC
−2 points
13 comments6 min readLW link
(thothhermes.substack.com)

[Question] Which pos­si­ble AI sys­tems are rel­a­tively safe?

Zach Stein-Perlman21 Aug 2023 17:00 UTC
42 points
20 comments1 min readLW link

Self-shut­down AI

jan betley21 Aug 2023 16:48 UTC
13 points
2 comments2 min readLW link

Con­tex­tual Trans­la­tions—At­tempt 1

Varshul Gupta21 Aug 2023 14:30 UTC
−1 points
0 comments2 min readLW link
(dubverseblack.substack.com)

DIY De­liber­ate Practice

lynettebye21 Aug 2023 12:22 UTC
62 points
4 comments5 min readLW link
(lynettebye.com)

Down­stairs Open­ing: 2br Apartment

jefftk21 Aug 2023 0:50 UTC
8 points
2 comments3 min readLW link
(www.jefftk.com)

Effi­ciency and re­source use scal­ing parity

Ege Erdil21 Aug 2023 0:18 UTC
47 points
0 comments20 min readLW link

Ruin­ing an ex­pected-log-money maximizer

philh20 Aug 2023 21:20 UTC
27 points
32 comments1 min readLW link
(reasonableapproximation.net)

Steven Wolfram on AI Alignment

Bill Benzon20 Aug 2023 19:49 UTC
65 points
15 comments4 min readLW link

[Question] What value does per­sonal pre­dic­tion track­ing have?

fx20 Aug 2023 18:43 UTC
7 points
3 comments1 min readLW link

Jan Kul­veit’s Cor­rigi­bil­ity Thoughts Distilled

brook20 Aug 2023 17:52 UTC
20 points
1 comment5 min readLW link

Memetic Judo #3: The In­tel­li­gence of Stochas­tic Par­rots v.2

Max TK20 Aug 2023 15:18 UTC
8 points
33 comments6 min readLW link

ACX/​SSC Boulder meetup- Septem­ber 23

Josh Sacks20 Aug 2023 14:16 UTC
1 point
4 comments1 min readLW link

“Dirty con­cepts” in AI al­ign­ment dis­courses, and some guesses for how to deal with them

20 Aug 2023 9:13 UTC
65 points
4 comments3 min readLW link

Call for Papers on Global AI Gover­nance from the UN

Chris_Leong20 Aug 2023 8:56 UTC
19 points
0 comments1 min readLW link
(www.linkedin.com)

How do I read things on the internet

Vlad Sitalo20 Aug 2023 5:43 UTC
16 points
2 comments8 min readLW link
(vlad.roam.garden)

AI Fore­cast­ing: Two Years In

jsteinhardt19 Aug 2023 23:40 UTC
65 points
15 comments11 min readLW link
(bounded-regret.ghost.io)

Four man­age­ment/​lead­er­ship book summaries

nikola19 Aug 2023 23:38 UTC
11 points
0 comments7 min readLW link

In­ter­pret­ing a di­men­sion­al­ity re­duc­tion of a col­lec­tion of ma­tri­ces as two pos­i­tive semidefinite block di­ag­o­nal matrices

Joseph Van Name19 Aug 2023 19:52 UTC
15 points
2 comments5 min readLW link

Will AI kill ev­ery­one? Here’s what the god­fathers of AI have to say [RA video]

Writer19 Aug 2023 17:29 UTC
56 points
8 comments1 min readLW link
(youtu.be)

Ten vari­a­tions on red-pill-blue-pill

Richard_Kennaway19 Aug 2023 16:34 UTC
21 points
34 comments3 min readLW link

Are we run­ning out of new mu­sic/​movies/​art from a meta­phys­i­cal per­spec­tive? (up­dated)

stephen_s19 Aug 2023 16:24 UTC
4 points
23 comments1 min readLW link

[Question] Any ideas for a pre­dic­tion mar­ket ob­serv­able that quan­tifies “cul­ture-wari­sa­tion”?

Ppau19 Aug 2023 15:11 UTC
6 points
1 comment1 min readLW link

[Question] Clar­ify­ing how mis­al­ign­ment can arise from scal­ing LLMs

Util19 Aug 2023 14:16 UTC
3 points
1 comment1 min readLW link

Chess as a case study in hid­den ca­pa­bil­ities in ChatGPT

AdamYedidia19 Aug 2023 6:35 UTC
45 points
32 comments6 min readLW link