Thoughts on self-in­spect­ing neu­ral net­works.

DeruwynMar 12, 2023, 11:58 PM
4 points
2 comments5 min readLW link

An AI risk ar­gu­ment that res­onates with NYTimes readers

Julian BradshawMar 12, 2023, 11:09 PM
212 points
14 comments1 min readLW link

Mu­si­ci­ans and Mouths

jefftkMar 12, 2023, 10:50 PM
13 points
7 comments2 min readLW link
(www.jefftk.com)

Are there cog­ni­tive realms?

TsviBTMar 12, 2023, 7:28 PM
34 points
3 comments10 min readLW link1 review

[Question] What hap­pened on the Ex­tropi­ans mes­sage board?

politicalpersuasionMar 12, 2023, 7:22 PM
−53 points
1 comment1 min readLW link

Creat­ing a Dis­cord server for Mechanis­tic In­ter­pretabil­ity Projects

Victor LevosoMar 12, 2023, 6:00 PM
30 points
6 comments2 min readLW link

Paper Repli­ca­tion Walk­through: Re­v­erse-Eng­ineer­ing Mo­du­lar Addition

Neel NandaMar 12, 2023, 1:25 PM
18 points
0 comments1 min readLW link
(neelnanda.io)

What prob­lems do Afri­can-Amer­i­cans face? An ini­tial in­ves­ti­ga­tion us­ing Stand­point Episte­mol­ogy and Surveys

tailcalledMar 12, 2023, 11:42 AM
34 points
26 comments15 min readLW link

“Liquidity” vs “solvency” in bank runs (and some notes on Sili­con Valley Bank)

rossryMar 12, 2023, 9:16 AM
108 points
27 comments12 min readLW link

“You’ll Never Per­suade Peo­ple Like That”

Zack_M_DavisMar 12, 2023, 5:38 AM
18 points
31 comments2 min readLW link

Par­a­sitic Lan­guage Games: main­tain­ing am­bi­guity to hide con­flict while burn­ing the commons

HazardMar 12, 2023, 5:25 AM
115 points
17 comments13 min readLW link

[Question] Is there a way to sort LW search re­sults by date posted?

zeshenMar 12, 2023, 4:56 AM
5 points
1 comment1 min readLW link

Is “Reg­u­lar­ity” an­other Phlo­gis­ton?

Cole WyethMar 12, 2023, 3:13 AM
2 points
3 comments3 min readLW link
(colewyeth.com)

Minor Life Op­ti­miza­tion: Con­sider Order­ing Your Food To-Go

sudoMar 12, 2023, 2:08 AM
9 points
20 comments1 min readLW link

A bunch of videos for in­tu­ition build­ing (2x speed, skip ones that bore you)

the gears to ascensionMar 12, 2023, 12:51 AM
72 points
5 comments4 min readLW link

The is­sue of mean­ing in large lan­guage mod­els (LLMs)

Bill BenzonMar 11, 2023, 11:00 PM
1 point
34 comments8 min readLW link

[Linkpost] Scott Alexan­der re­acts to OpenAI’s lat­est post

Orpheus16Mar 11, 2023, 10:24 PM
27 points
0 comments5 min readLW link
(astralcodexten.substack.com)

Com­po­si­tional lan­guage for hy­pothe­ses about computations

Vanessa KosoyMar 11, 2023, 7:43 PM
38 points
6 comments12 min readLW link

Un­der­stand­ing and con­trol­ling a maze-solv­ing policy network

Mar 11, 2023, 6:59 PM
334 points
28 comments23 min readLW link

[Question] How can we pro­mote AI al­ign­ment in Ja­pan?

Shoka KadoiMar 11, 2023, 6:52 PM
24 points
11 comments1 min readLW link

How to Sup­port Some­one Who is Struggling

David ZellerMar 11, 2023, 6:52 PM
76 points
13 comments5 min readLW link

[Question] Given one AI, why not more?

Frank AdkMar 11, 2023, 6:52 PM
7 points
12 comments1 min readLW link

Agents synchronization

Ben AmitayMar 11, 2023, 6:41 PM
12 points
1 comment5 min readLW link

Against Com­plete Black­out Cur­tains For Sleep

jpMar 11, 2023, 6:29 PM
19 points
11 comments1 min readLW link

[Question] Coun­ter­ar­gu­ments to Core AI X-Risk Sto­ries?

DavidWMar 11, 2023, 5:55 PM
10 points
2 comments1 min readLW link

The Power of In­tel­li­gence—The Animation

WriterMar 11, 2023, 4:15 PM
45 points
3 comments1 min readLW link
(youtu.be)

[Question] Hoard­ing Gmail-ac­counts in a post-CAPTCHA world?

Alexander Gietelink OldenzielMar 11, 2023, 4:08 PM
7 points
3 comments1 min readLW link

[Question] Will the Bit­coin fee mar­ket ac­tu­ally work?

TropicalFruitMar 11, 2023, 12:02 AM
10 points
6 comments1 min readLW link

Ra­tion­al­ism and so­cial rationalism

philosophybearMar 10, 2023, 11:20 PM
17 points
5 comments10 min readLW link
(philosophybear.substack.com)

Meetup Tip: Nametags

ScrewtapeMar 10, 2023, 9:00 PM
16 points
2 comments3 min readLW link

[Question] Is ChatGPT (or other LLMs) more ‘sen­tient’/​’con­scious/​etc. then a baby with­out a brain?

M. Y. ZuoMar 10, 2023, 7:00 PM
−5 points
2 comments1 min readLW link

The hu­man­ity’s biggest mistake

RomanSMar 10, 2023, 4:30 PM
0 points
1 comment2 min readLW link

Oper­a­tional­iz­ing timelines

Zach Stein-PerlmanMar 10, 2023, 4:30 PM
7 points
1 comment3 min readLW link

[Question] What do you think is wrong with ra­tio­nal­ist cul­ture?

tailcalledMar 10, 2023, 1:17 PM
16 points
77 comments1 min readLW link

Dice De­ci­sion Making

Bart BussmannMar 10, 2023, 1:01 PM
20 points
14 comments3 min readLW link

Stop call­ing it “jailbreak­ing” ChatGPT

TemplarrrMar 10, 2023, 11:41 AM
7 points
9 comments2 min readLW link

Long-term mem­ory for LLM via self-repli­cat­ing prompt

avturchinMar 10, 2023, 10:28 AM
20 points
3 comments2 min readLW link

Thoughts on the OpenAI al­ign­ment plan: will AI re­search as­sis­tants be net-pos­i­tive for AI ex­is­ten­tial risk?

Jeffrey LadishMar 10, 2023, 8:21 AM
58 points
3 comments9 min readLW link

Reflec­tions On The Fea­si­bil­ity Of Scal­able-Oversight

Felix HofstätterMar 10, 2023, 7:54 AM
11 points
0 comments12 min readLW link

Ja­pan AI Align­ment Conference

Mar 10, 2023, 6:56 AM
64 points
7 comments1 min readLW link
(www.conjecture.dev)

Every­thing’s nor­mal un­til it’s not

Eleni AngelouMar 10, 2023, 2:02 AM
7 points
0 comments3 min readLW link

Acolytes, re­form­ers, and atheists

lcMar 10, 2023, 12:48 AM
9 points
0 comments4 min readLW link

The hot mess the­ory of AI mis­al­ign­ment: More in­tel­li­gent agents be­have less coherently

Jonathan YanMar 10, 2023, 12:20 AM
48 points
22 comments1 min readLW link
(sohl-dickstein.github.io)

Why Not Just Out­source Align­ment Re­search To An AI?

johnswentworthMar 9, 2023, 9:49 PM
151 points
50 comments9 min readLW link1 review

What’s Not Our Problem

Jacob FalkovichMar 9, 2023, 8:07 PM
22 points
6 comments9 min readLW link

Ques­tions about Con­je­cure’s CoEm proposal

Mar 9, 2023, 7:32 PM
51 points
4 comments2 min readLW link

What Ja­son has been read­ing, March 2023

jasoncrawfordMar 9, 2023, 6:46 PM
12 points
0 comments6 min readLW link
(rootsofprogress.org)

[Question] “Provide C++ code for a func­tion that out­puts a Fibonacci se­quence of n terms, where n is pro­vided as a pa­ram­e­ter to the function

Thembeka99Mar 9, 2023, 6:37 PM
−21 points
2 comments1 min readLW link

An­thropic: Core Views on AI Safety: When, Why, What, and How

jonmenasterMar 9, 2023, 5:34 PM
17 points
1 comment22 min readLW link
(www.anthropic.com)

Why do we as­sume there is a “real” shog­goth be­hind the LLM? Why not masks all the way down?

Robert_AIZIMar 9, 2023, 5:28 PM
63 points
48 comments2 min readLW link