Nudg­ing Polarization

jefftk24 Mar 2023 23:50 UTC
41 points
14 comments3 min readLW link
(www.jefftk.com)

Why There Is No An­swer to Your Philo­soph­i­cal Question

Bryan Frances24 Mar 2023 23:22 UTC
−12 points
10 comments12 min readLW link

“Slightly Evil” AI Apps

intellectronica24 Mar 2023 22:52 UTC
1 point
2 comments2 min readLW link
(intellectronica.net)

[Question] Seek­ing Ad­vice on Rais­ing AI X-Risk Aware­ness on So­cial Media

MrThink24 Mar 2023 22:25 UTC
2 points
1 comment1 min readLW link

Hut­ter-Prize for Prompts

rokosbasilisk24 Mar 2023 21:26 UTC
5 points
10 comments1 min readLW link

How likely do you think worse-than-ex­tinc­tion type fates to be?

span124 Mar 2023 21:03 UTC
5 points
4 comments1 min readLW link

Meetup Tip: The Greeter

Screwtape24 Mar 2023 20:31 UTC
11 points
0 comments4 min readLW link

Me­tac­u­lus Pre­dicts Weak AGI in 2 Years and AGI in 10

Chris_Leong24 Mar 2023 19:43 UTC
29 points
14 comments1 min readLW link

[Question] How to model un­cer­tainty about prefer­ences?

quetzal_rainbow24 Mar 2023 19:04 UTC
10 points
2 comments1 min readLW link

Ex­plor­ing Tacit Linked Premises with GPT

romeostevensit24 Mar 2023 18:09 UTC
42 points
3 comments3 min readLW link

More ex­per­i­ments in GPT-4 agency: writ­ing memos

Christopher King24 Mar 2023 17:51 UTC
5 points
2 comments10 min readLW link

Why con­sumerism is good actually

jasoncrawford24 Mar 2023 17:42 UTC
11 points
16 comments1 min readLW link
(rootsofprogress.org)

Are ex­trap­o­la­tion-based AIs al­ignable?

cousin_it24 Mar 2023 15:55 UTC
22 points
15 comments1 min readLW link

Does GPT-4 ex­hibit agency when sum­ma­riz­ing ar­ti­cles?

Christopher King24 Mar 2023 15:49 UTC
16 points
2 comments5 min readLW link

GPT-2005: A con­ver­sa­tion with ChatGPT (fea­tur­ing semi-func­tional Wolfram Alpha plu­gin!)

Lone Pine24 Mar 2023 14:03 UTC
19 points
0 comments22 min readLW link

Microsoft Re­search Paper Claims Sparks of Ar­tifi­cial In­tel­li­gence in GPT-4

Zvi24 Mar 2023 13:20 UTC
72 points
14 comments6 min readLW link
(thezvi.wordpress.com)

So, just why do GPTs have to op­er­ate by con­tin­u­ing an ex­ist­ing string?

Bill Benzon24 Mar 2023 12:08 UTC
−4 points
0 comments3 min readLW link

Ap­ply now to ra­tio­nal­ity camps: ESPR & PAIR—new Pro­gram on AI and Rea­son­ing (ages 16-20)

Anna Gajdova24 Mar 2023 11:40 UTC
41 points
0 comments1 min readLW link

[Question] What does the econ­omy do?

tailcalled24 Mar 2023 10:49 UTC
9 points
20 comments1 min readLW link

[Question] Can in­de­pen­dent re­searchers get a spon­sored visa for the US or UK?

jacquesthibs24 Mar 2023 6:10 UTC
20 points
1 comment1 min readLW link

Wittgen­stein and ML — pa­ram­e­ters vs architecture

Cleo Nardo24 Mar 2023 4:54 UTC
37 points
8 comments5 min readLW link

Grind­ing slimes in the dun­geon of AI al­ign­ment research

Max H24 Mar 2023 4:51 UTC
10 points
2 comments4 min readLW link

A crazy hy­poth­e­sis: GPT-4 already is agen­tic and is try­ing to take over the world!

Christopher King24 Mar 2023 1:19 UTC
−2 points
11 comments9 min readLW link

Ab­stracts should be ei­ther Ac­tu­ally Short™, or bro­ken into paragraphs

Raemon24 Mar 2023 0:51 UTC
93 points
27 comments5 min readLW link

con­tinue work­ing on hard al­ign­ment! don’t give up!

Tamsin Leake24 Mar 2023 0:14 UTC
82 points
45 comments1 min readLW link
(carado.moe)

Us­ing GPT-4 to Un­der­stand Code

sid24 Mar 2023 0:09 UTC
23 points
2 comments6 min readLW link

Kingfisher Album Kickstarter

jefftk23 Mar 2023 23:20 UTC
8 points
0 comments2 min readLW link
(www.jefftk.com)

Is your job re­place­able by GPT-4? (as of March 2023)

Bezzi23 Mar 2023 22:16 UTC
18 points
6 comments1 min readLW link

ACX meetup [April]

sallatik23 Mar 2023 20:40 UTC
1 point
0 comments1 min readLW link

Fea­ture idea: ex­tra info about post au­thor’s re­sponse to com­ments.

Nathan Helm-Burger23 Mar 2023 20:14 UTC
6 points
0 comments1 min readLW link

Limit in­tel­li­gent weapons

Lucas Pfeifer23 Mar 2023 17:54 UTC
−11 points
36 comments1 min readLW link

We have to Upgrade

Jed McCaleb23 Mar 2023 17:53 UTC
126 points
35 comments2 min readLW link

The Over­ton Win­dow widens: Ex­am­ples of AI risk in the media

Akash23 Mar 2023 17:10 UTC
107 points
24 comments6 min readLW link

GPT-4 al­ign­ing with aca­sual de­ci­sion the­ory when in­structed to play games, but in­cludes a CDT ex­pla­na­tion that’s in­cor­rect if they differ

Christopher King23 Mar 2023 16:16 UTC
7 points
4 comments8 min readLW link

Is “FOXP2 speech & lan­guage di­s­or­der” re­ally “FOXP2 fore­brain fine-mo­tor crap­piness”?

Steven Byrnes23 Mar 2023 16:09 UTC
22 points
8 comments5 min readLW link

EAI Align­ment Speaker Series #1: Challenges for Safe & Benefi­cial Brain-Like Ar­tifi­cial Gen­eral In­tel­li­gence with Steve Byrnes

23 Mar 2023 14:32 UTC
28 points
0 comments27 min readLW link
(youtu.be)

[Question] Align­ment-re­lated jobs out­side of Lon­don/​SF

Ariel Kwiatkowski23 Mar 2023 13:24 UTC
26 points
14 comments1 min readLW link

Zuzalu

vincentweisser23 Mar 2023 11:24 UTC
3 points
0 comments1 min readLW link

How Do In­duc­tion Heads Ac­tu­ally Work in Trans­form­ers With Finite Ca­pac­ity?

Fabien Roger23 Mar 2023 9:09 UTC
27 points
0 comments5 min readLW link

ChatGPT’s “fuzzy al­ign­ment” isn’t ev­i­dence of AGI al­ign­ment: the ba­nana test

Michael Tontchev23 Mar 2023 7:12 UTC
23 points
6 comments4 min readLW link

Sparks of Ar­tifi­cial Gen­eral In­tel­li­gence: Early ex­per­i­ments with GPT-4 | Microsoft Research

DragonGod23 Mar 2023 5:45 UTC
68 points
23 comments1 min readLW link
(arxiv.org)

Tran­script: NBC Nightly News: AI ‘race to reck­less­ness’ w/​ Tris­tan Har­ris, Aza Raskin

WilliamKiely23 Mar 2023 1:04 UTC
63 points
4 comments3 min readLW link

Why We MUST Create an AGI that Disem­pow­ers Hu­man­ity. For Real.

twkaiser22 Mar 2023 23:01 UTC
−17 points
1 comment4 min readLW link

Progress links and tweets, 2023-03-22

jasoncrawford22 Mar 2023 22:19 UTC
13 points
0 comments2 min readLW link
(rootsofprogress.org)

[Question] How to con­vince some­one AGI is com­ing soon?

Zohar Jackson22 Mar 2023 22:16 UTC
5 points
7 comments1 min readLW link

Harry Pot­ter in The World of Path Semantics

Sven Nilsen22 Mar 2023 20:22 UTC
−3 points
17 comments1 min readLW link
(raw.githubusercontent.com)

Books: Lend, Don’t Give

jefftk22 Mar 2023 18:40 UTC
28 points
2 comments1 min readLW link
(www.jefftk.com)

[Linkpost] Shorter ver­sion of re­port on ex­is­ten­tial risk from power-seek­ing AI

Joe Carlsmith22 Mar 2023 18:09 UTC
7 points
0 comments1 min readLW link

An­nounc­ing the Euro­pean Net­work for AI Safety (ENAIS)

Esben Kran22 Mar 2023 17:57 UTC
19 points
0 comments1 min readLW link

[Question] Gen­uine ques­tion: If Eliezer is so ra­tio­nal why is he fat?

DirichletConvolution22 Mar 2023 17:41 UTC
−40 points
6 comments1 min readLW link