Kingfisher Album Kickstarter

jefftk23 Mar 2023 23:20 UTC
8 points
0 comments2 min readLW link
(www.jefftk.com)

Is your job re­place­able by GPT-4? (as of March 2023)

Bezzi23 Mar 2023 22:16 UTC
18 points
6 comments1 min readLW link

ACX meetup [April]

sallatik23 Mar 2023 20:40 UTC
1 point
0 comments1 min readLW link

Fea­ture idea: ex­tra info about post au­thor’s re­sponse to com­ments.

Nathan Helm-Burger23 Mar 2023 20:14 UTC
6 points
0 comments1 min readLW link

Limit in­tel­li­gent weapons

Lucas Pfeifer23 Mar 2023 17:54 UTC
−11 points
36 comments1 min readLW link

We have to Upgrade

Jed McCaleb23 Mar 2023 17:53 UTC
126 points
35 comments2 min readLW link

The Over­ton Win­dow widens: Ex­am­ples of AI risk in the media

Akash23 Mar 2023 17:10 UTC
107 points
24 comments6 min readLW link

GPT-4 al­ign­ing with aca­sual de­ci­sion the­ory when in­structed to play games, but in­cludes a CDT ex­pla­na­tion that’s in­cor­rect if they differ

Christopher King23 Mar 2023 16:16 UTC
7 points
4 comments8 min readLW link

Is “FOXP2 speech & lan­guage di­s­or­der” re­ally “FOXP2 fore­brain fine-mo­tor crap­piness”?

Steven Byrnes23 Mar 2023 16:09 UTC
22 points
8 comments5 min readLW link

EAI Align­ment Speaker Series #1: Challenges for Safe & Benefi­cial Brain-Like Ar­tifi­cial Gen­eral In­tel­li­gence with Steve Byrnes

23 Mar 2023 14:32 UTC
28 points
0 comments27 min readLW link
(youtu.be)

[Question] Align­ment-re­lated jobs out­side of Lon­don/​SF

Ariel Kwiatkowski23 Mar 2023 13:24 UTC
26 points
14 comments1 min readLW link

Zuzalu

vincentweisser23 Mar 2023 11:24 UTC
3 points
0 comments1 min readLW link

How Do In­duc­tion Heads Ac­tu­ally Work in Trans­form­ers With Finite Ca­pac­ity?

Fabien Roger23 Mar 2023 9:09 UTC
27 points
0 comments5 min readLW link

ChatGPT’s “fuzzy al­ign­ment” isn’t ev­i­dence of AGI al­ign­ment: the ba­nana test

Michael Tontchev23 Mar 2023 7:12 UTC
23 points
6 comments4 min readLW link

Sparks of Ar­tifi­cial Gen­eral In­tel­li­gence: Early ex­per­i­ments with GPT-4 | Microsoft Research

DragonGod23 Mar 2023 5:45 UTC
68 points
23 comments1 min readLW link
(arxiv.org)

Tran­script: NBC Nightly News: AI ‘race to reck­less­ness’ w/​ Tris­tan Har­ris, Aza Raskin

WilliamKiely23 Mar 2023 1:04 UTC
63 points
4 comments3 min readLW link

Why We MUST Create an AGI that Disem­pow­ers Hu­man­ity. For Real.

twkaiser22 Mar 2023 23:01 UTC
−17 points
1 comment4 min readLW link

Progress links and tweets, 2023-03-22

jasoncrawford22 Mar 2023 22:19 UTC
13 points
0 comments2 min readLW link
(rootsofprogress.org)

[Question] How to con­vince some­one AGI is com­ing soon?

Zohar Jackson22 Mar 2023 22:16 UTC
5 points
7 comments1 min readLW link

Harry Pot­ter in The World of Path Semantics

Sven Nilsen22 Mar 2023 20:22 UTC
−3 points
17 comments1 min readLW link
(raw.githubusercontent.com)

Books: Lend, Don’t Give

jefftk22 Mar 2023 18:40 UTC
28 points
2 comments1 min readLW link
(www.jefftk.com)

[Linkpost] Shorter ver­sion of re­port on ex­is­ten­tial risk from power-seek­ing AI

Joe Carlsmith22 Mar 2023 18:09 UTC
7 points
0 comments1 min readLW link

An­nounc­ing the Euro­pean Net­work for AI Safety (ENAIS)

Esben Kran22 Mar 2023 17:57 UTC
19 points
0 comments1 min readLW link

[Question] Gen­uine ques­tion: If Eliezer is so ra­tio­nal why is he fat?

DirichletConvolution22 Mar 2023 17:41 UTC
−40 points
6 comments1 min readLW link

Mak­ing bet­ter es­ti­mates with scarce information

Stan Pinsent22 Mar 2023 17:40 UTC
11 points
5 comments10 min readLW link

Anki with Uncer­tainty: Turn any flash­card deck into a cal­ibra­tion train­ing tool

Sage Future22 Mar 2023 17:26 UTC
14 points
2 comments1 min readLW link

Key Ques­tions for Digi­tal Minds

Jacy Reese Anthis22 Mar 2023 17:13 UTC
22 points
0 comments7 min readLW link
(www.sentienceinstitute.org)

Em­piri­cal risk min­i­miza­tion is fun­da­men­tally confused

Jesse Hoogland22 Mar 2023 16:58 UTC
32 points
5 comments1 min readLW link

[Question] Challenge: Does ChatGPT ever claim that a bad out­come for hu­man­ity is ac­tu­ally good?

Yair Halberstadt22 Mar 2023 16:01 UTC
49 points
29 comments1 min readLW link

The space of sys­tems and the space of maps

22 Mar 2023 14:59 UTC
39 points
0 comments5 min readLW link

Fea­ture Re­quest to OpenAI: Share but­ton in ChatGPT

Taleuntum22 Mar 2023 14:19 UTC
14 points
4 comments2 min readLW link

Why AI Safety is Hard

Simon Möller22 Mar 2023 10:44 UTC
3 points
0 comments6 min readLW link

[Question] Was Saga of Ta­ti­ana the Funny made by Fushimi Gaku?

Eve Grey22 Mar 2023 9:59 UTC
−9 points
0 comments1 min readLW link

The Gom Jab­bar scene from Dune is es­sen­tially a short film about what Ra­tion­al­ity is for

mako yass22 Mar 2023 8:33 UTC
6 points
1 comment1 min readLW link

Agen­tic GPT simu­la­tions: a risk and an opportunity

Yair Halberstadt22 Mar 2023 6:24 UTC
24 points
8 comments1 min readLW link

Emer­gent Analog­i­cal Rea­son­ing in Large Lan­guage Models

Roman Leventov22 Mar 2023 5:18 UTC
13 points
2 comments1 min readLW link
(arxiv.org)

[Linkpost] GatesNotes: The Age of AI has begun

WilliamKiely22 Mar 2023 4:20 UTC
19 points
9 comments1 min readLW link

An Ap­peal to AI Su­per­in­tel­li­gence: Rea­sons Not to Pre­serve (most of) Humanity

Alex Beyman22 Mar 2023 4:09 UTC
−15 points
6 comments19 min readLW link

Truth and Ad­van­tage: Re­sponse to a draft of “AI safety seems hard to mea­sure”

So8res22 Mar 2023 3:36 UTC
98 points
9 comments5 min readLW link

A Pro­posed Ap­proach for AI Safety Move­ment Build­ing: Pro­jects, Pro­fes­sions, Skills, and Ideas for the Fu­ture [long post][bounty for feed­back]

peterslattery22 Mar 2023 1:11 UTC
14 points
0 comments32 min readLW link

Prin­ci­ples for Pro­duc­tive Group Meetings

jsteinhardt22 Mar 2023 0:50 UTC
60 points
1 comment13 min readLW link
(bounded-regret.ghost.io)

God vs AI scientifically

Donatas Lučiūnas21 Mar 2023 23:03 UTC
−22 points
40 comments1 min readLW link

A method for em­piri­cal back-test­ing of AI’s abil­ity to self-improve

Michael Tontchev21 Mar 2023 20:24 UTC
3 points
0 comments2 min readLW link

the QACI al­ign­ment plan: table of contents

Tamsin Leake21 Mar 2023 20:22 UTC
102 points
0 comments1 min readLW link
(carado.moe)

AI Fables

Bard21 Mar 2023 19:19 UTC
18 points
12 comments4 min readLW link

[Question] Ad­ver­sar­ial (SEO) GPT train­ing data?

Dagon21 Mar 2023 18:55 UTC
2 points
0 comments1 min readLW link

[Question] Why not con­strain wet­labs in­stead of AI?

Lone Pine21 Mar 2023 18:02 UTC
15 points
10 comments1 min readLW link

[Question] Wouldn’t an in­tel­li­gent agent keep us al­ive and help us al­ign it­self to our val­ues in or­der to pre­vent risk ? by Risk I mean ex­per­i­men­ta­tion by try­ing to al­ign po­ten­tially smarter repli­cas?

Terrence Rotoufle21 Mar 2023 17:44 UTC
−3 points
1 comment2 min readLW link

[Question] Em­ployer con­sid­er­ing part­ner­ing with ma­jor AI labs. What to do?

GraduallyMoreAgitated21 Mar 2023 17:43 UTC
37 points
7 comments2 min readLW link

Sun-fol­low­ing Gar­den Mir­rors?

jefftk21 Mar 2023 16:20 UTC
15 points
5 comments1 min readLW link
(www.jefftk.com)