The shape of AGI: Car­toons and back of envelope

boazbarak17 Jul 2023 20:57 UTC
27 points
18 comments6 min readLW link

Pre­dic­tive his­tory classes

dkl917 Jul 2023 20:48 UTC
67 points
17 comments2 min readLW link
(dkl9.net)

High­lights from The In­dus­trial Revolu­tion, by T. S. Ashton

jasoncrawford17 Jul 2023 19:02 UTC
17 points
0 comments10 min readLW link
(rootsofprogress.org)

Ex­is­ten­tial Risk Per­sua­sion Tournament

PeterMcCluskey17 Jul 2023 18:04 UTC
71 points
1 comment8 min readLW link
(bayesianinvestor.com)

[In­ter­view w/​ Rob Miles] The case for tak­ing AI Safety seriously

fowlertm17 Jul 2023 17:08 UTC
17 points
1 comment1 min readLW link

An­nounc­ing the Ex­is­ten­tial In­foSec Forum

calebp9917 Jul 2023 17:05 UTC
10 points
0 comments2 min readLW link

Nar­ra­tive The­ory. Part 4. Neu­ral Darwinism

Eris17 Jul 2023 16:45 UTC
3 points
0 comments2 min readLW link

Sapi­ent Algorithms

Valentine17 Jul 2023 16:30 UTC
80 points
15 comments5 min readLW link

New ca­reer re­view: AI safety tech­ni­cal research

Benjamin Hilton17 Jul 2023 15:34 UTC
14 points
0 comments1 min readLW link

[Question] Con­di­tional on liv­ing in a AI safety/​al­ign­ment by de­fault uni­verse, what are the im­pli­ca­tions of this as­sump­tion be­ing true?

Noosphere8917 Jul 2023 14:44 UTC
26 points
10 comments1 min readLW link

Thoughts on “Pro­cess-Based Su­per­vi­sion”

Steven Byrnes17 Jul 2023 14:08 UTC
74 points
4 comments23 min readLW link

Proof of pos­te­ri­or­ity: a defense against AI-gen­er­ated misinformation

jchan17 Jul 2023 12:04 UTC
32 points
3 comments5 min readLW link

An Overview of AI risks—the Flyer

17 Jul 2023 12:03 UTC
20 points
0 comments1 min readLW link
(docs.google.com)

[Question] Build knowl­edge base first, or backchain?

NicholasKross17 Jul 2023 3:44 UTC
11 points
5 comments1 min readLW link

A fic­tional AI law laced w/​ al­ign­ment theory

MiguelDev17 Jul 2023 1:42 UTC
6 points
0 comments2 min readLW link

Au­toIn­ter­pre­ta­tion Finds Sparse Cod­ing Beats Alternatives

Hoagy17 Jul 2023 1:41 UTC
54 points
1 comment7 min readLW link

An up­com­ing US Supreme Court case may im­pede AI gov­er­nance efforts

NickGabs16 Jul 2023 23:51 UTC
57 points
17 comments2 min readLW link

Weak Ev­i­dence is Common

dkl916 Jul 2023 23:37 UTC
7 points
5 comments1 min readLW link
(dkl9.net)

Even briefer sum­mary of ai-plans.com

Iknownothing16 Jul 2023 23:25 UTC
10 points
6 comments2 min readLW link
(www.ai-plans.com)

Mech In­terp Puz­zle 1: Sus­pi­ciously Similar Embed­dings in GPT-Neo

Neel Nanda16 Jul 2023 22:02 UTC
65 points
15 comments1 min readLW link

A Tech­nol­ogy of Every­thing – Part 1: A Mag­i­cal Science Experiment

aiuisensei16 Jul 2023 22:01 UTC
−3 points
0 comments7 min readLW link
(www.aiui.cloud)

Scal­ing and Sus­tain­ing Stan­dards: A Case Study on the Basel Accords

Conrad K.16 Jul 2023 22:01 UTC
8 points
1 comment7 min readLW link
(docs.google.com)

AI, Con­scious­ness, and the prob­lem of Mo­ral Considerability

stultus16 Jul 2023 19:56 UTC
1 point
0 comments2 min readLW link

Nar­ra­tive The­ory. Part 3. Sim­plest to succeed

Eris16 Jul 2023 14:41 UTC
4 points
0 comments1 min readLW link

Ru­n­away Op­ti­miz­ers in Mind Space

silentbob16 Jul 2023 14:26 UTC
16 points
0 comments12 min readLW link

[Question] Is Adam Elga’s proof for thirdism in Sleep­ing Beauty still con­sid­ered to be sound?

Ape in the coat16 Jul 2023 14:11 UTC
8 points
25 comments1 min readLW link

A sim­ple way of ex­ploit­ing AI’s com­ing eco­nomic im­pact may be highly-impactful

kuira16 Jul 2023 9:33 UTC
11 points
2 comments2 min readLW link

Ac­ti­va­tion adding ex­per­i­ments with llama-7b

Nina Rimsky16 Jul 2023 4:17 UTC
50 points
1 comment3 min readLW link

In­tro­duc­ción al Riesgo Ex­is­ten­cial de In­teligen­cia Artificial

david.friva15 Jul 2023 20:37 UTC
4 points
2 comments4 min readLW link
(youtu.be)

The hous­ing crisis, ex­plained us­ing game theory

Johnstone15 Jul 2023 20:27 UTC
4 points
2 comments8 min readLW link

Only a hack can solve the shut­down problem

dp15 Jul 2023 20:26 UTC
5 points
0 comments8 min readLW link

Ro­bust­ness of Model-Graded Eval­u­a­tions and Au­to­mated Interpretability

15 Jul 2023 19:12 UTC
44 points
5 comments9 min readLW link

[Question] How to deal with fear of failure?

TeaTieAndHat15 Jul 2023 18:57 UTC
1 point
2 comments1 min readLW link

Sim­plified bio-an­chors for up­per bounds on AI timelines

Fabien Roger15 Jul 2023 18:15 UTC
20 points
4 comments5 min readLW link

A Hill of Val­idity in Defense of Meaning

Zack_M_Davis15 Jul 2023 17:57 UTC
8 points
118 comments75 min readLW link
(unremediatedgender.space)

What is a cog­ni­tive bias?

Lionel15 Jul 2023 13:01 UTC
1 point
0 comments2 min readLW link
(lionelpage.substack.com)

[Question] When peo­ple say robots will steal jobs, what kinds of jobs are never im­plied?

Mary Chernyshenko15 Jul 2023 10:50 UTC
5 points
12 comments1 min readLW link

Nar­ra­tive The­ory. Part 2. A new way of do­ing the same thing

Eris15 Jul 2023 10:37 UTC
2 points
0 comments1 min readLW link

How to use ChatGPT to get bet­ter book & movie recommendations

KatWoods15 Jul 2023 8:55 UTC
28 points
3 comments1 min readLW link

[Question] Would you take a job mak­ing hu­manoid robots for an AGI?

Super AGI15 Jul 2023 5:26 UTC
−1 points
2 comments1 min readLW link

Ra­tion­al­ity, Ped­a­gogy, and “Vibes”: Quick Thoughts

NicholasKross15 Jul 2023 2:09 UTC
14 points
1 comment4 min readLW link

(redacted) Ano­ma­lous to­kens might dis­pro­por­tionately af­fect com­plex lan­guage tasks

nikola15 Jul 2023 0:48 UTC
4 points
0 comments7 min readLW link

Why was the AI Align­ment com­mu­nity so un­pre­pared for this mo­ment?

Ras151315 Jul 2023 0:26 UTC
119 points
65 comments2 min readLW link

Physics is Ul­ti­mately Subjective

Gordon Seidoh Worley14 Jul 2023 22:19 UTC
5 points
34 comments3 min readLW link

[Question] How should a ra­tio­nal agent con­struct their util­ity func­tion when faced with ex­is­tence?

Aman Rusia14 Jul 2023 19:48 UTC
−2 points
1 comment1 min readLW link

AI Risk and Sur­vivor­ship Bias—How An­dreessen and LeCun got it wrong

Štěpán Los14 Jul 2023 17:43 UTC
13 points
2 comments6 min readLW link

Un­safe AI as Dy­nam­i­cal Systems

Robert_AIZI14 Jul 2023 15:31 UTC
11 points
0 comments3 min readLW link
(aizi.substack.com)

A Short Sum­mary of “Fo­cus Your Uncer­tainty”

Stephen James14 Jul 2023 11:18 UTC
2 points
0 comments1 min readLW link

Do the change you want to see in the world

TeaTieAndHat14 Jul 2023 10:19 UTC
7 points
0 comments1 min readLW link

Gear­ing Up for Long Timelines in a Hard World

Dalcy14 Jul 2023 6:11 UTC
13 points
0 comments4 min readLW link