Adum­bra­tions on AGI from an outsider

nicholashaldenMay 24, 2023, 5:41 PM
57 points
44 comments8 min readLW link
(nicholashalden.home.blog)

Ex­plain­ing “Hell is Game The­ory Folk The­o­rems”

electroswingMay 5, 2023, 11:33 PM
57 points
21 comments5 min readLW link

PaLM-2 & GPT-4 in “Ex­trap­o­lat­ing GPT-N perfor­mance”

Lukas FinnvedenMay 30, 2023, 6:33 PM
57 points
6 comments6 min readLW link

«Boundaries/​Mem­branes» and AI safety compilation

ChipmonkMay 3, 2023, 9:41 PM
56 points
17 comments8 min readLW link

The pos­si­ble shared Craft of de­liber­ate Lex­i­co­ge­n­e­sis

TsviBTMay 20, 2023, 5:56 AM
56 points
5 comments5 min readLW link

The Treach­er­ous Turn is finished! (AI-takeover-themed table­top RPG)

Daniel KokotajloMay 22, 2023, 5:49 AM
55 points
5 comments2 min readLW link
(thetreacherousturn.ai)

The way AGI wins could look very stupid

Christopher KingMay 12, 2023, 4:34 PM
54 points
22 comments1 min readLW link

Towards Mea­sures of Optimisation

May 12, 2023, 3:29 PM
53 points
37 comments4 min readLW link

Are Emer­gent Abil­ities of Large Lan­guage Models a Mirage? [linkpost]

Matthew BarnettMay 2, 2023, 9:01 PM
53 points
19 comments1 min readLW link
(arxiv.org)

Yoshua Ben­gio ar­gues for tool-AI and to ban “ex­ec­u­tive-AI”

habrykaMay 9, 2023, 12:13 AM
53 points
15 comments7 min readLW link
(yoshuabengio.org)

Distil­la­tion of Neu­rotech and Align­ment Work­shop Jan­uary 2023

May 22, 2023, 7:17 AM
52 points
9 comments14 min readLW link

Job Open­ing: SWE to help build sig­na­ture vet­ting sys­tem for AI-re­lated petitions

May 20, 2023, 7:02 PM
52 points
0 comments1 min readLW link

The Ap­pren­tice Thread 2

hathMay 1, 2023, 8:09 PM
50 points
19 comments1 min readLW link

Gem­ini will bring the next big timeline update

p.b.May 29, 2023, 6:05 AM
50 points
6 comments1 min readLW link

Look At What’s In Front Of You (Con­clu­sion to The Nuts and Bolts of Nat­u­ral­ism)

LoganStrohlMay 25, 2023, 7:00 PM
50 points
1 comment2 min readLW link

Prop­er­ties of Good Textbooks

niplavMay 7, 2023, 8:38 AM
50 points
11 comments1 min readLW link

GPT as an “In­tel­li­gence Fork­lift.”

boazbarakMay 19, 2023, 9:15 PM
49 points
27 comments3 min readLW link

TED talk by Eliezer Yud­kowsky: Un­leash­ing the Power of Ar­tifi­cial Intelligence

bayesedMay 7, 2023, 5:45 AM
49 points
36 comments1 min readLW link
(www.youtube.com)

Self-ad­ministered EMDR with­out a ther­a­pist is very use­ful for a lot of things!

EternallyBlissfulMay 25, 2023, 5:54 PM
49 points
12 comments11 min readLW link

Three Iter­a­tive Processes

LoganStrohlMay 12, 2023, 2:50 AM
49 points
0 comments3 min readLW link

For­mal­iz­ing the “AI x-risk is un­likely be­cause it is ridicu­lous” argument

Christopher KingMay 3, 2023, 6:56 PM
48 points
17 comments3 min readLW link

The case for re­mov­ing al­ign­ment and ML re­search from the train­ing dataset

berenMay 30, 2023, 8:54 PM
48 points
8 comments5 min readLW link

Sup­port Struc­tures for Nat­u­ral­ist Study

LoganStrohlMay 15, 2023, 12:25 AM
47 points
6 comments10 min readLW link

New OpenAI Paper—Lan­guage mod­els can ex­plain neu­rons in lan­guage models

MrThinkMay 10, 2023, 7:46 AM
47 points
14 comments1 min readLW link

Product En­dorse­ment: Apollo Neuro

ElizabethMay 8, 2023, 7:00 PM
46 points
28 comments5 min readLW link
(acesounderglass.com)

In­finite-width MLPs as an “en­sem­ble prior”

Vivek HebbarMay 12, 2023, 11:45 AM
46 points
0 comments5 min readLW link

Un­der­stand­ing mesa-op­ti­miza­tion us­ing toy models

May 7, 2023, 5:00 PM
45 points
2 comments10 min readLW link

AI #13: Po­ten­tial Al­gorith­mic Improvements

ZviMay 25, 2023, 3:40 PM
45 points
4 comments67 min readLW link
(thezvi.wordpress.com)

Mak­ing Up Baby Signs

jefftkMay 9, 2023, 4:40 PM
44 points
6 comments2 min readLW link
(www.jefftk.com)

Do Dead­lines Make Us Less Creative?

lynettebyeMay 19, 2023, 3:41 PM
44 points
6 comments4 min readLW link

Wor­ry­ing less about acausal extortion

RaemonMay 23, 2023, 2:08 AM
43 points
11 comments13 min readLW link

LessWrong Com­mu­nity Week­end 2023 [Ap­pli­ca­tions now closed]

Henry ProwbellMay 1, 2023, 9:31 AM
43 points
0 comments6 min readLW link

Con­flicts be­tween emo­tional schemas of­ten in­volve in­ter­nal coercion

Richard_NgoMay 17, 2023, 10:02 AM
43 points
4 comments4 min readLW link

Papers, Please #1: Var­i­ous Papers on Em­ploy­ment, Wages and Productivity

ZviMay 22, 2023, 12:00 PM
42 points
2 comments8 min readLW link
(thezvi.wordpress.com)

D&D.Sci 5E: Re­turn of the League of Defenders

aphyerMay 26, 2023, 8:39 PM
42 points
11 comments3 min readLW link

Difficul­ties in mak­ing pow­er­ful al­igned AI

DanielFilanMay 14, 2023, 8:50 PM
41 points
1 comment10 min readLW link
(danielfilan.com)

Why I’m Not (Yet) A Full-Time Tech­ni­cal Align­ment Researcher

Nicholas / Heather KrossMay 25, 2023, 1:26 AM
41 points
21 comments4 min readLW link
(www.thinkingmuchbetter.com)

$500 Bounty/​Prize Prob­lem: Chan­nel Ca­pac­ity Us­ing “Insen­si­tive” Functions

johnswentworthMay 16, 2023, 9:31 PM
40 points
11 comments2 min readLW link

Should Ra­tional An­i­ma­tions in­vite view­ers to read con­tent on LessWrong?

WriterMay 27, 2023, 7:26 PM
40 points
9 comments3 min readLW link

Align­ment Re­search @ EleutherAI

Curtis HuebnerMay 3, 2023, 10:45 PM
40 points
1 comment3 min readLW link
(blog.eleuther.ai)

$300 for the best sci-fi prompt

RomanSMay 17, 2023, 4:23 AM
40 points
30 comments2 min readLW link

The Rocket Align­ment Prob­lem, Part 2

ZviMay 1, 2023, 2:30 PM
40 points
20 comments9 min readLW link
(thezvi.wordpress.com)

Robin Han­son and I talk about AI risk

KatjaGraceMay 4, 2023, 10:20 PM
39 points
8 comments1 min readLW link
(worldspiritsockpuppet.com)

How to get good at programming

Ulisse MiniMay 5, 2023, 1:14 AM
39 points
3 comments2 min readLW link

[Linkpost] In­ter­pretabil­ity Dreams

DanielFilanMay 24, 2023, 9:08 PM
39 points
2 comments2 min readLW link
(transformer-circuits.pub)

Lan­guage Agents Re­duce the Risk of Ex­is­ten­tial Catastrophe

May 28, 2023, 7:10 PM
39 points
14 comments26 min readLW link

I bet $500 on AI win­ning the IMO gold medal by 2026

azsantoskMay 11, 2023, 2:46 PM
37 points
29 comments1 min readLW link

Real­ity and re­al­ity-boxes

Jim PivarskiMay 13, 2023, 2:14 PM
37 points
11 comments21 min readLW link

Boomerang—pro­to­col to dis­solve some com­mit­ment races

Filip SondejMay 30, 2023, 4:21 PM
37 points
10 comments8 min readLW link

Mr. Meeseeks as an AI ca­pa­bil­ity tripwire

Eric ZhangMay 19, 2023, 11:33 AM
37 points
17 comments2 min readLW link