Vi­su­al­i­sa­tion of Prob­a­bil­ity Mass

brookJan 25, 2023, 3:09 PM
7 points
0 commentsLW link

When Did EA Start?

jefftkJan 25, 2023, 2:30 PM
37 points
2 comments2 min readLW link
(www.jefftk.com)

Some Thoughts on AI Art

abramdemskiJan 25, 2023, 2:18 PM
74 points
20 comments7 min readLW link

Quick thoughts on “scal­able over­sight” /​ “su­per-hu­man feed­back” research

David Scott Krueger (formerly: capybaralet)Jan 25, 2023, 12:55 PM
27 points
9 comments2 min readLW link

Sapir-Whorf for Rationalists

Duncan Sabien (Inactive)Jan 25, 2023, 7:58 AM
155 points
49 comments19 min readLW link

ChatGPT vs the 2-4-6 Task

cwilluJan 25, 2023, 6:59 AM
20 points
4 comments3 min readLW link

Pes­simistic Shard Theory

Garrett BakerJan 25, 2023, 12:59 AM
72 points
13 comments3 min readLW link

Thatcher’s Axiom

Edward P. KöningsJan 24, 2023, 10:35 PM
10 points
22 comments4 min readLW link

[Question] Some ques­tions about free will compatibilism

Asking QuestionsJan 24, 2023, 9:54 PM
3 points
21 comments6 min readLW link

Alexan­der and Yud­kowsky on AGI goals

Jan 24, 2023, 9:09 PM
179 points
53 comments26 min readLW link1 review

[Question] Is _The Age of AI: And Our Hu­man Fu­ture_ worth reading

jmhJan 24, 2023, 9:05 PM
4 points
0 comments1 min readLW link

In­verse Scal­ing Prize: Se­cond Round Winners

Jan 24, 2023, 8:12 PM
58 points
17 comments15 min readLW link

ChatGPT in­ti­mates a tan­ta­l­iz­ing fu­ture; its core LLM is or­ga­nized on mul­ti­ple lev­els; and it has bro­ken the idea of think­ing.

Bill BenzonJan 24, 2023, 7:05 PM
5 points
0 comments5 min readLW link

How-to Trans­former Mechanis­tic In­ter­pretabil­ity—in 50 lines of code or less!

StefanHexJan 24, 2023, 6:45 PM
47 points
5 comments13 min readLW link

The Cabi­net of Wikipe­dian Curiosities

Sam EnrightJan 24, 2023, 6:22 PM
36 points
5 comments6 min readLW link
(samenright.com)

Ex­plana­tory Par­si­mony, Ex­plana­tory Su­perflu­ous­ness and Use­less­ness of New­ton’s First Law

Jimdrix_HendriJan 24, 2023, 5:21 PM
−2 points
7 comments2 min readLW link

Guessti­mate: Why and how to use it

Jan 24, 2023, 4:24 PM
8 points
0 comments3 min readLW link
(forum.effectivealtruism.org)

GWWC Pledge History

jefftkJan 24, 2023, 3:50 PM
15 points
0 comments3 min readLW link
(www.jefftk.com)

Gra­di­ent hack­ing is ex­tremely difficult

berenJan 24, 2023, 3:45 PM
170 points
22 comments5 min readLW link

[Question] What sci-fi books are most rele­vant to a fu­ture with trans­for­ma­tive AI?

sidJan 24, 2023, 3:30 PM
2 points
9 comments1 min readLW link

Grant-mak­ing in EA should con­sider peer-re­view­ing grant ap­pli­ca­tions along the pub­lic-sec­tor model

Ben SmithJan 24, 2023, 3:01 PM
0 points
3 commentsLW link

“Endgame safety” for AGI

Steven ByrnesJan 24, 2023, 2:15 PM
85 points
10 comments6 min readLW link

Thoughts on hard­ware /​ com­pute re­quire­ments for AGI

Steven ByrnesJan 24, 2023, 2:03 PM
63 points
32 comments24 min readLW link

Pa­ram­e­ter Scal­ing Comes for RL, Maybe

1a3ornJan 24, 2023, 1:55 PM
100 points
3 comments14 min readLW link

How to find cool things in a new place

Sam F. BrownJan 24, 2023, 11:20 AM
12 points
0 comments1 min readLW link

[Cross­post] ACX 2022 Pre­dic­tion Con­test Results

Jan 24, 2023, 6:56 AM
48 points
6 comments8 min readLW link

The Hu­man-AI Reflec­tive Equilibrium

Allison DuettmannJan 24, 2023, 1:32 AM
22 points
1 comment24 min readLW link

“Sta­tus” can be cor­ro­sive; here’s how I han­dle it

Orpheus16Jan 24, 2023, 1:25 AM
71 points
8 comments6 min readLW link

[Question] What area of the digi­tal do­main seems safe from AI in the next 5-10 years?

Adrien ChauvetJan 24, 2023, 1:16 AM
11 points
14 comments1 min readLW link

Some of my dis­agree­ments with List of Lethalities

TurnTroutJan 24, 2023, 12:25 AM
70 points
7 comments10 min readLW link

Round­ing Some­one Off

David UdellJan 24, 2023, 12:03 AM
25 points
0 comments5 min readLW link

Life Has a Cruel Symmetry

philhJan 23, 2023, 11:40 PM
21 points
5 comments11 min readLW link
(reasonableapproximation.net)

High­lights and Prizes from the 2021 Re­view Phase

RaemonJan 23, 2023, 9:41 PM
38 points
14 comments21 min readLW link

[Question] AI safety mile­stones?

Zach Stein-PerlmanJan 23, 2023, 9:00 PM
7 points
5 comments1 min readLW link

[Question] A post-quan­tum the­ory of clas­si­cal grav­ity?

Logan ZoellnerJan 23, 2023, 8:39 PM
13 points
5 comments1 min readLW link

Meals For Un­clear Die­tary Restrictions

jefftkJan 23, 2023, 8:00 PM
17 points
3 comments2 min readLW link
(www.jefftk.com)

It’s ok

stratospherJan 23, 2023, 6:11 PM
1 point
0 comments2 min readLW link

Ex­per­i­ment­ing with beta.char­ac­ter.ai

svemirskiJan 23, 2023, 5:31 PM
−3 points
5 comments1 min readLW link

This week in fashion

JanJan 23, 2023, 5:23 PM
29 points
7 comments7 min readLW link
(universalprior.substack.com)

Movie Re­view: Megan

ZviJan 23, 2023, 12:50 PM
60 points
19 comments24 min readLW link
(thezvi.wordpress.com)

[Question] Has pri­vate AGI re­search made in­de­pen­dent safety re­search in­effec­tive already? What should we do about this?

Roman LeventovJan 23, 2023, 7:36 AM
43 points
5 comments5 min readLW link

De­con­fus­ing “Ca­pa­bil­ities vs. Align­ment”

RobertMJan 23, 2023, 4:46 AM
27 points
7 comments2 min readLW link

What a com­pute-cen­tric frame­work says about AI take­off speeds

Tom DavidsonJan 23, 2023, 4:02 AM
188 points
30 comments16 min readLW link1 review

Philly Rat Fest

LoganChipkinJan 23, 2023, 4:01 AM
9 points
0 comments1 min readLW link

EA & LW Fo­rum Weekly Sum­mary (16th − 22nd Jan ’23)

Zoe WilliamsJan 23, 2023, 3:46 AM
13 points
0 commentsLW link

Con­sider Try­ing Dictation

jefftkJan 22, 2023, 10:50 PM
23 points
10 comments2 min readLW link
(www.jefftk.com)

Emo­tional at­tach­ment to AIs opens doors to problems

Igor IvanovJan 22, 2023, 8:28 PM
20 points
10 comments4 min readLW link

What fills a vac­uum?

Logan KiellerJan 22, 2023, 7:25 PM
11 points
6 comments2 min readLW link

Gem­ini mod­el­ing

TsviBTJan 22, 2023, 2:28 PM
12 points
8 comments11 min readLW link

Large lan­guage mod­els learn to rep­re­sent the world

gjmJan 22, 2023, 1:10 PM
101 points
20 comments3 min readLW link1 review