Whole Bird Emu­la­tion re­quires Quan­tum Mechanics

Jeffrey HeningerFeb 14, 2023, 11:50 PM
25 points
9 comments3 min readLW link
(aiimpacts.org)

Qual­ities that al­ign­ment men­tors value in ju­nior researchers

Orpheus16Feb 14, 2023, 11:27 PM
88 points
14 comments3 min readLW link

Help Up­date TryContra

jefftkFeb 14, 2023, 7:10 PM
12 points
0 comments1 min readLW link
(www.jefftk.com)

Con­tent Fea­tures Aren’t Enough for De­tect­ing Tox­i­c­ity. One Needs User Fea­tures.

Zachary WittenFeb 14, 2023, 6:48 PM
11 points
0 comments3 min readLW link

EIS III: Broad Cri­tiques of In­ter­pretabil­ity Research

scasperFeb 14, 2023, 6:24 PM
20 points
2 comments11 min readLW link

[Question] What would an AI need to boot­strap re­cur­sively self im­prov­ing robots?

Yair HalberstadtFeb 14, 2023, 5:58 PM
3 points
5 comments1 min readLW link

[linkpost] Bet­ter Without AI

DanielFilanFeb 14, 2023, 5:30 PM
47 points
13 comments1 min readLW link
(betterwithout.ai)

The Cave Alle­gory Re­vis­ited: Un­der­stand­ing GPT’s Worldview

Jan_KulveitFeb 14, 2023, 4:00 PM
86 points
5 comments3 min readLW link

[Question] Why should we ex­pect AIs to co­or­di­nate well?

Jonathan PaulsonFeb 14, 2023, 3:50 PM
25 points
9 comments1 min readLW link

Ex­plain­ing SolidGoldMag­ikarp by look­ing at it from ran­dom directions

Robert_AIZIFeb 14, 2023, 2:54 PM
8 points
0 comments8 min readLW link
(aizi.substack.com)

Re­v­erse-cor­re­la­tion: how to sum­mon the ghost of your men­tal imagery

MalmesburyFeb 14, 2023, 2:15 PM
40 points
0 comments5 min readLW link

Eval­u­at­ing 2022 ACX Predictions

ZviFeb 14, 2023, 12:20 PM
20 points
3 comments23 min readLW link
(thezvi.wordpress.com)

SolidGoldMag­ikarp III: Glitch to­ken archaeology

Feb 14, 2023, 10:17 AM
91 points
35 comments16 min readLW link

The Lin­guis­tic Blind Spot of Value-Aligned Agency, Nat­u­ral and Ar­tifi­cial

Roman LeventovFeb 14, 2023, 6:57 AM
6 points
0 comments2 min readLW link
(arxiv.org)

Con­cep­tual Pathfinding

DirectedEvolutionFeb 14, 2023, 5:49 AM
18 points
6 comments3 min readLW link

Im­por­tant fact about how peo­ple eval­u­ate sets of arguments

Daniel KokotajloFeb 14, 2023, 5:27 AM
33 points
11 comments2 min readLW link

[Question] How much is death a limit on knowl­edge ac­cu­mu­la­tion?

Gordon Seidoh WorleyFeb 14, 2023, 3:54 AM
31 points
9 comments2 min readLW link

The Filan Cabi­net Pod­cast with Oliver Habryka—Transcript

Feb 14, 2023, 2:38 AM
101 points
9 comments72 min readLW link

[Question] Is In­struc­tGPT Fol­low­ing In­struc­tions in Other Lan­guages Sur­pris­ing?

DragonGodFeb 13, 2023, 11:26 PM
39 points
15 comments1 min readLW link

LLM Ba­sics: Embed­ding Spaces—Trans­former To­ken Vec­tors Are Not Points in Space

NickyPFeb 13, 2023, 6:52 PM
83 points
11 comments15 min readLW link

4 ways to think about de­moc­ra­tiz­ing AI [GovAI Linkpost]

Orpheus16Feb 13, 2023, 6:06 PM
24 points
4 comments1 min readLW link
(www.governance.ai)

Does the AGPL Work?

jefftkFeb 13, 2023, 2:20 PM
13 points
12 comments2 min readLW link
(www.jefftk.com)

H5N1

ZviFeb 13, 2023, 12:50 PM
102 points
1 comment9 min readLW link
(thezvi.wordpress.com)

En­joy LessWrong in ebook format

Bart BussmannFeb 13, 2023, 11:53 AM
54 points
3 comments1 min readLW link

Mor­pholog­i­cal in­tel­li­gence, su­per­hu­man em­pa­thy, and eth­i­cal arbitration

Roman LeventovFeb 13, 2023, 10:25 AM
1 point
0 comments2 min readLW link

South Bay ACX/​LW Meetup

ISFeb 13, 2023, 6:08 AM
3 points
0 comments1 min readLW link

Idea: Net­work mod­u­lar­ity and in­ter­pretabil­ity by sex­ual reproduction

qbolecFeb 12, 2023, 11:06 PM
3 points
3 comments1 min readLW link

The End of Anonymity Online

SpioradFeb 12, 2023, 9:23 PM
3 points
9 comments2 min readLW link

Matt Clancy AMA on the Progress Forum

jasoncrawfordFeb 12, 2023, 8:23 PM
17 points
0 comments1 min readLW link
(progressforum.org)

La­tent vari­ables for pre­dic­tion mar­kets: mo­ti­va­tion, tech­ni­cal guide, and de­sign considerations

tailcalledFeb 12, 2023, 5:54 PM
100 points
25 comments23 min readLW link2 reviews

The con­cep­tual Dop­pelgänger problem

TsviBTFeb 12, 2023, 5:23 PM
12 points
5 comments4 min readLW link

How Car­dioid Are Car­dioids?

jefftkFeb 12, 2023, 4:20 PM
9 points
0 comments2 min readLW link
(www.jefftk.com)

How many of these jobs will have a 15% or more drop in em­ploy­ment plau­si­bly at­tributable to AI by 2031?

tailcalledFeb 12, 2023, 3:40 PM
12 points
5 comments1 min readLW link
(manifold.markets)

Hu­man-AI col­lab­o­ra­tive writing

DirectedEvolutionFeb 12, 2023, 2:57 PM
20 points
2 comments5 min readLW link

RaD-AI workshop

Ram RachumFeb 12, 2023, 12:46 PM
3 points
0 comments1 min readLW link

Ele­ments of Ra­tion­al­ist Discourse

Rob BensingerFeb 12, 2023, 7:58 AM
224 points
49 comments3 min readLW link1 review

Con­flict The­ory of Bounded Distrust

Zack_M_DavisFeb 12, 2023, 5:30 AM
112 points
33 comments3 min readLW link1 review

Why al­most ev­ery RL agent does learned optimization

Lee SharkeyFeb 12, 2023, 4:58 AM
32 points
3 comments5 min readLW link

How I Learn From Textbooks

DirectedEvolutionFeb 12, 2023, 4:45 AM
26 points
3 comments8 min readLW link

Top YouTube chan­nel Ver­i­ta­sium re­leases video on Sleep­ing Beauty Problem

Alex_AltairFeb 11, 2023, 8:36 PM
25 points
22 comments1 min readLW link
(www.youtube.com)

Short­en­ing Timelines: There’s No Buffer Anymore

Jeff RoseFeb 11, 2023, 7:53 PM
10 points
5 comments1 min readLW link

We Found An Neu­ron in GPT-2

Feb 11, 2023, 6:27 PM
143 points
23 comments7 min readLW link
(clementneo.com)

The Prac­ti­tioner’s Path 2.0: the Prag­ma­tist Archetype

EvenflairFeb 11, 2023, 3:48 PM
21 points
0 comments2 min readLW link
(guildoftherose.org)

The Illu­sion of Sim­plic­ity: Mone­tary Policy as a Prob­lem of Com­plex­ity and Alignment

Edward P. KöningsFeb 11, 2023, 3:04 PM
8 points
0 comments8 min readLW link
(edwardknings.substack.com)

In Defense of Chat­bot Romance

Kaj_SotalaFeb 11, 2023, 2:30 PM
124 points
53 comments11 min readLW link
(kajsotala.fi)

Threat­en­ing to do the im­pos­si­ble: A solu­tion to spu­ri­ous coun­ter­fac­tu­als for func­tional de­ci­sion the­ory via proof theory

Christopher KingFeb 11, 2023, 7:57 AM
5 points
4 comments5 min readLW link

Ra­tion­al­ity-re­lated things I don’t know as of 2023

Adam ZernerFeb 11, 2023, 6:04 AM
64 points
59 comments3 min readLW link

A note on ‘semiotic physics’

metasemiFeb 11, 2023, 5:12 AM
11 points
13 comments6 min readLW link

Inequal­ity Penalty: Mo­ral­ity in Many Worlds

ShmiFeb 11, 2023, 4:08 AM
11 points
17 comments6 min readLW link

The Im­por­tance of AI Align­ment, ex­plained in 5 points

Daniel_EthFeb 11, 2023, 2:56 AM
33 points
2 commentsLW link