​Some Ad­ven­tures of a Cu­ri­ous Richard Feynman

Dalton Mabery6 Jul 2022 23:11 UTC
10 points
0 comments3 min readLW link

Cog­ni­tive Dis­so­nance on Cog­ni­tive Capability

niederman6 Jul 2022 22:53 UTC
6 points
0 comments1 min readLW link
(maxniederman.com)

Outer vs in­ner mis­al­ign­ment: three framings

Richard_Ngo6 Jul 2022 19:46 UTC
49 points
5 comments9 min readLW link

Tar­nished Guy who Puts a Num on it

Jacob Falkovich6 Jul 2022 18:05 UTC
44 points
11 comments4 min readLW link

Deep neu­ral net­works are not opaque.

jem-mosig6 Jul 2022 18:03 UTC
22 points
14 comments3 min readLW link

How hu­man­ity would re­spond to slow take­off, with take­aways from the en­tire COVID-19 pan­demic

Noosphere896 Jul 2022 17:52 UTC
4 points
1 comment2 min readLW link

[Question] Should you write un­der a blog or your own name?

Dalton Mabery6 Jul 2022 15:26 UTC
2 points
2 comments1 min readLW link

Car­ry­ing the Torch: A Re­sponse to Anna Sala­mon by the Guild of the Rose

moridinamael6 Jul 2022 14:20 UTC
133 points
16 comments6 min readLW link

Pre­dict­ing Parental Emo­tional Changes?

jefftk6 Jul 2022 13:50 UTC
39 points
11 comments2 min readLW link
(www.jefftk.com)

Ber­lin AI Safety Open Meetup July 2022

pranomostro6 Jul 2022 12:41 UTC
6 points
0 comments1 min readLW link

Fore­cast­ing Through Fiction

Yitz6 Jul 2022 5:03 UTC
5 points
2 comments8 min readLW link

In­tro­duc­ing the Fund for Align­ment Re­search (We’re Hiring!)

6 Jul 2022 2:07 UTC
62 points
0 comments4 min readLW link

My vi­sion of a good fu­ture, part I

Jeffrey Ladish6 Jul 2022 1:23 UTC
66 points
18 comments9 min readLW link

Im­pe­rial Rus­sia was do­ing fine with­out the Soviets

Davis Kedrosky5 Jul 2022 22:24 UTC
6 points
3 comments14 min readLW link
(daviskedrosky.substack.com)

A Pat­tern Lan­guage For Rationality

Vaniver5 Jul 2022 19:08 UTC
75 points
14 comments15 min readLW link

How to de­stroy the uni­verse with a hypercomputer

Trevor Cappallo5 Jul 2022 19:05 UTC
2 points
3 comments1 min readLW link

The cu­ri­ous case of Pretty Good hu­man in­ner/​outer alignment

PavleMiha5 Jul 2022 19:04 UTC
41 points
45 comments4 min readLW link

When is it ap­pro­pri­ate to use statis­ti­cal mod­els and prob­a­bil­ities for de­ci­sion mak­ing ?

Younes Kamel5 Jul 2022 12:34 UTC
10 points
7 comments4 min readLW link
(youneskamel.substack.com)

Goal Factoring

CFAR!Duncan5 Jul 2022 7:10 UTC
80 points
2 comments8 min readLW link

As­sorted thoughts about ab­strac­tion

Adam Zerner5 Jul 2022 6:40 UTC
16 points
9 comments7 min readLW link

[AN #172] Sorry for the long hi­a­tus!

Rohin Shah5 Jul 2022 6:20 UTC
54 points
0 comments3 min readLW link
(mailchi.mp)

Out­line: The Rec­tify­ing of Maps

hamnox5 Jul 2022 5:14 UTC
7 points
0 comments2 min readLW link

[Question] Seek­ing opinions on the cur­rent and for­ward state of cryp­tocur­ren­cies.

jmh5 Jul 2022 5:01 UTC
7 points
6 comments1 min readLW link

ITT-pass­ing and ci­vil­ity are good; “char­ity” is bad; steel­man­ning is niche

Rob Bensinger5 Jul 2022 0:15 UTC
161 points
36 comments6 min readLW link1 review

Please help us com­mu­ni­cate AI xrisk. It could save the world.

otto.barten4 Jul 2022 21:47 UTC
4 points
7 comments2 min readLW link

Bench­mark for suc­cess­ful con­cept ex­trap­o­la­tion/​avoid­ing goal misgeneralization

Stuart_Armstrong4 Jul 2022 20:48 UTC
82 points
12 comments4 min readLW link

Pro­ce­du­ral Ex­ec­u­tive Func­tion, Part 1

DaystarEld4 Jul 2022 18:51 UTC
33 points
2 comments13 min readLW link
(daystareld.com)

An­thropic’s SoLU (Soft­max Lin­ear Unit)

Joel Burget4 Jul 2022 18:38 UTC
21 points
1 comment4 min readLW link
(transformer-circuits.pub)

Book Re­view: The Righ­teous Mind

ErnestScribbler4 Jul 2022 17:45 UTC
33 points
8 comments35 min readLW link

My Most Likely Rea­son to Die Young is AI X-Risk

AISafetyIsNotLongtermist4 Jul 2022 17:08 UTC
61 points
24 comments4 min readLW link
(forum.effectivealtruism.org)

Is Gen­eral In­tel­li­gence “Com­pact”?

DragonGod4 Jul 2022 13:27 UTC
27 points
6 comments22 min readLW link

Re­mak­ing Effi­cien­tZero (as best I can)

Hoagy4 Jul 2022 11:03 UTC
36 points
9 comments22 min readLW link

We Need a Con­soli­dated List of Bad AI Align­ment Solutions

Double4 Jul 2022 6:54 UTC
9 points
14 comments1 min readLW link

AI Fore­cast­ing: One Year In

jsteinhardt4 Jul 2022 5:10 UTC
132 points
12 comments6 min readLW link
(bounded-regret.ghost.io)

A com­pressed take on re­cent disagreements

kman4 Jul 2022 4:39 UTC
33 points
9 comments1 min readLW link

New US Se­nate Bill on X-Risk Miti­ga­tion [Linkpost]

Evan R. Murphy4 Jul 2022 1:25 UTC
35 points
12 comments1 min readLW link
(www.hsgac.senate.gov)

Monthly Shorts 6/​22

Celer3 Jul 2022 23:40 UTC
5 points
2 comments5 min readLW link
(keller.substack.com)

De­ci­sion the­ory and dy­namic inconsistency

paulfchristiano3 Jul 2022 22:20 UTC
79 points
33 comments10 min readLW link
(sideways-view.com)

Five routes of ac­cess to sci­en­tific literature

DirectedEvolution3 Jul 2022 20:53 UTC
13 points
4 comments6 min readLW link

Toni Kurz and the In­san­ity of Climb­ing Mountains

GeneSmith3 Jul 2022 20:51 UTC
268 points
67 comments11 min readLW link2 reviews

Won­der and The Golden AI Rule

JeffreyK3 Jul 2022 18:21 UTC
0 points
4 comments6 min readLW link

Evolu­tion Doesn’t Have Feelings

UtilityMonster3 Jul 2022 17:13 UTC
−1 points
0 comments1 min readLW link

Na­ture ab­hors an im­mutable repli­ca­tor… usually

MSRayne3 Jul 2022 15:08 UTC
28 points
10 comments3 min readLW link

Post hoc jus­tifi­ca­tions as Com­pres­sion Algorithm

Johannes C. Mayer3 Jul 2022 5:02 UTC
8 points
0 comments1 min readLW link

SOMA—A story about Consciousness

Johannes C. Mayer3 Jul 2022 4:46 UTC
10 points
0 comments1 min readLW link
(www.youtube.com)

Sex­ual self-acceptance

Johannes C. Mayer3 Jul 2022 4:26 UTC
11 points
6 comments1 min readLW link

Dono­hue, Le­vitt, Roe, and Wade: T-minus 20 years to a mas­sive crime wave?

Paul Logan3 Jul 2022 3:03 UTC
−24 points
6 comments3 min readLW link
(laulpogan.substack.com)

Can we achieve AGI Align­ment by bal­anc­ing mul­ti­ple hu­man ob­jec­tives?

Ben Smith3 Jul 2022 2:51 UTC
11 points
1 comment4 min readLW link

Trig­ger-Ac­tion Planning

CFAR!Duncan3 Jul 2022 1:42 UTC
81 points
14 comments13 min readLW link2 reviews

[Question] Which one of these two aca­demic routes should I take to end up in AI Safety?

Martín Soto3 Jul 2022 1:05 UTC
5 points
2 comments1 min readLW link