Gra­di­ent hack­ing: defi­ni­tions and examples

Richard_Ngo29 Jun 2022 21:35 UTC
38 points
2 comments5 min readLW link

Progress links and tweets, 2022-06-29

jasoncrawford29 Jun 2022 21:33 UTC
9 points
0 comments1 min readLW link
(rootsofprogress.org)

[Question] Cor­rect­ing hu­man er­ror vs do­ing ex­actly what you’re told—is there liter­a­ture on this in con­text of gen­eral sys­tem de­sign?

Jan Czechowski29 Jun 2022 21:30 UTC
6 points
0 comments1 min readLW link

La­tent Ad­ver­sar­ial Training

Adam Jermyn29 Jun 2022 20:04 UTC
42 points
12 comments5 min readLW link

Game Re­view: This Mer­chant Life

Zvi29 Jun 2022 18:30 UTC
20 points
0 comments13 min readLW link
(thezvi.wordpress.com)

Limits to Legibility

Jan_Kulveit29 Jun 2022 17:42 UTC
138 points
11 comments5 min readLW link1 review

Will Ca­pa­bil­ities Gen­er­al­ise More?

Ramana Kumar29 Jun 2022 17:12 UTC
132 points
39 comments4 min readLW link

Kevin Kelly’s “103 Bits of Ad­vice,” Expanded

Dalton Mabery29 Jun 2022 13:36 UTC
19 points
0 comments5 min readLW link

The table of differ­ent sam­pling as­sump­tions in anthropics

avturchin29 Jun 2022 10:41 UTC
38 points
5 comments12 min readLW link

Can We Align AI by Hav­ing It Learn Hu­man Prefer­ences? I’m Scared (sum­mary of last third of Hu­man Com­pat­i­ble)

apollonianblues29 Jun 2022 4:09 UTC
19 points
3 comments6 min readLW link

Kurzge­sagt – The Last Hu­man (Youtube)

habryka29 Jun 2022 3:28 UTC
54 points
7 comments1 min readLW link
(www.youtube.com)

[Question] Liter­a­ture on How to Max­i­mize Preferences

josh28 Jun 2022 22:41 UTC
1 point
0 comments1 min readLW link

Challenge: A Much More Alien Message

kman28 Jun 2022 21:50 UTC
24 points
7 comments1 min readLW link

It’s Prob­a­bly Not Lithium

Natália28 Jun 2022 21:24 UTC
442 points
186 comments28 min readLW link1 review

Reflec­tions on Liv­ing in “Guess Cul­ture”

Dalton Mabery28 Jun 2022 21:00 UTC
13 points
1 comment3 min readLW link

[Question] What is the LessWrong Logo(?) Sup­posed to Rep­re­sent?

DragonGod28 Jun 2022 20:20 UTC
8 points
6 comments1 min readLW link

What Are You Track­ing In Your Head?

johnswentworth28 Jun 2022 19:30 UTC
276 points
81 comments4 min readLW link1 review

Why is so much poli­ti­cal com­men­tary mis­lead­ing?

contrarianbrit28 Jun 2022 17:10 UTC
−2 points
5 comments6 min readLW link
(thomasprosser.substack.com)

CFAR Hand­book: Introduction

CFAR!Duncan28 Jun 2022 16:53 UTC
109 points
12 comments1 min readLW link

Units of Exchange

CFAR!Duncan28 Jun 2022 16:53 UTC
95 points
28 comments11 min readLW link

Scott Aaron­son and Steven Pinker De­bate AI Scaling

Liron28 Jun 2022 16:04 UTC
37 points
7 comments1 min readLW link
(scottaaronson.blog)

A physi­cist’s ap­proach to Ori­gins of Life

pchvykov28 Jun 2022 15:23 UTC
12 points
6 comments16 min readLW link

What suc­cess looks like

28 Jun 2022 14:38 UTC
19 points
4 comments1 min readLW link
(forum.effectivealtruism.org)

Four rea­sons I find AI safety emo­tion­ally compelling

28 Jun 2022 14:10 UTC
39 points
3 comments4 min readLW link

Some al­ter­na­tive AI safety re­search projects

Michele Campolo28 Jun 2022 14:09 UTC
9 points
0 comments3 min readLW link

Doom doubts—is in­ner al­ign­ment a likely prob­lem?

Crissman28 Jun 2022 12:42 UTC
6 points
7 comments1 min readLW link

Low-Fric­tion MBTA Predictions

jefftk28 Jun 2022 12:30 UTC
15 points
0 comments1 min readLW link
(www.jefftk.com)

What Diet Books Don’t Teach: A book re­view and a re­quest for more reading

Lone Pine28 Jun 2022 12:27 UTC
22 points
34 comments4 min readLW link

Assess­ing AlephAlphas Mul­ti­modal Model

p.b.28 Jun 2022 9:28 UTC
30 points
5 comments3 min readLW link

[Question] Is there any way some­one could post about pub­lic policy re­lat­ing to abor­tion ac­cess (or an­other sen­si­tive sub­ject) on LessWrong with­out get­ting su­per down­voted?

Evan_Gaensbauer28 Jun 2022 5:45 UTC
18 points
20 comments1 min readLW link

[Test Post Please Ig­nore] Test­ing pol­ling features

Lone Pine28 Jun 2022 4:35 UTC
7 points
5 comments1 min readLW link

Yann LeCun, A Path Towards Au­tonomous Ma­chine In­tel­li­gence [link]

Bill Benzon27 Jun 2022 23:29 UTC
5 points
1 comment1 min readLW link

Limits of Bodily Autonomy

jefftk27 Jun 2022 19:50 UTC
28 points
18 comments1 min readLW link
(www.jefftk.com)

[Question] Sys­tems Biol­ogy for self study

Ulisse Mini27 Jun 2022 19:36 UTC
5 points
2 comments1 min readLW link

[Yann Le­cun] A Path Towards Au­tonomous Ma­chine In­tel­li­gence

DragonGod27 Jun 2022 19:24 UTC
38 points
13 comments1 min readLW link
(openreview.net)

Ex­plor­ing Mild Be­havi­our in Embed­ded Agents

Megan Kinniment27 Jun 2022 18:56 UTC
21 points
4 comments18 min readLW link

Epistemic mod­esty and how I think about AI risk

Aryeh Englander27 Jun 2022 18:47 UTC
22 points
4 comments4 min readLW link

De­liber­a­tion Every­where: Sim­ple Examples

Oliver Sourbut27 Jun 2022 17:26 UTC
27 points
3 comments15 min readLW link

De­liber­a­tion, Re­ac­tions, and Con­trol: Ten­ta­tive Defi­ni­tions and a Res­tate­ment of In­stru­men­tal Convergence

Oliver Sourbut27 Jun 2022 17:25 UTC
11 points
0 comments11 min readLW link

[Question] Are long-form dat­ing pro­files pro­duc­tive?

AABoyles27 Jun 2022 17:03 UTC
34 points
32 comments1 min readLW link

Cus­tom iPhone Wid­get to En­courage Less Wrong Use

Will Payne27 Jun 2022 16:14 UTC
10 points
2 comments2 min readLW link
(forum.effectivealtruism.org)

An­nounc­ing the In­verse Scal­ing Prize ($250k Prize Pool)

27 Jun 2022 15:58 UTC
169 points
14 comments7 min readLW link

An­nounc­ing Epoch: A re­search or­ga­ni­za­tion in­ves­ti­gat­ing the road to Trans­for­ma­tive AI

27 Jun 2022 13:55 UTC
97 points
2 comments2 min readLW link
(epochai.org)

Air Con­di­tioner Repair

Zvi27 Jun 2022 12:40 UTC
81 points
34 comments4 min readLW link
(thezvi.wordpress.com)

[Question] Why Are Posts in the Se­quences Tagged [Per­sonal Blog] In­stead of [Front­page]?

DragonGod27 Jun 2022 9:35 UTC
5 points
2 comments1 min readLW link

Con­test: An Alien Message

DaemonicSigil27 Jun 2022 5:54 UTC
95 points
100 comments1 min readLW link

Robin Han­son asks “Why Not Wait On AI Risk?”

Gunnar_Zarncke26 Jun 2022 23:32 UTC
22 points
4 comments1 min readLW link
(www.overcomingbias.com)

Sex Fairy Lore

pchvykov26 Jun 2022 20:42 UTC
−26 points
10 comments6 min readLW link

King David’s %: Estab­lish­ing a new sym­bol for Bayesian prob­a­bil­ity.

Paul Logan26 Jun 2022 19:47 UTC
−11 points
1 comment5 min readLW link
(laulpogan.substack.com)

Do You Care Whether There Are “Suc­cess­ful” Ra­tion­al­ists?

UtilityMonster26 Jun 2022 18:53 UTC
12 points
8 comments7 min readLW link