Pos­i­tive val­ues seem more ro­bust and last­ing than prohibitions

TurnTrout17 Dec 2022 21:43 UTC
51 points
13 comments2 min readLW link

What we owe the microbiome

weverka17 Dec 2022 19:40 UTC
2 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

Why write more: im­prove your epistemics, self-care, & 28 other reasons

KatWoods17 Dec 2022 19:25 UTC
22 points
1 comment6 min readLW link

Look­ing for an al­ign­ment tutor

JanB17 Dec 2022 19:08 UTC
15 points
2 comments1 min readLW link

[Question] How to Con­vince my Son that Drugs are Bad

concerned_dad17 Dec 2022 18:47 UTC
139 points
84 comments2 min readLW link

Or­di­nary hu­man life

David Hugh-Jones17 Dec 2022 16:46 UTC
24 points
1 comment14 min readLW link
(wyclif.substack.com)

Pre­dic­tive Pro­cess­ing, Hetero­sex­u­al­ity and Delu­sions of Grandeur

lsusr17 Dec 2022 7:37 UTC
36 points
12 comments5 min readLW link

[Link] Es­cape the Echo Cham­ber (2018)

CronoDAS17 Dec 2022 6:14 UTC
13 points
0 comments2 min readLW link
(aeon.co)

“Starry Night” Sols­tice Cookies

maia17 Dec 2022 5:31 UTC
17 points
0 comments1 min readLW link

There have been 3 planes (billion­aire donors) and 2 have crashed

trevor17 Dec 2022 3:58 UTC
16 points
10 comments2 min readLW link

[Question] What about non-de­gree seek­ing?

Lao Mein17 Dec 2022 2:22 UTC
5 points
5 comments1 min readLW link

Us­ing In­for­ma­tion The­ory to tackle AI Align­ment: A Prac­ti­cal Approach

Daniel Salami17 Dec 2022 1:37 UTC
10 points
4 comments7 min readLW link

Paper: Con­sti­tu­tional AI: Harm­less­ness from AI Feed­back (An­thropic)

LawrenceC16 Dec 2022 22:12 UTC
68 points
11 comments1 min readLW link
(www.anthropic.com)

Vaguely in­ter­ested in Effec­tive Altru­ism? Please Take the Offi­cial 2022 EA Survey

Peter Wildeford16 Dec 2022 21:07 UTC
22 points
4 comments1 min readLW link
(rethinkpriorities.qualtrics.com)

Ab­stract con­cepts and met­al­in­gual defi­ni­tion: Does ChatGPT un­der­stand jus­tice and char­ity?

Bill Benzon16 Dec 2022 21:01 UTC
2 points
0 comments13 min readLW link

Beyond the mo­ment of invention

jasoncrawford16 Dec 2022 20:18 UTC
35 points
0 comments2 min readLW link
(rootsofprogress.org)

[Question] What’s the best time-effi­cient al­ter­na­tive to the Se­quences?

trevor16 Dec 2022 20:17 UTC
6 points
7 comments1 min readLW link

Can we effi­ciently ex­plain model be­hav­iors?

paulfchristiano16 Dec 2022 19:40 UTC
64 points
3 comments9 min readLW link
(ai-alignment.com)

Proper scor­ing rules don’t guaran­tee pre­dict­ing fixed points

16 Dec 2022 18:22 UTC
68 points
8 comments21 min readLW link

A learned agent is not the same as a learn­ing agent

Ben Amitay16 Dec 2022 17:27 UTC
4 points
5 comments4 min readLW link

[Question] Col­lege Selec­tion Ad­vice for Tech­ni­cal Alignment

TempCollegeAsk16 Dec 2022 17:11 UTC
11 points
8 comments1 min readLW link

How im­por­tant are ac­cu­rate AI timelines for the op­ti­mal spend­ing sched­ule on AI risk in­ter­ven­tions?

Tristan Cook16 Dec 2022 16:05 UTC
27 points
2 comments1 min readLW link

In­tro­duc­ing Shrubgrazer

jefftk16 Dec 2022 14:50 UTC
22 points
0 comments2 min readLW link
(www.jefftk.com)

Paper: Trans­form­ers learn in-con­text by gra­di­ent descent

LawrenceC16 Dec 2022 11:10 UTC
28 points
11 comments2 min readLW link
(arxiv.org)

Will Machines Ever Rule the World? MLAISU W50

Esben Kran16 Dec 2022 11:03 UTC
12 points
7 comments4 min readLW link
(newsletter.apartresearch.com)

AI over­hangs de­pend on whether al­gorithms, com­pute and data are sub­sti­tutes or complements

NathanBarnard16 Dec 2022 2:23 UTC
2 points
0 comments3 min readLW link

AI Safety Move­ment Builders should help the com­mu­nity to op­ti­mise three fac­tors: con­trib­u­tors, con­tri­bu­tions and coordination

peterslattery15 Dec 2022 22:50 UTC
4 points
0 comments6 min readLW link

Mask­ing to Avoid Miss­ing Things

jefftk15 Dec 2022 21:00 UTC
17 points
2 comments1 min readLW link
(www.jefftk.com)

Con­sider work­ing more hours and tak­ing more stimulants

Arjun Panickssery15 Dec 2022 20:38 UTC
36 points
11 comments1 min readLW link

We’ve stepped over the thresh­old into the Fourth Arena, but don’t rec­og­nize it

Bill Benzon15 Dec 2022 20:22 UTC
2 points
0 comments7 min readLW link

[Question] How is ARC plan­ning to use ELK?

jacquesthibs15 Dec 2022 20:11 UTC
24 points
5 comments1 min readLW link

How “Dis­cov­er­ing La­tent Knowl­edge in Lan­guage Models Without Su­per­vi­sion” Fits Into a Broader Align­ment Scheme

Collin15 Dec 2022 18:22 UTC
243 points
39 comments16 min readLW link1 review

High-level hopes for AI alignment

HoldenKarnofsky15 Dec 2022 18:00 UTC
58 points
3 comments19 min readLW link
(www.cold-takes.com)

Two Dog­mas of LessWrong

omnizoid15 Dec 2022 17:56 UTC
−6 points
155 comments69 min readLW link

Covid 12/​15/​22: China’s Wave Begins

Zvi15 Dec 2022 16:20 UTC
32 points
7 comments10 min readLW link
(thezvi.wordpress.com)

The next decades might be wild

Marius Hobbhahn15 Dec 2022 16:10 UTC
175 points
42 comments41 min readLW link1 review

Ba­sic build­ing blocks of de­pen­dent type theory

Thomas Kehrenberg15 Dec 2022 14:54 UTC
47 points
8 comments13 min readLW link

AI Ne­o­re­al­ism: a threat model & suc­cess crite­rion for ex­is­ten­tial safety

davidad15 Dec 2022 13:42 UTC
64 points
1 comment3 min readLW link

Who should write the defini­tive post on Ziz?

NicholasKross15 Dec 2022 6:37 UTC
3 points
45 comments3 min readLW link

[Question] Is Paul Chris­ti­ano still as op­ti­mistic about Ap­proval-Directed Agents as he was in 2018?

Chris_Leong14 Dec 2022 23:28 UTC
8 points
0 comments1 min readLW link

«Boundaries», Part 3b: Align­ment prob­lems in terms of bound­aries

Andrew_Critch14 Dec 2022 22:34 UTC
72 points
7 comments13 min readLW link

Align­ing al­ign­ment with performance

Marv K14 Dec 2022 22:19 UTC
2 points
0 comments2 min readLW link

Con­trary to List of Lethal­ity’s point 22, al­ign­ment’s door num­ber 2

False Name14 Dec 2022 22:01 UTC
−2 points
5 comments22 min readLW link

Kol­mogorov Com­plex­ity and Si­mu­la­tion Hypothesis

False Name14 Dec 2022 22:01 UTC
−3 points
0 comments7 min readLW link

[Question] Stan­ley Meyer’s wa­ter fuel cell

mikbp14 Dec 2022 21:19 UTC
2 points
6 comments1 min readLW link

all claw, no world — and other thoughts on the uni­ver­sal distribution

Tamsin Leake14 Dec 2022 18:55 UTC
15 points
0 comments7 min readLW link
(carado.moe)

[Question] Is the AI timeline too short to have chil­dren?

Yoreth14 Dec 2022 18:32 UTC
38 points
20 comments1 min readLW link

Pre­dict­ing GPU performance

14 Dec 2022 16:27 UTC
60 points
26 comments1 min readLW link
(epochai.org)

[In­com­plete] What is Com­pu­ta­tion Any­way?

DragonGod14 Dec 2022 16:17 UTC
16 points
1 comment13 min readLW link
(arxiv.org)

Chair Hang­ing Peg

jefftk14 Dec 2022 15:30 UTC
11 points
0 comments1 min readLW link
(www.jefftk.com)