Embed­ded Agency via Abstraction

johnswentworth26 Aug 2019 23:03 UTC
40 points
20 comments11 min readLW link

Rev­ersible changes: con­sider a bucket of water

Stuart_Armstrong26 Aug 2019 22:55 UTC
25 points
18 comments2 min readLW link

Toy model piece #3: close and dis­tant situations

Stuart_Armstrong26 Aug 2019 22:41 UTC
10 points
0 comments1 min readLW link

[Question] How do you learn for­eign lan­guage vo­cab­u­lary, be­yond Anki?

Elizabeth26 Aug 2019 21:00 UTC
9 points
21 comments1 min readLW link

[Question] How Can Peo­ple Eval­u­ate Com­plex Ques­tions Con­sis­tently?

Elizabeth26 Aug 2019 20:33 UTC
46 points
12 comments1 min readLW link

Prob­lems with AI debate

Stuart_Armstrong26 Aug 2019 19:21 UTC
21 points
3 comments5 min readLW link

Schel­ling Cat­e­gories, and Sim­ple Mem­ber­ship Tests

Zack_M_Davis26 Aug 2019 2:43 UTC
58 points
10 comments7 min readLW link

Limits of and to (ar­tifi­cial) Intelligence

MoritzG25 Aug 2019 22:16 UTC
1 point
3 comments7 min readLW link

Grat­ifi­ca­tion: a use­ful con­cept, maybe new

Stuart_Armstrong25 Aug 2019 18:58 UTC
17 points
7 comments3 min readLW link

Un­der a week left to win $1,000! By ques­tion­ing Or­a­cle AIs.

Stuart_Armstrong25 Aug 2019 17:02 UTC
12 points
2 comments1 min readLW link

[Question] I’m in­ter­ested in a sub-field of AI but don’t know what to call it.

fowlertm25 Aug 2019 14:55 UTC
9 points
4 comments1 min readLW link

[Question] Am I go­ing for a job in­ter­view with a woo pusher?

CronoDAS25 Aug 2019 14:39 UTC
6 points
7 comments1 min readLW link

OpenPhil on “GiveWell’s Top Char­i­ties Are (In­creas­ingly) Hard to Beat”

Raemon24 Aug 2019 23:28 UTC
17 points
0 comments6 min readLW link
(www.openphilanthropy.org)

Epistemic Spot Check: The Fate of Rome (Kyle Harper)

Elizabeth24 Aug 2019 21:40 UTC
39 points
3 comments5 min readLW link
(acesounderglass.com)

[Question] Perfor­mance IQ and higher mathematics

c5pi24 Aug 2019 17:31 UTC
4 points
5 comments1 min readLW link

[Question] how should a sec­ond ver­sion of “ra­tio­nal­ity: A to Z” look like?

Yoav Ravid24 Aug 2019 7:01 UTC
6 points
4 comments1 min readLW link

Petrov Day Cel­e­bra­tion 2019 - Oxford Campsite

jbeshir24 Aug 2019 3:42 UTC
8 points
1 comment1 min readLW link

[Question] How has ra­tio­nal­ism helped you?

Sunny from QAD24 Aug 2019 1:31 UTC
9 points
11 comments1 min readLW link

[Question] Is LW mak­ing progress?

zulupineapple24 Aug 2019 0:32 UTC
21 points
11 comments1 min readLW link

LessLong Launch Party

Raemon23 Aug 2019 22:18 UTC
12 points
1 comment1 min readLW link

[Question] Is there a sim­ple pa­ram­e­ter that con­trols hu­man work­ing mem­ory ca­pac­ity, which has been set trag­i­cally low?

Liron23 Aug 2019 22:10 UTC
17 points
8 comments1 min readLW link

Op­ti­miza­tion Provenance

Adele Lopez23 Aug 2019 20:08 UTC
38 points
5 comments5 min readLW link

Troll Bridge

abramdemski23 Aug 2019 18:36 UTC
79 points
58 comments12 min readLW link

Un­der­stand­ing understanding

mthq23 Aug 2019 18:10 UTC
24 points
1 comment2 min readLW link

Ac­tu­ally updating

SaraHax23 Aug 2019 17:46 UTC
54 points
10 comments4 min readLW link

When do util­ity func­tions con­strain?

Hoagy23 Aug 2019 17:19 UTC
29 points
7 comments7 min readLW link

Parables of Con­straint and Ac­tu­al­iza­tion

Spencer Wyman23 Aug 2019 16:56 UTC
13 points
0 comments6 min readLW link

Thoughts on Retriev­ing Knowl­edge from Neu­ral Networks

Jaime Ruiz23 Aug 2019 16:41 UTC
11 points
2 comments5 min readLW link

Al­gorith­mic Similarity

LukasM23 Aug 2019 16:39 UTC
27 points
10 comments11 min readLW link

Soft take­off can still lead to de­ci­sive strate­gic advantage

Daniel Kokotajlo23 Aug 2019 16:39 UTC
122 points
47 comments8 min readLW link4 reviews

Moscow LW meetup in “Nauchka” library

Alexander23023 Aug 2019 12:40 UTC
3 points
0 comments1 min readLW link

OpenGPT-2: We Repli­cated GPT-2 Be­cause You Can Too

avturchin23 Aug 2019 11:32 UTC
18 points
0 comments1 min readLW link
(medium.com)

Tor­ture and Dust Specks and Joy—Oh my! or: Non-Archimedean Utility Func­tions as Pseu­do­graded Vec­tor Spaces

Louis_Brown23 Aug 2019 11:11 UTC
19 points
29 comments8 min readLW link

Me­tal­ign­ment: De­con­fus­ing metaethics for AI al­ign­ment.

Guillaume Corlouer23 Aug 2019 10:25 UTC
13 points
7 comments3 min readLW link

[Question] A ba­sic prob­a­bil­ity question

shminux23 Aug 2019 7:13 UTC
11 points
3 comments1 min readLW link

Towards an In­ten­tional Re­search Agenda

romeostevensit23 Aug 2019 5:27 UTC
20 points
8 comments3 min readLW link

[Question] Why are peo­ple so op­ti­mistic about su­per­in­tel­li­gence?

bipolo23 Aug 2019 4:25 UTC
6 points
3 comments1 min readLW link

Vague Thoughts and Ques­tions about Agent Structures

loriphos23 Aug 2019 4:01 UTC
9 points
3 comments2 min readLW link

For­mal­is­ing de­ci­sion the­ory is hard

Lukas Finnveden23 Aug 2019 3:27 UTC
17 points
19 comments2 min readLW link

Creat­ing En­vi­ron­ments to De­sign and Test Embed­ded Agents

lukehmiles23 Aug 2019 3:17 UTC
13 points
5 comments8 min readLW link

Ta­boo­ing ‘Agent’ for Pro­saic Alignment

Hjalmar_Wijk23 Aug 2019 2:55 UTC
57 points
10 comments6 min readLW link

Vaniver’s View on Fac­tored Cognition

Vaniver23 Aug 2019 2:54 UTC
48 points
4 comments8 min readLW link

Redefin­ing Fast Takeoff

VojtaKovarik23 Aug 2019 2:15 UTC
10 points
1 comment1 min readLW link

[Question] Does Agent-like Be­hav­ior Im­ply Agent-like Ar­chi­tec­ture?

Scott Garrabrant23 Aug 2019 2:01 UTC
57 points
8 comments1 min readLW link

The Com­mit­ment Races problem

Daniel Kokotajlo23 Aug 2019 1:58 UTC
151 points
56 comments5 min readLW link

Anal­y­sis of a Se­cret Hitler Scenario

jaek23 Aug 2019 1:24 UTC
16 points
6 comments4 min readLW link

Thoughts from a Two Boxer

jaek23 Aug 2019 0:24 UTC
18 points
11 comments5 min readLW link

De­con­fuse Your­self about Agency

VojtaKovarik23 Aug 2019 0:21 UTC
15 points
9 comments5 min readLW link

Log­i­cal Op­ti­miz­ers

Donald Hobson22 Aug 2019 23:54 UTC
11 points
4 comments3 min readLW link

Towards a mechanis­tic un­der­stand­ing of corrigibility

evhub22 Aug 2019 23:20 UTC
47 points
26 comments6 min readLW link