RSS

Re­search Agendas

TagLast edit: 13 Apr 2021 13:48 UTC by [Error communicating with LW2 server]

The Learn­ing-The­o­retic AI Align­ment Re­search Agenda

Vanessa Kosoy4 Jul 2018 9:53 UTC
47 points
37 comments32 min readLW link

New safety re­search agenda: scal­able agent al­ign­ment via re­ward modeling

Vika20 Nov 2018 17:29 UTC
34 points
13 comments1 min readLW link
(medium.com)

Re­search Agenda v0.9: Syn­the­sis­ing a hu­man’s prefer­ences into a util­ity function

Stuart_Armstrong17 Jun 2019 17:46 UTC
63 points
20 comments33 min readLW link

Paul’s re­search agenda FAQ

zhukeepa1 Jul 2018 6:25 UTC
108 points
69 comments19 min readLW link

AI Gover­nance: A Re­search Agenda

habryka5 Sep 2018 18:00 UTC
25 points
3 comments1 min readLW link
(www.fhi.ox.ac.uk)

Embed­ded Agents

29 Oct 2018 19:53 UTC
185 points
41 comments1 min readLW link

Our take on CHAI’s re­search agenda in un­der 1500 words

alexflint17 Jun 2020 12:24 UTC
95 points
19 comments5 min readLW link

De­con­fus­ing Hu­man Values Re­search Agenda v1

G Gordon Worley III23 Mar 2020 16:25 UTC
18 points
12 comments4 min readLW link

Thoughts on Hu­man Models

21 Feb 2019 9:10 UTC
111 points
31 comments10 min readLW link2 nominations1 review

MIRI’s tech­ni­cal re­search agenda

So8res23 Dec 2014 18:45 UTC
54 points
52 comments3 min readLW link

Pre­face to CLR’s Re­search Agenda on Co­op­er­a­tion, Con­flict, and TAI

JesseClifton13 Dec 2019 21:02 UTC
54 points
8 comments2 min readLW link

Us­ing GPT-N to Solve In­ter­pretabil­ity of Neu­ral Net­works: A Re­search Agenda

3 Sep 2020 18:27 UTC
60 points
11 comments2 min readLW link

Ul­tra-sim­plified re­search agenda

Stuart_Armstrong22 Nov 2019 14:29 UTC
34 points
4 comments1 min readLW link

Embed­ded Agency (full-text ver­sion)

15 Nov 2018 19:49 UTC
114 points
11 comments54 min readLW link

Embed­ded Curiosities

8 Nov 2018 14:19 UTC
85 points
1 comment2 min readLW link

Sub­sys­tem Alignment

6 Nov 2018 16:16 UTC
99 points
12 comments1 min readLW link

Ro­bust Delegation

4 Nov 2018 16:38 UTC
108 points
10 comments1 min readLW link

Embed­ded World-Models

2 Nov 2018 16:07 UTC
85 points
16 comments1 min readLW link

De­ci­sion Theory

31 Oct 2018 18:41 UTC
105 points
38 comments1 min readLW link

Sec­tions 1 & 2: In­tro­duc­tion, Strat­egy and Governance

JesseClifton17 Dec 2019 21:27 UTC
33 points
5 comments14 min readLW link

Sec­tions 3 & 4: Cred­i­bil­ity, Peace­ful Bar­gain­ing Mechanisms

JesseClifton17 Dec 2019 21:46 UTC
19 points
2 comments12 min readLW link

Sec­tions 5 & 6: Con­tem­po­rary Ar­chi­tec­tures, Hu­mans in the Loop

JesseClifton20 Dec 2019 3:52 UTC
27 points
4 comments10 min readLW link

Sec­tion 7: Foun­da­tions of Ra­tional Agency

JesseClifton22 Dec 2019 2:05 UTC
14 points
3 comments8 min readLW link

Ac­knowl­edge­ments & References

JesseClifton14 Dec 2019 7:04 UTC
6 points
0 comments14 min readLW link

Align­ment pro­pos­als and com­plex­ity classes

evhub16 Jul 2020 0:27 UTC
31 points
26 comments13 min readLW link

The Good­hart Game

John_Maxwell18 Nov 2019 23:22 UTC
13 points
5 comments5 min readLW link

Re­sources for AI Align­ment Cartography

Gyrodiot4 Apr 2020 14:20 UTC
40 points
8 comments9 min readLW link

In­tro­duc­ing the Longevity Re­search Institute

sarahconstantin8 May 2018 3:30 UTC
53 points
20 comments1 min readLW link
(srconstantin.wordpress.com)

An­nounce­ment: AI al­ign­ment prize round 3 win­ners and next round

cousin_it15 Jul 2018 7:40 UTC
93 points
7 comments1 min readLW link

Ma­chine Learn­ing Pro­jects on IDA

24 Jun 2019 18:38 UTC
49 points
3 comments2 min readLW link

AI Align­ment Re­search Overview (by Ja­cob Stein­hardt)

Ben Pace6 Nov 2019 19:24 UTC
42 points
0 comments7 min readLW link
(docs.google.com)

Creat­ing Welfare Biol­ogy: A Re­search Proposal

ozymandias16 Nov 2017 19:06 UTC
20 points
5 comments4 min readLW link

Re­search Agenda in re­verse: what *would* a solu­tion look like?

Stuart_Armstrong25 Jun 2019 13:52 UTC
34 points
25 comments1 min readLW link

Fore­cast­ing AI Progress: A Re­search Agenda

10 Aug 2020 1:04 UTC
39 points
4 comments1 min readLW link

Tech­ni­cal AGI safety re­search out­side AI

Richard_Ngo18 Oct 2019 15:00 UTC
42 points
3 comments3 min readLW link

Why I am not cur­rently work­ing on the AAMLS agenda

jessicata1 Jun 2017 17:57 UTC
27 points
0 comments5 min readLW link

Which of these five AI al­ign­ment re­search pro­jects ideas are no good?

rmoehn8 Aug 2019 7:17 UTC
25 points
13 comments1 min readLW link

Re­search is polyg­a­mous! The im­por­tance of what you do needn’t be pro­por­tional to your awe­some­ness

diegocaleiro26 May 2013 22:29 UTC
35 points
43 comments2 min readLW link

Fund­ing Good Research

lukeprog27 May 2012 6:41 UTC
38 points
44 comments2 min readLW link

Please voice your sup­port for stem cell research

zaph22 May 2009 18:45 UTC
−5 points
4 comments1 min readLW link

Notes on effec­tive-al­tru­ism-re­lated re­search, writ­ing, test­ing fit, learn­ing, and the EA Forum

MichaelA28 Mar 2021 23:43 UTC
14 points
0 comments4 min readLW link
No comments.