RSS

Apart Research

TagLast edit: 22 Oct 2022 1:49 UTC by habryka

Apart Research is a research organization working on AI Alignment. This tag includes posts written by Apart researchers, and content about Apart Research.

Re­sults from the in­ter­pretabil­ity hackathon

17 Nov 2022 14:51 UTC
81 points
0 comments6 min readLW link
(alignmentjam.com)

Black Box In­ves­ti­ga­tion Re­search Hackathon

12 Sep 2022 7:20 UTC
9 points
4 comments2 min readLW link

Newslet­ter for Align­ment Re­search: The ML Safety Updates

Esben Kran22 Oct 2022 16:17 UTC
16 points
0 comments1 min readLW link

AI & ML Safety Up­dates W43

28 Oct 2022 13:18 UTC
9 points
3 comments3 min readLW link

AI Safety Ideas: An Open AI Safety Re­search Platform

Esben Kran17 Oct 2022 17:01 UTC
24 points
0 comments1 min readLW link

Safety timelines: How long will it take to solve al­ign­ment?

19 Sep 2022 12:53 UTC
36 points
7 comments6 min readLW link
(forum.effectivealtruism.org)

Re­sults from the lan­guage model hackathon

Esben Kran10 Oct 2022 8:29 UTC
22 points
1 comment4 min readLW link

Join the in­ter­pretabil­ity re­search hackathon

Esben Kran28 Oct 2022 16:26 UTC
15 points
0 comments1 min readLW link

[Book] In­ter­pretable Ma­chine Learn­ing: A Guide for Mak­ing Black Box Models Explainable

Esben Kran31 Oct 2022 11:38 UTC
19 points
1 comment1 min readLW link
(christophm.github.io)

Can we pre­dict the abil­ities of fu­ture AI? MLAISU W44

4 Nov 2022 15:19 UTC
10 points
0 comments3 min readLW link
(newsletter.apartresearch.com)

Are fund­ing op­tions for AI Safety threat­ened? W45

11 Nov 2022 13:00 UTC
7 points
0 comments3 min readLW link
(newsletter.apartresearch.com)

How Should AIS Re­late To Its Fun­ders? W46

21 Nov 2022 15:58 UTC
6 points
1 comment3 min readLW link
(newsletter.apartresearch.com)

NeurIPS Safety & ChatGPT. MLAISU W48

2 Dec 2022 15:50 UTC
3 points
0 comments4 min readLW link
(newsletter.apartresearch.com)

ML Safety at NeurIPS & Paradig­matic AI Safety? MLAISU W49

9 Dec 2022 10:38 UTC
19 points
0 comments4 min readLW link
(newsletter.apartresearch.com)

Join the AI Test­ing Hackathon this Friday

Esben Kran12 Dec 2022 14:24 UTC
10 points
0 comments1 min readLW link

Will Machines Ever Rule the World? MLAISU W50

Esben Kran16 Dec 2022 11:03 UTC
12 points
7 comments4 min readLW link
(newsletter.apartresearch.com)

AI im­prov­ing AI [MLAISU W01!]

Esben Kran6 Jan 2023 11:13 UTC
5 points
0 comments4 min readLW link
(newsletter.apartresearch.com)

Ro­bust­ness & Evolu­tion [MLAISU W02]

Esben Kran13 Jan 2023 15:47 UTC
10 points
0 comments3 min readLW link
(newsletter.apartresearch.com)

Gen­er­al­iz­abil­ity & Hope for AI [MLAISU W03]

Esben Kran20 Jan 2023 10:06 UTC
5 points
2 comments2 min readLW link
(newsletter.apartresearch.com)

De­cen­tral­ized Re­search & ChatGPT [MLAISU W04]

Esben Kran27 Jan 2023 13:55 UTC
3 points
0 comments3 min readLW link
(newsletter.apartresearch.com)

Over­sight and AI risk [MLAISU W05]

6 Feb 2023 12:26 UTC
4 points
0 comments2 min readLW link
(newsletter.apartresearch.com)

Arms Race Dy­nam­ics [MLAISU W06]

10 Feb 2023 13:20 UTC
5 points
0 comments3 min readLW link
(newsletter.apartresearch.com)

We Found An Neu­ron in GPT-2

11 Feb 2023 18:27 UTC
136 points
21 comments7 min readLW link
(clementneo.com)

Bing mis­al­ign­ment [MLAISU W07]

21 Feb 2023 9:02 UTC
5 points
0 comments3 min readLW link
(newsletter.apartresearch.com)

Au­to­mated Sand­wich­ing & Quan­tify­ing Hu­man-LLM Co­op­er­a­tion: ScaleOver­sight hackathon results

23 Feb 2023 10:48 UTC
8 points
0 comments6 min readLW link

RL In­ter­pretabil­ity & Per­spec­tives on AI Safety [MLAISU W08-09]

6 Mar 2023 23:52 UTC
5 points
0 comments4 min readLW link

GPT-4 & Ja­panese Align­ment [MLAISU W10]

15 Mar 2023 18:07 UTC
7 points
0 comments4 min readLW link
(newsletter.apartresearch.com)
No comments.