RSS

Fac­tored Cognition

TagLast edit: 16 Apr 2020 0:11 UTC by Ruby

Factored cognition is an approach to artificial intelligence where sophisticated learning and reasoning is broken down (or factored) into many small and mostly independent tasks [1].

Factored Cognition is related to Iterated Distillation and Amplification (IDA).

Fac­tored Cognition

stuhlmueller5 Dec 2018 1:01 UTC
45 points
6 comments17 min readLW link

Align­ment Newslet­ter #36

Rohin Shah12 Dec 2018 1:10 UTC
21 points
0 comments11 min readLW link
(mailchi.mp)

Vaniver’s View on Fac­tored Cognition

Vaniver23 Aug 2019 2:54 UTC
48 points
4 comments8 min readLW link

[Question] What are the differ­ences be­tween all the iter­a­tive/​re­cur­sive ap­proaches to AI al­ign­ment?

riceissa21 Sep 2019 2:09 UTC
30 points
14 comments2 min readLW link

[Part 2] Am­plify­ing gen­er­al­ist re­search via fore­cast­ing – re­sults from a pre­limi­nary exploration

19 Dec 2019 15:49 UTC
62 points
10 comments14 min readLW link1 review

[Part 1] Am­plify­ing gen­er­al­ist re­search via fore­cast­ing – Models of im­pact and challenges

19 Dec 2019 15:50 UTC
55 points
29 comments17 min readLW link

Up­date on Ought’s ex­per­i­ments on fac­tored eval­u­a­tion of arguments

Owain_Evans12 Jan 2020 21:20 UTC
29 points
1 comment1 min readLW link
(ought.org)

Wri­teup: Progress on AI Safety via Debate

5 Feb 2020 21:04 UTC
100 points
18 comments33 min readLW link

Re­search ideas to study hu­mans with AI Safety in mind

Riccardo Volpato3 Jul 2020 16:01 UTC
23 points
2 comments5 min readLW link

Solv­ing Math Prob­lems by Relay

17 Jul 2020 15:32 UTC
103 points
26 comments7 min readLW link

A guide to Iter­ated Am­plifi­ca­tion & Debate

Rafael Harth15 Nov 2020 17:14 UTC
75 points
12 comments15 min readLW link

Hid­ing Complexity

Rafael Harth20 Nov 2020 16:35 UTC
29 points
14 comments7 min readLW link

Pre­face to the Se­quence on Fac­tored Cognition

Rafael Harth30 Nov 2020 18:49 UTC
35 points
6 comments2 min readLW link

Ideal­ized Fac­tored Cognition

Rafael Harth30 Nov 2020 18:49 UTC
34 points
6 comments11 min readLW link

Travers­ing a Cog­ni­tion Space

Rafael Harth7 Dec 2020 18:32 UTC
17 points
5 comments12 min readLW link

Clar­ify­ing Fac­tored Cognition

Rafael Harth13 Dec 2020 20:02 UTC
23 points
2 comments3 min readLW link

Intuition

Rafael Harth20 Dec 2020 21:49 UTC
26 points
1 comment6 min readLW link

De­bate Minus Fac­tored Cognition

abramdemski29 Dec 2020 22:59 UTC
37 points
42 comments11 min readLW link

FC fi­nal: Can Fac­tored Cog­ni­tion schemes scale?

Rafael Harth24 Jan 2021 22:18 UTC
17 points
0 comments17 min readLW link

Garrabrant and Shah on hu­man mod­el­ing in AGI

Rob Bensinger4 Aug 2021 4:35 UTC
60 points
10 comments47 min readLW link

Su­per­vise Pro­cess, not Outcomes

5 Apr 2022 22:18 UTC
134 points
9 comments10 min readLW link

Rant on Prob­lem Fac­tor­iza­tion for Alignment

johnswentworth5 Aug 2022 19:23 UTC
90 points
51 comments6 min readLW link

A De­cep­tively Sim­ple Ar­gu­ment in fa­vor of Prob­lem Factorization

Logan Zoellner6 Aug 2022 17:32 UTC
3 points
4 comments1 min readLW link

Ought will host a fac­tored cog­ni­tion “Lab Meet­ing”

9 Sep 2022 23:46 UTC
35 points
1 comment1 min readLW link

A Library and Tu­to­rial for Fac­tored Cog­ni­tion with Lan­guage Models

28 Sep 2022 18:15 UTC
47 points
0 comments1 min readLW link

Align­ment with ar­gu­ment-net­works and as­sess­ment-predictions

Tor Økland Barstad13 Dec 2022 2:17 UTC
10 points
5 comments45 min readLW link

Sta­tus conscious

avantika.mehra16 Jan 2023 17:44 UTC
2 points
0 comments5 min readLW link

Thoughts on “Pro­cess-Based Su­per­vi­sion”

Steven Byrnes17 Jul 2023 14:08 UTC
74 points
4 comments23 min readLW link

Mea­sur­ing and Im­prov­ing the Faith­ful­ness of Model-Gen­er­ated Rea­son­ing

18 Jul 2023 16:36 UTC
109 points
13 comments6 min readLW link

AIS 101: Task de­com­po­si­tion for scal­able oversight

Charbel-Raphaël25 Jul 2023 13:34 UTC
27 points
0 comments19 min readLW link
(docs.google.com)

Crit­i­cal re­view of Chris­ti­ano’s dis­agree­ments with Yudkowsky

Vanessa Kosoy27 Dec 2023 16:02 UTC
172 points
40 comments15 min readLW link
No comments.