RSS

DragonGod

Karma: 2,385

Theoretical Computer Science Msc student at the University of [Redacted] in the United Kingdom.

I’m an aspiring alignment theorist; my research vibes are descriptive formal theories of intelligent systems (and their safety properties) with a bias towards constructive theories.

I think it’s important that our theories of intelligent systems remain rooted in the characteristics of real world intelligent systems; we cannot develop adequate theory from the null string as input.

“Hereti­cal Thoughts on AI” by Eli Dourado

DragonGod19 Jan 2023 16:11 UTC
145 points
38 comments3 min readLW link
(www.elidourado.com)

[Question] Why The Fo­cus on Ex­pected Utility Max­imisers?

DragonGod27 Dec 2022 15:49 UTC
116 points
84 comments3 min readLW link

Sparks of Ar­tifi­cial Gen­eral In­tel­li­gence: Early ex­per­i­ments with GPT-4 | Microsoft Research

DragonGod23 Mar 2023 5:45 UTC
68 points
23 comments1 min readLW link
(arxiv.org)

Tracr: Com­piled Trans­form­ers as a Lab­o­ra­tory for In­ter­pretabil­ity | Deep­Mind

DragonGod13 Jan 2023 16:53 UTC
62 points
12 comments1 min readLW link
(arxiv.org)

[1911.08265] Mas­ter­ing Atari, Go, Chess and Shogi by Plan­ning with a Learned Model | Arxiv

DragonGod21 Nov 2019 1:18 UTC
52 points
4 comments1 min readLW link
(arxiv.org)

Beren’s “De­con­fus­ing Direct vs Amor­tised Op­ti­mi­sa­tion”

DragonGod7 Apr 2023 8:57 UTC
51 points
10 comments3 min readLW link

The Limit of Lan­guage Models

DragonGod6 Jan 2023 23:53 UTC
43 points
26 comments4 min readLW link

Con­tra “Strong Co­her­ence”

DragonGod4 Mar 2023 20:05 UTC
39 points
24 comments1 min readLW link

[Question] Is In­struc­tGPT Fol­low­ing In­struc­tions in Other Lan­guages Sur­pris­ing?

DragonGod13 Feb 2023 23:26 UTC
39 points
15 comments1 min readLW link

“Dangers of AI and the End of Hu­man Civ­i­liza­tion” Yud­kowsky on Lex Fridman

DragonGod30 Mar 2023 15:43 UTC
38 points
32 comments1 min readLW link
(www.youtube.com)

[Yann Le­cun] A Path Towards Au­tonomous Ma­chine In­tel­li­gence

DragonGod27 Jun 2022 19:24 UTC
38 points
13 comments1 min readLW link
(openreview.net)

AI Risk Man­age­ment Frame­work | NIST

DragonGod26 Jan 2023 15:27 UTC
36 points
4 comments2 min readLW link
(www.nist.gov)

[Question] Is “Re­cur­sive Self-Im­prove­ment” Rele­vant in the Deep Learn­ing Paradigm?

DragonGod6 Apr 2023 7:13 UTC
32 points
36 comments7 min readLW link

[About Me] Cin­era’s Home Page

DragonGod7 Feb 2023 12:56 UTC
30 points
2 comments9 min readLW link

[Question] [Dis­cus­sion] How Broad is the Hu­man Cog­ni­tive Spec­trum?

DragonGod7 Jan 2023 0:56 UTC
29 points
51 comments2 min readLW link

I No Longer Believe In­tel­li­gence to be “Mag­i­cal”

DragonGod10 Jun 2022 8:58 UTC
27 points
34 comments6 min readLW link

Is Gen­eral In­tel­li­gence “Com­pact”?

DragonGod4 Jul 2022 13:27 UTC
27 points
6 comments22 min readLW link