DragonGod

Karma: 2,484

Theoretical Computer Science Msc student at the University of [Redacted] in the United Kingdom.

I’m an aspiring alignment theorist; my research vibes are descriptive formal theories of intelligent systems (and their safety properties) with a bias towards constructive theories.

I think it’s important that our theories of intelligent systems remain rooted in the characteristics of real world intelligent systems; we cannot develop adequate theory from the null string as input.

[Question] Change My Mind: Thirders in “Sleeping Beauty” are Just Doing Epistemology Wrong

DragonGodOct 16, 2024, 10:20 AM

8 points

67 comments6 min readLW link

Consequentialism is in the Stars not Ourselves

DragonGodApr 24, 2023, 12:02 AM

7 points

19 comments5 min readLW link

[Question] Is “Strong Coherence” Anti-Natural?

DragonGodApr 11, 2023, 6:22 AM

23 points

25 comments2 min readLW link

Feature Request: Right Click to Copy LaTeX

DragonGodApr 8, 2023, 11:27 PM

18 points

4 comments1 min readLW link

Beren’s “Deconfusing Direct vs Amortised Optimisation”

DragonGodApr 7, 2023, 8:57 AM

52 points

10 comments3 min readLW link

[Question] Is “Recursive Self-Improvement” Relevant in the Deep Learning Paradigm?

DragonGodApr 6, 2023, 7:13 AM

32 points

36 comments7 min readLW link

Orthogonality is Expensive

DragonGodApr 3, 2023, 12:43 AM

21 points

3 comments1 min readLW link

(www.beren.io)

“Dangers of AI and the End of Human Civilization” Yudkowsky on Lex Fridman

DragonGodMar 30, 2023, 3:43 PM

38 points

33 comments1 min readLW link

(www.youtube.com)

Sparks of Artificial General Intelligence: Early experiments with GPT-4 | Microsoft Research

DragonGodMar 23, 2023, 5:45 AM

68 points

23 comments1 min readLW link

(arxiv.org)

Contra “Strong Coherence”

DragonGodMar 4, 2023, 8:05 PM

39 points

24 comments1 min readLW link

Incentives and Selection: A Missing Frame From AI Threat Discussions?

DragonGodFeb 26, 2023, 1:18 AM

11 points

16 comments2 min readLW link

[Question] Is InstructGPT Following Instructions in Other Languages Surprising?

DragonGodFeb 13, 2023, 11:26 PM

39 points

15 comments1 min readLW link

[Question] Do the Safety Properties of Powerful AI Systems Need to be Adversarially Robust? Why?

DragonGodFeb 9, 2023, 1:36 PM

22 points

42 comments2 min readLW link

[About Me] Cinera’s Home Page

DragonGodFeb 7, 2023, 12:56 PM

30 points

2 comments9 min readLW link

[Question] What Are The Preconditions/Prerequisites for Asymptotic Analysis?

DragonGodFeb 3, 2023, 9:26 PM

8 points

2 comments1 min readLW link

AI Risk Management Framework | NIST

DragonGodJan 26, 2023, 3:27 PM

36 points

4 comments2 min readLW link

(www.nist.gov)

“Heretical Thoughts on AI” by Eli Dourado

DragonGodJan 19, 2023, 4:11 PM

146 points

38 comments3 min readLW link

(www.elidourado.com)

[Question] How Does the Human Brain Compare to Deep Learning on Sample Efficiency?

DragonGodJan 15, 2023, 7:49 PM

11 points

6 comments1 min readLW link

Tracr: Compiled Transformers as a Laboratory for Interpretability | DeepMind

DragonGodJan 13, 2023, 4:53 PM

62 points

12 comments1 min readLW link

(arxiv.org)

Microsoft Plans to Invest $10B in OpenAI; $3B Invested to Date | Fortune

DragonGodJan 12, 2023, 3:55 AM

23 points

10 comments2 min readLW link

(fortune.com)