RSS

Andrew_Critch(Andrew Critch)

Karma: 3,988

This is Dr. Andrew Critch’s professional LessWrong account. Andrew is the CEO of Encultured AI, and works for ~1 day/​week as a Research Scientist at the Center for Human-Compatible AI (CHAI) at UC Berkeley. He also spends around a ½ day per week volunteering for other projects like the Berkeley Existential Risk initiative and the Survival and Flourishing Fund. Andrew earned his Ph.D. in mathematics at UC Berkeley studying applications of algebraic geometry to machine learning models. During that time, he cofounded the Center for Applied Rationality and SPARC. Dr. Critch has been offered university faculty and research positions in mathematics, mathematical biosciences, and philosophy, worked as an algorithmic stock trader at Jane Street Capital’s New York City office, and as a Research Fellow at the Machine Intelligence Research Institute. His current research interests include logical uncertainty, open source game theory, and mitigating race dynamics between companies and nations in AI development.

New con­trac­tor role: Web se­cu­rity task force con­trac­tor for AI safety announcements

9 Oct 2023 18:36 UTC
11 points
0 comments2 min readLW link
(survivalandflourishing.com)

Con­scious­ness as a con­fla­tion­ary al­li­ance term for in­trin­si­cally val­ued in­ter­nal experiences

Andrew_Critch10 Jul 2023 8:09 UTC
190 points
46 comments11 min readLW link

TASRA: A Tax­on­omy and Anal­y­sis of So­cietal-Scale Risks from AI

Andrew_Critch13 Jun 2023 5:04 UTC
63 points
1 comment1 min readLW link

My May 2023 pri­ori­ties for AI x-safety: more em­pa­thy, more unifi­ca­tion of con­cerns, and less vil­ifi­ca­tion of OpenAI

Andrew_Critch24 May 2023 0:02 UTC
272 points
39 comments8 min readLW link

Job Open­ing: SWE to help build sig­na­ture vet­ting sys­tem for AI-re­lated petitions

20 May 2023 19:02 UTC
52 points
0 comments1 min readLW link

GPT can write Quines now (GPT-4)

Andrew_Critch14 Mar 2023 19:18 UTC
111 points
30 comments1 min readLW link

Acausal normalcy

Andrew_Critch3 Mar 2023 23:34 UTC
169 points
30 comments8 min readLW link

Payor’s Lemma in Nat­u­ral Language

Andrew_Critch2 Mar 2023 12:22 UTC
60 points
0 comments2 min readLW link

Mo­dal Fix­point Co­op­er­a­tion with­out Löb’s Theorem

Andrew_Critch5 Feb 2023 0:58 UTC
133 points
32 comments3 min readLW link

Löbian emo­tional pro­cess­ing of emer­gent co­op­er­a­tion: an example

Andrew_Critch17 Jan 2023 5:59 UTC
23 points
0 comments8 min readLW link

A Löbian ar­gu­ment pat­tern for im­plicit rea­son­ing in nat­u­ral lan­guage: Löbian party invitations

Andrew_Critch1 Jan 2023 17:39 UTC
23 points
8 comments7 min readLW link

Löb’s Lemma: an eas­ier ap­proach to Löb’s Theorem

Andrew_Critch24 Dec 2022 2:02 UTC
29 points
16 comments3 min readLW link

«Boundaries», Part 3b: Align­ment prob­lems in terms of bound­aries

Andrew_Critch14 Dec 2022 22:34 UTC
72 points
7 comments13 min readLW link

Open tech­ni­cal prob­lem: A Quinean proof of Löb’s the­o­rem, for an eas­ier car­toon guide

Andrew_Critch24 Nov 2022 21:16 UTC
58 points
35 comments3 min readLW link1 review

«Boundaries», Part 3a: Defin­ing bound­aries as di­rected Markov blankets

Andrew_Critch30 Oct 2022 6:31 UTC
86 points
20 comments15 min readLW link

The Onion Test for Per­sonal and In­sti­tu­tional Honesty

27 Sep 2022 15:26 UTC
154 points
31 comments3 min readLW link3 reviews

An­nounc­ing En­cul­tured AI: Build­ing a Video Game

18 Aug 2022 2:16 UTC
103 points
26 comments4 min readLW link

En­cul­tured AI Pre-plan­ning, Part 2: Pro­vid­ing a Service

11 Aug 2022 20:11 UTC
33 points
4 comments3 min readLW link

En­cul­tured AI, Part 1 Ap­pendix: Rele­vant Re­search Examples

8 Aug 2022 22:44 UTC
11 points
1 comment7 min readLW link

En­cul­tured AI Pre-plan­ning, Part 1: En­abling New Benchmarks

8 Aug 2022 22:44 UTC
63 points
2 comments6 min readLW link