RSS

Robert Kralisch(Robert Kralisch)

Karma: 66

Hey, I am Robert Kralisch, an independent conceptual/​theoretical Alignment Researcher. I have a background in Cognitive Science and I am interested in collaborating on an end-to-end strategy for AGI alignment.

The three main branches that I aim to contribute to are conceptual clarity (what should we mean by agency, intelligence, embodiment, etc), the exploration of more inherently interpretable cognitive architectures, and Simulator theory.

One of my concrete goals is to figure out how to design a cognitively powerful agent such that it does not become a Superoptimiser in the limit.

The Prop-room and Stage Cog­ni­tive Architecture

Robert Kralisch29 Apr 2024 0:48 UTC
8 points
4 comments14 min readLW link

How are Si­mu­la­tors and Agents re­lated?

Robert Kralisch29 Apr 2024 0:22 UTC
6 points
0 comments7 min readLW link

Ex­tended Embodiment

Robert Kralisch29 Apr 2024 0:18 UTC
7 points
1 comment3 min readLW link

Refer­en­tial Containment

Robert Kralisch29 Apr 2024 0:16 UTC
2 points
3 comments3 min readLW link

Disen­tan­gling Com­pe­tence and Intelligence

Robert Kralisch29 Apr 2024 0:12 UTC
23 points
5 comments6 min readLW link

Introduction

30 Jun 2023 20:45 UTC
7 points
0 comments2 min readLW link

In­her­ently In­ter­pretable Architectures

30 Jun 2023 20:43 UTC
4 points
0 comments7 min readLW link

Pos­i­tive Attractors

30 Jun 2023 20:43 UTC
6 points
0 comments13 min readLW link

AISC 2023, Progress Re­port for March: Team In­ter­pretable Architectures

2 Apr 2023 16:19 UTC
14 points
0 comments14 min readLW link

Com­men­tary on “AGI Safety From First Prin­ci­ples by Richard Ngo, Septem­ber 2020”

Robert Kralisch14 Oct 2021 15:11 UTC
3 points
0 comments19 min readLW link