Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
jan betley
Karma:
76
All
Posts
Comments
New
Top
Old
Self-shutdown AI
jan betley
21 Aug 2023 16:48 UTC
13
points
2
comments
2
min read
LW
link
Localizing goal misgeneralization in a maze-solving policy network
jan betley
6 Jul 2023 16:21 UTC
37
points
2
comments
7
min read
LW
link
[Question]
Reverse engineering of the simulation
jan betley
7 Feb 2022 21:36 UTC
1
point
2
comments
1
min read
LW
link
[Question]
What do we *really* expect from a well-aligned AI?
jan betley
4 Jan 2021 20:57 UTC
13
points
10
comments
1
min read
LW
link
Back to top