Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
dx26
(Dylan Xu)
Karma:
65
All
Posts
Comments
New
Top
Old
Measuring Coherence and Goal-Directedness in RL Policies
dx26
22 Apr 2024 18:26 UTC
2
points
0
comments
7
min read
LW
link
Measuring Coherence of Policies in Toy Environments
dx26
and
Richard_Ngo
18 Mar 2024 17:59 UTC
59
points
9
comments
14
min read
LW
link
Supervised Program for Alignment Research (SPAR) at UC Berkeley: Spring 2023 summary
mic
,
dx26
,
adamk
and
Carolyn Qian
19 Aug 2023 2:27 UTC
20
points
2
comments
6
min read
LW
link
Back to top