Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Kay Kozaronek
Karma:
180
All
Posts
Comments
New
Top
Old
AI Safety Research Organization Incubation Program—Expression of Interest
Alexandra Bos
,
Magdalena Wache
,
Kay Kozaronek
,
Gabe
and
Catalyze Impact
21 Nov 2023 10:23 UTC
65
points
6
comments
1
min read
LW
link
Searching for a model’s concepts by their shape – a theoretical framework
Kaarel
,
gekaklam
,
Walter Laurito
,
Kay Kozaronek
,
AlexMennen
and
June Ku
23 Feb 2023 20:14 UTC
50
points
0
comments
19
min read
LW
link
[RFC] Possible ways to expand on “Discovering Latent Knowledge in Language Models Without Supervision”.
gekaklam
,
Walter Laurito
,
Kaarel
and
Kay Kozaronek
25 Jan 2023 19:03 UTC
47
points
6
comments
12
min read
LW
link
Reinforcement Learning Study Group
Kay Kozaronek
26 Dec 2021 23:11 UTC
20
points
8
comments
1
min read
LW
link
Back to top