Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Josh Engels
Karma:
294
All
Posts
Comments
New
Top
Old
Negative Results on Group SAEs
Josh Engels
6 May 2025 21:49 UTC
70
points
3
comments
8
min read
LW
link
Interim Research Report: Mechanisms of Awareness
Josh Engels
,
Neel Nanda
and
Senthooran Rajamanoharan
2 May 2025 20:29 UTC
43
points
6
comments
8
min read
LW
link
Scaling Laws for Scalable Oversight
Subhash Kantamneni
,
Josh Engels
,
David Baek
and
Max Tegmark
30 Apr 2025 12:13 UTC
37
points
1
comment
9
min read
LW
link
Josh Engels’s Shortform
Josh Engels
30 Apr 2025 10:58 UTC
4
points
4
comments
1
min read
LW
link
Takeaways From Our Recent Work on SAE Probing
Josh Engels
,
Subhash Kantamneni
,
Senthooran Rajamanoharan
and
Neel Nanda
3 Mar 2025 19:50 UTC
30
points
4
comments
5
min read
LW
link
SAE Probing: What is it good for?
Subhash Kantamneni
,
Josh Engels
,
Senthooran Rajamanoharan
and
Neel Nanda
1 Nov 2024 19:23 UTC
34
points
0
comments
11
min read
LW
link
Back to top