Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
antonghawthorne
Karma:
8
All
Posts
Comments
New
Top
Old
No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes
antonghawthorne
,
ivanvmoreno
,
Arnau Padrés Masdemont
,
David Africa
and
LorenzoPacchiardi
16 Sep 2025 15:23 UTC
9
points
0
comments
4
min read
LW
link
(arxiv.org)
Investigating Representations in the Embedding in SONAR Text Autoencoders
antonghawthorne
and
Samuel Nellessen
6 Sep 2025 20:07 UTC
5
points
0
comments
10
min read
LW
link
Investigating Internal Representations of Correctness in SONAR Text Autoencoders
Samuel Nellessen
and
antonghawthorne
6 Aug 2025 12:13 UTC
5
points
0
comments
7
min read
LW
link
Back to top