Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Puria
Karma:
269
I’m helping build
geodesicresearch.org
All
Posts
Comments
New
Top
Old
Alignment Pretraining: AI Discourse Causes Self-Fulfilling (Mis)alignment
Cam
,
Puria
,
Kyle O’Brien
,
David Africa
,
Samuel Ratnam
and
andyk
21 Dec 2025 0:53 UTC
194
points
24
comments
9
min read
LW
link
Architectures for Increased Externalisation of Reasoning
Karthik Viswanathan
,
Liza Pavlova
,
Mariia Koroliuk
,
Puria
,
Cam
and
Edward James Young
26 Nov 2025 20:24 UTC
20
points
2
comments
13
min read
LW
link
Generalisation Hacking: a first look at adversarial generalisation failures in deliberative alignment
Cam
and
Puria
17 Nov 2025 21:44 UTC
46
points
2
comments
8
min read
LW
link
I Am Large, I Contain Multitudes: Persona Transmission via Contextual Inference in LLMs
Shi Feng
and
Puria
8 Sep 2025 13:52 UTC
33
points
0
comments
1
min read
LW
link
(www.researchgate.net)
Back to top