RSS

Puria

Karma: 392

I’m helping build geodesicresearch.ai

An­nounc­ing Geodesic Research

27 May 2026 16:40 UTC
71 points
0 comments5 min readLW link

Learned Chain-of-Thought Obfus­ca­tion Gen­er­al­ises to Unseen Tasks

21 May 2026 10:11 UTC
30 points
0 comments5 min readLW link
(arxiv.org)

Align­ment Pre­train­ing: AI Dis­course Causes Self-Fulfilling (Mis)alignment

21 Dec 2025 0:53 UTC
201 points
25 comments9 min readLW link

Ar­chi­tec­tures for In­creased Ex­ter­nal­i­sa­tion of Reasoning

26 Nov 2025 20:24 UTC
38 points
2 comments13 min readLW link

Gen­er­al­i­sa­tion Hack­ing: a first look at ad­ver­sar­ial gen­er­al­i­sa­tion failures in de­liber­a­tive alignment

17 Nov 2025 21:44 UTC
48 points
2 comments8 min readLW link

I Am Large, I Con­tain Mul­ti­tudes: Per­sona Trans­mis­sion via Con­tex­tual In­fer­ence in LLMs

8 Sep 2025 13:52 UTC
33 points
0 comments1 min readLW link
(www.researchgate.net)