RSS

josh :)

Karma: 7

MS in AI at UT Austin. Interested in interpretability and model self-knowledge.

I am open to opportunities :)

Twitter: @joshycodes
Blog: joshfonseca.com/​​blog

The Case for Ar­tifi­cial Man­i­fold Intelligence

josh :)28 Dec 2025 21:27 UTC
2 points
0 comments7 min readLW link

Train­ing Models to De­tect Ac­ti­va­tion Steer­ing: Re­sults and Implications

josh :)26 Nov 2025 14:51 UTC
7 points
0 comments4 min readLW link

How RLHF Silences AI

josh :)25 Nov 2025 6:01 UTC
1 point
0 comments1 min readLW link
(joshfonseca.com)