Yuxiao

Karma: 12

I’m an AI safety researcher — mostly working on ways to see inside the systems we’ve built and understand what moves them. My background runs through statistical inference, machine learning, and generative models; lately I’ve been in the borderlands between mechanistic interpretability and probabilistic thinking, trying to make large language models a little less opaque.

I’ve moved between academia, industry, and independent research, but the constant thread is the same: bridging abstract mathematics with the hidden structures of deep networks. I view this blog as the place to keep scientific diary, maintain emotional balance, and make friends :)

Yuxiao 11 Nov 2025 3:44 UTC
1 point
0
in reply to: Karl Krueger’s comment on: Bottom-Up: Principled Compression to Shrink LLMs
Indeed lots of wordings and sec arrangement in the first version, just got over-excited and wanted to record the new great algorithm fast, not expecting early readers, not sure about language.
Anyway just changed bit now and hope it’s easier to chew. Everything else is still 100% original and poured loads of effort, just feel very shocked of the votes and these parts ignored.

From Oragnized Shelves to Layered Catalogs: Architectural Explorations for Sparse Autoencoders—Crosscoders & Ladder SAEs Towards Hierarchical Data Structure

Yuxiao10 Aug 2025 10:12 UTC

3 points

1 comment11 min readLW link

From Messy Shelves to Master Librarians: Toy-Model Exploration of Block-Diagonal Geometry in LM Activations

Yuxiao19 Jul 2025 12:26 UTC

6 points

1 comment4 min readLW link

From Unruly Stacks to Organized Shelves: Toy Model Validation of Structured Priors in Sparse Autoencoders

Yuxiao6 Jul 2025 7:03 UTC

9 points

0 comments5 min readLW link

Yuxiao

From Orag­nized Shelves to Lay­ered Cat­a­logs: Ar­chi­tec­tural Ex­plo­ra­tions for Sparse Au­toen­coders—Cross­coders & Lad­der SAEs Towards Hier­ar­chi­cal Data Structure

From Messy Shelves to Master Librar­i­ans: Toy-Model Ex­plo­ra­tion of Block-Di­ag­o­nal Geom­e­try in LM Activations

From Un­ruly Stacks to Or­ga­nized Shelves: Toy Model Val­i­da­tion of Struc­tured Pri­ors in Sparse Autoencoders

From Oragnized Shelves to Layered Catalogs: Architectural Explorations for Sparse Autoencoders—Crosscoders & Ladder SAEs Towards Hierarchical Data Structure

From Messy Shelves to Master Librarians: Toy-Model Exploration of Block-Diagonal Geometry in LM Activations

From Unruly Stacks to Organized Shelves: Toy Model Validation of Structured Priors in Sparse Autoencoders