RSS

wassname

Karma: 481

Adapters as Rep­re­sen­ta­tional Hy­pothe­ses: What Adapter Meth­ods Tell Us About Trans­former Geometry

wassname22 Feb 2026 22:12 UTC
18 points
0 comments5 min readLW link

Do LLMs Learn Our Prefer­ences or Just Our Be­hav­iors?

wassname1 Feb 2026 11:28 UTC
13 points
0 comments1 min readLW link

An­tiPaSTO: Self-Su­per­vised Value Steer­ing for De­bug­ging Alignment

wassname13 Jan 2026 12:55 UTC
6 points
0 comments16 min readLW link