RSS

Narmeen

Karma: 22

Steer­ing Lan­guage Models in Mul­ti­ple Direc­tions Simultaneously

May 2, 2025, 3:27 PM
18 points
0 comments7 min readLW link

Back­doors have uni­ver­sal rep­re­sen­ta­tions across large lan­guage models

Dec 6, 2024, 10:56 PM
16 points
0 comments16 min readLW link