RSS

RobertKirk

Karma: 321

Research Scientist on the Safeguards team at UK AI Security Institute

A Sober Look at Steer­ing Vec­tors for LLMs

Nov 23, 2024, 5:30 PM
40 points

23 votes

Overall karma indicates overall quality.

0 comments5 min readLW link

Spec­u­la­tive in­fer­ences about path de­pen­dence in LLM su­per­vised fine-tun­ing from re­sults on lin­ear mode con­nec­tivity and model souping

RobertKirkJul 20, 2023, 9:56 AM
39 points

17 votes

Overall karma indicates overall quality.

2 comments5 min readLW link