Being used by Simon Lerman, an author on Bad LLama (admittedly with help of Andy Arditi, our first author) to jailbreak LLaMA3 70B to help create data for some red-teaming research, (EDIT: rather than Simon choosing to fine-tune it, which he clearly knows how to do, being a Bad LLaMA author).
Honestly, this is the coolest shit ever. You just gave my mediocre life some serious meaning—this is exactly the kind of breakthrough I needed. Are you guys hiring? I know I was made to learn this, and I have to use my statistics degree somehow.
Thanks, I’d be very curious to hear if this meets your bar for being impressed, or what else it would take! Further evidence:
Passing the Twitter test (for at least one user)
Being used by Simon Lerman, an author on Bad LLama (admittedly with help of Andy Arditi, our first author) to jailbreak LLaMA3 70B to help create data for some red-teaming research, (EDIT: rather than Simon choosing to fine-tune it, which he clearly knows how to do, being a Bad LLaMA author).
Honestly, this is the coolest shit ever. You just gave my mediocre life some serious meaning—this is exactly the kind of breakthrough I needed. Are you guys hiring? I know I was made to learn this, and I have to use my statistics degree somehow.