RSS

Quentin FEUILLADE--MONTIXI

Karma: 287

I am a former 42.fr student, and SERI Mats 3 scholar. I am currently interested in studying AI with a behavioral approach (Model Ethology). I worked for METR (ARC Eval at the time I worked there) and did independent red teaming for OpenAI and Anthropic. I am a co-founder of PRISM Eval

Emer­gence, The Blind Spot of GenAI In­ter­pretabil­ity?

Quentin FEUILLADE--MONTIXI10 Aug 2024 10:07 UTC
16 points
7 comments3 min readLW link

Study­ing The Alien Mind

5 Dec 2023 17:27 UTC
80 points
10 comments15 min readLW link

Scal­able And Trans­fer­able Black-Box Jailbreaks For Lan­guage Models Via Per­sona Modulation

7 Nov 2023 17:59 UTC
38 points
2 comments2 min readLW link
(arxiv.org)