RSS

Quentin FEUILLADE--MONTIXI

Karma: 287

I am a former 42.fr student, and SERI Mats 3 scholar. I am currently interested in studying AI with a behavioral approach (Model Ethology). I worked for METR (ARC Eval at the time I worked there) and did independent red teaming for OpenAI and Anthropic. I am a co-founder of PRISM Eval

Think­ing Part­ners: Build­ing AI-Pow­ered Knowl­edge Man­age­ment Systems

Quentin FEUILLADE--MONTIXI14 Oct 2025 17:42 UTC
18 points
3 comments10 min readLW link

In­vest­ing in Ro­bust Safety Mechanisms is crit­i­cal for re­duc­ing Sys­temic Risks

11 Dec 2024 13:37 UTC
8 points
3 comments2 min readLW link

Emer­gence, The Blind Spot of GenAI In­ter­pretabil­ity?

Quentin FEUILLADE--MONTIXI10 Aug 2024 10:07 UTC
16 points
7 comments3 min readLW link

Study­ing The Alien Mind

5 Dec 2023 17:27 UTC
80 points
10 comments15 min readLW link

Scal­able And Trans­fer­able Black-Box Jailbreaks For Lan­guage Models Via Per­sona Modulation

7 Nov 2023 17:59 UTC
38 points
2 comments2 min readLW link
(arxiv.org)

The Stochas­tic Par­rot Hy­poth­e­sis is de­bat­able for the last gen­er­a­tion of LLMs

7 Nov 2023 16:12 UTC
52 points
21 comments6 min readLW link

Pre­face to the Se­quence on LLM Psychology

Quentin FEUILLADE--MONTIXI7 Nov 2023 16:12 UTC
33 points
0 comments2 min readLW link

PICT: A Zero-Shot Prompt Tem­plate to Au­to­mate Evaluation

Quentin FEUILLADE--MONTIXI17 Feb 2023 23:16 UTC
17 points
1 comment11 min readLW link

Us­ing PICT against Pas­taGPT Jailbreaking

Quentin FEUILLADE--MONTIXI9 Feb 2023 4:30 UTC
26 points
0 comments9 min readLW link