RSS

Maheep Chaudhary

Karma: 30

Aware­ness Jailbreak­ing: Re­veal­ing True Align­ment in Eval­u­a­tion-Aware Models

Maheep Chaudhary29 Dec 2025 21:29 UTC
10 points
0 comments4 min readLW link

Eval­u­a­tion Aware­ness Scales Pre­dictably in Open-Weights Large Lan­guage Models

Maheep Chaudhary19 Dec 2025 2:47 UTC
21 points
0 comments6 min readLW link