Jingyuan FENG

Karma: 2

Jingyuan FENG 5 Jun 2026 4:54 UTC
2 points
0
on: Logits as a new monitor for evaluation awareness
Great to see work on unverbalized eval awaraness! It’s straght forward and insightful.
I have 2 questions regarding to the experiment settings:
1. How did you decide the classification threshold for logit monitor and LLM-judge?
2. In Figure2(right), what is the VEA pass@100?