Brandon Riggs comments on Subliminal Learning Across Models

Brandon Riggs 27 Nov 2025 13:32 UTC
3 points
0
Very interesting finding that this transfers across models!

Would be keen to see people pick up from here on defences/mitigations with whitebox methods given the LLM judge performance.