Daniel Kokotajlo comments on Backdoor awareness and misaligned personas in reasoning models