gwern comments on OpenAI: GPT-based LLMs show ability to discriminate between its own wrong answers, but inability to explain how/​why it makes that discrimination, even as model scales