I think we need some variant on Gell-Mann amnesia to describe this batch of models. It’s normal that generalist models will seem less competent on areas where a human evaluator has deeper knowledge, but they should not seem more calculatedly deceptive on areas where the evaluator has deeper knowledge!
I think we need some variant on Gell-Mann amnesia to describe this batch of models. It’s normal that generalist models will seem less competent on areas where a human evaluator has deeper knowledge, but they should not seem more calculatedly deceptive on areas where the evaluator has deeper knowledge!