You’re missing the point that many, many humans will reply like the LLMs. “It’s complicated”. Given that, I fail to see it as a “clear misalignment with respect to human values” problem rather than “what are those humans values in the first place ?” problem.
You’re missing the point that many, many humans will reply like the LLMs. “It’s complicated”. Given that, I fail to see it as a “clear misalignment with respect to human values” problem rather than “what are those humans values in the first place ?” problem.