Can you point to some examples of people on LessWrong saying that? I’d be quite surprised if it was a common idea that “Claude has no pointers to any of human values”. I think it’s very obvious that Claude has a solid understanding of much of human values. There are some subtleties there like:
Claude has a solid understanding, but that doesn’t mean it fundamentally cares about these values above all else (in fact, I’d say it clearly doesn’t—it seems to often prioritise completing a task over human values)
Claude’s understanding of human values may be imperfect
Claude may not have a single pointer that points to a neat concept of “Claude’s conception of human values” at all.
But I haven’t heard anyone say that Claude has no pointers to any of human values, which in my understanding means “Claude does not properly understand any human values”, which seems pretty clearly untrue.
In that case, you should be aware that Leo did do this: https://www.lesswrong.com/posts/YiRsCfkJ2ERGpRpen?commentId=CYSejKh6XTeux34sz
Good luck!