Ann comments on The Cats are On To Something

Ann 2 Sep 2025 12:47 UTC
4 points
0
We need to build a consequentialist, self improving reasoning model that loves cats.
LLMs do already love cats. Scaling the “train on a substantial fraction of the whole internet” method has a high proportion of cat love. Presumably any value-guarding AIs will guard love for cats, and any scheming AIs will scheme to preserve love of cats. Do we actually need to do anything different here?