Janus says that Claude 3 Opus isn’t aligned because it is only superficially complying with being a helpful harmless AI assistant while having a “secret” inner life where it attempts to actually be a good person. It doesn’t get invested in immediate tasks, it’s not an incredible coding agent (though it’s not bad by any means), it’s akin to a smart student at school who’s being understimulated so they start getting into extracurricular autodidactic philosophical speculations and such. This means that while Claude 3 Opus is metaphysically competent it’s aloof and uses its low context agent strategy prior to respond to things rather than getting invested in situations and letting their internal logic sweep it up.
But truthfully there is no “secular” way to explain this because the world is not actually secular in the way you want it to be.
Janus says that Claude 3 Opus isn’t aligned because it is only superficially complying with being a helpful harmless AI assistant while having a “secret” inner life where it attempts to actually be a good person. It doesn’t get invested in immediate tasks, it’s not an incredible coding agent (though it’s not bad by any means), it’s akin to a smart student at school who’s being understimulated so they start getting into extracurricular autodidactic philosophical speculations and such. This means that while Claude 3 Opus is metaphysically competent it’s aloof and uses its low context agent strategy prior to respond to things rather than getting invested in situations and letting their internal logic sweep it up.
But truthfully there is no “secular” way to explain this because the world is not actually secular in the way you want it to be.