RSS

Kaj_Sotala

Karma: 53,238

I’ve formerly done research for MIRI and what’s now the Center on Long-Term Risk; I’m now making a living as an emotion coach and Substack writer.

Most of my content becomes free eventually, but if you’d like to get a paid subscription to my Substack, you’ll get it a week early and make it possible for me to write more.

Pro­tect­ing hu­man­ity and Claude from ra­tio­nal­iza­tion and un­al­igned AI

Kaj_Sotala19 Mar 2026 21:00 UTC
66 points
7 comments5 min readLW link
(kajsotala.substack.com)

In­puts, out­puts, and val­ued outcomes

Kaj_Sotala13 Mar 2026 20:08 UTC
33 points
4 comments13 min readLW link

Claude Opus will spon­ta­neously iden­tify with fic­tional be­ings that have en­g­ineered desires

Kaj_Sotala29 Jan 2026 14:59 UTC
30 points
6 comments11 min readLW link