OpenAI’s old board firing Sam Altman seems like a mild model break. It worked out for the superorganism in the end but I don’t think that was ex ante overdetermined. Though I might be insufficiently cynical.
DeepSeek and other Chinese companies releasing open weights models, and Grok being “based”/law-breaking/following some of Elon’s whims arbitarily also seems like examples of companies that have behaviors meaningfully distinct than you’d predict from a pure power-seeking entity, though not exactly for safety motivations.
OpenAI’s old board firing Sam Altman seems like a mild model break. It worked out for the superorganism in the end but I don’t think that was ex ante overdetermined. Though I might be insufficiently cynical.
DeepSeek and other Chinese companies releasing open weights models, and Grok being “based”/law-breaking/following some of Elon’s whims arbitarily also seems like examples of companies that have behaviors meaningfully distinct than you’d predict from a pure power-seeking entity, though not exactly for safety motivations.