Generally haunting the Boston area. Primarily interested in the intersection of philosophy, ethics, and system dynamics.
Current focus is on the topology of evaluation, including white-box analysis of MLPs and other neural nets.
Open to work/research opportunities, particularly in alignment, safety, and/or mech interp. DMs are always welcome.
Largely I don’t think our positions are very far apart on what materially matters here because the crux of what I’m saying doesn’t depend on who was pursuing any particular strategy before this.
I’ll grant that there was nothing preventing this strategy prior to current events; given Anthropic’s commitment to Glasswing secrecy doesn’t seem to be the primary strategy they were employing. To be honest, I don’t know that it matters all that much for the forward-facing implications.
Regardless of what strategy any given actor was pursuing prior, the point remains: we now know more about the state of play than we did before these events. We now know we’re in the world in which an active authority has unilaterally and arbitrarily imposed restrictions on a disfavored actor for actions unrelated to safety.
Daddy just came home drunk, probably a good idea to hide in the bedroom until he sobers up.