Positive update on the value of Janus and its crowd.
Does anyone have an idea of why those insights don’t move to the AI Safety mainstream usually? It feels like Janus could have written this post years ago, but somehow did not. Do you know of other models of LLM behaviour like this one, that still did not get their “notalgebraist writes a post about it” moment?
The insights maybe don’t move into “AI Safety mainstream” or don’t match “average LessWrong taste” but they are familiar to the smart and curious parts of the extended AI safety community.
Yeah last post was two years ago. The Cyborgism and Simulators posts improved my thinking and AI strategy. The void may become one of those key posts for me, and it seems it could have been written much earlier by Janus himself.
IMO Janus mentoring during MATS 3.0 was quite impactful, as it led @Quentin FEUILLADE—MONTIXI to start his LLM ethology agenda and to cofound PRISM Eval.
I expect that there’s still a lot of potential value in Janus work that can only be realized through making it more legible to the rest of the AI safety community, be it mentoring, posting on LW.
I wish someone in the cyborgism community would pick up the ball of explaining the insights to outsiders. I’d gladly pay for a subscription to their Substack, and help them find money for this work.
Positive update on the value of Janus and its crowd.
Does anyone have an idea of why those insights don’t move to the AI Safety mainstream usually? It feels like Janus could have written this post years ago, but somehow did not. Do you know of other models of LLM behaviour like this one, that still did not get their “notalgebraist writes a post about it” moment?
The insights maybe don’t move into “AI Safety mainstream” or don’t match “average LessWrong taste” but they are familiar to the smart and curious parts of the extended AI safety community.
I think Janus is closer to “AI safety mainstream” than nostalgebraist?
AFAIK Janus does not publish posts on LessWrong to detail what he discovered and what it implies for AI Safety strategy.
https://www.lesswrong.com/users/janus-1 ?
Yeah last post was two years ago. The Cyborgism and Simulators posts improved my thinking and AI strategy. The void may become one of those key posts for me, and it seems it could have been written much earlier by Janus himself.
I note that Janus was a MATS mentor for at least one iteration, whereas I do not believe that nostalgebraist has been.
IMO Janus mentoring during MATS 3.0 was quite impactful, as it led @Quentin FEUILLADE—MONTIXI to start his LLM ethology agenda and to cofound PRISM Eval.
I expect that there’s still a lot of potential value in Janus work that can only be realized through making it more legible to the rest of the AI safety community, be it mentoring, posting on LW.
I wish someone in the cyborgism community would pick up the ball of explaining the insights to outsiders. I’d gladly pay for a subscription to their Substack, and help them find money for this work.
The post mentions Janus’s “Simulators” LessWrong blog post which was very popular in 2022 and received hundreds of upvotes.