One last post from me for a bit:
Does anyone want to join me in exploring a simulation of the safety and explorative nature of many copies of the LLM with the same activation Vs many copies with different activations . Known good activations and points on the line between them too.
I think this might have important implications for how we design and deploy AI. I’m hoping to answer the questions.
- Do we want a few well tested models or an ecosystem of models/agents that rely on each other.
- What are the trade offs.
Just a quick question: is anyone tracking or writing about the window we have where young people aren’t dependent on AI for their thinking and it’s impact on the future of alignment work