A couple of immediate issues with the branding:
people are really hating on OpenAI now. I suggest switching to another model and possibly changing the name.
“OpenErrata” is too many syllables/the word errata is too complex
but I do like the word “Open” on its own
I’ve been interested in the potential of zero-knowledge proof type things to verify that computers are not running unfriendly AIs, to get the minimal amount of omniveillance that may be necessary to thread the Scylla and Charybdis of x-risk via extinction and x-risk via totalitarian stagnation. Each computer attesting that it’s not doing <bad things>, with no more than a yes/no. Possible issues: maybe it’s computationally infeasible, hard to operationalize, or can be used to do more intrusive surveillance. I know Drexler was interested in this. IIRC Buterin may have as well.
Has anyone seriously investigated how technically and socially feasible this is? Is anyone (e.g. governance people at MIRI) working fulltime on this?