https://alignment.openai.com/prod-evals/I’m glad OpenAI did this research and published it! I hope this sort of thing becomes industry standard.
Seems like it might also be tractable to turn something like it, and reporting, into legislation which could meaningfully reduce x-risks.
https://alignment.openai.com/prod-evals/
I’m glad OpenAI did this research and published it! I hope this sort of thing becomes industry standard.
Seems like it might also be tractable to turn something like it, and reporting, into legislation which could meaningfully reduce x-risks.