WebGPT seemed like one of the most in-expectation harmful projects that OpenAI has worked on
That seems weirdly strong. Why do you think that?
For people viewing on the Alignment Forum, there is a separate thread on this question here. (Edit: my link to LessWrong is automatically converted to an Alignment Forum link, you will have to navigate there yourself.)
I moved that thread over the AIAF as well!
That seems weirdly strong. Why do you think that?
For people viewing on the Alignment Forum, there is a separate thread on this question here. (Edit: my link to LessWrong is automatically converted to an Alignment Forum link, you will have to navigate there yourself.)
I moved that thread over the AIAF as well!