RogerDearnaley comments on AI safety advocates should consider providing gentle pushback following the events at OpenAI

RogerDearnaley 23 Dec 2023 10:44 UTC
6 points
1
As far as is known publicly, the OpenAI disaster began with Sam Altman attempting to purge the board of serious AI safety advocates and it ended with him successfully purging the board of serious AI safety advocates.
This safety-vs-less-safety viewpoint on what happened does not match the account that, for example Zvi has been posting about of what happened recently at OpenAI. His theory, as I understand it, is that this was more of a start-up-mindset-vs-academic-research-mindset disagreement that turned into a power struggle. I’m not in a position to know, but Zvi appears to have put a lot of effort into trying to find this out, several parts of what he posted has since been corroborated by other sources, and he is not the only source from which I’ve heard evidence suggesting that this wasn’t simply a safety-vs-less-safety disagreement.
It’s also, to me, rather unclear who “won”: Sam Altman is back, but lost his seat on the board and is now under an investigation (along with the former board), Ilya also lost his seat on the board, while also all-but-one of the other board members were replaced, presumably with candidates acceptable both to Sam and the board members who then agreed to resign. All of the new board are more from the startup world than the academic researcher world, But some of them are, from what I’ve read, reputed to have quite safety-conscious views on AI risk. I’m concerned that people may be assuming that this was about what many people are arguing about outside OpenAI, whereas it may actually be about something which is partially or even almost completely orthogonal to that. Helen Toner, for example, is by all accounts I have heard both safety conscious, and an academic researcher.
OpenAI’s mission statement is to create safe AGI. It’s pretty clear why people with a $P (D O O M)$ anywhere from ~1% to ~90% might want to work there (for the higher percentages, if they feel that what OpenAI’s trying to do to make things safer isn’t misguided or even is our best shot, and that moving slower isn’t going to make things better, or might actually make it worse; for the lower percentages, if they agree that the Alignment/Preparedness/Superalignment problems do need to be worked on). I think it’s safe to assume that relatively few people with $P (D O O M)$ >= $O (99 %)$ are working at OpenAI. But the company is almost certainly a coalition between people with very different $P (D O O M)$ figures across the range ~1%–~90%, and even those with a $P (D O O M)$ of only, say, $O (10 %)$ presumably think this is an incredibly important problem representing a bigger risk to humanity than anything else we’re currently facing. Anyone smart trying to manage OpenAI it is going to understand that it’s vital to maintain a balance and culture that people across that full range can all feel comfortable with, and not feel like they’re being asked to come to work each morning to devote themselves to attempting to destroy the human race. So I strongly suspect that starting anything that the employees interpreted as a safety-vs-less-safety fight between the CEO and the board would be a great way to split the company, and that Sam Altman knows this. Which doesn’t appear to have happened. So my guess is that the employees don’t see it that way.
Also bear in mind that Sam Altman frequently, fluently, and persuasively expresses safety concerns in interviews, and spent a lot of time earlier this year successfully explaining to important people in major governments that the field his company is pioneering is not safe, and could in fact kill us all (not generally something you hear from titans of industry). I don’t buy the cynical claim (which I interpret as knee-jerk cynicism about capitalism, especially big tech, likely echoed by A19z and e/acc) that this is just an attempt at preemptive regulatory capture: there wasn’t any regulatory environment at all for LLMs, or any realistic prospect of one happening soon (even in the EU), until a bunch of people in the field, including from all the major labs, made it start happening. It looks to me a lot more like genuine fear that someone less careful and skilled might kill us all, and a desire to reduce/slow that risk. I think OpenAI and the other leading labs believe they’re more likely to get this right than someone else, where it’s unclear whether the “someone else” on their minds might be Meta, the Chinese, Mistral, A19z and the open-source community, the North Koreans, some lone nutcase, or some or all of the above. I doubt people inside OpenAI see themselves as primarily responsible for a race dynamic, so it seems more likely they feel pressure from other parties. And to be frank, they’re roughly 18 months ahead of the open-source community as financed by A19z (who are openly pirating what safety they have off OpenAI, in flagrant violation of their terms of service, something that OpenAI has never made a peep about until a Chinese company did that plus a whole lot more, so presumably they must regard as a good thing), and a little less than that ahead of Mistral (who so far appear to be doing approximately zero about safety). So it’s entirely possible to be very concerned about safety, and still want to move fast, if you believe the race dynamic is (currently) unstoppable and that the few leaders in the race are more capable of getting this right than the crowd behind them. Which seems rather plausible just from basic competence selection effects.
As Eliezer Yudkowski (eminence grise of the $P (D O O M)$ >= 99% camp) has observed, the amount of intelligence and resources required to kill us all by doing something stupid with AI is steadily decreasing, quite rapidly. So I suspect OpenAI may be in what one could call the “move fast, but don’t break things” camp.
- RogerDearnaley 27 Dec 2023 1:20 UTC
  3 points
  0
  Parent
  See also this extract from an interview with Emmet Shear, the temporary interim-CEO of OpenAI who arranged the negotiations between the board and Sam Altman that resulted in Sam returning as CEO, so who should be in a good position to know:
  [Context: Emmet earlier in the interview described four AI factions: a 2 ✕ 2 of safety/pause vs. acceleration ✕ expecting moderate vs. world-shattering impact. Click for helpful diagram]
  [Interviewer:] Was the dispute between Sam Altman and the board just a dispute between AI factions?
  [Emmet Shear:] I said this publicly on Twitter. I don’t think there’s any significant difference between what Sam believes and what the board believes in terms of timelines, danger, or anything like that. I think it had nothing to do with that, personally.
  I don’t have access to what’s going on in anyone’s brain directly, but I saw no evidence that that was the case.