The AI safety ecosystem is so well resourced that it has been correctly identified by many as one of the best paths into high prestige AI research jobs.
This person on twitter has written a popular article about getting into frontier ai labs and a Field Guide to AI Fellowships. The “AI Fellowships” are mostly AI safety programs funded by CG/OpenPhil. I have also noticed that people in ML research are quite likely to have heard of MATS and be interested in participating, even when they have very little interest in AI safety.
Idk how good or bad this is. It definitely causes a lot of ML researchers to engage with the AI safety literature, where they otherwise would not have. But it’s worth noting that while the primary driver of applications to programs like MATS used be concern about AI safety, now it is increasingly a desire to work at a frontier lab.
I used to select participants for LASR labs, one of programs listed, and we actively tried to choose people who cared about AI safety, but I think we often did not succeed and indeed some people on the program now work at frontier labs in roles that I think have little to do with safety.
Safety-washing in practice mostly looks like people rationalizing research or jobs that they would have done anyway as necessary for safety. It’s easy to do because it is in fact genuinely unclear in most cases what is helpful and harmful for safety. Only by looking at the larger pattern can we notice the suspicious abundance of conveniently overlapping opportunities that further both safety and a person’s short-term interests. However, I think it will be less common in people whose primary motivation is to prevent AI catastrophe.
Maybe you could make an argument that they don’t care enough, but I’m pretty confident that MATS does care about this. When I did it, they had a whole reading group program whose primary aim, as I understand it, was to get people to understand and care more about the fundamental safety issues. And I also believe they try to select for people who care about safety, at least to some extent.