ryan_greenblatt comments on ryan_greenblatt’s Shortform

ryan_greenblatt 25 May 2025 20:31 UTC
LW: 2 AF: 2
0
AF
How good/bad is it to work on capabilities at Anthropic?

That’s the most clear cut case, but lots of stuff trades off anthropic power with other stuff.
- Ben Pace 26 May 2025 3:48 UTC
  LW: 14 AF: 6
  9
  AF Parent
  I think the main thing I want to convey is that I think you’re saying that LWers (of which I am one) have a very low opinion of the integrity of people at Anthropic, but what I’m actually saying that their integrity is no match for the forces that they are being tested with.
  I don’t need to be able to predict a lot of fine details about individuals’ decision-making in order to be able to have good estimates of these two quantities, and comparing them is the second-most question relating to whether it’s good to work on capabilities at Anthropic. (The first one is a basic ethical question about working on a potentially extinction-causing technology that is not much related to the details of which capabilities company you’re working on.)
  - ryan_greenblatt 26 May 2025 4:12 UTC
    LW: 2 AF: 2
    0
    AF Parent
    
    I think you’re saying that LWers (of which I am one) have a very low opinion of the integrity of people at Anthropic
    
    This is related to what I was saying but it wasn’t what I was saying. I was saying “tend to be overly pessimistic about Anthropic leadership (in terms of how good of decisions Anthropic leadership will make under the LessWrong person’s views and values)”. I wasn’t making a claim about the perceived absolute level of integrity.
    
    Probably not worth hashing this out further, I think I get what you’re saying.