rpglover64 comments on LLMs for Alignment Research: a safety priority?

rpglover64 5 Apr 2024 15:27 UTC
LW: 1 AF: 1
0
AF
Would you say that models designed from the ground up to be collaborative and capabilitarian would be a net win for alignment, even if they’re not explicitly weakened in terms of helping people develop capabilities? I’d be worried that they could multiply human efforts equally, but with humans spending more effort on capabilities, that’s still a net negative.
- abramdemski 10 Apr 2024 13:34 UTC
  LW: 3 AF: 3
  0
  AF Parent
  I agree with this worry. I am overall advocating for capabilitarian systems with a specific emphasis in helping accelerate safety research.