MiguelDev comments on The case for ensuring that powerful AIs are controlled

MiguelDev 25 Jan 2024 2:29 UTC
1 point
0
Awesome post here! Thank you for talking about the importance of ensuring control mechanisms are in place in different areas of AI research.
We want safety techniques that are very hard for models to subvert.
I think that a safety technique that changes all of the model weights is hard (or impossible?) for ~~models~~the same model to subvert. In this regard, other safety techniques that do not consider controlling 100% of the network (eg. activation engineering) will not scale.
1. ^
  Changed “models” to “the same model” for clarity.