ryan_greenblatt comments on Preventing model exfiltration with upload limits

ryan_greenblatt 6 Feb 2024 17:23 UTC
4 points
0
See this section of the post for commentary.

TLDR: we actually think it’s reasonably likely that the total data outflow is of comparable scale to model weights (for an AI lab’s most capable model) under normal commercial operation.

Thinking these are of comparable scale is a core assumption for the upload limiting method to make sense. (Without the introduction of some more exotic approaches.)