I think I’d prefer “within a month after external deployment” over “by the time of external deployment” because I expect the latter will lead to (1) evals being rushed and (2) safety people being forced to prioritize poorly.
To clarify, the primary complaint from my perspective is not that they published the report a month after external deployment per se, but that the timing of the report indicates that they did not perform thorough pre-deployment testing (and zero external testing).
And the focus on pre-deployment testing is not really due to any opinion about the relative benefits of pre- vs. post- deployment testing, but because they committed to doing pre-deployment testing, so it’s important that they in fact do pre-deployment testing.
I think I’d prefer “within a month after external deployment” over “by the time of external deployment” because I expect the latter will lead to (1) evals being rushed and (2) safety people being forced to prioritize poorly.
To clarify, the primary complaint from my perspective is not that they published the report a month after external deployment per se, but that the timing of the report indicates that they did not perform thorough pre-deployment testing (and zero external testing).
And the focus on pre-deployment testing is not really due to any opinion about the relative benefits of pre- vs. post- deployment testing, but because they committed to doing pre-deployment testing, so it’s important that they in fact do pre-deployment testing.