For those organizations that do choose to compete… I think it is highly likely that they will attempt to build competing systems in basically the exact same way as the first organization did
...
It’s unlikely for there to exist both aligned and misaligned AI systems at the same time
If the first group sunk some cost into aligning their system, but that wasn’t integral to its everyday task performance, wouldn’t a second competing group be somewhat likely to skimp on the alignment part?
It seems like this calls into the question the claim that we wouldn’t get a mix of aligned and misaligned systems.
Do you expect it to be difficult to disentangle the alignment from the training, such that the path of least resistance for the second group will necessarily include doing a similar amount of alignment?
If the first group sunk some cost into aligning their system, but that wasn’t integral to its everyday task performance, wouldn’t a second competing group be somewhat likely to skimp on the alignment part?
I think that alignment will be a pretty important desideratum for anybody building an AI system—and I think that copying whatever alignment strategy was used previously is likely to be the easiest, most conservative, most risk-averse option for other organizations trying to fulfill that desideratum.
If the first group sunk some cost into aligning their system, but that wasn’t integral to its everyday task performance, wouldn’t a second competing group be somewhat likely to skimp on the alignment part?
It seems like this calls into the question the claim that we wouldn’t get a mix of aligned and misaligned systems.
Do you expect it to be difficult to disentangle the alignment from the training, such that the path of least resistance for the second group will necessarily include doing a similar amount of alignment?
I think that alignment will be a pretty important desideratum for anybody building an AI system—and I think that copying whatever alignment strategy was used previously is likely to be the easiest, most conservative, most risk-averse option for other organizations trying to fulfill that desideratum.