How expensive is the finetuning step relative to the pretraining (in terms of compute, data, labor, or anything else)?
I gather it’d be ~$1000 to “uncensor” a finetuned model, but as mentioned, this might be the first significant model released before finetuning, so I have no intuition for this. Two orders of magnitude more? Three?
How expensive is the finetuning step relative to the pretraining (in terms of compute, data, labor, or anything else)?
I gather it’d be ~$1000 to “uncensor” a finetuned model, but as mentioned, this might be the first significant model released before finetuning, so I have no intuition for this. Two orders of magnitude more? Three?