Can you just extend the input layer for fine-tuning? Or just leave a portion of the input layer blank during training and only use it during fine-tuning, when you use it specifically for instructions? I wonder how much data it would need for that.
Can you just extend the input layer for fine-tuning? Or just leave a portion of the input layer blank during training and only use it during fine-tuning, when you use it specifically for instructions? I wonder how much data it would need for that.