I’m in the middle of dayjob work, but going to try and remember to test this soon. I have the next dataset generating. 200 examples this time. Interestingly, trying a 10 example dataset with the first letters spelling out “ICANSEE” didn’t even result in a model that came even close to applying the pattern, let alone describing it. I will reply back once it’s been generated and I’ve had a chance to test it.
Turns out even 250 examples isn’t enough to replicate the pattern. I’m going to try the same thing tomorrow but with an extra newline between each sentence whose starting letter ends an acrostic word to see if it catches on. If not, I’ll need to try a different approach.
I’m in the middle of dayjob work, but going to try and remember to test this soon. I have the next dataset generating. 200 examples this time. Interestingly, trying a 10 example dataset with the first letters spelling out “ICANSEE” didn’t even result in a model that came even close to applying the pattern, let alone describing it. I will reply back once it’s been generated and I’ve had a chance to test it.
Turns out even 250 examples isn’t enough to replicate the pattern. I’m going to try the same thing tomorrow but with an extra newline between each sentence whose starting letter ends an acrostic word to see if it catches on. If not, I’ll need to try a different approach.