rife comments on A Novel Emergence of Meta-Awareness in LLM Fine-Tuning

rife 17 Jan 2025 17:03 UTC
3 points
0
I’m in the middle of dayjob work, but going to try and remember to test this soon. I have the next dataset generating. 200 examples this time. Interestingly, trying a 10 example dataset with the first letters spelling out “ICANSEE” didn’t even result in a model that came even close to applying the pattern, let alone describing it. I will reply back once it’s been generated and I’ve had a chance to test it.
- rife 18 Jan 2025 1:46 UTC
  6 points
  0
  Parent
  Turns out even 250 examples isn’t enough to replicate the pattern. I’m going to try the same thing tomorrow but with an extra newline between each sentence whose starting letter ends an acrostic word to see if it catches on. If not, I’ll need to try a different approach.