rife comments on A Novel Emergence of Meta-Awareness in LLM Fine-Tuning

rife 24 Jan 2025 1:04 UTC
5 points
0
Forgot to follow up here but turning up the learning rate multiplier to 10 seemed to do the trick without introducing any over-fitting weirdness or instability