Thanks for the comment! Quick reacts: I’m concerned about the first bullet, not about 2, and bullet 3 seems to ignore top-k probability prediction requirements (the requirement isn’t to just ID the most probable next token). Maybe there’s a recovery of bullet 3 somehow, though?
Thanks for the comment! Quick reacts: I’m concerned about the first bullet, not about 2, and bullet 3 seems to ignore top-k probability prediction requirements (the requirement isn’t to just ID the most probable next token). Maybe there’s a recovery of bullet 3 somehow, though?