Trivial, but do token-based LMs follow instructions like “only output tokens ‘1’, ‘2’, ‘3’” where they’d output 123 as one token without that instruction?
Trivial, but do token-based LMs follow instructions like “only output tokens ‘1’, ‘2’, ‘3’” where they’d output 123 as one token without that instruction?