Josh You comments on ryan_greenblatt’s Shortform

Josh You 3 Jun 2025 16:18 UTC
1 point
0
One possibility I’ve wondered about is whether AI can automate this learning work: start from a transcript of someone trying to do things with AI with mistakes and subsequent feedback, and then curating some data from that works well for RL fine-tuning. Or even distilling it into examples for in-context learning (which probably works somewhat well, sometimes, today).