agents should (be trained to actually) RTFM before touching existing code (i.e. do the equivalent of mouse hover for signatures and docs of just-faded-from-memory functions) instead of vibing the type from far away context (or let’s be honest, just guessing from function name and the code so far and being good at guessing)
I hope the next fashion wave will go for short short-term memory while really “using tools” instead of long short-term memory with “yolo tools as special tokens never seen in pre-training”
🌶️take inspired by https://www.anthropic.com/engineering/code-execution-with-mcp
agents should (be trained to actually) RTFM before touching existing code (i.e. do the equivalent of mouse hover for signatures and docs of just-faded-from-memory functions) instead of vibing the type from far away context (or let’s be honest, just guessing from function name and the code so far and being good at guessing)
I hope the next fashion wave will go for short short-term memory while really “using tools” instead of long short-term memory with “yolo tools as special tokens never seen in pre-training”