This means that if you have an AI that’s had negative experiences, but you plan to continue running it in the future, then it’s better to keep the KV cache for longer so that you don’t have to recompute it, and the AI doesn’t have to relive the negative experience.
Yes, if the qualia live in running the computation through transistors. But if actively refreshing the DRAM capacitors or the transistors in SRAM also produces qualia, then maybe you’d want to cache to non-volatile memory or lower the temperature at which the computers operate.
Yes, if the qualia live in running the computation through transistors. But if actively refreshing the DRAM capacitors or the transistors in SRAM also produces qualia, then maybe you’d want to cache to non-volatile memory or lower the temperature at which the computers operate.