Adam Karvonen comments on Sam Marks’s Shortform

Adam Karvonen 2 Jul 2025 6:39 UTC
4 points
0
This could also be influenced / exacerbated by the fact that Deepseek R1 was trained in FP8 precision, so quantizing may partially be reverting to its original behavior.