Gunnar_Zarncke comments on What’s the short timeline plan?

Gunnar_Zarncke 15 Jan 2025 20:49 UTC
2 points
0
I think the single most important point is: Keep a paradigm with human-legible CoT. Most other points are downstream of that. If it is legible, it is possible and more likely to notice that it is not faithful and to build monitoring on top. It might be the single simple You Get About Five Words thing that might make it into regulation.
- Gunnar_Zarncke 21 Jan 2025 9:47 UTC
  2 points
  0
  Parent
  I just saw a method to make more parts of the model human-legible, addressing the main concern.