Wheeee.… Feel the trendline… The winds of progress blowing through our hair...
I’ve been comparing r1 to r1-zero and v3. r1 is just way more creative feeling. Like, something really gelled there.
@janus has been exploring hypotheses around steganography in the reasoning traces. I think we should work on ways to actively mitigate such. For example: https://www.lesswrong.com/posts/uPi2YppTEnzKG3nXD/nathan-helm-burger-s-shortform?commentId=Epa9fduKA3DHCPbx7
Wheeee.… Feel the trendline… The winds of progress blowing through our hair...
I’ve been comparing r1 to r1-zero and v3. r1 is just way more creative feeling. Like, something really gelled there.
@janus has been exploring hypotheses around steganography in the reasoning traces. I think we should work on ways to actively mitigate such. For example: https://www.lesswrong.com/posts/uPi2YppTEnzKG3nXD/nathan-helm-burger-s-shortform?commentId=Epa9fduKA3DHCPbx7