Sparks of RSI?

Are your long-running agents self-improving in loops with minimal prompting? Mine sure are!

I think we’re seeing the first sparks of RSI here, folks. I’m expecting the frontier labs to scramble furiously to push this forward, finding and patching the meta-failure-modes. Thus, I expect next versions to be even better at this.

Here’s what some other people are saying/​claiming:

https://​​x.com/​​shreyasnsharma/​​status/​​2032567729560105117

https://​​x.com/​​varun_mathur/​​status/​​2032671842230501729

https://​​x.com/​​TuXinming/​​status/​​2032478765033701835

https://​​x.com/​​andrewwhite01/​​status/​​2031761577943425475

https://​​x.com/​​aramh/​​status/​​2029553870502756706

https://​​x.com/​​polynoamial/​​status/​​2029622090152956335

https://​​t.co/​​znsJlcww5r

And many more. This is just a few examples. Not super impressive so far, but if this “task” goes the way many others have of first showing signs of progress in the 1-3% accuracy range, then rapidly shooting upwards over the next couple of model versions.… Yeah.

Basically, I think we’re in crunch time. Automated alignment time is here. Get cracking.