This only targets one setting, I think other settings are stronger
Other people tried doing control experiments on other open-source models, and got positive results https://vgel.me/posts/qwen-introspection/
This only targets one setting, I think other settings are stronger
Other people tried doing control experiments on other open-source models, and got positive results https://vgel.me/posts/qwen-introspection/