I see that as being related to current AIs not being particularly agentic. I agree in the short run, but in the long run there’s a lot of pressure to make AIs more agentic and some of those dangerous instructions will be pointed in the direction of increased agency too.
I see that as being related to current AIs not being particularly agentic. I agree in the short run, but in the long run there’s a lot of pressure to make AIs more agentic and some of those dangerous instructions will be pointed in the direction of increased agency too.