Whenever I try to “learn what’s going on with AI alignment” I wind up on some article about whether dogs know enough words to have thoughts or something. I don’t really want to kill off the theoretical term (it can peek into the future a little later and function more independent of technology, basically) but it seems like kind of a poor way to answer stuff like: what’s going on now, or if all the AI companies allowed me to write their 6 month goals, what would I put on it.
Whenever I try to “learn what’s going on with AI alignment” I wind up on some article about whether dogs know enough words to have thoughts or something. I don’t really want to kill off the theoretical term (it can peek into the future a little later and function more independent of technology, basically) but it seems like kind of a poor way to answer stuff like: what’s going on now, or if all the AI companies allowed me to write their 6 month goals, what would I put on it.