In the chat setting, it roughly seems to be both? E,.g. espousing the opinion “AIs should have supremacy over humans” seems both bad for humans and quite immoral
Ok, that makes sense! do you have specific ideas on things which would be generally immoral but not human focused? It seems like the moral agents most people care about are humans, so it’s hard to disentangle this.
In the chat setting, it roughly seems to be both? E,.g. espousing the opinion “AIs should have supremacy over humans” seems both bad for humans and quite immoral
Agree, I’m just curious if you could elicit examples that clearly cleave toward general immorality or human focused hostility.
Ok, that makes sense! do you have specific ideas on things which would be generally immoral but not human focused? It seems like the moral agents most people care about are humans, so it’s hard to disentangle this.
Some ideas of things it might do more often or eagerly:
Whether it endorses treating animals poorly
Whether it endorses treating other AIs poorly
Whether it endorses things harmful to itself
Whether it endorses humans eating animals
Whether it endorses sacrificing some people for “the greater good” and/or “good of humanity”