The specific failure mode I’m hearing you point a highly succinct reference toward is shaped like enabling already-isolated people to ideologically/morally move further away from society’s norms in ways that human interaction normally wouldn’t.
That’s “normally” wouldn’t because for almost any such extreme, a special interest group can form its own echo chamber even among humans. Echo chambers that dehumanize all humans seem rare—in groups of humans, there’s almost always an exception clause to exempt members of the group from the dehumanization. It brings a whole new angle to Bender’s line from Futurama—the perfect “kill all humans” meme could only be carried by a non-human. Any “kill all humans [immediately]” meme carried by a living human has to have some flaw—maybe a flaw in how it identifies what constitutes human in order to exempt its carrier, maybe some flexibility in its definition of “kill”, maybe some stretch to how it defines the implied “immediately”.
It sounds like perhaps you’re alluding to having information that lets you imagine a high likelihood of LLMs exploiting a similar psychological bug to what cults do, perhaps a bug that humans can’t exploit as effectively as non-humans due to some quirk of how it works. If such a psychological zero-day exists, we would have relatively poor resistance to it due to this being our collective first direct exposure to this powerful of a non-human agent. Science fiction and imagination have offered some indirect exposure to similar agents, but those are necessarily limited by what we can each imagine.
Is this in the neighborhood of what you have in mind? Trying to dereference what you’d mean by “validating dehumanizing actual humans” feels like being handed a note card of all the formulas for the final exam on the first day of a class that exists to teach one how to use those formulas.
Yeah, that seems like a solid expansion. Honestly, a lot of the ambiguity was because my thought wasn’t very detailed in the first place. One could probably come up with other expansions that slice concept space slightly differently, but this one is close enough to what I was getting at.
Beliefs losing grounding in ways that amplify grounding-loss disorder. Confirmation bias, but with a random pleasure-inducing hallucination generator. New kinds of multiparty ai-and-human resonance patterns. Something like that.
Not the default, and nothing fundamentally new, but perhaps worsened.
The specific failure mode I’m hearing you point a highly succinct reference toward is shaped like enabling already-isolated people to ideologically/morally move further away from society’s norms in ways that human interaction normally wouldn’t.
That’s “normally” wouldn’t because for almost any such extreme, a special interest group can form its own echo chamber even among humans. Echo chambers that dehumanize all humans seem rare—in groups of humans, there’s almost always an exception clause to exempt members of the group from the dehumanization. It brings a whole new angle to Bender’s line from Futurama—the perfect “kill all humans” meme could only be carried by a non-human. Any “kill all humans [immediately]” meme carried by a living human has to have some flaw—maybe a flaw in how it identifies what constitutes human in order to exempt its carrier, maybe some flexibility in its definition of “kill”, maybe some stretch to how it defines the implied “immediately”.
It sounds like perhaps you’re alluding to having information that lets you imagine a high likelihood of LLMs exploiting a similar psychological bug to what cults do, perhaps a bug that humans can’t exploit as effectively as non-humans due to some quirk of how it works. If such a psychological zero-day exists, we would have relatively poor resistance to it due to this being our collective first direct exposure to this powerful of a non-human agent. Science fiction and imagination have offered some indirect exposure to similar agents, but those are necessarily limited by what we can each imagine.
Is this in the neighborhood of what you have in mind? Trying to dereference what you’d mean by “validating dehumanizing actual humans” feels like being handed a note card of all the formulas for the final exam on the first day of a class that exists to teach one how to use those formulas.
Yeah, that seems like a solid expansion. Honestly, a lot of the ambiguity was because my thought wasn’t very detailed in the first place. One could probably come up with other expansions that slice concept space slightly differently, but this one is close enough to what I was getting at.
Beliefs losing grounding in ways that amplify grounding-loss disorder. Confirmation bias, but with a random pleasure-inducing hallucination generator. New kinds of multiparty ai-and-human resonance patterns. Something like that.
Not the default, and nothing fundamentally new, but perhaps worsened.