What does it matter? We’d ignore whatever AI says just like anosognosics ignore “your arm is paralyzed”.
This is a little tricky, I’ll admit, but if we could just ignore whatever the AI says—which is something in a different modality from whatever it is we’re ignoring—then doesn’t that defeat the whole thought-experiment? Because you could just ignore the anognosic module you, in a fit of absence of mind, wrote into your AI and subsequently ignored on all your reviews.
(Yes, a module full of code like that would look absolutely nothing like what was being censored, but it’s not like the statement ’90% of SIDs are actually irritated mothers murdering their kids’ looks anything like an irritated mother murdering her child either.)
This is a little tricky, I’ll admit, but if we could just ignore whatever the AI says—which is something in a different modality from whatever it is we’re ignoring—then doesn’t that defeat the whole thought-experiment? Because you could just ignore the anognosic module you, in a fit of absence of mind, wrote into your AI and subsequently ignored on all your reviews.
(Yes, a module full of code like that would look absolutely nothing like what was being censored, but it’s not like the statement ’90% of SIDs are actually irritated mothers murdering their kids’ looks anything like an irritated mother murdering her child either.)