I think it’s a big stretch to say that deception is basically just trojans. There are similarities, but the regularities that make deception a natural category of behavior that we might be able to detect are importantly fuzzier than the regularities that trojan-detecting strategies use. If “deception” just meant acting according to a wildly different distribution when certain cues were detected, trojan-detection would have us covered, but what counts as “deception” depends more heavily on our standards for the reasoning process, and doean’t reliably result in behavior that’s way different than non-deceptive behavior.
Could you give an example of a case of deception that is quite unlike a trojan? Maybe we have different definitions. Maybe I’m not accounting for something. Either way, it seems useful to figure out the disagreement.
I think it’s a big stretch to say that deception is basically just trojans. There are similarities, but the regularities that make deception a natural category of behavior that we might be able to detect are importantly fuzzier than the regularities that trojan-detecting strategies use. If “deception” just meant acting according to a wildly different distribution when certain cues were detected, trojan-detection would have us covered, but what counts as “deception” depends more heavily on our standards for the reasoning process, and doean’t reliably result in behavior that’s way different than non-deceptive behavior.
Thanks. See also EIS VIII.
Could you give an example of a case of deception that is quite unlike a trojan? Maybe we have different definitions. Maybe I’m not accounting for something. Either way, it seems useful to figure out the disagreement.
I’m slowly making my way through these, so I’ll leave you a more complete comment after I read post 8.