Get me to adopt a solution wrong in a particular direction, like a design that hands the universe over to them? I can maybe figure out the first time through who’s out to get me, if it’s 200 Yudkowsky-years. If it’s 200,000 Yudkowsky-years I think I’m just screwed.
Get me to make any lethal mistake at all? I don’t think I can get to 90% confidence period, or at least, not without spending an amount of Yudkowsky-time equivalent to the untrustworthy source.
Depends what the evil clones are trying to do.
Get me to adopt a solution wrong in a particular direction, like a design that hands the universe over to them? I can maybe figure out the first time through who’s out to get me, if it’s 200 Yudkowsky-years. If it’s 200,000 Yudkowsky-years I think I’m just screwed.
Get me to make any lethal mistake at all? I don’t think I can get to 90% confidence period, or at least, not without spending an amount of Yudkowsky-time equivalent to the untrustworthy source.