ChristianKl comments on LessWrong’s attitude towards AI research

ChristianKl 22 Sep 2014 11:53 UTC
2 points
0

Goals are standardly regarded as immune self modification, so an off switch, in my sense, would be too.

No. Part of what making an FAI is about is to produce agents that keeps their values constant under self modification. It’s not something where you expect that someone accidently get’s it right.
- TheAncientGeek 22 Sep 2014 12:50 UTC
  2 points
  0
  Parent
  Tht isn’t a fact. MIRI assumes goal stability is desirable for safety, but at the same time, MIRIs favourite UFAI is only possible with goal stability.
  - A1987dM 22 Sep 2014 12:55 UTC
    4 points
    0
    Parent
    
    MIRIs favourite UFAI is only possible with goal stability.
    
    A paperclip maximizer wouldn’t become that much less scary if it accidentally turned itself into a paperclip-or-staple maximizer, though.
    - Mark_Friedenbach 22 Sep 2014 15:46 UTC
      1 point
      0
      Parent
      What if it decided making paperclips was boring, and spent some time in deep meditation formulating new goals for itself?
  - ChristianKl 22 Sep 2014 14:32 UTC
    1 point
    0
    Parent
    Paperclip maximizers serve as illustration of a principle. I think that most MIRI folks consider UFAI to be more complicated than simple paperclip maximizers.
    
    Goal stability also get’s harder the more complicated the goal happens to be. A paperclip maximizer can have a off switch but at the same time prevent anyone from pushing that switch.