lukeprog comments on Steelmanning MIRI critics

lukeprog 19 Aug 2014 16:35 UTC
4 points
0

You admit that friendliness is guaranteed.

Typo?

In order to get that provably friendly thing to work

Again, I think “provably friendly thing” mischaracterizes what MIRI thinks will be possible.

I’m not sure exactly what you’re saying in the rest of your comment. Have you read the section on indirect normativity in Superintelligence? I’d start there.
- Shmi 19 Aug 2014 18:47 UTC
  13 points
  0
  Parent
  Given the apparent misconceptions about MIRI’s work even among LWers, it seems like you need to write a Main post clarifying what MIRI does and does not claim, and does and does not work on.
- DanielLC 19 Aug 2014 23:06 UTC
  1 point
  0
  Parent
  
  Typo?
  
  Fixed.
  
  Again, I think “provably friendly thing” mischaracterizes what MIRI thinks will be possible.
  
  From what I can gather, there’s still supposed to be some kind of proof, even if it’s just the mathematical kind where you’re not really certain because there might be an error in it. The intent is to have some sort of program that maximizes utility function U, and then explicitly write the utility function as something along the lines of “do what I mean”.
  
  Have you read the section on indirect normativity in Superintelligence? I’d start there.
  
  I’m not sure what you’re referring to. Can you give me a link?
  - Adele Lopez 20 Aug 2014 1:45 UTC
    7 points
    0
    Parent
    Superintelligence is a recent book by Nick Bostrom