the entire deep learning paradigm is itself unsafe
yudkowsky is such a goofball about deep learning. a thing I believe: the strongest version of alignment, where there is no step during the training process that ever produces any amount of misaligned cognition whatsoever, if it’s possible to do at all, is possible to do with deep learning. I also think it’s not significantly harder to do with deep learning than some other way. And I think it’s possible to do at all. Justification post pending me convincing myself to write a bad post rather than no post, and/or someone asking me questions that make me write down things that clarify this. if someone wanted to grill me in an lw dialogue I’d be down.
I don’t know enough about the subject matter to grill you in detail, but I’d certainly love to see a post about this. (Or even a long comment.) The obvious big questions are “why do you believe that” but also “how can you possibly know that”—after all, who knows what AI-related techniques and technologies remain undiscovered? Surely you can’t know whether some of them make it easier to produce aligned AIs than deep learning…?
yudkowsky is such a goofball about deep learning. a thing I believe: the strongest version of alignment, where there is no step during the training process that ever produces any amount of misaligned cognition whatsoever, if it’s possible to do at all, is possible to do with deep learning. I also think it’s not significantly harder to do with deep learning than some other way. And I think it’s possible to do at all. Justification post pending me convincing myself to write a bad post rather than no post, and/or someone asking me questions that make me write down things that clarify this. if someone wanted to grill me in an lw dialogue I’d be down.
I don’t know enough about the subject matter to grill you in detail, but I’d certainly love to see a post about this. (Or even a long comment.) The obvious big questions are “why do you believe that” but also “how can you possibly know that”—after all, who knows what AI-related techniques and technologies remain undiscovered? Surely you can’t know whether some of them make it easier to produce aligned AIs than deep learning…?