Question from a total layman: have you tried to change activations of some neurons to prevent them from reaching certain region of activation space and see what happens? I don’t fully understand the underlying math but it seems to me like it should “censor” some possible outputs of NN.
Question from a total layman: have you tried to change activations of some neurons to prevent them from reaching certain region of activation space and see what happens? I don’t fully understand the underlying math but it seems to me like it should “censor” some possible outputs of NN.