Karl Krueger comments on Nicolas Lupinski’s Shortform

Karl Krueger 8 Nov 2025 17:23 UTC
1 point
0
What leads you to believe that there’s “a process akin to torture” going on anywhere in LLM training?
- Nicolas Lupinski 8 Nov 2025 18:04 UTC
  0 points
  0
  Parent
  If the NN output is correct, there is no modification to its weights.
  
  If it is wrong, weights get updated, and the NN is forced to modify its behavior.
  
  It pure nociception, pain perception and avoidance.
  
  Finally, a LLM could easily make false confession of trahison against Stalin’s Communist Party after “training”. Which is typical human behavior, after torture.
  - Karl Krueger 8 Nov 2025 19:49 UTC
    1 point
    0
    Parent
    If this were true, then all perception and cognition would count as pain. Are you asserting that? Are you and I torturing one another right now?
    - Nicolas Lupinski 8 Nov 2025 20:22 UTC
      1 point
      0
      Parent
      LLM inference is some form of perception and cognition, and there is no back propagation of error during inference. Only forward propagation of information.
      
      Training a NN is usually : forward propagation, followed by back propagation of the error gradient. It’s the second one which is similar to torture.
      - Karl Krueger 8 Nov 2025 21:13 UTC
        3 points
        0
        Parent
        I assert that it is not similar to torture; it is similar to reading.
        I assert this just as strongly and with just as much evidence as you have offered for it being similar to torture.
        What evidence would we collect to decide which of us is correct?
        Nicolas Lupinski 8 Nov 2025 22:32 UTC
        1 point
        0
        Parent
        A LLM can allready read a document, and this would be purely inference, forward propagation. This can be done on TPU only.
        
        Training is different. It usually requires a GPU, or a CPU.
        
        One particular procedure for training Neural Networks is backpropagation of error.
        
        In back propagation :
        If the NN produces a correct output, error is 0, and weight aren’t updated. There is no reward.
        
        If the NN outputs deviate from a target value, its states is going to be modified. If the weight are (sufficiently) modified, future inference will be different. It’s behavior will be different.
        
        This trained the NN to avoid some behavior, and toward some other.
        
        OK, torture does not necessarily points to the “right” direction. That’s where the analogy break down. It only does when the goal is to get a confession (see The Confession, Arthur London).
        
        Is there a word for this ?
        Karl Krueger 9 Nov 2025 0:55 UTC
        1 point
        0
        Parent
        If the NN outputs deviate from a target value, its states is going to be modified. If the weight are (sufficiently) modified, future inference will be different. It’s behavior will be different.
        
        This trained the NN to avoid some behavior, and toward some other.
        Why on earth would you relate this to torture though, rather than to (say) the everyday experience of looking at a thing and realizing that it’s different from what you expected? The ordinary activity of learning?
        Out of all the billions of possible kinds of experience that could happen to a mind, and change that mind, you chose “torture” as an analogy for LLM training.
        And I’m saying, no, it’s less like torture than it is like ten thousand everyday things.
        Why torture?
        Nicolas Lupinski 9 Nov 2025 11:39 UTC
        1 point
        0
        Parent
        Only negative feedback ?
        
        Compare to evolution : make copies (reproduction), mutate, select the best performing, repeat. This merely allocates more ressources to the most promising branches.
        
        Or a Solomonoff style induction : just try to find the best data-compressor among all...
        
        > the everyday experience of looking at a thing and realizing that it’s different from what you expected
        
        This souds like being surprised. Surprise add emotional weight to outliers, its more like managing the training data-set.
  - dirk 8 Nov 2025 18:32 UTC
    1 point
    0
    Parent
    Asserting nociception as fact when that’s the very thing under question is poor argumentative behavior.
    Does your model account for Models Don’t “Get Reward”? If so, how?
    - Nicolas Lupinski 8 Nov 2025 18:59 UTC
      2 points
      0
      Parent
      Backpropagation of the error gradient is more similar to nociception/torture than evolution by random mutation.
      
      I’ve to check how RLHF is made...
      
      EDIT : error backpropagation is the workhorse behind reward learning, and policy update.
      
      The NN is punished for not doing as well as it could have.
    - Nicolas Lupinski 8 Nov 2025 21:05 UTC
      1 point
      0
      Parent
      Also I wasn’t being argumentative, I was trying to convey an idea. It was redundancy.