Hastings comments on testingthewaters’s Shortform

Hastings 31 Oct 2025 13:13 UTC
3 points
−2
There has been high quality research finding ways that some models are biased against white people, and high quality research finding ways that models are biased against not white people. Generally, the pattern is that base models and early post trained models like GPT-3.5 are traditionally racist, and post-trained models are often woke, sometimes in spectacular “only pictures of black nazis” ways. I’ve personally validated that some of these replicated, from how davinci-002 would always pick the white sounding resume, to how claude 4.5 would if prodded save 1 muslim over 10 christians.

Lesswrong very white and very human, and so it’s not that surprising, but a little sad, that it has pivoted hard from sarcastically dismissive to very interested in model bias as the second dynamic emerged.
- testingthewaters 31 Oct 2025 13:39 UTC
  3 points
  1
  Parent
  For what it’s worth I’m not white and I come primarily from an AI ethics background, my formal training is in the humanities. I do think its sad that people only fret about bias the moment it affects them, however, and I would rather the issue be taken seriously from the start.
  - Hastings 31 Oct 2025 18:30 UTC
    3 points
    0
    Parent
    Thanks for the reply! Sorry that my original comment was a little too bitter.
    - testingthewaters 31 Oct 2025 18:41 UTC
      3 points
      0
      Parent
      No worries at all, I know I’ve had my fair share of bitter moments around AI as well. I hope you have a nice rest of your day :)