johnswentworth comments on johnswentworth’s Shortform

johnswentworth 21 Apr 2026 1:13 UTC
32 points
2
But they were, all of them, deceived, for another csv file was made. In the subfolder of a subfolder, the LLM hardcoded a bunch of data, and never wrote code to pull the actual values. And into this csv file the LLM poured vaguely reasonable-sounding numbers, which unfortunately did not match the real world. One subtle csv file of made-up data which invalidated all of the complicated calculations and projections.
- Roman Malov 21 Apr 2026 16:33 UTC
  1 point
  0
  Parent
  This is a paragraph from the description of a future where AI companies try to solve alignment by automating it with LLM agents, did I guess correctly?
  - johnswentworth 21 Apr 2026 16:48 UTC
    9 points
    0
    Parent
    I mean, it easily could be, that would not be a huge surprise. But it was originally generated as a stylized description of actual experiences I’ve had with Claude. For instance, a week ago I asked it for data on the biggest ETFs by trade volume, and it just hardcoded some numbers into a file without actually looking anything up.
    - Linch 21 Apr 2026 20:35 UTC
      3 points
      0
      Parent
      I’ve had similar experiences.