Jeremy Gillen comments on eggsyntax’s Shortform

Jeremy Gillen 31 Dec 2025 19:27 UTC
4 points
0
When you say ‘that weakness’, you mean the inability to identify a subtask as alignment-related?
Mainly “bad actors can split their work...” with current LLMs, but yeah also identifying/guessing the overall intentions of humans giving subtasks.