Kaj_Sotala comments on Surprising LLM reasoning failures make me think we still need qualitative breakthroughs for AGI

Kaj_Sotala 16 Apr 2025 6:25 UTC
13 points
0
Note that Claude and o1 preview weren’t multimodal, so were weak at spatial puzzles. If this was full o1, I’m surprised.
I just tried the sliding puzzle with o1 and it got it right! Though multimodality may not have been relevant, since it solved it by writing a breadth-first search algorithm and running it.
- Seth Herd 16 Apr 2025 6:32 UTC
  5 points
  1
  Parent
  Interesting! Nonetheless, I agree with your opening statement that LLMs learning to do any of these things individually doesn’t address the larger point that the have important cognitive gaps and fail.to generalize in ways that humans can.