Lukas Finnveden comments on ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks

Lukas Finnveden 23 Aug 2023 6:12 UTC
2 points
0
This is interesting — would it be easy to share the transcript of the conversation? (If it’s too long for a lesswrong comment, you could e.g. copy-paste it into a google doc and link-share it.)
- patrickleask 1 Sep 2023 11:54 UTC
  5 points
  0
  Parent
  Here you go: https://chat.openai.com/share/c5df0119-13de-43f9-8d4e-1c437bafa8ec
  - Lukas Finnveden 2 Sep 2023 1:32 UTC
    2 points
    0
    Parent
    Thanks!