Dentosal comments on Dentosal’s Shortform

Dentosal 28 Apr 2026 10:10 UTC
2 points
−2
AI eval idea: metabench. Make each LLM autonomously design and build a benchmark. Then run these benchmarks for all participants and sum the results. Compare with external benchmarks too.
- David Africa 28 Apr 2026 16:50 UTC
  3 points
  0
  Parent
  The name metabench is already taken!