Ebenezer Dukakis comments on Gemini 3 Pro Is a Vast Intelligence With No Spine

Ebenezer Dukakis 26 Nov 2025 8:22 UTC
9 points
0
I wonder if Google is optimizing harder for benchmarks, to try and prop up its stock price against possible deflation of an AI bubble.
It occurs to me that an AI alignment organization should create comprehensive private alignment benchmarks and start releasing the scores. They would have to be constructed in a non-traditional way so they’re less vulnerable to standard goodharting. If these benchmarks become popular with AI users and AI investors, they could be a powerful way to steer AI development in a more responsible direction. By keeping them private, you could make it harder for AI companies to optimize against the benchmarks, and nudge them towards actually solving deeper alignment issues. It would also be a powerful illustration of the point that advanced AI will need to solve unforeseen/out-of-distribution alignment challenges. @Eliezer Yudkowsky
- Seth Herd 26 Nov 2025 14:38 UTC
  2 points
  0
  Parent
  This seems like a great idea. I strongly suggest you write it up as a short form to get feedback and perhaps then as a full post.