Mikhail Samin comments on Mikhail Samin’s Shortform

Mikhail Samin 20 Apr 2026 16:32 UTC
1 point
−1
It does not inspire confidence in the AI safety field that the only problem (among many that would appear at the superintelligent level) that was deliberately materialized earlier is alignment-faking, explicitly mentioned by Yudkowsky in AGI Ruin.
(Am I missing other problems that would appear at the superintelligent level that have already been demonstrated? Do we know people who can predict those problems on their own (without requiring Eliezer Yudkowsky to point them out) and then materialize them now?)
- Mateusz Bagiński 20 Apr 2026 17:09 UTC
  3 points
  0
  Parent
  Are you asking about problems that would by default only appear at the SI level that have been demonstrated at a sub-SI level via some sort of elicitation?