MondSemmel comments on So You Want to Work at a Frontier AI Lab

MondSemmel 12 Jun 2025 11:52 UTC
3 points
0
The entire argument for avoiding frontier labs falls apart if you admit even a 20% likelihood that frontier labs will create aligned superintelligence,
Sure, but given that none of the frontier labs seem remotely on track to align anything, superintelligence or otherwise, that’s an extraordinary claim which requires extraordinary levels of evidence.
- 307th 12 Jun 2025 13:15 UTC
  −3 points
  −10
  Parent
  The frontier labs have certainly succeeded at aligning their models. LLMs have achieved a level of alignment people wouldn’t have dreamed of 10 years ago.
  Now labs are running into issues with the reasoning models, but this doesn’t at all seem insurmountable.
  - MondSemmel 12 Jun 2025 22:07 UTC
    6 points
    5
    Parent
    Contemporary AI models are not “aligned” in any sense that would help the slightest bit against a superintelligence. You need stronger guardrails against stronger AI capabilities, and current “alignment” doesn’t even prevent stuff like ChatGPT’s recent sycophancy, or jailbreaking.