Cervera comments on Alignment remains a hard, unsolved problem

Cervera 28 Nov 2025 16:20 UTC
2 points
−2
Could we not devise AlphaFold but for LLM alignment?

Your P/NP remark reminded me of the scepticism around Protein folder before the Alpha fold days.
- TsviBT 28 Nov 2025 20:07 UTC
  6 points
  10
  Parent
  I think the skepticism about the protein folder was “we can’t make something effective because we can’t optimize enough / search hard enough”, where my skepticism about alignment is “we can’t make something aligned because we can’t aim optimization processes well enough”. Part of how we can’t aim search processes is that we don’t have easily testable proxy measurements that are bound up with alignment strongly enough. What would be the evaluation function for AlignmentFold?