harfe comments on quila’s Shortform

harfe 6 Dec 2024 13:52 UTC
1 point
0
For a provably aligned (or probably aligned) system you need a formal specification of alignment. Do you have something in mind for that? This could be a major difficulty. But maybe you only want to “prove” inner alignment and assume that you already have an outer-alignment-goal-function, in which case defining alignment is probably easier.
- quila 6 Dec 2024 14:01 UTC
  1 point
  0
  Parent
  But maybe you only want to “prove” inner alignment and assume that you already have an outer-alignment-goal-function
  correct, i’m imagining these being solved separately