leogao comments on leogao’s Shortform

leogao 4 May 2025 15:59 UTC
4 points
0
it is kind of funny that caring a lot about reflective stability of alignment proposals and paradoxes arising from self modelling (e.g in action counterfactuals) is most common in the people who are the worst at modelling themselves