berns

Karma: 17

Independent alignment researcher trying to build some philosophy in public

berns 17 Apr 2026 20:15 UTC
1 point
0
in reply to: cubefox’s comment on: Verify, but Trust
Good suggestion, thanks

Verify, but Trust

berns17 Apr 2026 3:25 UTC

8 points

berns 10 Apr 2026 5:36 UTC
1 point
0
in reply to: papetoast’s comment on: Precommitments
I’ve updated the footnote to specify that I mean commitments about our future behavior which typically, as a consequence, narrow our future options. I think that’s basically just the same as saying a commitment about future behavior that carries any weight at all. Yes, I would say that the reason FDT solves Parfit’s hitchhiker is exactly because of its greater power to precommit than e.g. CDT (which is basically unable to precommit at all.) Thinking about a value like honesty with a quantifiable dollar figure doesn’t really reflect our behavior or desires about it. Treating honesty like a numerical preference basically forces it to be nothing more than a scalar input to a calculation you’re always running. A big part of the point of the Mary/Jane example is that that’s a bad idea when it comes to accounting for values you actually care about.

berns6 Apr 2026 22:54 UTC

11 points