If anyone wants to have a voice chat with me about a topic that I’m interested in (see my recent post/comment history to get a sense), please contact me via PM.
My main “claims to fame”:
Created the first general purpose open source cryptography programming library (Crypto++, 1995), motivated by AI risk and what’s now called “defensive acceleration”.
Published one of the first descriptions of a cryptocurrency based on a distributed public ledger (b-money, 1998), predating Bitcoin.
Proposed UDT, combining the ideas of updatelessness, policy selection, and evaluating consequences using logical conditionals.
First to argue for pausing AI development based on the technical difficulty of ensuring AI x-safety (SL4 2004, LW 2011).
Identified current and future philosophical difficulties as core AI x-safety bottlenecks, potentially insurmountable by human researchers, and advocated for research into metaphilosophy and AI philosophical competence as possible solutions.
There’s an extensive literature in economics on optimal punishment. Does that count, as far as utilitarians working on justice as an instrumental good?
I think we just need our terminal values to not change too much over time, so if I ever feel like I need to rethink my plans, I’ll come up with a similar or even better plan. Is your thinking that this is impossible or infeasible for most humans, due to things like “power corrupts”? If so, I think consequentialism is still good as it lets us manage or mitigate such value drift, e.g., if I can foresee power (or other circumstances) corrupting my values, I can take precautions like avoiding getting into those situations?
Linking this to your other recent shortform, how could Paul have avoided other people misusing his work, except by doing better consequentialism (i.e., foreseeing this consequence and doing something ahead of time to mitigate it)? Are you not applying consequentialism in predicting the possible downside of one research/communications approach and adopting a different approach based on this prediction?