How would you describe the proto-solution to value loading that we have? We manifestly have conversational AI that can act in accordance with a value system, but how would we characterize the mechanism? Is it something like “create a general mind simulator via superintelligent autocomplete, then system-prompt it to be nice”?
How would you describe the proto-solution to value loading that we have? We manifestly have conversational AI that can act in accordance with a value system, but how would we characterize the mechanism? Is it something like “create a general mind simulator via superintelligent autocomplete, then system-prompt it to be nice”?