These days, it’s relatively easy to create a digital replica of a person.
You give the person’s writings to a top LLM, and (with a clever prompt) the LLM starts thinking like the person. E.g. see our experiments on the topic.
Of course, it’s far away from a proper mind uploading. But even in this limited form, it could be highly useful for AI alignment research:
accelerate the research by building digital teams of hundreds of virtual alignment researchers
run smarter alignment benchmarks (e.g. the digital Yudkowsky running millions of clever tests against your new model)
explore the human values, inner- and outer alignment with the help of digital humans.
Why no one is doing this?
Given the short timelines and the low likelihood of AI slowdown, this may be the only way to get alignment before AGI, by massively (OOMs) accelerating the alignment research.
I think many people experiment with creating different digital personas but with low effort, like “You are Elon Musk”.
I personally often ask LLM to comment on my drafts as Yudkowsky and other well known LWers. What such answers lack is extreme unique insight which is often for real EY.
The essence of human genius is missed and this is exactly why we still don’t have AGI.
Also, for really good EY model we may need more data about his internal thought stream and biographical details which only he can collect. It seems that he is not interested and even if he would, it would be time consuming (but he write quickly). One thousand pages of unedited thought stream may significantly improve the model.
I tried this four years ago.