Then it has a truly terrible title and I will gladly take bets at 5:1 odds that the primary context other people, including Forethought people, will link to this will be in discussion about things like the Anthropic Constitution, or other approaches centrally determined around choosing the literary character that AIs are aiming to imitate.
Yes, your reaction and the reaction of other commenters here certainly show the title is very confusing/misleading for at least some readers. I don’t know how common that reaction is. It’s not something I saw in others who commented on the draft, but we didn’t seek out “classic LW doomer” types and in hindsight we should have.
Maybe a better name for this type of work than “AI character” would be “the alignment target”, that’s a useful update
Why do you think the Anthropic Constitution and OAI model spec are just about choosing AI’s literary character? They are describing the whole alignment target, not just the “literary character” parts of it. So yes, I agree other ppl will associate this post with things like Anthropic Constitution, but disagree that means they’ve misunderstood. And I totally think LWers who think about the alignment target should be weighing in about what these docs say, not treating them as unimportant.
On the positive side, it seems like you correctly understood my most recent attempt to explain the content of this post.
I disagree—I think we just use it for the broader thing. What sentences do we use it to refer to literary character?
I’m particularly surprised that you say the post equivocates, given that you seem to understand me when I reframe the post in terms of the alignment target.