For what it’s worth, as one of the people who believes “ChatGPT is not evidence of alignment-of-the-type-that-matters”, I don’t believe “Bing Chat is evidence of misalignment-of-the-type-that-matters”.
I believe the alignment of the outward behavior of simulacra is only very tenuously related to the alignment of the underlying AI, so both things provide ~no data on that (in a similar way to how our ability or inability to control the weather is entirely unrelated to alignment).
For what it’s worth, as one of the people who believes “ChatGPT is not evidence of alignment-of-the-type-that-matters”, I don’t believe “Bing Chat is evidence of misalignment-of-the-type-that-matters”.
I believe the alignment of the outward behavior of simulacra is only very tenuously related to the alignment of the underlying AI, so both things provide ~no data on that (in a similar way to how our ability or inability to control the weather is entirely unrelated to alignment).