Could potentially be up for playing red team against you, in exchange for you playing red team against me (but if I think I could have something to contribute as red team would depend on specifics of what is proposed/discussed—e.g., I’m not familiar with technical specifics of deep learning beyond vague descriptions).
I don’t have anything prepared for red teaming at the moment—I appreciate the offer though! Can I take advantage of it in the future? (Anyone who wants to give me critical feedback on my drafts should send me a personal message!)
And yes, do feel free to send me drafts in the future if you want me to look over them. I don’t give guaranties regarding amount or speed of feedback, but it would be my intention to try to be helpful :)
Interesting comment. I feel like I recently have experienced this phenomena myself (that it’s hard to find people who can play “red team”).
Do you have any “blue team” ideas for alignment where you in particular would want someone to play “red team”?
I would be interested in having someone play “red team” here, but if someone were to do so in a non-trivial manner then it would probably be best to wait at least until I’ve completed Part 3 (which will take at least weeks, partly since I’m busy with my main job): https://www.lesswrong.com/posts/ZmZBataeY58anJRBb/agi-assisted-alignment-part-1-introduction
Could potentially be up for playing red team against you, in exchange for you playing red team against me (but if I think I could have something to contribute as red team would depend on specifics of what is proposed/discussed—e.g., I’m not familiar with technical specifics of deep learning beyond vague descriptions).
I wrote a comment on your post with feedback.
I don’t have anything prepared for red teaming at the moment—I appreciate the offer though! Can I take advantage of it in the future? (Anyone who wants to give me critical feedback on my drafts should send me a personal message!)
Thanks for the feedback!
And yes, do feel free to send me drafts in the future if you want me to look over them. I don’t give guaranties regarding amount or speed of feedback, but it would be my intention to try to be helpful :)