AI Misuse

AI misuse. Humans using AI in a way that harms humanity.

Dist­in­guish­ing mi­suse is difficult and uncomfortable

lukehmiles1 May 2023
17 points
3 comments

Manag­ing catas­trophic mi­suse with­out ro­bust AIs

16 Jan 2024
58 points
17 comments

Ad­ver­sar­ial Ro­bust­ness Could Help Prevent Catas­trophic Misuse

aogara11 Dec 2023
30 points
18 comments

Scal­able And Trans­fer­able Black-Box Jailbreaks For Lan­guage Models Via Per­sona Modulation

7 Nov 2023
36 points
2 comments

On ex­clud­ing dan­ger­ous in­for­ma­tion from training

ShayBenMoshe17 Nov 2023
23 points
5 comments

Pro­posal: Align Sys­tems Ear­lier In Training

OneManyNone16 May 2023
18 points
0 comments

Pro­posal: we should start refer­ring to the risk from un­al­igned AI as a type of *ac­ci­dent risk*

Christopher King16 May 2023
22 points
6 comments
