RSS

AI Misuse

TagLast edit: 1 May 2023 17:42 UTC by Raemon

AI misuse. Humans using AI in a way that harms humanity.

Ad­ver­sar­ial Ro­bust­ness Could Help Prevent Catas­trophic Misuse

aog11 Dec 2023 19:12 UTC
30 points
18 comments9 min readLW link

Manag­ing catas­trophic mi­suse with­out ro­bust AIs

16 Jan 2024 17:27 UTC
63 points
17 comments11 min readLW link

Dist­in­guish­ing mi­suse is difficult and uncomfortable

lemonhope1 May 2023 16:23 UTC
17 points
3 comments1 min readLW link

Pro­posal: Align Sys­tems Ear­lier In Training

OneManyNone16 May 2023 16:24 UTC
18 points
0 comments11 min readLW link

Misal­ign­ment or mi­suse? The AGI al­ign­ment tradeoff

Max_He-Ho20 Jun 2025 10:43 UTC
3 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

Hu­man study on AI spear phish­ing campaigns

3 Jan 2025 15:11 UTC
81 points
8 comments5 min readLW link

Pro­posal: we should start refer­ring to the risk from un­al­igned AI as a type of *ac­ci­dent risk*

Christopher King16 May 2023 15:18 UTC
22 points
6 comments2 min readLW link

On ex­clud­ing dan­ger­ous in­for­ma­tion from training

ShayBenMoshe17 Nov 2023 11:14 UTC
23 points
5 comments3 min readLW link

Vi­sual Prompt In­jec­tions: Re­sults on test­ing AI spam-defense and AI vuln­er­a­bil­ity to de­cep­tive web ads.

Seon Gunness3 Jun 2025 20:10 UTC
4 points
0 comments12 min readLW link

Scal­able And Trans­fer­able Black-Box Jailbreaks For Lan­guage Models Via Per­sona Modulation

7 Nov 2023 17:59 UTC
38 points
2 comments2 min readLW link
(arxiv.org)

Tech­ni­cal Risks of (Lethal) Au­tonomous Weapons Systems

Heramb23 Oct 2024 20:41 UTC
2 points
0 comments1 min readLW link
(encodejustice.org)

How to solve the mi­suse prob­lem as­sum­ing that in 10 years the de­fault sce­nario is that AGI agents are ca­pa­ble of syn­thetiz­ing pathogens

jeremtti27 Nov 2024 21:17 UTC
6 points
0 comments9 min readLW link

Covert Mal­i­cious Finetuning

2 Jul 2024 2:41 UTC
94 points
4 comments3 min readLW link
No comments.