AI Misuse

TagLast edit: 1 May 2023 17:42 UTC by Raemon

AI misuse. Humans using AI in a way that harms humanity.

Managing catastrophic misuse without robust AIs

ryan_greenblatt and Buck

16 Jan 2024 17:27 UTC

58 points

16 comments11 min readLW link

Adversarial Robustness Could Help Prevent Catastrophic Misuse

aogara11 Dec 2023 19:12 UTC

30 points

18 comments9 min readLW link

On excluding dangerous information from training

ShayBenMoshe17 Nov 2023 11:14 UTC

23 points

5 comments3 min readLW link

Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation

Soroush Pour, rusheb, Quentin FEUILLADE--MONTIXI, Arush and scasper

7 Nov 2023 17:59 UTC

36 points

2 comments2 min readLW link

(arxiv.org)

Proposal: Align Systems Earlier In Training

OneManyNone16 May 2023 16:24 UTC

18 points

0 comments11 min readLW link

Proposal: we should start referring to the risk from unaligned AI as a type of accident risk

Christopher King16 May 2023 15:18 UTC

22 points

6 comments2 min readLW link

Distinguishing misuse is difficult and uncomfortable

lukehmiles1 May 2023 16:23 UTC

17 points

3 comments1 min readLW link

No comments.