Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
AI Misuse
Tag
Last edit:
1 May 2023 17:42 UTC
by
Raemon
AI misuse.
Humans using AI in a way that harms humanity.
Relevant
New
Old
Managing catastrophic misuse without robust AIs
ryan_greenblatt
and
Buck
16 Jan 2024 17:27 UTC
58
points
16
comments
11
min read
LW
link
Adversarial Robustness Could Help Prevent Catastrophic Misuse
aogara
11 Dec 2023 19:12 UTC
30
points
18
comments
9
min read
LW
link
On excluding dangerous information from training
ShayBenMoshe
17 Nov 2023 11:14 UTC
23
points
5
comments
3
min read
LW
link
Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation
Soroush Pour
,
rusheb
,
Quentin FEUILLADE--MONTIXI
,
Arush
and
scasper
7 Nov 2023 17:59 UTC
36
points
2
comments
2
min read
LW
link
(arxiv.org)
Proposal: Align Systems Earlier In Training
OneManyNone
16 May 2023 16:24 UTC
18
points
0
comments
11
min read
LW
link
Proposal: we should start referring to the risk from unaligned AI as a type of *accident risk*
Christopher King
16 May 2023 15:18 UTC
22
points
6
comments
2
min read
LW
link
Distinguishing misuse is difficult and uncomfortable
lukehmiles
1 May 2023 16:23 UTC
17
points
3
comments
1
min read
LW
link
No comments.
Back to top