RSS

AI Persuasion

TagLast edit: 30 Dec 2024 10:08 UTC by Dakara

AI Persuasion is the ability of artificial intelligence systems to influence human beliefs, decisions, and behaviors.

Risks from AI persuasion

Beth Barnes24 Dec 2021 1:48 UTC
76 points
15 comments31 min readLW link

Per­sua­sion Tools: AI takeover with­out AGI or agency?

Daniel Kokotajlo20 Nov 2020 16:54 UTC
85 points
25 comments11 min readLW link1 review

How it feels to have your mind hacked by an AI

blaked12 Jan 2023 0:33 UTC
372 points
222 comments17 min readLW link

On The Cur­rent Sta­tus Of AI Dating

Nikita Brancatisano7 Feb 2023 20:00 UTC
53 points
8 comments6 min readLW link

What 2026 looks like

Daniel Kokotajlo6 Aug 2021 16:14 UTC
588 points
163 comments16 min readLW link1 review

One True Love

Zvi9 Feb 2024 15:10 UTC
34 points
7 comments10 min readLW link
(thezvi.wordpress.com)

[Question] Bayesian Per­sua­sion?

Karthik Tadepalli28 May 2022 17:52 UTC
8 points
2 comments1 min readLW link

I at­tempted the AI Box Ex­per­i­ment (and lost)

Tuxedage21 Jan 2013 2:59 UTC
79 points
246 comments5 min readLW link

I played the AI Box Ex­per­i­ment again! (and lost both games)

Tuxedage27 Sep 2013 2:32 UTC
62 points
123 comments11 min readLW link

How To Win The AI Box Ex­per­i­ment (Some­times)

pinkgothic12 Sep 2015 12:34 UTC
56 points
21 comments22 min readLW link

AGI isn’t just a technology

Seth Herd1 Sep 2023 14:35 UTC
18 points
12 comments2 min readLW link

Take Pre­cau­tion­ary Mea­sures Against Su­per­hu­man AI Persuasion

Yitz12 Jul 2025 5:34 UTC
12 points
9 comments2 min readLW link

Anti-Su­per­per­sua­sion Interventions

23 Jul 2025 15:18 UTC
21 points
1 comment5 min readLW link

I Am Scared of Post­ing Nega­tive Takes About Bing’s AI

Yitz17 Feb 2023 20:50 UTC
63 points
28 comments1 min readLW link

Meth­ods of defense against AGI manipulation

MarkelKori28 Apr 2025 21:03 UTC
3 points
0 comments2 min readLW link

A sim­ple pre­sen­ta­tion of AI risk arguments

Seth Herd26 Apr 2023 2:19 UTC
19 points
0 comments2 min readLW link

Can Per­sua­sion Break AI Safety? Ex­plor­ing the In­ter­play Between Fine-Tun­ing, At­tacks, and Guardrails

Devina Jain4 Feb 2025 19:10 UTC
9 points
0 comments10 min readLW link

An Alter­nate His­tory of the Fu­ture, 2025-2040

Mr Beastly24 Feb 2025 5:53 UTC
5 points
5 comments10 min readLW link

Did ChatGPT just gaslight me?

TW1231 Dec 2022 5:41 UTC
124 points
45 comments9 min readLW link
(aiwatchtower.substack.com)

AI scares and chang­ing pub­lic beliefs

Seth Herd6 Apr 2023 18:51 UTC
48 points
23 comments6 min readLW link

The re­ward func­tion is already how well you ma­nipu­late humans

Kerry19 Oct 2022 1:52 UTC
20 points
9 comments2 min readLW link

Let’s use AI to harden hu­man defenses against AI manipulation

Tom Davidson17 May 2023 23:33 UTC
35 points
7 comments24 min readLW link

What is LMArena ac­tu­ally mea­sur­ing?

Baybar16 Sep 2025 21:44 UTC
11 points
0 comments5 min readLW link

What Par­a­sitic AI might tell us about LLMs Per­sua­sion Capabilities

Baybar13 Sep 2025 20:39 UTC
11 points
5 comments4 min readLW link

[Question] Or­a­cle AGI—How can it es­cape, other than se­cu­rity is­sues? (Steganog­ra­phy?)

RationalSieve25 Dec 2022 20:14 UTC
3 points
6 comments1 min readLW link
No comments.