RSS

AI Persuasion

TagLast edit: 26 Apr 2022 13:49 UTC by plex

AI which is highly capable of persuading people might have significant effects on humanity.

Risks from AI persuasion

Beth Barnes24 Dec 2021 1:48 UTC
69 points
15 comments31 min readLW link

Per­sua­sion Tools: AI takeover with­out AGI or agency?

Daniel Kokotajlo20 Nov 2020 16:54 UTC
75 points
25 comments11 min readLW link1 review

On The Cur­rent Sta­tus Of AI Dating

Nikita Brancatisano7 Feb 2023 20:00 UTC
52 points
8 comments6 min readLW link

How it feels to have your mind hacked by an AI

blaked12 Jan 2023 0:33 UTC
354 points
219 comments17 min readLW link

What 2026 looks like

Daniel Kokotajlo6 Aug 2021 16:14 UTC
473 points
150 comments16 min readLW link1 review

I played the AI Box Ex­per­i­ment again! (and lost both games)

Tuxedage27 Sep 2013 2:32 UTC
59 points
123 comments11 min readLW link

How To Win The AI Box Ex­per­i­ment (Some­times)

pinkgothic12 Sep 2015 12:34 UTC
55 points
21 comments22 min readLW link

One True Love

Zvi9 Feb 2024 15:10 UTC
32 points
7 comments10 min readLW link
(thezvi.wordpress.com)

AGI isn’t just a technology

Seth Herd1 Sep 2023 14:35 UTC
18 points
12 comments2 min readLW link

[Question] Bayesian Per­sua­sion?

Karthik Tadepalli28 May 2022 17:52 UTC
8 points
2 comments1 min readLW link

I at­tempted the AI Box Ex­per­i­ment (and lost)

Tuxedage21 Jan 2013 2:59 UTC
78 points
245 comments5 min readLW link

I Am Scared of Post­ing Nega­tive Takes About Bing’s AI

Yitz17 Feb 2023 20:50 UTC
63 points
27 comments1 min readLW link

Did ChatGPT just gaslight me?

ThomasW1 Dec 2022 5:41 UTC
123 points
45 comments9 min readLW link
(aiwatchtower.substack.com)

[Question] Or­a­cle AGI—How can it es­cape, other than se­cu­rity is­sues? (Steganog­ra­phy?)

RationalSieve25 Dec 2022 20:14 UTC
3 points
6 comments1 min readLW link

A sim­ple pre­sen­ta­tion of AI risk arguments

Seth Herd26 Apr 2023 2:19 UTC
16 points
0 comments2 min readLW link

AI scares and chang­ing pub­lic beliefs

Seth Herd6 Apr 2023 18:51 UTC
45 points
21 comments6 min readLW link

The re­ward func­tion is already how well you ma­nipu­late humans

Kerry19 Oct 2022 1:52 UTC
20 points
9 comments2 min readLW link
No comments.