RSS

Luke Bailey

Karma: 96

Research intern at CHAI

Image Hi­jacks: Ad­ver­sar­ial Images can Con­trol Gen­er­a­tive Models at Runtime

20 Sep 2023 15:23 UTC
58 points
9 comments1 min readLW link
(arxiv.org)

Ten­sor Trust: An on­line game to un­cover prompt in­jec­tion vulnerabilities

1 Sep 2023 19:31 UTC
30 points
0 comments5 min readLW link
(tensortrust.ai)

Ex­am­ples of Prompts that Make GPT-4 Out­put Falsehoods

22 Jul 2023 20:21 UTC
21 points
5 comments6 min readLW link