Computer Security & Cryptography

Tag

Security Mindset and Ordinary Paranoia

Eliezer Yudkowsky25 Nov 2017 17:53 UTC

132 points

25 comments29 min readLW link

On Destroying the World

Chris_Leong28 Sep 2020 7:38 UTC

82 points

87 comments5 min readLW link

Better Password Peppering

Yoav Ravid24 Aug 2021 10:19 UTC

6 points

9 comments3 min readLW link

Work on Security Instead of Friendliness?

Wei Dai21 Jul 2012 18:28 UTC

71 points

107 comments2 min readLW link

Access to powerful AI might make computer security radically easier

Buck8 Jun 2024 6:00 UTC

108 points

14 comments6 min readLW link

POC || GTFO culture as partial antidote to alignment wordcelism

lc15 Mar 2023 10:21 UTC

162 points

17 comments7 min readLW link 2 reviews

Entropy isn’t sufficient to measure password strength

benwr17 Jan 2022 6:41 UTC

36 points

30 comments2 min readLW link

(www.benwr.net)

Upskilling, bridge-building, research on security/cryptography and AI safety

Allison Duettmann20 Apr 2023 22:32 UTC

14 points

0 comments4 min readLW link

Donutting is bad

Jarrah16 Jun 2025 4:12 UTC

20 points

4 comments1 min readLW link

Privacy Tradeoffs

jefftk29 Dec 2022 3:40 UTC

13 points

1 comment2 min readLW link

(www.jefftk.com)

Not Getting Hacked

jefftk21 Dec 2022 21:40 UTC

40 points

14 comments7 min readLW link

(www.jefftk.com)

[Question] How much does cybersecurity reduce AI risk?

Darmani12 Jun 2022 22:13 UTC

34 points

23 comments1 min readLW link

The danger of error descriptions

gistya25 Mar 2021 20:05 UTC

−1 points

2 comments3 min readLW link

Paper review: A Cryptographic Solution to a Game Theoretic Problem

victorsintnicolaas24 Apr 2021 11:54 UTC

23 points

4 comments6 min readLW link

How Likely is Losing a Google Account?

jefftk30 Jan 2023 0:20 UTC

52 points

12 comments3 min readLW link

(www.jefftk.com)

The rise of AI in cybercrime

BobyResearcher30 Jul 2023 20:19 UTC

−15 points

1 comment2 min readLW link

(riseofAIincybercryme)

Homomorphic encryption and Bitcoin

jimrandomh19 May 2011 1:07 UTC

10 points

9 comments1 min readLW link

Frontier Model Security

Vaniver26 Jul 2023 4:48 UTC

32 points

1 comment3 min readLW link

(www.anthropic.com)

11 diceware words is enough

DanielFilan and benwr

15 Feb 2024 0:13 UTC

23 points

6 comments1 min readLW link

(threadreaderapp.com)

[Question] How good is security for LessWrong and the Alignment Forum?

Quintin Pope4 Oct 2021 22:27 UTC

20 points

4 comments1 min readLW link

gamers beware: modded Minecraft has new malware

the gears to ascension7 Jun 2023 13:49 UTC

14 points

5 comments1 min readLW link

(github.com)

Cryptographic Boxes for Unfriendly AI

paulfchristiano18 Dec 2010 8:28 UTC

78 points

162 comments5 min readLW link

Secure homes for digital people

paulfchristiano10 Oct 2021 15:50 UTC

161 points

37 comments9 min readLW link 1 review

(sideways-view.com)

Preventing model exfiltration with upload limits

ryan_greenblatt6 Feb 2024 16:29 UTC

71 points

22 comments14 min readLW link

[Question] In software engineering, what are the upper limits of Language-Based Security?

mako yass27 Dec 2020 5:50 UTC

13 points

6 comments1 min readLW link

What is Cryptographically Possible

paulfchristiano24 Dec 2010 4:58 UTC

27 points

19 comments5 min readLW link

[Question] What is good Cyber Security Advice?

Gunnar_Zarncke24 Oct 2022 23:27 UTC

30 points

12 comments2 min readLW link

Implications of the AI Security Gap

Dan Braun8 Jan 2025 8:31 UTC

46 points

0 comments9 min readLW link

How much novel security-critical infrastructure do you need during the singularity?

Buck4 Jul 2025 16:54 UTC

56 points

7 comments5 min readLW link

Reliability, Security, and AI risk: Notes from infosec textbook chapter 1

Orpheus167 Apr 2023 15:47 UTC

34 points

1 comment4 min readLW link

AXRP Episode 30 - AI Security with Jeffrey Ladish

DanielFilan1 May 2024 2:50 UTC

25 points

0 comments79 min readLW link

AI Safety is Dropping the Ball on Clown Attacks

trevor22 Oct 2023 20:09 UTC

74 points

83 comments34 min readLW link

n of m ring signatures

DanielFilan4 Dec 2023 20:00 UTC

51 points

7 comments1 min readLW link

(danielfilan.com)

Crypto-fed Computation

aaguirre13 Jun 2022 21:20 UTC

24 points

7 comments7 min readLW link

Cryptanalysis as Epistemology? (paging cryptonerds)

SilasBarta6 Apr 2011 19:06 UTC

17 points

42 comments1 min readLW link

Unikernels: No Longer an Academic Exercise

Martin Sustrik23 Oct 2018 11:40 UTC

27 points

1 comment3 min readLW link

(250bpm.com)

[Question] Hoarding Gmail-accounts in a post-CAPTCHA world?

Alexander Gietelink Oldenziel11 Mar 2023 16:08 UTC

7 points

3 comments1 min readLW link

AI Governance & Strategy: Priorities, talent gaps, & opportunities

Orpheus163 Mar 2023 18:09 UTC

56 points

2 comments4 min readLW link

Security Mindset: Lessons from 20+ years of Software Security Failures Relevant to AGI Alignment

elspood21 Jun 2022 23:55 UTC

369 points

42 comments7 min readLW link 1 review

5 Reasons Why Governments/Militaries Already Want AI for Information Warfare

trevor30 Oct 2023 16:30 UTC

32 points

0 comments10 min readLW link

[Question] Is there a convenient way to make “sealed” predictions?

Daniel Kokotajlo6 May 2022 23:00 UTC

32 points

20 comments1 min readLW link

Freedom and Privacy of Thought Architectures

SebastianG 20 Jul 2024 21:43 UTC

5 points

2 comments1 min readLW link

Hacker-AI – Does it already exist?

Erland Wittkotter7 Nov 2022 14:01 UTC

3 points

12 comments11 min readLW link

AI security might be helpful for AI alignment

Igor Ivanov6 Jan 2023 20:16 UTC

36 points

1 comment2 min readLW link

Biosecurity Culture, Computer Security Culture

jefftk30 Aug 2023 16:40 UTC

103 points

11 comments2 min readLW link

(www.jefftk.com)

The Unbearable Lightness of Web Vulnerabilities

aiiixiii29 May 2022 21:13 UTC

29 points

2 comments1 min readLW link

(www.theoreticalstructures.io)

On Seeing Through ‘On Seeing Through: A Unified Theory’: A Unified Theory

gwern15 Jun 2019 18:57 UTC

26 points

0 comments1 min readLW link

(www.gwern.net)

Cryptographic and auxiliary approaches relevant for AI safety

Allison Duettmann18 Apr 2023 14:18 UTC

7 points

0 comments6 min readLW link

Protecting agent boundaries

Chris Lakin25 Jan 2024 4:13 UTC

11 points

6 comments2 min readLW link

Planning to build a cryptographic box with perfect secrecy

Lysandre Terrisse31 Dec 2023 9:31 UTC

40 points

6 comments11 min readLW link

PoMP and Circumstance: Introduction

benatkin9 Dec 2024 5:54 UTC

1 point

1 comment1 min readLW link

Sealed Computation: Towards Low-Friction Proof of Locality

Paul Bricman29 Apr 2025 15:26 UTC

4 points

0 comments10 min readLW link

(noemaresearch.com)

Estimating the consequences of device detection tech

Jsevillamol8 Jul 2018 18:25 UTC

27 points

4 comments7 min readLW link

Non-Technical Preparation for Hacker-AI and Cyberwar 2.0+

Erland Wittkotter19 Dec 2022 11:42 UTC

2 points

0 comments25 min readLW link

How important is AI hacking as LLMs advance?

artkpv29 Jan 2024 18:41 UTC

1 point

0 comments6 min readLW link

Improved Security to Prevent Hacker-AI and Digital Ghosts

Erland Wittkotter21 Oct 2022 10:11 UTC

4 points

3 comments12 min readLW link

[Question] Transferring credence without transferring evidence?

Kaarel4 Feb 2022 8:11 UTC

11 points

6 comments3 min readLW link

Latacora might be of interest to some AI Safety organizations

NunoSempere25 Nov 2021 23:57 UTC

14 points

10 comments1 min readLW link

(www.latacora.com)

Security Mindset and the Logistic Success Curve

Eliezer Yudkowsky26 Nov 2017 15:58 UTC

106 points

49 comments20 min readLW link

Using an LLM perplexity filter to detect weight exfiltration

Adam Karvonen21 Jul 2024 18:18 UTC

25 points

11 comments2 min readLW link

Navigating the Attackspace

Jonas Kgomo12 Dec 2023 13:59 UTC

1 point

0 comments2 min readLW link

The 2/3 rule for multi-factor authentication

RomanHauksson4 Feb 2023 2:57 UTC

4 points

0 comments1 min readLW link

(roman.computer)

Can startups be impactful in AI safety?

Esben Kran and Archana Vaidheeswaran

13 Sep 2024 19:00 UTC

15 points

0 comments6 min readLW link

Proof of posteriority: a defense against AI-generated misinformation

jchan17 Jul 2023 12:04 UTC

33 points

3 comments5 min readLW link

The Pragmatic Side of Cryptographically Boxing AI

Bart Jaworski6 Aug 2024 17:46 UTC

6 points

0 comments9 min readLW link

Using GPT-Eliezer against ChatGPT Jailbreaking

Stuart_Armstrong and rgorman

6 Dec 2022 19:54 UTC

170 points

85 comments9 min readLW link

Can Large Language Models effectively identify cybersecurity risks?

emile delcourt30 Aug 2024 20:20 UTC

18 points

0 comments11 min readLW link

GPT-4o Guardrails Gone: Data Poisoning & Jailbreak-Tuning

ChengCheng, Brendan Murphy, AdamGleave and Kellin Pelrine

1 Nov 2024 0:10 UTC

18 points

0 comments6 min readLW link

(far.ai)

Why do we post our AI safety plans on the Internet?

Peter S. Park3 Nov 2022 16:02 UTC

4 points

4 comments11 min readLW link

Defense Against The Super-Worms

viemccoy20 Mar 2025 7:24 UTC

24 points

1 comment2 min readLW link

Apply to the Conceptual Boundaries Workshop for AI Safety

Chris Lakin27 Nov 2023 21:04 UTC

50 points

0 comments3 min readLW link

Optimal User-End Internet Security (Or, Rational Internet Browsing)

Arkanj3l9 Sep 2011 18:23 UTC

2 points

23 comments1 min readLW link

How security and cryptography can aid AI safety [sequence]

Allison Duettmann1 Apr 2023 16:28 UTC

24 points

0 comments1 min readLW link

Secret Collusion: Will We Know When to Unplug AI?

schroederdewitt, srm, MikhailB, Lewis Hammond, chansmi and sofmonk

16 Sep 2024 16:07 UTC

65 points

8 comments31 min readLW link

Disproving and partially fixing a fully homomorphic encryption scheme with perfect secrecy

Lysandre Terrisse26 May 2024 14:56 UTC

16 points

1 comment18 min readLW link

Computer security story

Perplexed17 Feb 2011 0:26 UTC

5 points

30 comments1 min readLW link

Scrying for outcomes where the problem of deepfakes has been solved

mako yass15 Apr 2019 4:45 UTC

26 points

11 comments4 min readLW link

Hacker-AI and Digital Ghosts – Pre-AGI

Erland Wittkotter19 Oct 2022 15:33 UTC

9 points

7 comments8 min readLW link

Microsoft and Google using LLMs for Cybersecurity

Phosphorous18 May 2023 17:42 UTC

6 points

0 comments5 min readLW link

Hacker-AI and Cyberwar 2.0+

Erland Wittkotter19 Dec 2022 11:46 UTC

2 points

0 comments15 min readLW link

Safe Development of Hacker-AI Countermeasures – What if we are too late?

Erland Wittkotter1 Dec 2022 7:59 UTC

3 points

0 comments14 min readLW link

Transformer Architecture Choice for Resisting Prompt Injection and Jail-Breaking Attacks

RogerDearnaley21 May 2023 8:29 UTC

9 points

1 comment4 min readLW link

Consider attending the AI Security Forum ’24, a 1-day pre-DEFCON event

Charlie Rogers-Smith12 Jul 2024 23:01 UTC

21 points

0 comments1 min readLW link

[Question] How much should e-signatures have to cost a country?

FlorianH21 Nov 2023 22:45 UTC

5 points

5 comments1 min readLW link

Incorporating Mechanism Design Into Decision Theory

StrivingForLegibility26 Jan 2024 18:25 UTC

17 points

4 comments4 min readLW link

Hunting for AI Hackers: LLM Agent Honeypot

Reworr R and jacobhaimes

12 Feb 2025 20:29 UTC

35 points

0 comments5 min readLW link

(www.apartresearch.com)

Fixing Insider Threats in the AI Supply Chain

Madhav Malhotra7 Oct 2023 13:19 UTC

20 points

2 comments5 min readLW link

For the Greatest Minds in AI, Cryptography, Medicine, Engineering, Physics and Most of All—New Post-Quantum Mathematical Technology

Thomas Wolf14 Jul 2025 20:22 UTC

1 point

0 comments2 min readLW link

Boundaries-based security and AI safety approaches

Allison Duettmann12 Apr 2023 12:36 UTC

43 points

2 comments6 min readLW link

Neural program synthesis is a dangerous technology

syllogism12 Jan 2018 16:19 UTC

10 points

6 comments2 min readLW link

Predicting AI Releases Through Side Channels

Reworr R7 Jan 2025 19:06 UTC

16 points

1 comment1 min readLW link

Human study on AI spear phishing campaigns

Simon Lermen, Fred Heiding and Andrew Kao

3 Jan 2025 15:11 UTC

81 points

8 comments5 min readLW link

Cybersecurity of Frontier AI Models: A Regulatory Review

Deric Cheng and Elliot Mckernon

25 Apr 2024 14:51 UTC

8 points

0 comments8 min readLW link

End-to-end hacking with language models

tchauvin5 Apr 2024 15:06 UTC

29 points

0 comments8 min readLW link

Securing AI Models Against Data Leakage While Preserving Cultural and National Integrity

majith666dam@gmail.com13 Aug 2025 15:31 UTC

1 point

0 comments2 min readLW link

Preventing AI Misuse: State of the Art Research and its Flaws

Madhav Malhotra23 Apr 2023 17:37 UTC

15 points

0 comments11 min readLW link

(forum.effectivealtruism.org)

Agent membranes and causal distance

Chris Lakin2 Jan 2024 22:43 UTC

20 points

3 comments3 min readLW link

The Virus—Short Story

Michael Soareverix13 Apr 2023 18:18 UTC

4 points

0 comments4 min readLW link

How to Fake Decryption

ohmurphy5 Sep 2024 9:18 UTC

12 points

0 comments4 min readLW link

(ohmurphy.substack.com)

Join the $10K AutoHack 2024 Tournament

Paul Bricman25 Sep 2024 11:54 UTC

5 points

0 comments1 min readLW link

(noemaresearch.com)

Apply to the AI Security Bootcamp [Aug 4 - Aug 29]

Pranav Gade, Jan Michelfeit and Jinglin Li

1 Jun 2025 19:47 UTC

27 points

1 comment4 min readLW link

Revisiting the Manifold Hypothesis

Aidan Rocke1 Oct 2023 23:55 UTC

13 points

19 comments4 min readLW link

[Question] AI interpretability could be harmful?

Roman Leventov10 May 2023 20:43 UTC

13 points

2 comments1 min readLW link

Privacy in a Digital World

Faustify27 Jul 2023 10:46 UTC

2 points

0 comments5 min readLW link

AI infosec: first strikes, zero-day markets, hardware supply chains, adoption barriers

Allison Duettmann1 Apr 2023 16:44 UTC

41 points

0 comments9 min readLW link

Deepfake(?) Phishing

jefftk21 Oct 2022 14:30 UTC

37 points

9 comments1 min readLW link

(www.jefftk.com)

No comments.

Com­puter Se­cu­rity & Cryptography

Computer Security & Cryptography