Artem Karpov

Karma: 161

NEST: Nascent Encoded Steganographic Thoughts

Artem Karpov17 Feb 2026 7:55 UTC

20 points

8 comments13 min readLW link

Steganographic Chains of Thought Are Low-Probability but High-Stakes: Evidence and Arguments

Artem Karpov11 Dec 2025 7:40 UTC

20 points

1 comment6 min readLW link

The Illegible Chain-of-Thought Menagerie

Artem Karpov18 Nov 2025 12:01 UTC

3 points

0 comments8 min readLW link

artkpv’s Shortform

Artem Karpov12 Oct 2025 9:52 UTC

2 points

15 comments1 min readLW link

How dangerous is encoded reasoning?

Artem Karpov30 Jun 2025 11:54 UTC

17 points

0 comments10 min readLW link

Philosophical Jailbreaks: Demo of LLM Nihilism

Artem Karpov4 Jun 2025 12:03 UTC

3 points

0 comments5 min readLW link

The Steganographic Potentials of Language Models

Artem Karpov, Tinuade and SCho

8 May 2025 11:23 UTC

9 points

0 comments1 min readLW link

CCS on compound sentences

Artem Karpov4 May 2024 12:23 UTC

6 points

0 comments9 min readLW link

Inducing human-like biases in moral reasoning LMs

Artem Karpov, Austin Meek, Bogdan Ionut Cirstea and SCho

20 Feb 2024 16:28 UTC

23 points

3 comments14 min readLW link

How important is AI hacking as LLMs advance?

Artem Karpov29 Jan 2024 18:41 UTC

1 point

0 comments6 min readLW link

My (naive) take on Risks from Learned Optimization

Artem Karpov31 Oct 2022 10:59 UTC

7 points

0 comments5 min readLW link