artkpv

Karma: 89

Artem Karpov

The Illegible Chain-of-Thought Menagerie

artkpv18 Nov 2025 12:01 UTC

2 points

0 comments8 min readLW link

artkpv’s Shortform

artkpv12 Oct 2025 9:52 UTC

2 points

10 comments1 min readLW link

How dangerous is encoded reasoning?

artkpv30 Jun 2025 11:54 UTC

17 points

0 comments10 min readLW link

Philosophical Jailbreaks: Demo of LLM Nihilism

artkpv4 Jun 2025 12:03 UTC

3 points

0 comments5 min readLW link

The Steganographic Potentials of Language Models

artkpv, Tinuade and SCho

8 May 2025 11:23 UTC

9 points

0 comments1 min readLW link

CCS on compound sentences

artkpv4 May 2024 12:23 UTC

6 points

0 comments9 min readLW link

Inducing human-like biases in moral reasoning LMs

artkpv, Austin Meek, Bogdan Ionut Cirstea and SCho

20 Feb 2024 16:28 UTC

23 points

3 comments14 min readLW link

How important is AI hacking as LLMs advance?

artkpv29 Jan 2024 18:41 UTC

1 point

0 comments6 min readLW link

My (naive) take on Risks from Learned Optimization

artkpv31 Oct 2022 10:59 UTC

7 points

0 comments5 min readLW link