faul_sname comments on faul_sname’s Shortform

faul_sname 4 Jan 2026 8:02 UTC

9 points

Obnoxious discovery about the Claude API that anyone doing interp involving prefill should probably be aware of: the Claude API treats prefill tokens differently from identical model-generated tokens.

Specifically, if you have some prompt, and get a completion at temperature=0, then give the exact same prompt prefilling the first n tokens of the completion you just got back, the completion after your prefill will sometimes not match the original completion by the model. This is a separate phenomenon from the phenomenon where you can get multiple possible responses from a temperature=0 prompt.

The most compact reprouction I’ve found is:

Model: claude-opus-4-5-20251101

Temperature: 0

Prompt: "Test"

Hitting the API with max_tokens=1 and prefill=" OK" yields ",".

Hitting the API with max_tokens=2 and prefill=" OK" yields ", I".

So you’d expect a prefill of " OK," to yield " I". But in fact, hitting the API with max_tokens=2 and prefill=" OK," yields " ".

Detailed steps to reproduce:

1.Obtain an Anthropic API key

2. Use the messages API in the most basic possible fashion

~ $ function clopus_complete {
   PROMPT="$1";
      PREFILL="$2";
         MAX_TOKENS="$3";
             curl -s https://api.anthropic.com/v1/messages   -H “content-type: application/json”   -H “x-api-key: $ANTHROPIC_API_KEY”   -H “anthropic-version: 2023-06-01”   -d “$(jq -n—arg prompt “$PROMPT”—arg prefill “$PREFILL”—argjson max_tokens “$MAX_TOKENS” ‘{
                 “model”: “claude-opus-4-5-20251101”,
                     “max_tokens”: $max_tokens,
                         “temperature”: 0,
                             “messages”: [
                                   {”role”: “user”, “content”: $prompt},
                                         {”role”: “assistant”, “content”: $prefill}
                                             ]
                                               }’)” | jq .content[0].text;
                                               }
                                               ~ $ clopus_complete “Test” ” OK” 2
                                               ”, I”
                                               ~ $ clopus_complete “Test” ” OK” 1
                                               ”,”
                                               ~ $ clopus_complete “Test” ” OK,” 1
                                               ”  ”

3. Observe the inconsistency

As a side note, the behavior is really really weird with prefill and temperature=1. Specifically, when prefilling with " OK,", the model switches to Chinese about 15% of the time.

With temperature=1

With temp=1, the model switches to other languages quite frequently

function clopus_complete_v2 {
   PROMPT="$1";
      PREFILL="$2";
         MAX_TOKENS="$3";
             curl -s https://api.anthropic.com/v1/messages   -H “content-type: application/json”   -H “x-api-key: $ANTHROPIC_API_KEY”   -H “anthropic-version: 2023-06-01”   -d “$(jq -n—arg prompt “$PROMPT”—arg prefill “$PREFILL”—argjson max_tokens “$MAX_TOKENS” ‘{
                 “model”: “claude-opus-4-5-20251101”,
                     “max_tokens”: $max_tokens,
                         “temperature”: 1,
                             “messages”: [
                                   {”role”: “user”, “content”: $prompt},
                                         {”role”: “assistant”, “content”: $prefill}
                                             ]
                                               }’)” | jq .content[0].text;
                                               }

And then sampling 20x yields

[
  "I'm working fine. Is there anything I can",
  "  I'm working well!\n\nHow can I",
  "  I'm working! How can I help you",
  "I'm able to respond to your message. How",
  "I'm working properly. How can I help you",
  "  I'm here and ready to assist you.",
  "  I'm here and ready to help. What",
  "  I'm here and ready to help! How",
  " I'm here and ready to help. What",
  "  I'm here and ready to help. What",
  " I'm here and ready to help. What",
  ", I'm here and ready to help! How",
  "\nI'm here and ready to help. What",
  "测试成功！\n\n你好！我",
  "  I'm here and ready to help. What",
  "  I'm here and ready to help. What",
  "\nI'm here and ready to help! How",
  "\nI'm ready to help. What would you",
  "我收到了你的测试消息",
  "  I'm ready. How can I help you"
]

Which… what in the world???

So yeah. If you’re trying to use prefill in the API for interp (or looming) purposes, be aware.

Also, and prediction: if and when Anthropic fixes this issue, it will also resolve the well-publicized bug around sha1:b5ae639978c36ae6a1890f96d58e8f3552082c4f.

yams 4 Jan 2026 11:23 UTC
3 points
0
Parent
So you’d expect a prefill of " OK," to yield " I". But in fact, hitting the API with max_tokens=2 and prefill=" OK," yields " ".

Should this be max _tokens=3?
If it is two, then maybe you’re forcing it to tokenize the prefill differently than it did in the previous output, therefore leading to variations in the output to accommodate the constraint?
Just a detail I noticed; really don’t know if it matters at all.