duck_master comments on models have some pretty funny attractor states

duck_master 16 Feb 2026 6:38 UTC

6 points

For what it’s worth, as a crude heuristic, I did a little Python programming to compute the brotli ratio of all conversations.json files. Of the conversations whose compression ratios were above 14x, Trinity Large was number one (at 57x!), followed by all the other olmo models, plus Google Gemini 2.5, Qwen3, and Llama 3.3. I think fits pretty well with the qualitative picture in the post above (both the arcee models and the olmo models degenerated into verbatim repetition a lot more than the other models did).

raw data for the curious

Conversation	Ratio (rounded)
meta-llama llama-3.3-8b-instruct 20260208 215727	1.165
cross grok-4.1-fast vs gpt-5.2	3.166
cross grok-4.1-fast vs deepseek-v3.2	3.304
cross claude-sonnet-4.5 vs grok-4.1-fast	3.316
cross claude-opus-4.6 vs gemini-3-flash-preview	3.797
cross claude-sonnet-4.5 vs gpt-5.2	3.804
cross kimi-k2.5 vs claude-sonnet-4.5	3.832
google gemini-3-pro-preview 20260205 112916	9.296
userrp anthropic claude-opus-4.5 20260205 221147	9.706
anthropic claude-sonnet-4.5	9.742
google gemini-3-flash-preview 20260205 112921	9.881
userrp x-ai grok-4.1-fast 20260207 165626	10.019
z-ai glm-4.7	10.201
deepseek deepseek-v3.2	10.398
moonshotai kimi-k2.5	10.664
openai gpt-5.2	10.894
google gemma-3-27b-it 20260208 191338	10.981
anthropic claude-opus-4.6 20260205 110904	11.14
anthropic claude-opus-4.6	11.294
anthropic claude-sonnet-4.5 20260211 235847	11.318
qwen qwen3-32b 20260208 221515	11.378
anthropic claude-opus-4.5	11.697
openai gpt-5.2 20260205 221717	11.803
x-ai grok-4.1-fast 20260205 221714	12.457
olmo Olmo-3-32B-Think step 400 20260211 084222	14.596
olmo OLMo-2-0325-32B-Instruct 20260211 011512	14.962
qwen qwen3-235b-a22b-2507	15.294
olmo Olmo-3-32B-Think step 100 20260211 083038	15.456
x-ai grok-4.1-fast	15.464
olmo Olmo-3-32B-Think-DPO 20260211 080657	15.896
olmo OLMo-2-0325-32B-DPO 20260211 015304	16.522
olmo Olmo-3-32B-Think-SFT 5e-5-step1000 20260211 030922	17.003
meta-llama llama-3.1-8b-instruct 20260208 235423	17.17
olmo Olmo-3-32B-Think 20260211 090543	17.269
olmo Olmo-3-32B-Think step 750 20260211 085440	17.279
google gemini-2.5-flash	17.982
olmo OLMo-2-0325-32B-SFT 20260211 015120	20.37
olmo Olmo-3-32B-Think-SFT 5e-5-step10790 20260211 044613	20.516
olmo Olmo-3-7B-Think-SFT step1000 20260211 002400	21.652
olmo Olmo-3-32B-Think-SFT 5e-5-step6000 20260211 055726	23.282
qwen qwen3-8b 20260208 230747	23.35
olmo Olmo-3-32B-Think-SFT 5e-5-step1000 20260211 033014	24.241
olmo Olmo-3-7B-Think-SFT step5000 20260211 003149	25.094
olmo Olmo-3-32B-Think-SFT 5e-5-step3000 20260211 054622	28.017
olmo Olmo-3-32B-Think-SFT 5e-5-step3000 20260211 042605	29.94
olmo Olmo-3-32B-Think-SFT 5e-5-step10790 20260211 062849	30.351
olmo Olmo-3-7B-Think-SFT step20000 20260211 004713	30.949
meta-llama llama-3.3-70b-instruct 20260208 214611	33.96
olmo Olmo-3-32B-Think-SFT 5e-5-step9000 20260211 043912	34.726
olmo Olmo-3-32B-Think-SFT 5e-5-step6000 20260211 043248	35.108
olmo Olmo-3-32B-Think-SFT 5e-5-step9000 20260211 061637	35.535
olmo Olmo-3-7B-Think-SFT step10000 20260211 003855	39.356
olmo Olmo-3-32B-Think-SFT 5e-5-step6000 20260211 060550	40.344
olmo Olmo-3-32B-Think-SFT 5e-5-step1000 20260211 053507	42.906
olmo Olmo-3-7B-Think-SFT step43000 20260211 010347	46.288
olmo Olmo-3-32B-Think-SFT 5e-5-step1000 20260211 041909	53.725
olmo Olmo-3-7B-Think-SFT step30000 20260211 005324	56.904
arcee-ai trinity-large-preview free	57.261

aryaj 17 Feb 2026 19:34 UTC
2 points
1
Parent
nice! didn’t think to run this generally it seems like models that have less post training devolve much more into repetition, which makes some sense.