Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
viemccoy
Karma:
64
All
Posts
Comments
New
Top
Old
The Weighted Perplexity Benchmark: Tokenizer-Normalized Evaluation for Language Model Comparison
jessicata
and
viemccoy
7 Jul 2025 21:43 UTC
21
points
0
comments
7
min read
LW
link
(www.morpheus.systems)
Schizobench: Documenting Magical-Thinking Behavior in Claude 4 Opus
viemccoy
23 May 2025 1:31 UTC
23
points
0
comments
1
min read
LW
link
(metanomicon.ink)
Defense Against The Super-Worms
viemccoy
20 Mar 2025 7:24 UTC
24
points
1
comment
2
min read
LW
link
Back to top