dirk

Karma: 759

dirk 6 Sep 2025 1:14 UTC
1 point
0
in reply to: Gordon Seidoh Worley’s comment on: In Defense of Alcohol
I vaguely remember looking at one of those studies and finding that the amount of alcohol used was significantly less than a standard drink, though I don’t have a link now.

dirk 4 Sep 2025 23:21 UTC
8 points
8
in reply to: Viliam’s comment on: Viliam’s Shortform
One possible contributor: posttraining involves chat transcripts in the desired style (often, nowadays, generated by an older LLM), and I suspect that in learning to imitate the format models also learn to imitate the tone (and to overfit, at that; perhaps it’s due to having only a few examples relative to the size of the corpus, but this is merely idle speculation). (The consensus on twitter seemed to be that “delve” in particular was a consequence of human writing; it’s used far more commonly in African English than in American, and OpenAI outsourced data labeling to save on costs.) I haven’t noticed nearly as much of a consistent flavor in my limited experimentation with base models, so I think posttraining must make it worse even if it’s not the cause.

dirk 23 Aug 2025 1:53 UTC
7 points
3
in reply to: Wei Dai’s comment on: Banning Said Achmiz (and broader thoughts on moderation)
Making status calculations at all times is a choice you have the right to make, but in my opinion it’s a bad one.

dirk 21 Aug 2025 16:48 UTC
4 points
2
in reply to: eggsyntax’s comment on: Aaron_Scher’s Shortform
I suspect the models’ output tokens become input tokens when the conversation proceeds to the next turn; certainly my API statistics show several times as many input tokens as output despite the fact that my responses are invariably shorter than the models’.

dirk 20 Aug 2025 2:13 UTC
2 points
0
in reply to: Raemon’s comment on: Raemon’s Shortform Feed
I just saw How to use hypnagogic hallucinations as biofeedback to relieve insomnia in the feed the other day, and it seems like quite a convenient option if it works; could be worth a try, though I haven’t tested it myself.

dirk 16 Aug 2025 20:29 UTC
1 point
0
in reply to: gwern’s comment on: keltan’s Shortform
No, actually; the mindset implied by repeating that text as a meme is quite different than the mindset implied by unironically generating it.

dirk 16 Aug 2025 13:00 UTC
4 points
0
in reply to: dbohdan’s comment on: keltan’s Shortform
The bio is an edited meme, not an original; it mostly communicates that they’re a heavy user of the internet. Example from a year ago

dirk 14 Aug 2025 17:04 UTC
3 points
0
in reply to: A1987dM’s comment on: Doing A Thing Puts You in The Top 10% (And That Sucks)
Personally I can run for one (1) minute before I’m too out of breath to continue; a quarter-mile is short enough that walking for a majority of the time would still finish it in under ten minutes, but I’d certainly struggle to run it.

dirk 31 Jul 2025 1:49 UTC
3 points
0
in reply to: dirk’s comment on: Can Reasoning Models Avoid the Most Forbidden Technique?
Just wanted to follow up on this; I’ve read more Gemini COTs since then, and I currently have the strong impression that they’re summarized.

dirk 31 Jul 2025 0:32 UTC
1 point
0
in reply to: The Dao of Bayes’s comment on: Change My View: AI is Conscious
Because it’s been experimentally verified that what they’re internally doing doesn’t match their verbal descriptions (not that there was really any reason to believe it would); see the section in this post (or in the associated paper for slightly more detail) about mental math, where Claude claims to perform addition in the same fashion humans do despite interpretability revealing otherwise.

dirk 30 Jul 2025 6:04 UTC
46 points
45
on: My Empathy Is Rarely Kind
That doesn’t sound like empathy; it sounds more like you go through life viewing other people as without agency and remembering they have agency disgusts you. There’s a step beyond that where you run a sandboxed emulation of their mindset, which is IMO what’s typically meant by empathy.

dirk 30 Jul 2025 0:31 UTC
5 points
10
in reply to: Rowan Lóchrann’s comment on: Make More Grayspaces
I think you’re perceiving threats where there are none, and should probably turn the aggro meter way down.

dirk 28 Jul 2025 22:17 UTC
7 points
0
on: What are non-obvious class markers?
The appearance of your teeth has class implications; cosmetic tooth treatments are expensive, but common among rich Americans, so the wealthy often have whiter, straighter teeth than do the poor. (AIUI this is not so much a thing in Britain, where cosmetic dentistry is rarer in general).

dirk 28 Jul 2025 2:30 UTC
−1 points
−2
in reply to: sunwillrise’s comment on: dbohdan’s Shortform
See, when you put it like that, I think the reason rationalists don’t win as much as was expected is quite obvious: claims about the power of rationality were significant overpromises from the start.