ceselder

Karma: 25

ceselder 10 Nov 2025 22:57 UTC
1 point
0
in reply to: fx’s comment on: Five very good reasons to not write down literally every single thought you have
This is very hard to answer. I just tried to write down basically everything. The noise kind of stopped after a while. it was a very strange sensation

ceselder 3 Nov 2025 14:03 UTC
1 point
0
in reply to: Richard_Kennaway’s comment on: Pepperoni and the end of morality
It’s fiction, I’m vaguely talking about myself as “you” here but I’m getting at some instinct here basically. Thanks for linking that, I hadn’t seen it and that’s kind of exactly what I was getting at.

ceselder 31 Oct 2025 7:50 UTC
4 points
2
in reply to: ollie_’s comment on: Emergent Introspective Awareness in Large Language Models
Possibly yes, but I don’t think that’s a legitimate safety concern since this can already be done very easily with other techniques. And for this technique you would need to model diff with a nonrefusal prompt of the bad concept in the first place, so the safety argument is moot. But sounds like an interesting research question

ceselder 31 Oct 2025 7:31 UTC
1 point
0
in reply to: Brendan Long’s comment on: Why you shouldn’t eat meat if you hate factory farming
This makes sense honestly. I guess you would still run the risk of a non-vegan seeing you do these things and going “ha! hypocrite!” but I don’t know how real that risk is honestly.

ceselder 7 Oct 2025 14:28 UTC
1 point
0
on: ceselder’s Shortform
Maybe a term like Extinction-(risk)-Level-Super-Intelligence or ELSI for short may be a more productive term to use than asi or agi

ceselder 4 Oct 2025 7:41 UTC
1 point
0
in reply to: samuelshadrach’s comment on: ceselder’s Shortform
Yeah I can see that analogy, I just don’t think most non-rationalist types have realized this

ceselder 3 Oct 2025 16:14 UTC
2 points
0
on: ceselder’s Shortform
Isn’t it very likely that AI safety research is one of the very first things to be cut if AI companies start to have less access to VC money? I don’t think the company has a huge incentive for AI safety training, particularly in a way that people allocating funding would understand. Isn’t this a huge problem? Maybe this has been adressed and I missed it.