CanYouFeelTheBenefits

Karma: 20

CanYouFeelTheBenefits 23 Nov 2025 11:16 UTC
3 points
0
in reply to: Mitchell_Porter’s comment on: Let Us Stop the Train Before it Crashes! 🙏
In a way you need computers to wirehead everyone. But you don’t necessarily need AI to wirehead everyone. I think we as humans can figure out how the reward system works.

CanYouFeelTheBenefits 8 Oct 2025 13:05 UTC
1 point
0
in reply to: Seth Herd’s comment on: How likely are “s-risks” (large-scale suffering outcomes) from unaligned AI compared to extinction risks?
Thanks for your detailed and nuanced answer. I really appreciate how you distinguish between different forms of misalignment and how s-risks fit within that picture. Your comment helped clarify a lot.
If you have time, I’d love to hear you expand a bit more on the likelihood of s-risks relative to other AGI outcomes. You mentioned that s-risks seem much less likely than extinction or successful alignment, but could you give a rough probability estimate (even if it’s just an intuitive order-of-magnitude guess, like “1 in a thousand” or “1 in a million”)?
It would also be interesting to hear your thoughts on what factors most strongly influence that probability, for example, how much governance or alignment progress would need to fail for s-risks to become plausible, whether you think “instrumental torture” (as opposed to large-scale indifferent suffering) deserves separate consideration, and how much you think the risk depends on who ends up in control of early AGIs (e.g., sociopathic or sadistic actors).
Basically, I’m trying to understand not just whether s-risks are neglected, but how much weight they deserve compared to extinction in our overall AGI-risk prioritization.
Thanks again for engaging with these hard questions.

[Question] How likely are “s-risks” (large-scale suffering outcomes) from unaligned AI compared to extinction risks?

CanYouFeelTheBenefits5 Oct 2025 14:38 UTC

15 points

2 comments1 min readLW link

How likely are “s-risks” (large-scale suffering outcomes) from unaligned AI compared to extinction risks?

CanYouFeelTheBenefits2 Oct 2025 10:02 UTC

5 points

0 comments1 min readLW link

Against the Inevitability of Habituation to Continuous Bliss

CanYouFeelTheBenefits1 Oct 2025 12:12 UTC

8 points

0 comments1 min readLW link

CanYouFeelTheBenefits 30 Jul 2025 16:57 UTC
8 points
4
on: On Wireheading
This is indeed an often overlooked approach to very effectively alleviate suffering.
Another key research question should be the optimal location of the electrode. The experiments done in rats only stimulated areas that induce intense wanting for the stimulation, the rats probably did not even experience pleasure but rather just craving for the stimulation.
Pleasure/joy itself is encoded in the medial orbitofrontal cortex, so this area might be worth looking into.
I could also imagine multiple electrodes stimulating all hedonic hotspots in the brain, e.g. in the nucleus accumbens, insula, ventral pallidum, orbitofrontal cortex. More on this here: https://www.pnas.org/doi/full/10.1073/pnas.1705753114
Electrode stimulation of the insula cortex has already been shown to produce bliss in humans, so this location might also be a good approach:
“Insular Stimulation Produces Mental Clarity and Bliss”: https://pmc.ncbi.nlm.nih.gov/articles/PMC9300149/

Are current LLMs safe for psychotherapy?

CanYouFeelTheBenefits12 Feb 2025 19:16 UTC

5 points

4 comments1 min readLW link