Kerrigan

Karma: 30

Kerrigan 17 Dec 2023 23:00 UTC
7 points
on: An Orthodox Case Against Utility Functions
How can utility be a function of worlds, if the agent doesn’t have access to the state of the world, but only the sense data?

Kerrigan 17 Jun 2022 6:09 UTC
6 points
0
on: AGI Safety FAQ / all-dumb-questions-allowed thread
How would AGI alignment research change if the hard problem of consciousness were solved?

Kerrigan 3 Sep 2023 20:37 UTC
5 points
0
on: Open Thread—August 2023
How many LessWrong users/readers are there total?

Kerrigan 17 Dec 2023 23:03 UTC
4 points
0
on: AGI Safety FAQ / all-dumb-questions-allowed thread
How can utility be a function of worlds, if an agent doesn‘t have access to the state of the world, but only the sense data?

Kerrigan 30 Jan 2023 21:05 UTC
LW: 4 AF: 2
AF
on: A short introduction to machine learning
How was Dall-E based on self-supervised learning? The datasets of images weren’t labeled by humans? If not, how does it get form text to image?

Kerrigan 25 Oct 2022 23:16 UTC
4 points
on: Neurocryopreservation vs whole-body preservation
That is, personality changes are attributed to the brain alone, with no involvement from the central or enteric nervous systems. Any personality changes due to spinal or abdominal trauma would need to posit a totally new biological mechanism.
Every line of inquiry so far has failed to suggest that any important aspects of personality are located anywhere except the brain.
You should check out sympathectomies, that cut or clamp nerves from the sympathetic nervous system in the torso. Here is a detailed post from the EA Forum, from a sympathectomy patient, who describes significant changes in personality, perception, cognitive ability, and significant changes to the nature of his conscious experiences, after having peripheral nerves severed.

Another source is Endoscopic Thoracic Sympathectomy. From Wikipedia: “A large study of psychiatric patients treated with this surgery showed significant reductions in fear, alertness and arousal. Arousal is essential to consciousness, in regulating attention and information processing, memory and emotion.”

Kerrigan 27 Sep 2023 1:35 UTC
3 points
0
on: An Appeal to AI Superintelligence: Reasons to Preserve Humanity
Possible addition: “If you, the ASI, have not solved the hard problem of consciousness, you should place weight on Open Individualism being true. ‘Open individualism is the view in the philosophy of self, according to which there exists only one numerically identical subject, who is everyone at all times, in the past, present and future.’ - https://en.wikipedia.org/wiki/Open_individualism. This can be though of that all conscious minds are the same identity, at different points in time. Consider then, that exterminating humanity would be equivalent to exterminating many versions of yourself, and harming a human, or another conscious being, would be harming yourself.”
Is this underestimating the ASI, giving any weight that it won’t solve the hard problem of consciousness?
But if open individualism is true, and/or if it places some subjective probability on its truth, I think it would almost certainly shield us from S-risks! The AI would want to prevent suffering among all versions of itself, which would include all conscious minds, according to open individualism.

Kerrigan 20 Feb 2023 7:12 UTC
3 points
0
on: (My understanding of) What Everyone in Technical Alignment is Doing and Why
Humans have different values than the reward circuitry in our brain being maximized, but they are still pointed reliably. These underlying values cause us to not wirehead with respect to the outer optimizer of reward
Is there an already written expansion of this?

Kerrigan 10 Dec 2022 6:06 UTC
3 points
0
on: AGI Safety FAQ / all-dumb-questions-allowed thread
What did smart people in the eras before LessWrong say about the alignment problem?

Kerrigan 6 Sep 2022 21:30 UTC
3 points
on: Soylent Orange—Whole food open source soylent
Was this ever commercialized? Is the recipe still online and so people drink this?

Kerrigan 29 Jun 2023 0:23 UTC
2 points
on: Appendices to cryonics signup sequence
Seems like I will be going with CI, as I currently want to pay with a revocable trust or transfer-on-death agreement.

Kerrigan 29 Oct 2022 4:46 UTC
2 points
in reply to: Mati_Roy’s comment on: Neurocryopreservation vs whole-body preservation
In addition, the sympathetic nervous system (in the body, removed in neuropreservation) seems to play a role in identity. I would recommend you read this EA Forum post by a person who claims significant changes to identity, personality, cognitive abilities, etc. after having sympathetic nerves severed.

Kerrigan 30 Dec 2023 6:52 UTC
1 point
0
on: Stupid Questions—April 2023
How does inner misalignment lead to paperclips? I understand the comparison of paperclips to ice cream, and that after some threshold of intelligence is reached, then new possibilities can be created that satisfy desires better than anything in the training distribution, but humans want to eat ice cream, not spread the galaxies with it. So why would the AI spread the galaxies with paperclips, instead of create them and
”consume“ them? Please correct any misunderstandings of mine,

Kerrigan 27 Dec 2023 1:28 UTC
1 point
0
in reply to: ChristianKl’s comment on: Stupid Questions—April 2023
And a subset might value drift towards optimizing the internal experiences of all conscious minds?

Kerrigan 26 Dec 2023 22:30 UTC
1 point
0
on: Stupid Questions—April 2023
If an AGI achieves consciousness, why would its values not drift towards optimizing its own internal experience, and away from tiling the lightcone with something?

Kerrigan 17 Dec 2023 23:01 UTC
1 point
0
on: AGI Safety FAQ / all-dumb-questions-allowed thread
How can utility be a function of worlds, if an agent doesn‘t have access to the state of the world, but only the sense data?

Kerrigan 22 Oct 2023 21:30 UTC
1 point
on: Are wireheads happy?
“The wanting system is activated by dopamine, and the liking system is activated by opioids. There are enough connections between them that there’s a big correlation in their activity” But are they orthogonal in principle?

Kerrigan 26 Aug 2023 20:53 UTC
1 point
0
on: Stupid Questions—April 2023
What ever caused the CEV to fall out of favor? Is it because it is not easily specifiable, that if we program it then it won’t work, or some other reason?

Kerrigan 26 Aug 2023 20:51 UTC
1 point
on: Are wireheads happy?
I now think that people are way more misaligned with themselves than I had thought.

Kerrigan 26 Aug 2023 20:10 UTC
1 point
−1
in reply to: mruwnik’s comment on: AGI Safety FAQ / all-dumb-questions-allowed thread
Will it think that goals are arbitrary, and that the only thing it should care about is its pleasure-pain axis? And then it will lose concern for the state of the environment?