elbow921

Karma: −27

elbow921 15 Mar 2024 2:51 UTC
1 point
0
on: Hammertime Day 6: Mantras
I like the mantra, “If we choose to give more effort today, then we are sure to go beyond our past mistakes.” This mantra is on my desktop screen.

elbow921 5 Aug 2023 18:38 UTC
1 point
on: Time and Effort Discounting
For the examples in this article, for each option only take the monetary value that goes last. log(amount after year)~0.79*log(amount now)+0.79 is the indifference curve. If U(now)=log(amount now), U(year)=(log(amount after year)-0.79)/0.79.

elbow921′s Shortform

elbow9218 Jul 2023 1:34 UTC

1 point

elbow921 12 Jun 2023 21:03 UTC
1 point
0
in reply to: Rafael Harth’s comment on: Bing chat is the AI fire alarm
There is a hypothetical example of simulating a ridiculous number of humans typing text and seeing what fraction of those people that type out the current text type out each next token. In the limit, this approaches the best possible text predictor. This would simulate a lot of consciousness.

elbow921 11 May 2023 1:04 UTC
1 point
on: The Strangest Thing An AI Could Tell You
What if most people would develop superhuman intelligences in their brains without school but, because they have to write essays in school, these superhuman intelligences become aligned with writing essays fast? And no doomsday scenario has happened because they mostly cancel out each others’ attempted manipulations and they couldn’t program nanobots with their complicated utility functions. ChatGPT writes faster than us and has 20B parameters where humans have 100T parameters, but our neural activations are more noisy than floating-point arithmetic.

elbow921 17 Apr 2023 20:06 UTC
1 point
0
in reply to: ZT5’s comment on: Does this algorithm experience pleasure and suffering when run?
This is what I am wondering: Does this algorithm, when run, instantiate a subjective experience with the same moral relevance as the subjective experience that happens when mu opioids are released in biological brains?

elbow921 17 Apr 2023 15:03 UTC
1 point
0
in reply to: Richard_Kennaway’s comment on: Does this algorithm experience pleasure and suffering when run?
‘By ‘obvious to the algorithm’ I mean that, to the algorithm, A is referenced with no intermediate computation. This is how pleasure and pain feel to me. I do not believe all reinforcement learning algorithms feel pleasure/pain. A simple example that does not suffer is the Simpleton iterated prisoner’s dilemma strategy. I believe pain and pleasure are effective ways to implement reinforcement learning. In animals, reinforcement learning is called operant conditioning. See Reinforcement learning on a chicken for a chicken that has experienced it. I do not know any algorithms to determine whether there is anything to be like a given program. I suspected this program experienced pleasure/pain because of its paralells to the neuroscience of pleasure and pain.

elbow921 17 Apr 2023 11:45 UTC
1 point
0
in reply to: Richard_Kennaway’s comment on: Does this algorithm experience pleasure and suffering when run?
As this algorithm executes, the last and 2last variables become the program’s last 2 outputs. L1′s even indexes become the average input(reward?) given the number of ones the program outputted the last 2 times. I called L1′s odd indexes ‘confidence’ because, as they get higher, the corresponding average reward changes less based on evidence. When L1 becomes entangled with the input generation process, the algorithm chooses which outputs make the inputs higher on average. That is why I called the input ‘reward’. L2 reads off the average reward given the last 2 outputs. The algorithm chooses outputs that make the number of ones outputted closer to the number that has yielded the highest inputs in the past. This makes L2 analogous to ‘wanting’.

elbow9213 Mar 2023 17:44 UTC

−3 points