Joanna Morningstar(Jonathan Lee)

Karma: 295

Joanna Morningstar 10 Sep 2009 3:34 UTC
5 points
on: Outlawing Anthropics: An Updateless Dilemma
I’ve been watching for a while, but have never commented, so this may be horribly flawed, opaque or otherwise unhelpful.

I think the problem is entirely caused by the use of the wrong sets of belief, and that anything holding to Eliezer’s 1-line summary of TDT or alternatively UDT should get this right.

Suppose that you’re a rational agent. Since you are instantiated in multiple identical circumstances (green rooms) and asked identical questions, your answers should be identical. Hence if you wake up in a green room and you’re asked to steal from the red rooms and give to the green rooms, you either commit a group of 2 of you to a loss of 52 or commit a group of 18 of you to a gain of 12.

This committal is what you wish to optimise over from TDT/UDT, and clearly this requires knowledge about the likelyhood of different decision making groups. The distribution of sizes of random groups is not the same as the distribution of sizes of groups that a random individual is in. The probabilities of being in a group are upweighted by the size of the group and normalised. This is why Bostrom’s suggested 1/n split of responsibility works; it reverses the belief about where a random individual is in a set of decision making groups to a belief about the size of a random decision making group.

By the construction of the problem the probability that a random (group of all the people in green rooms) has size 18 is 0.5, and similarly for 2 the probability is 0.5. Hence the expected utility is (0.512)+(0.5-52)=-20.

If you’re asked to accept a bet on there being 18 people in green rooms, and you’re told that only you’re being offered it, then the decision commits exactly one instance of you to a specific loss or gain, regardless of the group you’re in. Hence you can’t do better than the 0.9 and 0.1 beliefs.

If you’re told that the bet is being offered to everyone in a green room, then you are committing to n times the outcome in any group of n people. In this case gains are conditional on group size, and so you have to use the 0.5-0.5 belief about the distribution of groups. It doesn’t matter because the larger groups have the larger multiplier and thus shutting up and multiplying yields the same answers as a single-shot bet.

ETA: At some level this is just choosing an optimal output for your calculation of what to do, given that the result is used variably widely.

Joanna Morningstar 10 Sep 2009 10:38 UTC
0 points
in reply to: CarlShulman’s comment on: Outlawing Anthropics: An Updateless Dilemma
Hence if there are 2 instances of your decision algorithm in Green rooms, there are 2 runs of your decision algorithm, and if they vote to steal there is a loss of 3 from each red and gain 1 for each green, for a total gain of 12-318 = − 52.

If there are 18 instances in Green rooms, there are 18 runs of your decision algorithm, and if they vote to steal there is a loss of 3 from each red and a gain of 1for each green, for a total gain of 118-23 = 12

The “committal of a group of” is noting that there are 2 or 18 runs of your decision algorithm that are logically forced by the decision made this specific instance of the decision algorithm in a green room.

Joanna Morningstar 10 Sep 2009 11:52 UTC
1 point
in reply to: Christian_Szegedy’s comment on: Outlawing Anthropics: An Updateless Dilemma
Between non communicating copies of your decision algorithm, it’s forced that every instance comes to the same answers/distributions to all questions, as otherwise Eliezer can make money betting between different instances of the algorithm. It’s not really a categorical imperative, beyond demanding consistency.

The crux of the OP is asking for a probability assessment of the world, not whether the DT functions.

I’m not postulating 1/n allocation of responsibility; I’m stating that the source of the confusion is over: P(A random individual is in a world of class A_i | Data) with P(A random world is of class A_i | Data) And that these are not equal if the number of individuals with access to Data are different in distinct classes of world.

Hence in this case, there are 2 classes of world, A_1 with 18 Green rooms and 2 Reds, and A_2 with 2 Green rooms and 18 Reds.

P(Random individual is in the A_1 class | Woke up in a green room) = 0.9 by anthropic update. P(Random world is in the A_1 class | Some individual woke up in a green room) = 0.5

Why? Because in A_1 there ¹⁸⁄₂₀ individuals fit the description “Woke up in a green room”, but in A_2 only ²⁄₂₀ do.

The crux of the OP is that neither a ⁹⁰⁄₁₀ nor ⁵⁰⁄₅₀ split seem acceptable, if betting on “Which world-class an individual in a Green room is in” and “Which world-class the (set of all individuals in Green rooms which contains this individual) is in” are identical. I assert that they are not. The first case is 0.9/0.1 A_1/A_2, the second is 0.5/0.5 A_1/A_2.

Consider a similar question where a random Green room will be asked. If you’re in that room, you update both on (Green walls) and (I’m being asked) and recover the 0.5/0.5, correctly. This is close to the OP as if we wildly assert that you and only you have free will and force the others, then you are special. Equally in cases where everyone is asked and plays separately, you have 18 or 2 times the benefits depending on whether you’re in A_1 or A_2.

If each individual Green room played separately, then you update on (Green walls), but P(I’m being asked|Green) = 1 in either case. This is betting on whether there are 18 people in green rooms or 2, and you get the correct 0.9/0.1 split. To reproduce the OP the offers would need to be +1/18 to Greens and −3/18 from Reds in A_1, and +1/2 to Greens and −3/2 from Reds in A_2, and then you’d refuse to play, correctly.

Joanna Morningstar 10 Sep 2009 22:14 UTC
0 points
in reply to: byrnema’s comment on: Outlawing Anthropics: An Updateless Dilemma
Your first claim needs qualifications: You should only bet if you’re being drawn randomly from everyone. If it is known that one random person in a green room will be asked to bet, then if you wake up in a green room and are asked to bet you should refuse.

P(Heads | you are in a green room) = 0.9 P(Being asked | Heads and Green) = ¹⁄₁₈, P(Being asked | Tails and Green) = ¹⁄₂ Hence P(Heads | you are asked in a green room) = 0.5

Of course the OP doesn’t choose a random individual to ask, or even a random individual in a green room. The OP asks all people in green rooms in this world.

If there is confusion about when your decision algorithm “chooses”, then TDT/UDT can try to make the latter two cases equivalent, by thinking about the “other choices I force”. Of course the fact that this asserts some variety of choice for a special individual and not for others, when the situation is symmetric, suggests something is being missed.

What is being missed, to my mind, is a distinction between the distribution of (random individuals | data is observed), and the distribution of (random worlds | data is observed).

In the OP, the latter distribution isn’t altered by the update as the observed data occurs somewhere with probability 1 in both cases. The former is because it cares about the number of copies in the two cases.

Joanna Morningstar 10 Sep 2009 23:54 UTC
0 points
in reply to: RolfAndreassen’s comment on: The Lifespan Dilemma
In the payout is computational resources with unlimited storage, then patching utility doesn’t work well. If utility is sublinear in experienced time, then forking yourself increases utility.

This makes it difficult to avoid taking Omega up on the offer every time. For clarity, suppose Omega makes the offer to a group of 1.25M forked copies of you. If you turn it down, then on the average 10^6 of you live for 10^(10^10) years. If you all accept and fork a copy, then on the average 2.(10^6 − 1) of you live for 10^(10^(10^10))/2 years each. Clearly this is better; there are more of you living for longer.

The only thing that changes on the shift to 1 initial copy of you is that the (std. dev. of utilons)/(mean utilons) increases by a factor of 10^6. Unless you place a special cost on risk, this doesn’t matter. If you do place such a cost on risk, then you fail to take profitable bets.

ETA: The only reason to not take the offer immediately is if you think some other Omega-esque agent is going to arrive with an even better offer, and you’d better be very sure of that before you risk loosing so much.

Joanna Morningstar 11 Sep 2009 0:53 UTC
1 point
in reply to: RolfAndreassen’s comment on: The Lifespan Dilemma
Fair-rephrasing.

On the other hand, your patching of the utility function requires it to be bounded above as subjective time tends to infinity, or I can find a function that grows quickly enough to get you to accept 1/3^^^^3 chances. If altruistic utility from the existence of others also is bounded above by some number of subjective-you equivalents, then you are asserting that total utility is bounded above.

On a related point you do need to care equally about the utility of other copies of you; otherwise you’ll maximise utility if you gain 1 utilon at an overall cost of 1+epsilon to all other copies of you. You’d defect in PD played against yourself...

Joanna Morningstar 11 Sep 2009 22:30 UTC
0 points
in reply to: RolfAndreassen’s comment on: The Lifespan Dilemma
That bullet has hidden issues for reflective consistency. You’re asserting that any future you would not have wished to take Omega up on the offer again.

This seems unlikely: If you’re self-modifying or continually improving, then it’s likely that new things will become accessible and “fun” to do, if only in terms of new deep problems to solve. It seems very likely that your conception of the bounds of utility shift up as you become more capable. The bounds that you think are on utility probably will alter given 10^^10 years to think.

You shouldn’t defect because you will regret it; in retrospect you’d choose to self-modify to be an agent that cooperates with copies of you. Actually, you’d choose to self-modify to cooperate with anything that implements such a no-later-regrets decision algorithm.

Joanna Morningstar 15 Sep 2009 13:37 UTC
0 points
in reply to: RolfAndreassen’s comment on: The Lifespan Dilemma
Sorry; it’s apparent that what I wrote confused two issues.

The assertion is necessary if you are reflectively consistent and you don’t take Omega up on offer n. If a future copy of you is likely to regret a decision not to take Omega up again, then the decision was the very definition of reflectively inconsistent.

Now we try to derive a utility function from a DT. The problem for bounded utility is that bounds on conceivable and achievable utility will only increase with time. Hence a future you will likely regret any decision you make on the basis that utility is bounded above, because your future bound on achievable utility almost certainly exceeds your current bound on conceivable utility. Hence asserting that utility is bounded above is probably reflectively inconsistent. (The “almost certainly” is, to my mind, justified by EY’s posts on Fun Space)

Your example suggests that you don’t consider reflective consistency to be a good idea; the peasants would promptly regret the decision not to self-modify to move from a CDT (as the aristocrat is using) to a TDT/UDT/other DT which prevents defection.

Joanna Morningstar 16 Sep 2009 9:52 UTC
0 points
in reply to: RolfAndreassen’s comment on: The Lifespan Dilemma
The fact that Omega is offering unbounded lifespans implies that the universe isn’t going to crunch or rip in any finite time. Excluding them leaves you with a universe where the Hubble radius tends to infinity, which thus makes negentropy (information) unbounded above.

Self-modification is just an optimisation process over the design space for agents and run by some agent, with the constraint that only one agent can be instantiated at any time.

But I invite you to consider the other scenario where I did accept Omega’s next offer, the randomness did not go my way, and I have an hour left to live, and regret not stopping one offer earlier.

And regardless of what n is, only a 10^-6 portion of the (n-1)-survivors regret taking decision n. If you’re in the block that’s killed off by decision 1, then decisions 2,3,4,… are all irrelevant to you. Clearly attempting to apply both and applying neither consistently leads to money pumping.

Joanna Morningstar 16 Sep 2009 18:46 UTC
0 points
in reply to: RolfAndreassen’s comment on: The Lifespan Dilemma
Omega’s offers are unbounded; 10^^n exceeds any finite bound with a finite n. If the Hubble distance (edge of the observable universe) recedes, then even with a fixed quantity of mass-energy the quantity of storable data increases. You have more potential configurations.

Yes, in the hypothetical situation given; I can’t consistently assert anything else. In any “real” analogue there are many issues with the premises I’d take, and would likely merely take omega up a few times with the intend of gaining Omega-style ability.

Joanna Morningstar 17 Sep 2009 10:25 UTC
0 points
in reply to: RolfAndreassen’s comment on: The Lifespan Dilemma
No, I mean quite simply that there is no finite bound that holds for all n; if the universe were to collapse/rip in a finite time t, then Omega could only offer you the deal some fixed number of times. We seem to disagree about the how many times Omega would offer this deal—I read the OP as Omega being willing to offer it as many times as desired.

AFAIK (I’m only a mathematician), your example only holds if the total energy of the system is negative. In a more complicated universe, having a subset of the universe with positive total energy is not unreasonable, at which point it could be distributed arbitrarily over any flat spacetime. Consider a photon moving away from a black hole; if the universe gets larger the set of possible distances increases.

Joanna Morningstar 17 Sep 2009 17:37 UTC
1 point
in reply to: RolfAndreassen’s comment on: The Lifespan Dilemma
I think we’re talking on slightly different terms. I was thinking of the Hubble radius, which in the limit equates to Open/Flat/Closed iff there is no cosmological constant (Dark energy). This does not seem to be the case. With a cosmological constant, the Hubble radius is relevant because of results on black hole entropy, which would limit the entropy content of a patch of the universe which had a finitely bounded Hubble radius. I was referring to the regression of the boundary as the “expansion of the universe”. The two work roughly similarly in cases where there is a cosmological constant.

I have no formal training in cosmology. In a flat spacetime as you suggest, the number of potential states seems infinite; you have an infinite maximum distance and can have any multiple of the plank distance as a separation. In a flat universe, your causal boundary recedes at a constant c, and thus peak entropy in the patch containing your past light cone goes as t^2. It is not clear that there is a finite bound on the whole of a flat spacetime. I agree entirely on your closed/open comments.

Omega could alternatively assert that the majority of the universe is open with a negative cosmological constant, which would be both stable and have the energy in your cosmological horizon unbounded by any constant.

As to attacking the premises; I entirely agree.

Joanna Morningstar 9 Jan 2010 11:10 UTC
2 points
on: Hypotheses For Dualism
Interesting post. Have you read Being No One? He derives a similar system based on studies of interesting neurological phenomena.

I would offer an additional hypothesis for why humans like dualism: We implement dualism neurologically to compactly model cognitive agents. Hence we perceive mind to be ontologically fundamental. This would have had utility in minimising cognitive resource needed to figure out that the growling cat with extended claws “intends” to eat “you”.

So that’s my thesis: consciousness is the simulation of reality run on the hardware of our brains, and qualia is the Level3+ observation that the reality we perceive is simulated.

One critique: Your thesis puts qualia as higher level than conciousness. As I see it qualia are the neurologically basic distinguishable stimuli; they allow reality to be compressed and conversely error-corrected. Hence we see in colour in spite of being unable to perceive colour across most of our visual field, and can’t reproduce stimuli as well as we can distinguish them (disregarding savantism).

I agree in broad terms with your assessment of conciousness as self-simulation. That puts the two things as largely orthogonal; neither requires the other. In practice, I’d assert qualia came earlier—simulating other minds (and your own) being far easier once you’re already compressing reality; as these simulations just temporally compress actions.

Joanna Morningstar 9 Jan 2010 16:54 UTC
23 points
on: Consciousness
Projecting the ontology of your (flawed) internal representations onto reality is a bad idea. “Doing a Dennet” is also not dealt with, except by incredulity.

It’s a fact that the individual shades of color exist, however it is that we group them—and your ontology must contain them, if it pretends to completeness.

This is simply not the case. The fact that we can compare two stimuli more accurately than we can identify a stimuli merely means that internally we represent reality with lesser fidelity than our senses theoretically can achieve. On a reductionist view at most you’ve established “greater than” and “round to nearest” are implemented in neurons. You do not need to have colour.

Let’s unpack “blueness”. It’s a property we ascribe to objects, yet it’s trivial to “concieve of” blueness independent of an object. Neurologically, we process colour, motion, edge finding and so in in parallel; the linking of them together occurs at a higher level. Furthermore the brain fakes much of the data, giving the perception of colour vision, for example, in regions of the visual field where no ability to discriminate colour exists, and cases of blindness with continued concious perception of colour.

Brains compress input extensively; it would be crass to worry about the motion of every spot on a leopard separately—block them up as a single leopard. Asserting that the world must fit with our hallucination of reality lets you see things that are marginally visible, and get by with far worse sensory apparatus than needed. Cue optical illusions: this, this and this, for example. Individual shades don’t exist as you want them to.

It is absurdly clear that the map your brain makes does not correspond to either the territory of your direct sense perception (at the retina) or reality. On precisely what basis do you assert to project from the ontology of a bad map to the territory?

“Blue” is a referent to properties of internal representations, which is translatable across multiple instances of primate brains. You say “X is blue”, and I can check my internal representation of X to see whether I would categorise it as “blue”. This does not require “blue” to be fundamental in ontology. There isn’t a “blue thing” in physics, nor should there be. “Blue” existing means simply that there are things which this block of wetware puts in some equivalence class.

Lets move on to computation:

But if the “computational state” of a physical object is an observer-dependent attribution rather than an intrinsic property, then how can my thoughts be brain states?

Again, you seem project from an internal map of your own brain to the territory. Simply because I can look at a computer at multiple levels, say: Starting Excel, API calls, Machine instructions, microcode, functional units on the CPU, adders/multipliers/whatever on the CPU, logic gates, transistors, current flows or probability masses in the field of electrons, does not in principle invalidate any of the above views as correct views of an operating computer. The observer dependence isn’t an issue if (modulo translation/equivalence classes for abstraction between languages) they all give the same function or behaviour. You can block things up as many low level behaviours or a smaller number of high level ones; this doesn’t invalidate a computational view. What is the computation implemented by starting Excel? What details do you care about? It doesn’t matter to a functionalist, as the computations are equivalent, albeit in different languages or formalisms.

The critique of aboutness is similar to your issues over colour. You percieve “X is about Y” and thus assume it to be ontologically fundamental. Semantic content is a compressed and inaccurate rendition of low level states: Useful for communicating and processing if you don’t care about the details. Indeed the only reason we care about this kind of semantics is that our own wetware implements theory-of-mind directly. Good idea for predicting cognitive agents; not neccessarily a true statement about the world. The “Y” that “X” is “about” is another contraction—an infered property of a model.

“Time” is as flexible as your neural architecture wants it to be. Causality is a good idea, for Darwinian reasons, but people’s perception of the flow of time is adjustable. I will point out that your senses imply strongly that the world is a 2D surface. Have you ever been able to see behind an object without moving your head? I haven’t either, therefor clearly this 3D stuff is bunkum—the world is a flat plane and I directly percieve part of one side of it. Ditto time. Causality limits the state of a cognitive thing to be dependent on its previous states and its light cone at this point in space-time, and you percieve time to flow because you can remember previous brain states, and depending on them (compressed somewhat) is good for survival.

And now for unity of conciousness. It isn’t unitary. Multiple personality, dissociative disorders, blindsight, sleepwalking, alien hand, need I go on? I percieve my own representation of reality to be unitary; I know for a fact that it’s half made up. You claim that the individual issues “just can’t” be the whole story. Why? Personal incredulity isn’t an argument. The brain in the skull you call yours isn’t just running a single cognitive entity. You move before even realising “you” were going to; you are unconcious of breathing until you decide to be. Why is a unitary conciousness fundamental? Why isn’t it just a shortcut to approximate “you” and others in planning the future and figuring out the present?

Joanna Morningstar 9 Jan 2010 17:42 UTC
1 point
in reply to: byrnema’s comment on: Hypotheses For Dualism
You’re lucky I’m not one of those without abstract imagined mental imagery, which would weaken your point somewhat. The fact that you can imagine a given stimulus, for example pure blue or a circle, does not imply that conciousness precedes the stimuli or the compact representation of it.

What it implies is that you can plan, reason counterfactually, conceive of what is not. Call it what you will. I’m not suggesting that neurological compression (my qualia) are required for conciousness, only that they likely came earlier. Your internal processes are not a hierarchy of cognitive processes; it’s parallel processing run riot. Being able to hacking the more basic sensory compressors and reasoning systems doesn’t make conciousness prior to them, any more than the ability for concious control of breathing makes conciousness prior to that.

Joanna Morningstar 9 Jan 2010 23:27 UTC
0 points
in reply to: byrnema’s comment on: Hypotheses For Dualism
The visualisation (of abstract things) seems to be the important point; inability to interact with simulations of reality would preclude planning or memory, and would be pathological.

As a monist I think I understand the words uttered by dualists, and even the phenomena being described. What I do not know is why these things are perceived to be fundamental things. It does not bother me overmuch to recognise that my senses need not project out into the world. I will note that mathematics deals in the properties of unseen abstract things, which may make it easier to conceive of representations that aren’t fundamental in themselves.

Joanna Morningstar 10 Jan 2010 10:17 UTC
4 points
on: Dennett’s “Consciousness Explained”: Prelude
The suggestion that the integration of new sense-data into a model is at least partly driven by the state of the model is further supported by images with multiple interpretations (classically the Necker cube or shadows of rotating objects). Data consistent with multiple models is integrated into the currently held one. Inattentional blindness is a similar phenomena.

… consciousness is a big, strange problem. Not intelligence, not even assigning meaning to representations, but consciousness.

Why?

Mitchell Porter hasn’t explained this either. What do you deem conciousness to be? Is this typical minds at another level? To me, at least, the argument seems analogous to prime mover arguments, in that it is asserted that no finite regress of physical causes could account for (consciousness/the universe), and thus we must extend ontology.

Joanna Morningstar 10 Jan 2010 10:24 UTC
3 points
in reply to: cousin_it’s comment on: Dennett’s “Consciousness Explained”: Prelude
Being No One, Metzinger. Review and overview here. Precis here.

It’s heavy cognitive neurology, but it does attempt to find minimal sets of properties needed for subjectivity and conciousness. It also suggests that the fundamental problem in monist/dualist debates is that the processes of cognition are invisible to self-inspection.

Joanna Morningstar 10 Jan 2010 11:09 UTC
−1 points
in reply to: cousin_it’s comment on: Dennett’s “Consciousness Explained”: Prelude
So you demand AGI-level projects be completed before admitting even in principle that conciousness might be a solvable problem?

Do you apply similar standards to evolution by means of natural selection?

Metzinger identifies a plausible set of minimal properties, and justifies that selection on the basis of neurological work and thinking. It’s as much philosophising as reverse engineering “mind” based on failure modes.

Joanna Morningstar 10 Jan 2010 11:24 UTC
−1 points
in reply to: PhilGoetz’s comment on: Hypotheses For Dualism
Gravity is the macro-scale effect of non-euclidean space Balls rolling in curves on rubber are the macro effect of the rubber not being flat.

Space has a tensor field of the locally correct lorentz transform. Rubber has a vector field of the local gradient. Both are derivatives; the fact they aren’t constant implies non-eulcidean geometry

The laplacian (second derivative) of space appears made discontinuous only by mass-energy Ditto rubber.

If it isn’t mysterious why rubber sheets get distorted, then it shouldn’t be mysterious why space is distorted. Both are minimising the deviation of second derivative from a specified forcing, and have dynamics for the forcing over time. They are identical processes.
What links here?
- byrnema's comment on Hypotheses For Dualism by byrnema (10 Jan 2010 22:52 UTC; 5 points)