justinpombrio

Karma: 655

justinpombrio 30 Apr 2024 17:02 UTC
3 points
0
in reply to: avturchin’s comment on: Magic by forgetting
Exactly.

justinpombrio 29 Apr 2024 15:33 UTC
1 point
0
in reply to: avturchin’s comment on: Magic by forgetting

However, If I already know that I have the disease, and I am not altruistic to my copies, playing such game is a wining move to me?

Correct. But if you don’t have the disease, you’re probably also not altruistic to your copies, so you would choose not to participate. Leaving the copies of you with the disease isolated and unable to “trade”.

justinpombrio 28 Apr 2024 13:41 UTC
5 points
4
in reply to: avturchin’s comment on: Magic by forgetting
Not “almost no gain”. My point is that it can be quantified, and it is exactly zero expected gain under all circumstances. You can verify this by drawing out any finite set of worlds containing “mediators”, and computing the expected number of disease losses minus disease gains as:

num(people with disease)*P(person with disease meditates)*P(person with disease who meditates loses the disease) - num(people without disease)*P(person without disease meditates)*P(person without disease who meditates gains the disease)

My point is that this number is always exactly zero. If you doubt this, you should try to construct a counterexample with a finite number of worlds.

justinpombrio 28 Apr 2024 3:54 UTC
1 point
0
in reply to: avturchin’s comment on: Magic by forgetting
My point still stands. Try drawing out a specific finite set of worlds and computing the probabilities. (I don’t think anything changes when the set of worlds becomes infinite, but the math becomes much harder to get right.)

justinpombrio 24 Apr 2024 21:34 UTC
17 points
13
on: Magic by forgetting

There is a 0.001 chance that someone who did not have the disease will get it. But he can repeat the procedure.

No, that doesn’t work. It invalidates the implicit assumption you’re making that the probability that a person chooses to “forget” is independent of whether they have the disease. Ultimately, you’re “mixing” the various people who “forgot”, and a “mixing” procedure can’t change the proportion of people who have the disease.

When you take this into account, the conclusion becomes rather mundane. Some copies of you can gain the disease, while a proportional number of copies can lose it. (You might think you could get some respite by repeatedly trading off “who” has the disease, but the forgetting procedure ensures that no copy ever feels respite, as that would require remembering having the disease.)

justinpombrio 3 Apr 2024 20:46 UTC
4 points
0
in reply to: zhukeepa’s comment on: The Cognitive-Theoretic Model of the Universe: A Partial Summary and Review

I think formalizing it in full will be a pretty nontrivial undertaking, but formalizing isolated components feels tractable, and is in fact where I’m currently directing a lot of my time and funding.

Great. Yes, I think that’s the thing to do. Start small! I (and presumably others) would update a lot from a new piece of actual formal mathematics from Chris’s work. Even if that work was, by itself, not very impressive.

(I would also want to check that that math had something to do with his earlier writings.)

My current understanding is that he believes that his current written work should be sufficient for modern mathematicians and scientists to understand his core ideas

Uh oh. The “formal grammar” that I checked used formal language, but was not even close to giving a precise definition. So Chris either (i) doesn’t realize that you need to be precise to communicate with mathematicians, or (ii) doesn’t understand how to be precise.

Please be prepared for the possibility that Chris is very smart and creative, and that he’s had some interesting ideas (e.g. Syndiffeonesis), but that his framework is more of a interlocked collection of ideas than anything mathematical (despite using terms from mathematics). Litany of Tarsky and all that.

justinpombrio 3 Apr 2024 14:48 UTC
1 point
1
in reply to: zhukeepa’s comment on: The Cognitive-Theoretic Model of the Universe: A Partial Summary and Review
“gesture at something formal”—not in the way of the “grammar” it isn’t. I’ve seen rough mathematics and proof sketches, especially around formal grammars. This isn’t that, and it isn’t trying to be. There isn’t even an attempt at a rough definition for which things the grammar derives.

I think Chris’s work is most valuable to engage with for people who have independently explored philosophical directions similar to the ones Chris has explored

A big part of Chris’s preliminary setup is around how to sidestep the issues around making the sets well-ordered.

Nonsense! If Chris has an alternative to well-ordering, that’s of general mathematical interest! He would make a splash simply writing that up formally on its own, without dragging the rest of his framework along with it.

Except, I can already predict you’re going to say that no piece of his framework can be understood without the whole. Not even by making a different smaller framework that exists just to showcase the well-ordering alternative. It’s a little suspicious.

because someone else I’d funded to review Chris’s work

If you’re going to fund someone to do something, it should be to formalize Chris’s work. That would not only serve as a BS check, it would make it vastly more approachable.

I’m confused why you’re asking about specific insights people have gotten when Jessica has included a number of insights she’s gotten in her post

I was hoping people other than Jessica would share some specific curated insights they got. Syndiffeonesis is in fact a good insight.
What links here?
- CTMU insight: maybe consciousness *can* affect quantum outcomes? by zhukeepa (19 Apr 2024 15:23 UTC; 9 points)

justinpombrio 29 Mar 2024 14:46 UTC
23 points
2
on: The Cognitive-Theoretic Model of the Universe: A Partial Summary and Review
tldr; a spot check calls bullshit on this.

I know a bunch about formal languages (PhD in programming languages), so I did a spot check on the “grammar” described on page 45. It’s described as a “generative grammar”, though instead of words (sequences of symbols) it produces “L_O spacial relationships”. Since he uses these phrases to describe his “grammar”, and they have their standard meaning because he listed their standard definition earlier in the section, he is pretty clearly claiming to be making something akin to a formal grammar.

My spot check is then: is the thing defined here more-or-less a grammar, in the following sense?
1. There’s a clearly defined thing called a grammar, and there can be more than one of them.
2. Each grammar can be used to generate something (apparently an L_O) according to clearly defined derivation rules that depend only on the grammar itself.
If you don’t have a thing plus a way to derive stuff from that thing, you don’t have anything resembling a grammar.

My spot check says:
1. There’s certainly a thing called a grammar. It’s a four-tuple, whose parts closely mimic that of a standard grammar, but using his constructs for all the basic parts.
2. There’s no definition of how to derive an “L_O spacial relationship” given a grammar. Just some vague references to using “telic recursion”.
I’d categorize this section as “not even wrong”; it isn’t doing anything formal enough to have a mistake in it.

Another fishy aspect of this section is how he makes a point of various things coinciding, and how that’s very different from the standard definitions. But it’s compatible with the standard definitions! E.g. the alphabet of a language is typically a finite set of symbols that have no additional structure, but there’s no reason you couldn’t define a language whose symbols were e.g. grammars over that very language. The definition of a language just says that its symbols form a set. (Perhaps you’d run into issues with making the sets well-ordered, but if so he’s running headlong into the same issues.)

I’m really not seeing any value in this guy’s writing. Could someone who got something out of it share a couple specific insights that got from it?

justinpombrio 2 Mar 2024 17:47 UTC
5 points
0
in reply to: Adam Zerner’s comment on: The Parable Of The Fallen Pendulum—Part 1
How did you find me? How do they always find me? No matter...

Have you tried applying your models to predict the day’s weather, or what your teacher will be wearing that day? I bet not: they wouldn’t work very well. Models have domains in which they’re meant to be applied. More precise models tend to have more specific domains.

Making real predictions about something, like what the result of a classroom experiment will be even if the pendulum falls over, is usually outside the domain of any precise model. That’s why your successful models are compound models, using Newtonian mechanics as a sub-model, and that’s why they’re so unsatisfyingly vague and cobbled together.

There is a skill to assembling models that make good predictions in messy domains, and it is a valuable skill. But it’s not the goal of your physics class. That class is trying to teach you about precise models like Newtonian mechanics. Figuring out exactly how to apply Newtonian mechanics to a real physical experiment is often harder than solving the Newtonian math! But surely you’ve noticed by now that, in the domains where Newtonian mechanics seems to actually apply, it applies very accurately?

This civilization we live in tends to have two modes of thinking. The first is ‘precise’ thinking, where people use precise models but don’t think about the mismatch between their domain and reality. The model’s domain is irrelevant in the real world, so people will either inappropriately apply the model outside its domain or carefully only make statements within the model’s domain and hope that others will make that incorrect leap on their own. The other mode of thinking is ‘imprecise’ thinking, where people ignore all models and rely on their gut feelings. We are extremely bad, at the moment, of the missing middle path of making and recognizing models for messy domains.

justinpombrio 1 Mar 2024 15:49 UTC
28 points
13
on: The Parable Of The Fallen Pendulum—Part 1
“There’s no such thing as ‘a Bayesian update against the Newtonian mechanics model’!” says a hooded figure from the back of the room. “Updates are relative: if one model loses, it must be because others have won. If all your models lose, it may hint that there’s another model you haven’t thought of that does better than all of them, or it may simply be that predicting things is hard.”

“Try adding a couple more models to compare against. Here’s one: pendulums never swing. And here’s another: Newtonian mechanics is correct but experiments are hard to perform correctly, so there’s a 80% probability that Newtonian mechanics gives the right answer and 20% probability spread over all possibilities including 5% on ‘the pendulum fails to swing’. Continue to compare these models during your course, and see which one wins. I think you can predict it already, despite your feigned ignorance.”

The hooded figure opens a window in the back of the room and awkwardly climbs through and walks off.

justinpombrio 23 Feb 2024 15:48 UTC
1 point
0
in reply to: numpyNaN’s comment on: The Pareto Best and the Curse of Doom
Are we assuming things are fair or something?

I would have modeled this as von Neumann getting 300 points and putting 260 of them into the maths and sciences and the remaining 40 into living life and being well adjusted.

justinpombrio 22 Nov 2023 19:26 UTC
1 point
0
in reply to: dyokomizo’s comment on: Implementing Decision Theory
Oh, excellent!

It’s a little hard to tell from the lack of docs, but you’re modelling dilemmas with Bayesian networks? I considered that, but wasn’t sure how to express Sleeping Beauty nicely, whereas it’s easy to express (and gives the right answers) in my tree-shaped dilemmas. Have you tried to express Sleeping Beauty?

And have you tried to express a dilemma like smoking lesion where the action that an agent takes is not the action their decision theory tells them to take? My guess is that this would be as easy as having a chain of two probabilistic events, where the first one is what the decision theory says to do and the second one is what the agent actually does, but I don’t see any of this kind of dilemma in your test cases.

justinpombrio 16 Nov 2023 18:54 UTC
1 point
in reply to: Waller-Hefkó János’s comment on: My attitude towards death
I have a healthy fear of death; it’s just that none of it stems from an “unobserved endless void”. Some of the specific things I fear are:
- Being stabbed is painful and scary (it’s scary even if you know you’re going to live)
- Most forms of dying are painful, and often very slow
- The people I love mourning my loss
- My partner not having my support
- Future life experiences, not happening
- All of the things I want to accomplish, not happening
The point I was making in this thread was that “unobserved endless void” is not on this list, I don’t know how to picture it, and I’m surprised that other people think it’s a big deal.

Who knows, maybe if I come close to dying some time I’ll suddenly gain a new ontological category of thing to be scared of.

justinpombrio 12 Nov 2023 0:22 UTC
2 points
0
in reply to: Nicolas Macé’s comment on: Implementing Decision Theory
What’s the utility function of the predictor? Is there necessarily a utility function for the predictor such that the predictor’s behavior (which is arbitrary) corresponds to maximizing its own utility? (Perhaps this is mentioned in the paper, which I’ll look at.)

EDIT: do you mean to reduce a 2-player game to a single-agent decision problem, instead of vice-versa?

justinpombrio 10 Nov 2023 23:12 UTC
2 points
0
in reply to: Nicolas Macé’s comment on: Implementing Decision Theory
I was not aware of Everitt, Leike & Hutter 2015, thank you for the reference! I only delved into decision theory a few weeks ago, so I haven’t read that much yet.

Would you say that this is similar to the connection that exists between fixed points and Nash equilibria?

Nash equilibria come from the fact that your action depends on your opponent’s action, which depends on your action. When you assume that each player will greedily change their action if it improves their utility, the Nash equilibria are the fixpoints at which no player changes their action.

In single-agent decision theory problems, your (best) action depends on the situation you’re in, which depends on what someone predicted your action would be, which (effectively) depends on your action.

If there’s a deeper connection than this, I don’t know it. There’s a fundamental difference between the two cases, I think, because a Nash equilibrium involves multiple agents that don’t know each others’ decision process (problem statement: maximize the outputs of two functions independently), while single-agent decision theory involves just one agent (problem statement: maximize the output of one function).

justinpombrio 7 Nov 2023 21:39 UTC
1 point
0
in reply to: ACrackedPot’s comment on: Implementing Decision Theory

My solution, which assumes computation is expensive

Ah, so I’m interested in normative decision theory: how one should ideally behave to maximize their own utility. This is what e.g. UDT&FDT are aiming for. (Keep in mind that “your own utility” can, and should, often include other people’s utility too.)

Minimizing runtime is not at all a goal. I think the runtime of the decision theories I implemented is something like doubly exponential in the number of steps of the simulation (the number of events in the simulation is exponential in its duration; each decision typically involves running the simulation using a trivial decision theory).

reason about other agents based on their behavior towards a simplified-model third agent

That’s an interesting approach I hadn’t considered. While I don’t care about efficiency in the “how fast does it run” sense, I do care about efficiency in the “does it terminate” sense, and that approach has the advantage of terminating.

Defect against bots who defect against cooperate-bot, otherwise cooperate

You’re doing to defect against UDT/FDT then. They defect against cooperate-bot. You’re thinking it’s bad to defect against cooperate-bot, because you have empathy for the other person. But I suspect you didn’t account for that empathy in your utility function in the payoff matrix, and that if you do, you’ll find that you’re not actually in a prisoner’s dilemma in the game-theory sense. There was a good SlateStarCodex post about this that I can’t find.

Implementing Decision Theory

justinpombrio7 Nov 2023 17:55 UTC

21 points

12 comments3 min readLW link

justinpombrio 7 Nov 2023 17:07 UTC
4 points
2
in reply to: Sune’s comment on: Are language models good at making predictions?
Yeah, exactly. For example, if humans had a convention of rounding probabilities to the nearest 10% when writing them, then baseline GPT-4 would follow that convention and it would put a cap on the maximum calibration it could achieve. Humans are badly calibrated (right?) and baseline GPT-4 is mimicking humans, so why is it well calibrated? It doesn’t follow from its token stream being well calibrated relative to text.

justinpombrio 29 Sep 2023 15:48 UTC
9 points
2
on: Peacewagers so Far
I like the idea of Peacemakers. I even had the same idea myself—to make an explicitly semi-cooperative game with a goal of maximizing your own score but every player having a different scoring mechanism—but haven’t done anything with it.

That said, I think you’re underestimating how much cooperation there is in a zero-sum game.

If you offer a deal, you must be doing it because it increases your chance of winning, but only one person can win under the MostPointsWins rule, so that deal couldn’t be very good for me, and I’ll always suspect your deal of being a trick, so in most cases no detailed deals will be offered.

Three examples of cooperation that occur in three-player Settlers of Catan (between, say, Alice, Bob, and Carol), even if all players are trying only to maximize their own chance of winning:
- Trading. Trading increases the chances that the two trading players win, to the detriment of the third. As long as there’s sufficient uncertainty about who’s winning, you want to trade. (There’s a world Catan competition. I bet that these truly competitive games involve less trading than you would do with your friends, but still a lot. Not sure how to find out.)
- Refusing to trade with the winning player, once it’s clear who that is. If Alice is ahead then Bob and Carol are in a prisoner’s dilemma, where trading with Alice is defecting.
- Alice says at the beginning of the game: “Hey Bob, it sure looks like Carol has the strongest starting position, doesn’t it? Wouldn’t be very fair if she won just because of that. How about we team up against her by agreeing now to never trade with her for the entire game?” If Bob agrees, than the winning probabilities of Alice, Bob, Carol go from (say) 20%,20%,60% to 45%,45%,10%. Cooperation!
So it’s not that zero-sum games lack opportunities for cooperation, it’s just that every opportunity for cooperation with another player is at the detriment to a third. Which is why there isn’t any cooperation at all in a two player zero-sum game.

Realize that even in a positive-sum game, players are going to be choosing between doing things for the betterment of everyone, and doing things for the betterment of themselves, and maximizing your own score involves doing more of the latter than the former, ideally while convincing everyone else that you’re being more than fair.

Suggestion for the game: don’t say the goal is to maximize your score. Instead say you’re roleplaying a character who’s goal is to maximize [whatever]. For a few reasons:
- It makes every game (more) independent of every other game. This reduces the possibility that Alice sabotages Bob in their second game together because Bob was a dick in their first game together. The goal is to have interesting negotiations, not to ruin friendships.
- It encourages exploration. You can try certain negotiating tactics in one game, and then abandon them in the next, and the fact that you were “roleplaying” will hopefully reduce how much people associate those tactics with you instead of that one time you played.
- It could lighten the mood. You should try really hard to lighten the mood. Because you know what else is a semi-cooperative game that’s heavy on negotiation? Diplomacy.

justinpombrio 22 Aug 2023 3:28 UTC
8 points
2
in reply to: interstice’s comment on: Rice’s Theorem says that AIs can’t determine much from studying AI source code
Expanding on this, there are several programming languages (Idris, Coq, etc.) whose type system ensures that every program that type checks will halt when it’s run. One way to view a type system is as an automated search for a proof that your program is well-typed (and a type error is a counter-example). In a language like Idris or Coq, a program being well-typed implies that it halts. So machine generated proofs that programs halt aren’t just theoretically possible, they’re used extensively by some languages.

justinpombrio

Im­ple­ment­ing De­ci­sion Theory

Implementing Decision Theory