janos

Karma: 200

janos 3 Apr 2009 2:45 UTC
1 point
in reply to: Paul Crowley’s comment on: Where are we?
I’m mostly a lurker, but I’m in Toronto.

janos 4 Apr 2009 16:39 UTC
1 point
in reply to: Annoyance’s comment on: On dollars, utility, and crack cocaine
At least not when they’re already in the bucket.

janos 7 Apr 2009 14:59 UTC
3 points
in reply to: Sideways’s comment on: Rationalists should beware rationalism
Bayes’ Theorem never returns “undefined”. In the absence of any evidence it returns the prior.

janos 17 Apr 2009 14:22 UTC
3 points
in reply to: AndySimpson’s comment on: The Trouble With “Good”
Do you have some good examples of abuse of Bayes’ theorem?

janos 1 May 2009 15:32 UTC
1 point
in reply to: pjeby’s comment on: Generalizing From One Example
Interesting. My internal experience of programming is quite different; I don’t see boxes and lines. Data structures for me are more like people who answer questions, although of course with no personality or voice; the voice is mine as I ask them a question, and they respond in a “written” form, i.e. with a silent indication. So the diagrams people like to draw for databases and such don’t make direct sense to me per se; they’re just a way of organizing written information.

I am finding it quite difficult to coherently and correctly describe such things; no part of this do I have any certainty of, except that I know I don’t imagine black-and-white box diagrams.

janos 15 May 2009 16:24 UTC
2 points
in reply to: Mike Bishop’s comment on: Religion, Mystery, and Warm, Soft Fuzzies
I think you’re confusing the act of receiving information/understanding about an experience with the experience itself.

Re: the joke example, I think that one would get tired of hearing a joke too many times, and that’s what the dissection is equivalent to, because you keep hearing it in your head; but if you already get the joke, the dissection is not really adding to your understanding. If you didn’t get the joke, you will probably receive a twinge of enjoyment at the moment when you finally do understand. If you don’t understand a joke, I don’t think you can get warm fuzzies from it.

With hormones, again I think that being explicitly reminded of the role of hormones in physical attraction while experiencing physical attraction reduces warm fuzzies only because it’s distracting you from the source of the warm fuzzies and making you feel self-conscious. On the other hand, knowing more about the role of hormones should not generally distract you from your physical attraction; instead you could use it to tada get more warm fuzzies.

janos 27 Jul 2009 15:48 UTC
0 points
in reply to: cousin_it’s comment on: Bayesian Flame
I am trying to understand the examples on that page, but they seem strange; shouldn’t there be a model with parameters, and a prior distribution for those parameters? I don’t understand the inferences. Can someone explain?

janos 27 Jul 2009 15:55 UTC
8 points
in reply to: cousin_it’s comment on: Bayesian Flame
Updated, eh? Where did your prior come from? :)

janos 27 Jul 2009 16:56 UTC
4 points
on: Bayesian Flame
Since we’re discussing (among other things) noninformative priors, I’d like to ask: does anyone know of a decent (noninformative) prior for the space of stationary, bidirectionally infinite sequences of 0s and 1s?

Of course in any practical inference problem it would be pointless to consider the infinite joint distribution, and you’d only need to consider what happens for a finite chunk of bits, i.e. a higher-order Markov process, described by a bunch of parameters (probabilities) which would need to satisfy some linear inequalities. So it’s easy to find a prior for the space of mth-order Markov processes on {0,1}; but these obvious (uniform) priors aren’t coherent with each other.

I suppose it’s possible to normalize these priors so that they’re coherent, but that seems to result in much ugliness. I just wonder if there’s a more elegant solution.

janos 28 Jul 2009 15:42 UTC
3 points
in reply to: marks’s comment on: Bayesian Flame
The purpose would be to predict regularities in a “language”, e.g. to try to achieve decent data compression in a way similar to other Markov-chain-based approaches. In terms of properties, I can’t think of any nontrivial ones, except the usual important one that the prior assign nonzero probability to every open set; mainly I’m just trying to find something that I can imagine computing with.

It’s true that there exists a bijection between this space and the real numbers, but it doesn’t seem like a very natural one, though it does work (it’s measurable, etc). I’ll have to think about that one.

janos 29 Jul 2009 6:04 UTC
1 point
in reply to: marks’s comment on: Bayesian Flame
Each element of the set is characterized by a bunch of probabilities; for example there is p_01101, which is the probability that elements x_{i+1} through x_{i+5} are 01101, for any i. I was thinking of using the topology induced by these maps (i.e. generated by preimages of open sets under them).

How is putting a noninformative prior on the reals hard? With the usual required invariance, the uniform (improper) prior does the job. I don’t mind having the prior be improper here either, and as I said I don’t know what invariance I should want; I can’t think of many interesting group actions that apply. Though of course 0 and 1 should be treated symmetrically; but that’s trivial to arrange.

I guess you’re right that regularities can be described more generally with computational models; but I expect them to be harder to deal with than this (relatively) simple, noncomputational (though stochastic) model. I’m not looking for regularities among the models, so I’m not sure how a computational model would help me.

janos 4 Aug 2009 14:31 UTC
3 points
in reply to: cousin_it’s comment on: Bayesian Flame
Right, that is a good piece. But I’m afraid I was unclear. (Sorry if I was.) I’m looking for a prior over stationary sequences of digits, not just sequences. I guess the adjective “stationary” can be interpreted in two compatible ways: either I’m talking about sequences such that for every possible string w the proportion of substrings of length |w| that are equal to |w|, among all substrings of length |w|, tends to a limit as you consider more and more substrings (either extending forward or backward in the sequence); this would not quite be a prior over generators, and isn’t what I meant.

The cleaner thing I could have meant (and did) is the collection of stationary sequence-valued random variables, each of which (up to isomorphism) is completely described by the probabilities p_w of a string of length |w| coming up as w. These, then, are generators.

janos 11 Dec 2009 15:33 UTC
0 points
in reply to: Psy-Kosh’s comment on: Probability Space & Aumann Agreement
Nope; it’s the limit of I(J(I(J(I(J(I(J(...(w)...), where I(S) for a set S is the union of the elements of I that have nonempty intersections with S, i.e. the union of I(x) over all x in S, and J(S) is defined the same way.

Alternately if instead of I and J you think about the sigma-algebras they generate (let’s call them sigma(I) and sigma(J)), then sigma(I meet J) is the intersection of sigma(I) and sigma(J). I prefer this somewhat because the machinery for conditional expectation is usually defined in terms of sigma-algebras, not partitions.
What links here?
- Tyrrell_McAllister's comment on Probability Space & Aumann Agreement by Wei Dai (11 Dec 2009 15:41 UTC; 1 point)

janos 11 Dec 2009 15:48 UTC
0 points
in reply to: AndrewKemendo’s comment on: Probability Space & Aumann Agreement
Huh? The reference set Ω is the set of possible world histories, out of which one element is the actual world history. I don’t see what’s wrong with this.

janos 11 Dec 2009 16:10 UTC
0 points
in reply to: Psy-Kosh’s comment on: Probability Space & Aumann Agreement
That simplification is a situation in which there is no common knowledge. In world-state w, agent 1 knows A1 (meaning knows that the correct world is in A1), and agent 2 knows A2. They both know A1 union A2, but that’s still not common knowledge, because agent 1 doesn’t know that agent 2 knows A1 union A2.

I(w) is what agent 1 knows, if w is correct. If all you know is S, then the only thing you know agent 1 knows is I(S), and the only thing that you know agent 1 knows agent 2 knows is J(I(S)), and so forth. This is why the usual “everyone knows that everyone knows that … ” definition of common knowledge translates to I(J(I(J(I(J(...(w)...).

janos 11 Dec 2009 16:42 UTC
0 points
in reply to: Psy-Kosh’s comment on: Probability Space & Aumann Agreement
As far as I understand, agent 1 doesn’t know that agent 2 knows A2, and agent 2 doesn’t know that agent 1 knows A1. Instead, agent 1 knows that agent 2′s state of knowledge is in J and agent 2 knows that agent 1′s state of knowledge is in I. I’m a bit confused now about how this matches up with the meaning of Aumann’s Theorem. Why are I and J common knowledge, and {P(A|I)=q} and {P(A|J)=q} common knowledge, but I(w) and J(w) are not common knowledge? Perhaps that’s what the theorem requires, but currently I’m finding it hard to see how I and J being common knowledge is reasonable.

Edit: I’m silly. I and J don’t need to be common knowledge at all. It’s not agent 1 and agent 2 who perform the reasoning about I meet J, it’s us. We know that the true common knowledge is a set from I meet J, and that therefore if it’s common knowledge that agent 1′s posterior for the event A is q1 and agent 2′s posterior for A is q2, then q1=q2. And it’s not unreasonable for these posteriors to become common knowledge without I(w) and J(w) becoming common knowledge. The theorem says that if you’re both perfect Bayesians and you have the same priors then you don’t have to communicate your evidence.

But if I and J are not common knowledge then I’m confused about why any event that is common knowledge must be built from the meet of I and J.

janos 12 Dec 2009 6:14 UTC
1 point
in reply to: HalFinney’s comment on: Probability Space & Aumann Agreement
What I don’t like about the example you provide is: what player 1 and player 2 know needs to be common knowledge. For instance if player 1 doesn’t know whether player 2 knows whether die 1 is in 1-3, then it may not be common knowledge at all that the sum is in 2-6, even if player 1 and player 2 are given the info you said they’re given.

This is what I was confused about in the grandparent comment: do we really need I and J to be common knowledge? It seems so to me. But that seems to be another assumption limiting the applicability of the result.

janos 6 Feb 2011 3:54 UTC
13 points
in reply to: Alicorn’s comment on: My hour-long interview with Yudkowsky on “Becoming a Rationalist”
It’s provided in the linked page; you need to scroll down to see it.

janos 6 Feb 2011 18:37 UTC
3 points
on: A bit meta: Do posts come in batches? If so, why?
Echoing the others:

If we suppose these are 22 iid samples from a Poisson then the max likelihood estimate for the Poisson parameter is 0.82 (the sample mean). Simulating such draws from such a Poisson and looking at sample correlation between Jan 15-Feb 4 and Jan 16-Feb 5, the p-value is 0.1. And when testing Poisson-ness vs negative binomial clustering (with the same mean), the locally most powerful test uses statistic (x-1.32)^2, and gives a simulated p-value of 0.44.

janos 7 Feb 2011 4:23 UTC
14 points
on: Procedural Knowledge Gaps
Regarding investment, my suggestion (if you work in the US) is to open a basic (because it doesn’t periodically charge you fees) E*TRADE account here. They will provide an interface for buying and selling shares of stocks and various other things (ETFs and such; I mention stocks and ETFs because those are the only things I’ve tried doing anything with). They will charge you $10 for every transaction you make, so unless you’re going to be (or become) active/clever enough to make it worthwhile, it makes sense not to trade too frequently.

EDIT: These guys appear to charge less, though they also deal in fewer things (e.g. no bonds).
What links here?
- Topics from “Procedural Knowledge Gaps” by NancyLebovitz (11 Feb 2012 21:38 UTC; 58 points)
- Unnamed's comment on How can I make money? by Goobahman (21 Apr 2011 3:32 UTC; 11 points)