JonahS

Karma: 3,471

JonahS 3 Dec 2016 3:46 UTC
33 points
on: CFAR’s new focus, and AI Safety
A few nitpicks on choice of “Brier-boosting” as a description of CFAR’s approach:

Predictive power is maximized when Brier score is minimized

Brier score is the sum of differences between probabilities assigned to events and indicator variables that are are 1 or 0 according to whether the event did or did not occur. Good calibration therefore corresponds to minimizing Brier score rather than maximizing it, and “Brier-boosting” suggests maximization.

What’s referred to as “quadratic score” is essentially the same as the negative of Brier score, and so maximizing quadratic score corresponds to maximizing predictive power.

Brier score fails to capture our intuitions about assignment of small probabilities

A more substantive point is that even though the Brier score is minimized by being well-calibrated, the way in which it varies with the probability assigned to an event does not correspond to our intuitions about how good a probabilistic prediction is. For example, suppose four observers A, B, C and D assigned probabilities 0.5, 0.4, 0.01 and 0.000001 (respectively) to an event E occurring and the event turns out to occur. Intuitively, B’s prediction is only slightly worse than A’s prediction, whereas D’s prediction is much worse than C’s prediction. But the difference between the increase in B’s Brier score and A’s Brier score is 0.36 − 0.25 = 0.11, which is much larger than corresponding difference for D and C, which is approximately 0.02.

Brier score is not constant across mathematically equivalent formulations of the same prediction

Suppose that a basketball player is to make three free throws, observer A predicts that the player makes each one with probability p and suppose that observer B accepts observer A’s estimate and notes that this implies that the probability that the player makes all three free throws is p^3, and so makes that prediction.

Then if the player makes all three free throws, observer A’s Brier score increases by

3*(1 - p)^2

while observer B’s Brier score increases by

(1 - p^3)^2

But these two expressions are not equal in general, e.g. for p = 0.9 the first is 0.03 and the second is 0.073441. So changes to Brier score depend on the formulation of a prediction as opposed to the prediction itself.

======

The logarithmic scoring rule handles small probabilities well, and is invariant under changing the representation of a prediction, and so is preferred. I first learned of this from Eliezer’s essay A Technical Explanation of a Technical Explanation.

Minimizing logarithmic score is equivalent to maximizing the likelihood function for logistic regression / binary classification. Unfortunately, the phrase “likelihood boosting” has one more syllable than “Brier boosting” and doesn’t have same alliterative ring to it, so I don’t have an actionable alternative suggestion :P.

JonahS 28 Jun 2013 5:49 UTC
19 points
in reply to: Eliezer Yudkowsky’s comment on: Tiling Agents for Self-Modifying AI (OPFAI #2)
I think that your paraphrasing

I don’t think MIRI’s efforts are valuable because I think that AI in general has made no progress on AGI for the last 60 years, but aside from that MIRI isn’t doing anything wrong in particular, and it would be an admittedly different story if I thought that AI in general was making progress on AGI.

is pretty close to my position.

I would qualify it by saying:
1. I’d replace “no progress” with “not enough progress for there to be a known research program with a reasonable chance of success.”
2. I have high confidence that some of the recent advances in narrow AI will contribute (whether directly or indirectly) to the eventual creation of AGI (contingent on this event occurring), just not necessarily in a foreseeable way.
3. If I discover that there’s been significantly more progress on AGI than I had thought, then I’ll have to reevaluate my position entirely. I could imagine updating in the directly of MIRI’s FAI work being very high value, or I could imagine continuing to believe that MIRI’s FAI research isn’t a priority, for reasons different from my current ones.
What links here?
- JonahS's comment on Tiling Agents for Self-Modifying AI (OPFAI #2) by Eliezer Yudkowsky (1 Jul 2013 21:18 UTC; 5 points)

JonahS 10 Aug 2013 23:50 UTC
18 points
in reply to: lukeprog’s comment on: Common sense as a prior
Even if I read the QM sequence and find the arguments compelling, I still wouldn’t feel as though I had enough subject matter expertise to rationally disagree with elite physicists with high confidence. I don’t think that I’m more rational than Bohr, Heisenberg, Dirac, Feynman, Penrose, Schrödinger and Wigner. These people thought about quantum mechanics for decades. I wouldn’t be able to catch up in a week. The probability that I’d be missing fundamental and relevant things that they knew would dominate my prior.

I’ll think about reading the QM sequence.
What links here?
- Document's comment on Common sense as a prior by Nick_Beckstead (12 Aug 2013 3:31 UTC; 0 points)

JonahS 27 Nov 2016 20:20 UTC
15 points
on: On the importance of Less Wrong, or another single conversational locus
Brian Tomasik’s article Why I Prefer Public Conversations is relevant to

I suspect that most of the value generation from having a single shared conversational locus is not captured by the individual generating the value (I suspect there is much distributed value from having “a conversation” with better structural integrity / more coherence, but that the value created thereby is pretty distributed). Insofar as there are “externalized benefits” to be had by blogging/commenting/reading from a common platform, it may make sense to regard oneself as exercising civic virtue by doing so, and to deliberately do so as one of the uses of one’s “make the world better” effort. (At least if we can build up toward in fact having a single locus.)

JonahS 27 Jun 2015 2:02 UTC
14 points
0
in reply to: minusdash’s comment on: Beyond Statistics 101
The top 3 answers to the MathOverflow question Which mathematicians have influenced you the most? are Alexander Grothendieck, Mikhail Gromov, and Bill Thurston. Each of these have expressed serious concerns about the community.
- Grothendieck was actually effectively excommunicated by the mathematical community and then was pathologized as having gone crazy. See pages 37-40 of David Ruelle’s book A Mathematician’s Brain.
- Gromov expresses strong sympathy for Grigory Perelman having left the mathematical community starting on page 110 of Perfect Rigor. (You can search for “Gromov” in the pdf to see all of his remarks on the subject.)
- Thurston made very apt criticisms of the mathematical community in his essay On Proof and Progress In Mathematics. See especially the beginning of Section 3: “How is mathematical understanding communicated?” Terry Tao endorses Thurston’s essay in his obituary of Thurston. But the community has essentially ignored Thurston’s remarks: one almost never hears people talk about the points that Thurston raises.

JonahS 4 May 2015 15:16 UTC
14 points
in reply to: cousin_it’s comment on: Is Scott Alexander bad at math?
Remember his sequence here :-).

JonahS 4 May 2015 15:12 UTC
14 points
in reply to: Vaniver’s comment on: Is Scott Alexander bad at math?

I think this only works if you keep ‘mathematics’ a broad and vague category.

I agree. But group theory isn’t less “math” than actuarial math is!! I’d be happy with just dropping “math” as a term and renaming second semester calculus “computing integrals” or something. Then Scott could say that he’s bad at computing integrals, rather than thinking that because he’s bad at computing integrals, group theory must be way beyond him.

But since people call both calculus class and group theory “math,” I need to respond to that.

So, I’ll defer to Scott if he disagrees, but my impression is that he has substantially more trouble learning scales and chords than his brother, and that he is “worse than him at music.” It might not be logically necessary, but we can certainly notice that it is probabilistically likely.

Yes, but I don’t think that Scott’s innately worse at music than most people who can easily pick up on scales and chords, the countervailing forces cutting in his favor are too strong.

JonahS 5 May 2015 19:44 UTC
13 points
in reply to: Lumifer’s comment on: Is Scott Alexander bad at math?

You take the position of someone from above bestowing wisdom upon those below. LW has always been sensitive to status and you are assuming the role of a lord to whom lowly peasants should show obeisance wherever he throws them scraps from his table. That will not and does not play well.

This actually is helpful feedback. Can you elaborate on your thoughts on the sensitivity of LWers to status? I’m not sure that I have a clear understanding of the situation here.

My comments above were not intended as a slight toward you or anyone else. I was relating factual information: I know much more about what I’m writing about than most LWers, and have high opportunity cost of time, but I don’t feel smug about it.

Presumably I’m missing something really important. I’d welcome the opportunity to better understand it.

People are responsible for themselves—you, too. It’s your own responsibility to figure out what’s cost-efficient for you and whether it’s a good use of your time to post things on LW. Complaining about ingratitude and threatening to pick up your toys and go home is unlikely to get you much.

This was not my intention. I don’t care about whether I get gratitude, I care about people learning from me. I value constructive criticism and explanation of why people aren’t finding my posts more useful. As a factual matter, my efforts to help people throughout my life have been largely fruitless. I take responsibility for that.

JonahS 20 May 2015 1:53 UTC
12 points
in reply to: J_Smart’s comment on: How my social skills went from horrible to mediocre
This is a very long conversation – communicating it is my highest priority right now, but it’s a huge undertaking. For now I’ll just say that the most important part to my mind is learning to think in terms of dimensionality reduction. See Chris Olah’s post Visualizing MNIST: An Exploration of Dimensionality Reduction if you’re unfamiliar with the subject.

JonahS 6 May 2015 23:56 UTC
12 points
0
in reply to: Mirzhan_Irkegulov’s comment on: Is Scott Alexander bad at math?
I find it funny that I’m finally getting the feedback that I needed 25 years ago, from so many people at once. See here and here: over the past ~6 months, I finally started to get it.

Thanks very much for your comment, I appreciate the time that you put into it. The points that you make have largely been made already by other commenters, and I feel a little bit sheepish that you went through so much effort, but I might find your framing of things to be helpful at the margin, even on reflection.

JonahS 11 Sep 2013 2:01 UTC
12 points
in reply to: Eliezer Yudkowsky’s comment on: High School, Human Capital, Signaling and College Admissions
People Are Crazy, The World Is Mad.

I feel as though this is a conversation stopper. Two issues:
1. The claim is vague, and it’s not clear what it means. Using it in a given instance without elaboration paints in overly broad brush strokes, with insufficient specificity to be falsifiable, serving as a fully general argument.
2. It would be helpful if you said more about why you hold your view. I haven’t seen you respond to Carl’s question: “What are some of the major predictive successes of “the world is mad” that held up under careful investigation of dispositive facts?” aside from appealing to the one example of physicists’ views on interpretations of quantum mechanics, where you haven’t substantiated your position by systematically examining and refuting sophisticates’ arguments against multiple worlds.

JonahS 26 Jun 2013 4:43 UTC
12 points
in reply to: CarlShulman’s comment on: A personal history of involvement with effective altruism
I agree with the content and spirit of this comment — thanks for writing it.

There remains the puzzle of why the Gates Foundation has devoted so many resources toward education efforts, which look to be ineffective from the outside. I have high confidence that they could have found a more effective use of the money.

JonahS 29 May 2013 0:27 UTC
12 points
in reply to: Eliezer Yudkowsky’s comment on: Earning to Give vs. Altruistic Career Choice Revisited
I’m somewhat confused by the direction that this discussion has taken. I might be missing something, but I believe that the points related to AMF that I’ve made are:
1. GiveWell’s explicit cost-effectiveness estimate for AMF is much higher than the cost per DALY saved implied by the figure that MacAskill cited.
2. GiveWell’s explicit estimates for the cost-effectiveness of the best giving opportunities in the field of direct global health interventions have steadily gotten lower, and by conservation of expected evidence, one can expect this trend to continue.
3. The degree of regression to the mean observed in practice suggests that there’s less variance amongst the cost-effectiveness of giving opportunities than may initially appear to be the case.
4. By choosing an altruistic career path, one can cut down on the number of small probability failure modes associated with what you do.
I don’t remember mentioning AMF and x-risk reduction together at all. I recognize that it’s in principle possible that the “earning to give” route is better for x-risk reduction than it is for improving global health, but I believe the analogy between the two domains is sufficiently strong that my remarks on AMF have relevance (on a meta-level, not on an object level).

JonahS 14 Feb 2015 0:29 UTC
11 points
in reply to: JoshuaZ’s comment on: The Truth About Mathematical Ability
I’d certainly defer to you in relation to subject matter knowledge (my knowledge of number theory really only extends through 1965 or so), but this is not the sense that I’ve gotten from speaking with the best number theorists.

When I met Shimura, he was extremely dismissive of contemporary number theory research, to a degree that seemed absurd to me (e.g. he characterized papers in the Annals of Mathematics as “very mediocre.”) I would ordinarily be hesitant to write about a private conversation publicly, but he freely and eagerly expresses his views freely to everyone who he meets. Have you read The Map of My Life? He’s very harsh and cranky and perhaps even paranoid, but that doesn’t undercut his track record of being an extremely fertile mathematician. I reflected on his comments and learned more over the years (after meeting with him in 2008) his position came to seem progressively more sound (to my great surprise!).

A careful reading of Langlands’ Reflexions on receiving the Shaw Prize hints that he thinks that the methods that Taylor and collaborators have been using to prove theorems such as the Sato-Tate conjecture won’t have lasting value, though he’s very guarded in how he expresses himself. I remember coming across a more recent essay where he was more explicit and forceful, but I forget where it is (somewhere on his website, sorry, I realize that this isn’t so useful). It’s not clear to me that Taylor would disagree – he may explicitly be more committed to solving problems in the near term than by creating work of lasting value.

One can speculate these views are driven by arrogance, but they’re not even that exotic outside of the set of people who have unambiguously done great work. For example, the author of the Galois Representations blog, who you probably know of, wrote in response to Jordan Ellenberg:

That said, there is a secondary argument which portrays mathematics as a grand collective endeavour to which we can all contribute. I think that this is a little unrealistic. In my perspective, the actual number of people who are advancing mathematics in any genuine sense is very low. This is not to say that there aren’t quite a number of people doing interesting mathematics. But it’s not so clear the extent to which the discovery of conceptual breakthroughs is contingent on others first making incremental progress. This may sound like a depressing view of mathematics, but I don’t find it so. Merely to be an observer in the progress of number theory is enough for me — I know how to prove Fermat’s Last Theorem, how exciting is that?

apparently implicitly characterizing his own work as insignificant. And there aren’t very many number theorists as capable as him.

JonahS 24 Mar 2014 0:40 UTC
11 points
in reply to: lukeprog’s comment on: How can Cognito Mentoring do the most good?
Some points:
- This would work if both the parents and the students wanted advising for the students.
- Teenagers and young adults are often rebellious and don’t want to do what their parents tell them to.
- If the students don’t want advising, they won’t benefit much from it.
- A large fraction of parents are willing to pay for SAT prep and such, but that’s very different from what we’re offering.
- If the children are sufficiently young, there’s less of an issue of teenage rebelliousness, and parents are more directly involved in their children’s education.
- The only parents who have made contact with us are parents of young children. (Edit: I misremembered – there were a few parents of older children who contacted us, but their children weren’t interested.)
- The parents of young children who contacted us expressed willingness to pay much more often than the students who contacted us.
- Our expertise is much more relevant to high school and college students than to elementary and middle school students, except for the ones who are precocious to the point of being cognitively similar to high performing high school and college students. We haven’t worked with other elementary and middle school students and have no background in early childhood development.
- As in my post, there’s an issue of needing a large flow of clients (because the number of hours per client is small).
It could be that we haven’t pursued this option in sufficient depth. We could to focus on elementary and middle school students, or see whether the parent-child correlation of interest in advising for older students is sufficiently high. We would guess that the correlation isn’t high enough, though we only have a few relevant data points (examples or parents being interested when their children aren’t—there are many examples in the other direction) and further testing could reveal otherwise.

Any thoughts?
What links here?
- JonahS's comment on How can Cognito Mentoring do the most good? by JonahS (29 Mar 2014 1:20 UTC; 2 points)

JonahS 31 Jul 2013 22:28 UTC
11 points
in reply to: Eliezer Yudkowsky’s comment on: Why I’m Skeptical About Unproven Causes (And You Should Be Too)
I agree that x-risk reduction is a lot less popular than, e.g., caring for the blind, but it doesn’t follow that people are strongly biased against caring about x-risk reduction. Note that x-risk reduction is a relatively new cause (because the issues didn’t become clear until relatively recently), whereas people have been caring for the blind for millennia. Under the circumstances, one would expect much more attention to go toward caring for the blind independently of whether people were biased against x-risk reduction specifically. I expect x-risk reduction to become more popular over time.

JonahS 6 Jun 2013 2:02 UTC
11 points
in reply to: lukeprog’s comment on: Many Weak Arguments vs. One Relatively Strong Argument
Thanks for the feedback.

I think there are some important qualifications to make about this post, as others have noted.

My hunch is that most significant problem with the MWA approach is the assumption of (weak) independence, in the sense that in practice, when sophisticated use of MWA fails, it’s usually because the weak lines of evidence are all being driven by the same selection effect. A hypothetical example that jumps to mind is:

A VC is evaluating a startup. He or she reasons
1. The sector is growing
2. My colleagues think that the sector is good to invest in
3. On an object level, their plan looks good
4. The people are impressive
and the situation is

Re: #1 — The reason that the sector is growing is because there’s a bubble

Re: #2 — The reason that the VC’s colleagues think that the sector is good to invest in is because, like the VC, they don’t recognize that there’s a bubble.

Re: #3 — The VC’s views on the object level merit of the project are colored by the memes that have been spreading around that are causing the bubble

Re: #4 — The reason that impressive people are going into the sector is because there’s a bubble, so everyone’s going into the sector – the people’s impressiveness isn’t manifesting itself in their choosing a good focus.

I don’t know whether this situation occurs in practice, but it seems very possible.

Givewell tends to emphasize the MWA approach, and has been remarkably successful at figuring out the parts of the world they’re trying to understand.

GiveWell is an interesting case, insofar as it’s done more ORSA work than I’ve seen in most contexts. The page on long lasting insecticide treated nets provides examples. Part of why I’m favoring MWA is because GiveWell has done both and of the two, leans toward MWA.
What links here?
- Some clarifications concerning my “many weak arguments” post by JonahS (7 Jun 2013 19:34 UTC; 8 points)

JonahS 5 Jun 2013 20:55 UTC
11 points
in reply to: Eliezer Yudkowsky’s comment on: Many Weak Arguments vs. One Relatively Strong Argument

(It’s true that what Jonah means is technically ‘principle of charity’ used to interpret original intent, not ‘steelman’ used to repair original intent, but the principle of charity says we should interpret the request above as if he had said ‘principle of charity’.)

:-)

JonahS 20 May 2015 23:56 UTC
10 points
0
in reply to: Jiro’s comment on: How my social skills went from horrible to mediocre
It does apply to praise: I take statements of the type “you’re so wonderful” as having much more to do with how the person feels than it has to do with me.

JonahS 5 May 2015 20:52 UTC
10 points
in reply to: Lumifer’s comment on: Is Scott Alexander bad at math?

LW regulars are a conceited and contentious bunch.

What do you think is going on here? Why are LW regulars a conceited and contentious bunch? I’ve been wondering this since I started posting under a pseudonym back in 2010, and I still don’t understand.

Even if you may feel that it will be quite good for them to accept you as a master and learn useful things from you, you don’t get to decide that. If you want to offer learning, you can only offer it.

This is absolutely correct, and a lesson that it’s taken me decades to start to appreciate deeply.

I’m still learning. This is actually the main reason that I started this subthread – because I had (before starting this sequence of posts) been just not taking the time to post to LW anymore out of exasperation (without voicing my frustration), and I’m breaking from that behavior by initiating a conversation around it.

And speaking of gratitude, while you may not care whether you get gratitude, you do seem to care when you get pushback and criticism (gratitude with the flipped sign) -- this is why this whole sub-thread exists.

Until several months ago, I had been finding it insulting to receive responses along the lines “I don’t think that you know what you’re talking about” after having spent ~6-18 hours to write a post to share knowledge that I had put thousands of hours of work into developing.

I no longer do: I recently studied the life of Martin Luther King, and it helped me figure out how he was able to not mind people responding in hostile ways to his efforts to help people.

A large part of it seems to be adopting a super-high status pose of the type that I did above: to take the attitude that your detractors have shown themselves to be very confused, and that you don’t have to give their confused remarks serious consideration.

I think that this mentality would help a lot of LWers who feel like they unfairly have low status.

It doesn’t make any sense for Scott Alexander to feel marginalized on account of how women have behaved toward him. He’s regarded as one of the best young writers in the world. He has high earning power as a future psychiatrist, and is probably one of the best young psychiatrists in the world. I’ve found him very pleasant when meeting him in person, not at all uncomfortably weird.

Given that > 50% of people are in romantic relationships, it’s not plausible that virtually no women who he found desirable would be interested in someone so heavily loaded with traits that are widely considered to be good. If he got that impression, it’s a function of him having been unaware of women who were interested in him but too shy to let him know, or them just not knowing almost anything about him. All of his railing against women for being unfair to him is confused: the situation is just a huge misunderstanding.

Of course, there are few LWers who are as strikingly talented as Scott, but it’s still broadly the case that LWers having been marginalized is more a function of people not having understood them than it is a function of there being something intrinsically wrong with them.