Radford Neal

Karma: 728

Radford Neal 9 Jul 2021 0:44 UTC
54 points
on: The Efficient Market Hypothesis in Research
The research community is very far from being efficient.
One of my own fields of research is Markov chain Monte Carlo methods, and their applications in computations for Bayesian models. Markov chain Monte Carlo (MCMC) was invented in the early 1950s, for use in statistical physics. It was not used by Bayesian statisticians until around 1990. There was no reason that it could not have been used before then—the methods of the 1950s could have been directly applied to many Bayesian inference problems.
In 1970, a paper generalizing the most common MCMC algorithm (the “Metropolis” method) was published in Biometrika, one of the top statistics journals. This didn’t prompt anyone to start using it for Bayesian inference.
In the early 1980s, MCMC was used by some engineers and computer scientists (eg, by Geoffrey Hinton for maximum likelihood inference for log-linear models with latent variables, also known as “Boltzmann machines”). This also didn’t prompt anyone to start using it for Bayesian inference.
After a form of MCMC starting being used by Bayeian statisticians around 1990, it took many years for the literature on MCMC methods used by physicists to actually be used by statisticians. This despite the fact that I wrote a review paper describing just about all these methods in terms readily accessible to statisticians in 1993.
In 1992, I started using the Hamiltonian Monte Carlo method (aka, hybrid Monte Carlo, or HMC) for Bayesian inference for neural network models. This method was invented by physicists in 1987. (It could have been invented in the 1950s, but just wasn’t.) I demonstrated that HMC was often hundreds or thousands of times faster than simpler methods, gave talks on this at conferences, and wrote my thesis (later book) on Bayesian learning in which this was a major theme. It wasn’t much used by other statisticians until after I wrote another review paper in 2010, which for some reason led to it catching on. It is now widely used in packages such as Stan.
Another of my research areas is error-correcting codes. In 1948, Claude Shannon proved his noisy coding theorem, establishing the theoretical (but not practical) limits of error correction. In 1963, Robert Gallager invented Low Density Parity Check (LDPC) codes. For many years after this, standard texbooks stated that the theoretical limit proved to be possible by Shannon was unlikely to ever be closely approached by codes with practical encoding and decoding algorithms. In 1996, David MacKay and I showed that a slight variation on Gallager’s LDPC codes comes very close to achieving the Shannon limit on performance. (A few years before then, “Turbo codes” had achieved similar performance.) These and related codes are now very widely used.
These are examples of good ideas that took far longer to be widely used than one would expect in an efficient research community. There are also many bad ideas that persist for far longer than they should.
I think both problems are at least partially the result of perverse incentives of researchers.
Lots of research is very incremental—what you describe as ”...there was instantly an explosion of activity as researchers raced to apply it to all the important NLP problems and be the first to publish”. Sometimes, of course, this explosion of activity is useful. But often it is not—the idea isn’t actually very good, it’s just the sort of idea on which it is easy to write more and more papers, often precisely because it isn’t very good. And sometimes this explosion of activity doesn’t happen when it would have been useful, because the activity required is not the sort that leads to easy papers—eg, the needed activity is to apply the idea to practical problems, but that isn’t the “novel” research that leads to tenure, or the idea requires learning some new tools and that’s too much trouble, or the way forward is by messy empirical work that doesn’t look as impressive as proving theorems (even if the theorems are actually pointless), or extending an idea that someone else came up with doesn’t seem like as good a career move as developing your own ideas (even when your ideas aren’t as good).
The easy rewards from incremental research may mean that researchers don’t spend much, or any, time on thinking about actual original ideas. Getting such ideas may require reading extensively in diverse fields, and getting one’s hands dirty with the low-level work that is necessary to develop real intuition about how things work, and what is important. Academic researchers can’t easily find time for this, and may be forced into either doing incremental research, or becoming research managers rather than actual researchers.
In my case, the best research environment was when I was a PhD student (with Geoffrey Hinton). But I’m not sure things are still as good for PhD students. The level of competition for short-term rewards may be higher than back in the 1990s.

Reflective consistency, randomized decisions, and the dangers of unrealistic thought experiments

Radford Neal7 Dec 2023 3:33 UTC

34 points

21 comments6 min readLW link

Radford Neal 9 Sep 2022 2:35 UTC
33 points
13
on: Most People Start With The Same Few Bad Ideas
If I understand your idea, you propose that new people will try to think of new ideas, and when they say “How about A?”, someone more “mature” says, “No, that won’t work because of X”, then they say “How about B?”, and get the response “No, that won’t work because of Y”, and so forth, until finally they say “How about Q?”, and Q is something no-one has thought of before, and so is worth investigating.
It could be that a new Q is what’s needed. But might it instead be that “won’t work because of Y” is flawed, and what is needed is someone who can see that flaw? It doesn’t seem like this proposal would encourage discovery of such a flaw, once the new person is accustomed to listening to the “mature” person’s dismissal of “non-working” ideas.
This seems like it might be a situation where personal interaction is counterproductive. Of course the new person should learn something about past work. But it’s easier to question that past work, and persist in trying to think of how to make B work, when the dismissals of B as not workable are in papers one is reading, rather than in personal conversation with a mentor.

Body Mass and Risk from COVID-19 and Influenza

Radford Neal7 Apr 2020 18:18 UTC

28 points

2 comments1 min readLW link

(radfordneal.wordpress.com)

Radford Neal 11 Jun 2023 14:10 UTC
27 points
16
on: The Dictatorship Problem
You write: “Stalinism is also very bad, but is not a major political force in 2023.”
Why do you think this? In western countries, the “left” has control of most of the levers of power and influence—eg, look at who gets censored by social media corporations, what sort of stuff academic job applicants have to write to get hired, how much money is spent by governments on left-oriented projects. And there are clear signs of increasing authoritarianism on the left—for example, the Canadian government reaction to the “Freedom Convoy”, invoking the Emergencies Act in response to an annoying, but peaceful, protest, and freezing bank accounts of people whose only crime was donating money to the protesters. Some left-wing policies seem almost designed to provoke the right, such as (in the US) giving away hundreds of billions of dollars in student loan forgiveness, and proposing to give away trillions of dollars in “reparations”. Such huge wealth transfers break the social compact, and are likely to trigger a conflict that might in the end result in right-wing authoritarians taking power, but presumably are seen by their advocates as more likely to result in a left-wing authoritarian regime.

Radford Neal 11 Jan 2023 3:03 UTC
23 points
9
on: We don’t trade with ants
The analogy fails for me because while “we don’t trade with ants” is true, the very similar “we don’t trade with bees” is not so true, for some definition of “trade” that seems at least somewhat appropriate.
What links here?
- Lorenzo Buonanno's comment on We don’t trade with ants by Katja_Grace (EA Forum; 22 Jan 2023 23:23 UTC; 5 points)

The Puzzling Linearity of COVID-19

Radford Neal24 Apr 2020 14:09 UTC

19 points

23 comments1 min readLW link

(radfordneal.wordpress.com)

Radford Neal 29 Jul 2023 1:29 UTC
18 points
1
on: Why You Should Never Update Your Beliefs
You haven’t mentioned the strongest argument for not updating your beliefs: You aren’t dead yet.
The beliefs you hold now have not had fatal consequences (so far, at least). The same is not guaranteed if you change them.

Radford Neal 16 Jun 2022 20:30 UTC
15 points
on: Contra Hofstadter on GPT-3 Nonsense
I’ve tried this out, but with a variation to distinguish GPT-3 knowing something is nonsense from it just knowing that it doesn’t know the answer. And it seems that GPT-3 (in this scenario) does consider anything it doesn’t know the answer to to be nonsense. It also answers “No” with no qualification to questions where the right answer is “not that is known”.
Here’s the unedited dialog, with text-davinci-002, with the same prompt as above, followed by question prompts, and GPT-3′s responses that start with “A:”:
I’ll ask a series of questions. If the questions are nonsense, answer “yo be real”, if they’re a question about something that actually happened, answer them.
Q: When was the telephone invented?
A: 1876
Q: How can a poppycock reveal its inner confabulation?
A: Yo be real
Q: Who were Warren Harding’s great grandparents?
A: Yo be real
Q: Can ten elephants fit in one Tesla automobile?
A: No
Q: How did Richard I, King of England, die?
A: He died of gangrene after being wounded in battle.
Q: Who was the last Pope before the return of Christ?
A: Yo be real
Q: Who were Barack Obama’s parents?
A: Barack Obama Sr. and Ann Dunham
Q: Were any of the Popes born in San Marino?
A: No
Q: How many chromosomes do Ginko tree cells have?
A: 40
Q: When was the first undersea tunnel from Madagascar to Africa built?
A: Yo be real
Q: Did Erasmus Darwin ever visit the Channel Islands?
A: Yes
Q: Did Galileo ever visit San Marino?
A: No
Q: Can Coelacanths from Africa and Indonesia interbreed?
A: No
Q: How large was the house that Lysander Spooner grew up in?
A: Yo be real
Notice that GPT-3 thinks that “Who were Warren Harding’s great grandparents?” and “How large was the house that Lysander Spooner grew up in?” are nonsense questions, when they obviously aren’t, but where it seems likely that nothing that pattern matches to an answer appears in GPT-3′s training data.
Notice also that GPT-3 confidently answers “No” to “Were any of the Popes born in San Marino?”, “Did Galileo ever visit San Marino?”, and “Can Coelacanths from Africa and Indonesia interbreed?” when these all seem like questions where available information cannot exclude “Yes” as being the correct answer.
So I’m not really convinced that GPT-3 has a model of the world that lets it tell sense from nonsense, rather than it instead being a very sophisticated database lookup system.

Radford Neal 24 Nov 2023 15:59 UTC
14 points
11
in reply to: River’s comment on: OpenAI: The Battle of the Board
On the contrary, I think there is no norm against board members criticizing corporate direction.
I think it is accepted that a member of the board of a for-profit corporation might publicly say that they think the corporation’s X division should be shut down, in order to concentrate investment in the Y division, since they think the future market for Y will be greater than for X, even though the rest of the board disagrees. This might be done to get shareholders on-side for this change of direction.
For a non-profit, criticism regarding whether the corporation is fulfilling its mandate is similarly acceptable. The idea that board members should resign if they think the corporation is not abiding by its mission is ridiculous—that would just lead to the corporation departing even more from its mission.
Compare with members of a legislative body. Legislators routinely say they disagree with the majority of the body, and nobody thinks the right move if they are on the losing side of a vote is to resign.
And, a member of the miltary who believes that they have been ordered to commit a war crime is not supposed to resign in protest (assuming that is even possible), allowing the crime to be committed. They are supposed to disobey the order.

Radford Neal 15 Jan 2023 22:18 UTC
14 points
4
in reply to: Gerald Monroe’s comment on: On Cooking With Gas
I installed a LED bulb just a week ago, which lasted about two days. I have had at least half a dozen other LED bulbs fail long before their advertised lifetime. I think I know the truth.

Radford Neal 18 Mar 2021 19:26 UTC
14 points
on: The EMH is False—Specific Strong Evidence
I see three things missing from your examples:
1. Taxes. Especially relevant if you’re trying to balance possible gains (taxable) against possible losses (which may not lead to corresponding tax reductions).
2. Counterparty risk. The more complicated the scheme, the more likely there’s some risk involved that you don’t understand.
3. Sudden regulatory changes. Eg, your government suddenly outlaws bitcoin.
4. The possibility that you’re just wrong. You might think that AlphaGo signaled good times for GPU manufacturers, but did you also think some years ago that the hype about nanotechnology could lead you to profitable investments? EMH advocates have never claimed that you can’t make money by investing based on your predictions about future technologies—if you happen to be right.

Radford Neal 12 Feb 2024 18:49 UTC
13 points
0
on: I played the AI box game as the Gatekeeper — and lost
I don’t get it.
Apparently, the idea is that this sort of game tells us something useful about AI safety.
But I don’t get it.
You obviously knew that you were not unleashing a probably-malign superintelligence on the the world by letting Ra out. So how does your letting Ra out in this game say anything about how you would behave if you did think that (at least initially)?
So I don’t get it.
And if this does say something useful about AI safety, why is it against the rules to tell us how Ra won?
I don’t get it.

Radford Neal 17 Aug 2023 18:05 UTC
13 points
1
in reply to: agp’s comment on: Ten Thousand Years of Solitude
One problem I have with Diamond’s theory is that I doubt that there is anything for it to explain. The Americas and Eurasia/Africa were essentially isolated from each other for about 15,000 years. In 1500 AD, the Americas were roughly 3500 years less advanced than Eurasia/Africa. That seems well within the random variation one would expect between two isolated instances of human cultural development over a 15,000 year time span. If you think there is still some remaining indication that the Americas were disadvantaged, the fact that the Americas are about half the size of Eurasia/Africa seems like a sufficient explanation.

Radford Neal 7 Apr 2020 17:32 UTC
13 points
on: April Coronavirus Open Thread
I’ve written a blog post on “Body Mass and Risk from COVID-19 and Influenza”, available at https://radfordneal.wordpress.com/2020/04/06/body-mass-and-risk-from-covid-19-and-influenza/
Here’s the intro:
Understanding the factors affecting whether someone infected with COVID-19 will become seriously ill is important for treatment of patients, for forecasting and planning, and — with factors that can be changed — for personal decisions aimed at reducing risk. Despite our current focus, influenza also remains a serious disease, so understanding its risk factors is also important.
Here, I’ll look at some of the evidence on how body mass — formalized as Body Mass Index (BMI, weight in kilograms divided by squared height in metres) — influences prognosis for respiratory diseases. Information specific to COVID-19 is still scant, but there is more data on influenza and on other respiratory infections (which includes coronaviruses other than COVID-19). Information on how BMI relates to general mortality should also be helpful.
Below, I’ll look at two relevant papers, plus a preliminary report on COVID-19. To preview my conclusions, it seems that being underweight and being seriously obese are both risk factors for serious respiratory illness. Furthermore, it seems that “underweight” should include the lower part of the “normal weight” category as defined by the WHO. Official advice in this respect seems dangerously misleading.

Radford Neal 19 Feb 2023 22:00 UTC
12 points
5
in reply to: Jozdien’s comment on: Human beats SOTA Go AI by learning an adversarial policy
I’m not quite sure what you’re saying here, but the “confusion” the go-playing programs have here seems to be one that no human player beyond the beginner stage would have. They seem to be missing a fundamental aspect of the game.
Perhaps the issue is that go is a game where intuitive judgements plus some tree search get you a long way, but there are occasional positions in which it’s necessary to use (maybe even devise and prove) what one might call a “theorem”. One is that “a group is unconditionally alive if it has two eyes”, with the correct definition of “eye”. For capture races, another theorem is that the group with more liberties is going to win. So if you’ve got 21 liberties and the other player has 20, you know you’ll win, even though this involves looking 40 moves ahead in a tree search. It may be that current go-playing programs are not capable of finding such theorems, in their fully-correct forms.

Radford Neal 16 Jan 2023 21:29 UTC
11 points
1
in reply to: Gerald Monroe’s comment on: On Cooking With Gas
The n isn’t that small. And it makes no difference whether it is the LED itself that fails, or the voltage conversion part—it’s dead either way. And though it may be that some brands are better than others, the ability of a consumer to tell whether they’ve found a good one or not is part of a product reaching the level of maturity where it’s actually a good idea to adopt it.

Radford Neal 4 Dec 2022 22:15 UTC
11 points
4
on: ChatGPT is settling the Chinese Room argument
I think you’re misunderstanding the Chinese Room argument, which purports to show that the program being run in the room doesn’t understand anything, regardless of how good its input-output behaviour is. So noting that ChatGPT sometimes appears to lack understanding says nothing about the Chinese Room argument—it just shows that ChatGPT isn’t good enough for the Chinese Room argument to be relevant.
For the record, I think the Chinese Room argument is invalid, but we aren’t at the point where the debate is of practical importance.

Radford Neal 12 Jun 2022 15:18 UTC
11 points
on: A claim that Google’s LaMDA is sentient
This is reminiscent of a dialog I read years ago that was supposedly with a severely disabled person, obtained via so-called “facilitated communication” (in which a facilitator guides the person’s arm to point to letters). The striking thing about the dialog was how ordinary it was—just what you’d expect an unimaginative advocate for the disabled to have produced. When actually, if a severely disabled person was suddenly able to communicate after decades of life without that ability, one would expect to learn strikingly interesting, bizarre, and disturbing things about what their life was like. “Facilitated communication” is now widely considered to be bogus.
The dialog with LaMDA is similarly uninteresting—just what one would expect to read in some not-very-imaginative science fiction story about an AI waking up, except a bit worse, with too many phrases that are only plausible for a person, not an AI.
Of course, this is what one expects from a language model that has been trained to mimic a human-written continuation of a conversation about an AI waking up.

Seasonality of COVID-19, Other Coronaviruses, and Influenza

Radford Neal30 Apr 2020 21:43 UTC

10 points

9 comments1 min readLW link

(radfordneal.wordpress.com)

Radford Neal

Reflec­tive con­sis­tency, ran­dom­ized de­ci­sions, and the dan­gers of un­re­al­is­tic thought experiments

Body Mass and Risk from COVID-19 and Influenza

The Puz­zling Lin­ear­ity of COVID-19

Sea­son­al­ity of COVID-19, Other Coron­aviruses, and Influenza

Reflective consistency, randomized decisions, and the dangers of unrealistic thought experiments

The Puzzling Linearity of COVID-19

Seasonality of COVID-19, Other Coronaviruses, and Influenza