On the usefulness of blockchain, I think the analytical case for our financial system being on blockchain is significantly weaker than the analytical case for better forecasts being useful for policymaking.
With my original comment, I was implicitly drawing a distinction between ‘cannot be useful’ and ‘has not been useful’:
If forecasting simply has not been useful, then it leaves us with two possibilities: (1) it cannot be and we should stop funding it; (2) it can be and we should redirect funding to efforts that figure out and solve why it hasn’t been (e.g. distribution).
I feel like the analytical case for at least attempting (2) is pretty strong. I will admit that this may validate skepticism to the tune of, ‘We should reduce funding and seriously consider our theory of change’.If the argument is that no amount of additional funding will justify the ~$100m we’ve already spent, I would argue that it doesn’t need to. Any additional funding only needs to be useful enough to justify itself. Even if the overall forecasting program ends up being overfunded and unjustified, we should treat future funding as independent of any potential bad decisions in the past.
For example, if the program is currently giving us $10m of value (for $100m of funding), and spending an additional $10m would increase that to $30m of value, then we should spend it. Even if the overall program remains a failure, our $10m has given us a 2x return.
Wrt the startup analogy and LLMs, I am not sure it is reasonable to claim that all useful technologies get adopted at this extreme speed, re: seatbelts, childproof medical caps, vaccinations, drunk driving laws, helmets, indoor smoking bans, etc. which all required significant distribution efforts.
To clarify ‘distribution’, I think there’s a difference between ‘everyone has heard of prediction markets’ and ‘we have the right tools to allow policy/decision-makers to adopt prediction markets in their decision-making processes’.
Having taken a quick look at the source for Claude’s data, it seems like a reasonable read is that the doctors are simply not well-calibrated on the relevance of the patients statements and redirect them away too quickly. Not sure this means they were ‘rude’,
The HCAHPS survey measures ‘courteousness’ by asking patients, and consistently find that in ~85% of cases, doctors are rated ‘always’ respectful and courteous. From my understanding, a further ~10% are rated as ‘usually’ respectful and courteous. This is not a perfect measure, but seems better than using interruption times as a proxy.