most history is done in a very humanitiespilled, academia flavored way. are there good examples of people doing very analytical, capital-intensive history research where the quality of the work is judged based on how successfully the resulting theories made good predictions/decisions?
What do you mean by predictions? In sense of predicting the “direction” of history not really—historicism is generally poor because the reality of humanity on a large scale is massively complex and filled with stochastic, unknown, unpredictable uncertainties.
Predictions for historical work are judged by their ability to predict historical observations. That is, how well a theory conforms to current and future observations derived from records (the evaluation of which is themselves extremely nuanced in many cases), archeological findings and to a somewhat more controversial extent experimental archeology.
You wouldn’t judge a theory of how stars formation by how well it predicts who the next president will be, rather you judge it by how well it conforms to our present and future observations and understanding of the material facts involved (e.g., how atoms and matter works) and our observations of the cosmos.
Huang’s work is genuinely capital-intensive, quantitative history. The standout detail from the episode: he and Chinese collaborators spent six years with around 40 research assistants digitizing Joseph Needham’s 27-volume Science and Civilisation in China to build a statistical database — Needham himself never analyzed his material quantitatively. That database powers the CDI (inventions-per-capita) scores that drive Huang’s central empirical claim that China was most inventive during its fragmented post-Han “European moment” (220–589 CE), before keju was institutionalized. He also has a co-authored paper with Clair Yang doing statistical work on civil service exams and imperial stability, plus statistical analyses of social mobility in imperial China across dynasties. This is the opposite of vibes-based humanities history — it’s a multi-year, multi-person, data-infrastructure-first research program.
It also generates falsifiable forward predictions: Huang argues Xi’s elimination of term limits has reintroduced the ancient succession problem and that current top-down industrial policy will produce Brezhnev-style stagnation. Those are bets you can score over the next decade or two.
Where it doesn’t fit:
The LW commenter’s stronger ask is for fields where quality is judged by prediction track record. Huang’s work isn’t judged that way — it’s still judged by academic peer review, theoretical elegance, and historiographical argument. Nobody is keeping a Brier score on his China forecasts. The infrastructure is quantitative; the epistemic culture is still humanities-academic.
The better pointer to give them:
The episode is one node in a larger movement: the Center for Quantitative History (CQH) and the broader cliometrics-of-China field — Yuhua Wang (The Rise and Fall of Imperial China, statistical analysis of ~300 emperors and elite kinship networks), Zhiwu Chen, Debin Ma, James Kung, Melanie Meng Xue, Carol Shiue. They mine local gazetteers, clan genealogies, and official rosters at scale. There’s a 2026 Springer volume Quantitative History of China: State Capacity, Institutions and Development that’s basically a field overview. Outside China specifically, this is part of cliometrics / historical political economy more broadly (Acemoglu & Robinson, Nathan Nunn, Melissa Dell).
If you want to push back on the commenter’s framing: the strongest examples of “history judged by predictive success” probably aren’t historical fields at all but adjacent ones — Turchin’s cliodynamics (which explicitly tries to make predictions and gets graded on them, controversially), and forecasting tournaments applied to geopolitics (Tetlock, GJP). Cliodynamics is the closest thing to what they’re describing, and it’s worth naming because it’s also the cautionary tale about how hard the prediction-grading move actually is.
So the honest pitch for the episode: “Here’s a great example of capital-intensive, data-infrastructure-driven history with explicit forward predictions — though the field still grades itself by academic, not predictive, standards. If you want the prediction-grading version, you want cliodynamics.”
most history is done in a very humanitiespilled, academia flavored way. are there good examples of people doing very analytical, capital-intensive history research where the quality of the work is judged based on how successfully the resulting theories made good predictions/decisions?
Peter Turchin et al?
I’ve found the literature on econometric history/cliometrics quite helpful. I’ve liked reading through the Handbook of Cliometrics quite useful: https://link.springer.com/referencework/10.1007/978-3-642-40458-0
What do you mean by predictions? In sense of predicting the “direction” of history not really—historicism is generally poor because the reality of humanity on a large scale is massively complex and filled with stochastic, unknown, unpredictable uncertainties.
Predictions for historical work are judged by their ability to predict historical observations. That is, how well a theory conforms to current and future observations derived from records (the evaluation of which is themselves extremely nuanced in many cases), archeological findings and to a somewhat more controversial extent experimental archeology.
You wouldn’t judge a theory of how stars formation by how well it predicts who the next president will be, rather you judge it by how well it conforms to our present and future observations and understanding of the material facts involved (e.g., how atoms and matter works) and our observations of the cosmos.
This podcast episode I enjoyed is somewhat an example: https://www.chinatalk.media/p/autocracy-and-stagnation-how-imperial
Opus 4.6 summary of the relevance
Huang’s work is genuinely capital-intensive, quantitative history. The standout detail from the episode: he and Chinese collaborators spent six years with around 40 research assistants digitizing Joseph Needham’s 27-volume Science and Civilisation in China to build a statistical database — Needham himself never analyzed his material quantitatively. That database powers the CDI (inventions-per-capita) scores that drive Huang’s central empirical claim that China was most inventive during its fragmented post-Han “European moment” (220–589 CE), before keju was institutionalized. He also has a co-authored paper with Clair Yang doing statistical work on civil service exams and imperial stability, plus statistical analyses of social mobility in imperial China across dynasties. This is the opposite of vibes-based humanities history — it’s a multi-year, multi-person, data-infrastructure-first research program.
It also generates falsifiable forward predictions: Huang argues Xi’s elimination of term limits has reintroduced the ancient succession problem and that current top-down industrial policy will produce Brezhnev-style stagnation. Those are bets you can score over the next decade or two.
Where it doesn’t fit:
The LW commenter’s stronger ask is for fields where quality is judged by prediction track record. Huang’s work isn’t judged that way — it’s still judged by academic peer review, theoretical elegance, and historiographical argument. Nobody is keeping a Brier score on his China forecasts. The infrastructure is quantitative; the epistemic culture is still humanities-academic.
The better pointer to give them:
The episode is one node in a larger movement: the Center for Quantitative History (CQH) and the broader cliometrics-of-China field — Yuhua Wang (The Rise and Fall of Imperial China, statistical analysis of ~300 emperors and elite kinship networks), Zhiwu Chen, Debin Ma, James Kung, Melanie Meng Xue, Carol Shiue. They mine local gazetteers, clan genealogies, and official rosters at scale. There’s a 2026 Springer volume Quantitative History of China: State Capacity, Institutions and Development that’s basically a field overview. Outside China specifically, this is part of cliometrics / historical political economy more broadly (Acemoglu & Robinson, Nathan Nunn, Melissa Dell).
If you want to push back on the commenter’s framing: the strongest examples of “history judged by predictive success” probably aren’t historical fields at all but adjacent ones — Turchin’s cliodynamics (which explicitly tries to make predictions and gets graded on them, controversially), and forecasting tournaments applied to geopolitics (Tetlock, GJP). Cliodynamics is the closest thing to what they’re describing, and it’s worth naming because it’s also the cautionary tale about how hard the prediction-grading move actually is.
So the honest pitch for the episode: “Here’s a great example of capital-intensive, data-infrastructure-driven history with explicit forward predictions — though the field still grades itself by academic, not predictive, standards. If you want the prediction-grading version, you want cliodynamics.”