Pablo Villalobos

Karma: 618

Previously Staff Researcher at Epoch, these days I’m mostly thinking about better frameworks for understanding superintelligence.

Pablo Villalobos 1 Apr 2026 16:03 UTC
3 points
0
on: Lesswrong Liberated
Galaxy-Brained
This design will inspire you to conjure grander and more baroque intellectual works

Pablo Villalobos 23 Mar 2026 20:02 UTC
3 points
0
on: On The Independence Axiom
I just want to flag that there is another family of non-independent decision rules called multiplier preferences or more generally variational preferences which result in a certain kind of adversarial robustness to Knightian uncertainty. Multiplier preferences in particular have a very natural interpretation in terms of information geometry and can be seen as roughly the dual of active inference. Where active inference says “my utility function can be seen as a prior”, or “I can deviate from my policy during planning by paying a KL cost”, multiplier preferences say “Nature can adversarially deviate from my model of it by paying a KL cost”.

Pablo Villalobos 22 Dec 2025 16:58 UTC
1 point
0
on: Most successful entrepreneurship is unproductive
This applies to basically any form of competition beyond the amount needed to avoid pricing power.
For example, if you are a job seeker, you can dedicate your time to learning actual skills and looking for employers with unmet needs, or you can rack up your credentials to be candidate #1 instead of #2 in an oversubscribed hiring process.
That doesn’t make these activities unproductive in most cases: if customers or employers pick you it’s because you are providing some marginal value, but the marginal value might be low compared to the amount you capture

Pablo Villalobos 19 Nov 2025 0:28 UTC
82 points
36
on: Status Is The Game Of The Losers’ Bracket
I think this is partly cope. Middle management at large companies might have a significant political component but large companies still have much higher labor productivity than small ones, they still represent like a quarter of OECD economic output, and the average middle manager is probably still creating very significant amounts of marginal value despite the political infighting.
Yes, there might be very few middle managers at the top of Forbes’ list. But look at millionaires, the people at the top 10% of the wealth distribution in the US. Most middle managers will probably be there, along with other highly paid professionals. And if you found a startup and it fails, you won’t make the 10%, which is what happens in the overwhelming majority of cases.

So purely in terms of wealth and creating value for society, marginal improvements in middle management seem quite valuable. Sure, being a founder might have much higher EV, but also vastly higher variance. And risk aversion is behaviorally indistinguishable from just having a different utility function.

Speaking of which, you might want to be immortal and go to the moon, but most people don’t. You could argue that if they’d read the right books or had the right parent/teacher/friend or had more vision, they would also want that, but at that point you’re just saying that everyone should be playing the game you like, instead of the one they like.
And I dispute the idea that there’s less politics, signaling or strategic/conflictive behavior at the “winner’s bracket”. Look at Sam Altman, look at the status competitions and fighting for credit in the highest halls of science. Look at states for God’s sake. Do you really think Vladimir Putin or Donald Trump or Xi Jinping are not in the winner’s bracket, that they will have less real power than the first person to figure out how to solve aging?

Physicists might have found the secret knowledge of how to create nuclear weapons that nobody else had, but after they had the bright idea the bottlenecks were capital, labor, natural resources, and the ability to manage their combination efficiently, and the physicists were not the ones who got to control the weapons in the end.

I think something like your thesis might be true in terms of actually having good counterfactual impact on reality vs merely capturing the resulting wealth, power and prestige, what you call leading the parade. But that doesn’t mean that you get to both have the impact and lead the parade by pursuing just the impact part!

Pablo Villalobos 18 Apr 2025 14:42 UTC
52 points
13
in reply to: habryka’s comment on: jacquesthibs’s Shortform
Personal view as an employee: Epoch has always been a mix of EAs/safety-focused people and people with other views. I don’t think our core mission was ever explicitly about safety, for a bunch of reasons including that some of us were personally uncertain about AI risk, and that an explicit commitment to safety might have undermined the perceived neutrality/objectiveness of our work. The mission was raising the standard of evidence for thinking about AI and informing people to hopefully make better decisions.

My impression is that Matthew, Tamay and Ege were among the most skeptical about AI risk and had relatively long timelines more or less from the beginning. They have contributed enormously to Epoch and I think we’d have done much less valuable work without them. I’m quite happy that they have been working with us until now, they could have moved to do direct capabilities work or anything else at any point if they wanted and I don’t think they lacked opportunities to do so.

Finally, Jaime is definitely not the only one who still takes risks seriously (at the very least I also do), even if there have been shifts in relative concern about different types of risks (eg: ASI takeover vs gradual disempowerment).

Pablo Villalobos 12 Apr 2025 9:15 UTC
1 point
0
on: Madrid – ACX Meetups Everywhere Spring 2025
We’re in the nearby bar, Casa Remigio, since the theater is occupied

Pablo Villalobos 1 Mar 2025 15:52 UTC
3 points
0
in reply to: jimrandomh’s comment on: How to Make Superbabies
I suspect the analogy does not really work that well. Much of human genetic variation is just bad mutations that take a while to be selected out. For example, maybe a gene variant slightly decreases the efficiency of your neurons and makes everything in your brain slightly slower

Pablo Villalobos 28 Feb 2025 10:05 UTC
1 point
0
in reply to: purple fire’s comment on: Market Capitalization is Semantically Invalid
I stand corrected. Although the broader point about share prices noisily approximating a discounted expected cash flow which can be added or multiplied still holds

Pablo Villalobos 27 Feb 2025 17:21 UTC
1 point
−2
on: Market Capitalization is Semantically Invalid
There is a sense in which the price approximates an intrinsic property of the shares that you can add up or multiply by the number of shares. Each share gives you a vote in the shareholder assembly and an equal portion of the dividends. If you had all the shares, you would own the company and in principle could pay yourself as much as the company can afford in dividends.
How much the company can afford to pay in dividends in the future is basically how much net operating profit after taxes (NOPAT) the company will have.
If you have a prediction of the future NOPAT of the company, it implies a present value for the whole company and its shares assuming all of it is cashed out as dividends. It is commonly assumed that in most cases the market price of shares oscillates around a rational expectation of future NOPAT, in which case it would be a reasonable approximation to something that you can semantically multiply by the number of shares to get the overall value of the company.

Madrid—ACX Meetups Everywhere Fall 2024

Pablo Villalobos5 Aug 2024 18:36 UTC

4 points

0 comments1 min readLW link

Pablo Villalobos 24 Jun 2024 10:43 UTC
14 points
2
on: A Step Against Land Value Tax
The arguments you make seem backwards to me.
All this to say, land prices represent aggregation effects / density / access / proximity of buildings. They are the cumulative result of being surrounded by positive externalities which necessarily result from other buildings not land. It is the case that as more and more buildings are built, the impact of a single building to its land value diminishes although the value of its land is still due to the aggregation of and proximity to the buildings that surround it.
Yes, this is the standard Georgist position, and it’s the reason why land owners mainly capture (positive and negative) externalities from land use around them, not in their own land.
Consider an empty lot on which you can build either a garbage dump or a theme park, each of equivalent economic value. Under SQ, the theme park is built as the excess land value is capture by the land owner. Under LVT, the garbage dump is built as the reduced land values reduces their tax burden. The SQ encourages positive externalities, LVT encourages negative externalities.
This seems wrong. The construction of a building mainly affects the value of the land around it, not the land on which it sits. Consider the following example in which instead of buildings, we have an RV and a truck, so there is no cost of building or demolishing stuff:

There’s a pristine neighborhood with two empty lots next to each other in the middle of it. Both sell for the same price. The owner of empty lot 1 rents it to a drug dealer, who places a rusty RV on the lot and sells drugs in it. The owner of empty lot 2 rents it to a well-known chef who places a stylish food truck on the lot and serves overpriced food to socialites in it.

Under SQ, who do you think would profit from selling the land now? The owner of lot 2 has to sell land next to a drug dealer that a prospective buyer can do nothing about. The owner of lot 1 has to sell land next to delicious high-status food, and if a buyer minds the drug dealer he can kick him out. Who is going to have an easier time selling? Who is going to get a higher price?

Now, suppose there is a LVT. If the tax is proportional to the selling price of the land under SQ (as it ideally should), which owner is going to pay more tax?

The case of the theme park and garbage dump is exactly the same, with the added complication of construction / demolition costs. An LVT should be proportional to the price of the land if there were no buildings on top of it (and without taking into account the tax itself), so building a garbage dump is not going to significantly reduce your tax payments.

In such a way, a land value tax has a regularisation effect on building density, necessitating a spread of concentration.
There are several separate effects here, if you are a landowner. Under LVT:
1. You are incentivized to reduce the density in surrounding land
2. You are incentivized to build as densely as possible within your own land to compensate the tax
Under SQ:
1. You are incentivized to increase the density in surrounding land
2. You are not incentivized to increase density in your own land
The question is, which of these effects is bigger? I would say that landowners have more influence over their own land than over surrounding land, so a priori I would expect more density to result from an LVT

Data on AI

Robi Rahman, Jaime Sevilla Molina, Pablo Villalobos and Ben Cottier

20 Jun 2024 6:31 UTC

1 point

0 comments1 min readLW link

(epochai.org)

Announcing Epoch’s newly expanded Parameters, Compute and Data Trends in Machine Learning database

Robi Rahman, Jaime Sevilla Molina, Tamay, Ege Erdil, Pablo Villalobos, Ben Cottier and Matthew Barnett

25 Oct 2023 2:55 UTC

18 points

0 comments1 min readLW link

(epochai.org)

EA Madrid social

Pablo Villalobos11 Oct 2023 15:34 UTC

6 points

0 comments1 min readLW link

Trading off compute in training and inference (Overview)

Pablo Villalobos31 Jul 2023 16:03 UTC

42 points

2 comments7 min readLW link

(epochai.org)

Pablo Villalobos 17 Apr 2023 15:39 UTC
1 point
0
on: ACX Meetup
We’ll be at the ground floor!

Pablo Villalobos 10 Apr 2023 10:14 UTC
11 points
2
in reply to: Daniel Kokotajlo’s comment on: Revisiting the Horizon Length Hypothesis
Not quite. What you said is a reasonable argument, but the graph is noisy enough, and the theoretical arguments convincing enough, that I still assign >50% credence that data (number of feedback loops) should be proportional to parameters (exponent=1).
My argument is that even if the exponent is 1, the coefficient corresponding to horizon length (‘1e5 from multiple-subjective-seconds-per-feedback-loop’, as you said) is hard to estimate.
There are two ways of estimating this factor
1. Empirically fitting scaling laws for whatever task we care about
2. Reasoning about the nature of the task and how long the feedback loops are
Number 1 requires a lot of experimentation, choosing the right training method, hyperparameter tuning, etc. Even OpenAI made some mistakes on those experiments. So probably only a handful of entities can accurately measure this coefficient today, and only for known training methods!
Number 2, if done naively, probably overestimates training requirements. When someone learns to run a company, a lot of the relevant feedback loops probably happen on timescales much shorter than months or years. But we don’t know how to perform this decomposition of long-horizon tasks into sets of shorter-horizon tasks, how important each of the subtasks are, etc.
We can still use the bioanchors approach: pick a broad distribution over horizon lengths (short, medium, long). My argument is that outperforming bioanchors by making more refined estimates of horizon length seems too hard in practice to be worth the effort, and maybe we should lean towards shorter horizons being more relevant (because so far we have seen a lot of reduction from longer-horizon tasks to shorter-horizon learning problems, eg expert iteration or LLM pretraining).

Revisiting the Horizon Length Hypothesis

Pablo Villalobos6 Apr 2023 6:39 UTC

25 points

4 comments3 min readLW link

ACX Meetup Madrid

Pablo Villalobos4 Apr 2023 8:53 UTC

5 points

2 comments1 min readLW link

Pablo Villalobos 21 Feb 2023 10:16 UTC
18 points
20
on: There are no coherence theorems
Note that you can still get EUM-like properties without completeness: you just can’t use a single fully-fleshed-out utility function. You need either several utility functions (that is, your system is made of subagents) or, equivalently, a utility function that is not completely defined (that is, your system has Knightian uncertainty over its utility function).
See Knightian Decision Theory. Part I
Arguably humans ourselves are better modeled as agents with incomplete preferences. See also Why Subagents?

Pablo Villalobos

Madrid—ACX Mee­tups Every­where Fall 2024

Data on AI

An­nounc­ing Epoch’s newly ex­panded Pa­ram­e­ters, Com­pute and Data Trends in Ma­chine Learn­ing database

EA Madrid social

Trad­ing off com­pute in train­ing and in­fer­ence (Overview)

Re­vis­it­ing the Hori­zon Length Hypothesis

ACX Meetup Madrid

Madrid—ACX Meetups Everywhere Fall 2024

Announcing Epoch’s newly expanded Parameters, Compute and Data Trends in Machine Learning database

Trading off compute in training and inference (Overview)

Revisiting the Horizon Length Hypothesis