artifex

Karma: 81

I’m always interested in knowing why people disagree with me, but recognize that people have limited motivation to expend the effort to explain to me why I am wrong in a way I can understand.

In case that helps make it take less effort, I am permanently committed to Crocker’s rules.

Also, the LW user artifex0 is a different person.

artifex 23 May 2022 4:55 UTC
27 points
in reply to: Eliezer Yudkowsky’s comment on: Beware boasting about non-existent forecasting track records

My first ‘dunk’ on April 18, about a 5-year shortening of Metaculus timelines in response to evidence that didn’t move me at all, asking about a Metaculus forecast of the Metaculus forecast 3 years later, implicitly predicts that Metaculus will update again within 3 years.

I do however claim it as a successful advance prediction, if something of a meta one

Wait, unless I misunderstand you there’s a reasoning mistake here. You request epistemic credit for predicting implicitly that the Metaculus median was going to drop by five years at some point in the next three years. But that’s a prediction that the majority of Metaculites would also have made and it’s a given that it was going to happen, in an interval of time as long as three years. It’s a correct advance prediction, if you did make it (let’s assume so and not get into inferring implicit past predictions with retrospective text analysis), but it’s not one that is even slightly impressive at all.

As an example to explain why, I predict (with 80% probability) that there will be a five-year shortening in the median on the general AI question at some point in the next three years. And I also predict (with 85% probability) that there will be a five-year lengthening at some point in the next three years.

I’m predicting both that Metaculus timelines will shorten and that they will lengthen! What gives? Well, I’m predicting volatility… Should I be given much epistemic credit if I later turned out to be right on both predictions? No, it’s very predictable and you don’t need to be a good forecaster to anticipate it. If you think you should get some credit for your prediction, I should get much more from these two predictions. But it’s not the case that I should get much, nor that you should.

Are there inconsistencies in the AGI questions on Metaculus? Within the forecast timeline, with other questions, with the resolution criteria? Yes, there are plenty! Metaculus is full of glaring inconsistencies. The median on one question will contradict the median on another. An AI question with stronger operationalization will have a lower median than a question with weaker operationalization. The current median says there is a four percent chance that AGI was already developed. The resolution criteria on a question will say it can’t resolve at the upper bound and the median will have 14% for it resolving at the upper bound anyway.

It’s commendable to notice these inconsistencies and right to downgrade your opinion of Metaculus because of them. But it’s wrong to conclude (even with weak confidence), because you can observe such glaring inconsistencies frequently, and predict in advance that specific ones will happen, including changes over time in the median that are predictable even in expected value after accounting for skew, that you are a better forecaster on even just AGI questions (and the implicit claim of being “a slightly better Bayesian” actually seems far stronger and more general than that) than most of the Metaculites forecasting on these questions.

Why? Because Metaculites know there are glaring inconsistencies everywhere, they identify them often, they know that there are more, and they can find them, and fix most of them, easily. It’s not that you’re a better forecaster, just that you have unreasonable expectations of a community of forecasters who are almost all effectively unpaid volunteers.

It’s not surprising that the Metaculus median will change over time in specific and predictable ways that are inconsistent with good Bayesianism. That doesn’t mean they’re that bad: let us see you do better, after all. It’s because people’s energy and interest are scarce. The questions in tournaments with money prizes get more engagement, as do questions about things that are currently in the news. There are still glaring inconsistencies in these questions, because it’s still not enough engagement to fix them all. (Also because the tools are expensive in time to use for making and checking your distributions.)

There are only 601 forecasters who have more than 1000 points on Metaculus: that means only 601 forecasters who have done even a pretty basic amount of forecasting. One of the two forecasters with exactly 1000 points has made predictions on only six questions, for example. You can do that in less than one hour, so it’s really not a lot.

If 601 sounds like a lot, there are thousands of questions on the site, each one with a wall of text describing the background and the resolution criteria. Predictions need updated constantly! The most active predictors on the site burn out because it takes so much time.

It’s not reasonable to expect not to see inconsistencies, predictable changes in the median, and so on. It’s not that they’re bad forecasters. Of course you can do better on one or a few specific questions, but that doesn’t mean much. If you want even just a small but worthwhile amount of evidence, from correct advance predictions, that you are a better forecaster than other Metaculites, you need, for example, to go and win a tournament. One of the tournaments with money prizes that many people are participating in.

Evaluating forecasting track records in practice is hard and very dependent on the scoring rule you use (rankings for PredictionBook vary a lot with your methodology for evaluating relative performance, for example). You need a lot of data, and high quality, to get significant evidence. If you have low-quality data, and only a little, you just aren’t going to get a useful amount of evidence.

artifex 16 Jul 2022 11:16 UTC
12 points
4
in reply to: avturchin’s comment on: A time-invariant version of Laplace’s rule
In the case where there are zero observed successes (so 𝑆 = 0) in the last 𝑛 years, Gott’s formula

$P (N \leq Z) = \int_{N = n}^{N = Z} P (N | n) d N = \frac{Z - n}{Z}$

for the probability that the next success happens in the next 𝑚 = 𝑍 − 𝑛 years gives

$\frac{m}{m + n} = 1 - {(1 + \frac{m}{n})}^{- 1}$

which ends up being exactly the same as the time-invariant Laplace’s rule. The same happens if there was a success (𝑆 = 1) but we chose not to update on it because we chose to start the time period with it. So the time-invariant Laplace’s rule is a sort of generalization of Gott’s formula, which is neat.

artifex 1 May 2022 23:52 UTC
11 points
on: How to be skeptical about meditation/Buddhism
Meditation and Buddhism are of low interest to most rationalists who have not interacted with any of the in-person rationalist communities. My preference for how to approach these topics in the rationalist community would be: don’t, or do it in a place other than LessWrong frontpage, or do it much less than this. These hypotheses are being unreasonably privileged and overdiscussed on LessWrong relative to the ~nil amount of real knowledge that has been generated by the discussion and investigation so far.

artifex 26 Nov 2019 23:04 UTC
11 points
in reply to: Viliam’s comment on: Effect of Advertising
because apparently the strongest evidence for “being the kind of person who buys X” is having bought X recently

In general, that you’ve bought something is evidence that you’re the kind of person who buys that thing. Furthermore, if you’ve bought certain items recently, you are far more likely to buy a similar product (for example, you regret the purchase and want to replace it) than someone who hasn’t.

artifex 7 Jan 2023 16:00 UTC
9 points
0
on: Why didn’t we get the four-hour workday?
I disagree total working hours have decreased. The number of average weekly hours per person from 1950 to 2000 has been “roughly constant”. Work weeks are shorter but there are more people working.

artifex 7 Jun 2022 16:33 UTC
9 points
4
on: AGI Ruin: A List of Lethalities
Great post. Many of these arguments are fairly convincing.

artifex 23 Jun 2022 16:55 UTC
8 points
on: Air Conditioner Test Results & Discussion

The air conditioner was intended as an example in which a product is shitty in ways the large majority of consumers don’t notice, and therefore market pressures don’t fix it.

But they do: among air-air heat pumps, dual hose air conditioners exist (but one hose versus two hoses is a huge gain in convenience), as do window air conditioners which are better (for efficiency; they cannot be installed in all windows), as do heat pumps with split indoor and outdoor units, which are much better (but more expensive). And ground-source heat pumps, which are better still, exist as well (but are still more expensive upfront and often not subsidized by utility companies and governments like air-air heat pumps are; but, like regulations on the units and on the people installing them, this depends on location and there are places where they are widely used for heating and air conditioning). And simple fans, which are not even air conditioners, also exist. The market offers the entire range of possible tradeoffs between efficiency, convenience, and cost. And different consumers are using products in this entire range.

… though at the same time, a counter has incremented in the back of my head, and I do have a slight concern that I’m avoiding evidence against the “people don’t notice major problems” model.

You are avoiding evidence against that model, but not in the way you think. It’s because you were looking at air conditioner ratings on Amazon, which gives you an impression of consumer preferences that is biased for convenience.

There are a lot of people using more efficient systems for air conditioning that they also use for heating. Searching for air conditioners on Amazon will give you a distorted picture because it selects against systems that are also meant for heating and systems that usually require professional installation—these are the most efficient systems, so searching on Amazon gives you a strong selection bias against efficiency and in favor of convenience. But that doesn’t mean that the majority of consumers don’t notice what products are more efficient. It’s just that Amazon search results for air conditioners aren’t representative of the market: the most efficient air conditioners aren’t marketed as air conditioners and consumers don’t purchase them on Amazon.

artifex 9 Feb 2023 23:44 UTC
3 points
0
in reply to: Matthew Barnett’s comment on: Noting an error in Inadequate Equilibria
They did in fact not go far enough. Japanese GNI per capita growth from 2013 to 2021 was 1.02%. The prescription would be something like 4%.

artifex 19 Aug 2022 15:32 UTC
3 points
0
in reply to: deepthoughtlife’s comment on: Against population ethics

Would you say you are one?

Yes, I consider it very likely correct to care about paths. I don’t care what percentage of utilitarians have which kinds of utilitarian views because the most common views have huge problems and are not likely to be right. There isn’t that much that utilitarians have in common other than the general concept of maximizing aggregate utility (that is, maximizing some aggregate of some kind of utility). There are disagreements over what the utility is of (it doesn’t have to be world states), what the maximization is over (doesn’t have to be actions), how the aggregation is done (doesn’t have to be a sum or an average or even to use any cardinal information, and don’t forget negative utilitarianism fits in here too), which utilities are aggregated (doesn’t have to be people’s own preference utilities, nor does it have to be happiness, nor pleasure and suffering, nor does it have to be von Neumann–Morgenstern utilities), or with what weights (if any; and they don’t need to be equal). I find it all pretty confusing. Attempts by some smart people to figure it out in the second half of the 20th century seem to have raised more questions than they have produced answers. I wouldn’t be very surprised if there were people who knew the answers and they were written up somewhere, but if so I haven’t come across that yet.

artifex 17 Aug 2022 4:20 UTC
3 points
0
in reply to: deepthoughtlife’s comment on: Against population ethics
Utilitarianism is pretty broad! There are utilitarians who care about the paths taken to reach an outcome.

artifex 26 Jul 2022 9:08 UTC
3 points
0
on: Unifying Bargaining Notions (1/2)

To put it mildly, this is not really a desiderata at all, it’s actually an extremely baffling property.

How can we decide an axiom used to pin down a bargaining solution is intuitive or baffling without first having a goal in mind? Which axioms are sound for the bargaining solution used to pick deals depends on the purpose that led us to want to apply bargaining theory to a problem. If you’re designing a file sharing protocol, you don’t care about bargaining chips. You just want the files to be distributed quickly. Or if you’re designing a standard for network equipment and you want to minimize spectrum congestion or wireless interference, knowing that you can’t trust the owners of the equipment not to be selfish at the expense of other users. You want the solution that works best and if some solution that isn’t the solution that works best becomes unavailable, that doesn’t change the solution you consider best. Independence of irrelevant alternatives is sound for some of the goals we want to apply bargaining theory to.

artifex 16 Jul 2022 11:17 UTC
3 points
2
on: A time-invariant version of Laplace’s rule
This is a fantastic post. Thank you for writing it!

artifex 31 Aug 2023 20:03 UTC
2 points
1
in reply to: Viliam’s comment on: The Economics of the Asteroid Deflection Problem (Dominant Assurance Contracts)

Either I am missing a point somewhere, or this probably doesn’t work as well outside of textbook examples.

In the example, Frank was “blackmailed” into paying, because the builder knew that there were exactly 10 villagers, and knew that Frank needs the street paved. In real life, you often do not have this kind of knowledge.

Yes, you need to solve two problems (according to Tabarrok) to solve public goods provision, one of which is the free-rider problem. Dominant assurance contracts only solve the free-rider problem, but you need to also solve what he calls the information problem to know how to set the parameters of the contract.

artifex 5 Dec 2022 23:54 UTC
2 points
in reply to: artifex’s comment on: Beware boasting about non-existent forecasting track records

As an example to explain why, I predict (with 80% probability) that there will be a five-year shortening in the median on the general AI question at some point in the next three years. And I also predict (with 85% probability) that there will be a five-year lengthening at some point in the next three years.

Both of these things have happened. The community prediction was June 28, 2036 at one time in July 2022, July 30, 2043 in September 2022 and is March 13, 2038 now. So there has been a five-year shortening and a five-year lengthening.

artifex 26 Sep 2022 5:16 UTC
2 points
0
on: There’s One Thing Futurists are Never Wrong About

This unfortunately means that copies could never be absolutely exact as a consequence of Heisenberg’s Uncertainty Principle

The uncertainty principle doesn’t mean what you think: to replicate a person exactly, you just need to replicate exactly the values of each classical field at each point of space occupied by the person (the world is made of fields, not particles). You probably can’t do that, but it’s not the uncertainty principle that says you can’t do that.

What the uncertainty principle says is more like this: there are no wave functions in the phase space of a quantum system evolving according to a Schrödinger equation such that the density given by the Born rule is concentrated on one value of a variable while simultaneously being concentrated on one value of another variable when the two variables are a pair of conjugate variables, because in that case for the density to be concentrated on one value of one of the variables automatically implies a combination of amplitudes for the values of the other variable with which the density is not concentrated on a single value.

The uncertainty principle is about what’s mathematically possible, rather than about what you can know. You can know what the wave function is and that’s really all there is to know. It’s just that the wave function isn’t going to have definite values simultaneously for both of a pair of conjugate variables.

artifex 16 May 2022 4:47 UTC
2 points
on: My Morality

If morality is subjective, why do I form moral opinions and try to act on them? I think I do that for the same reason I think I do anything else. To be happy.

What makes you happy is objective, so if that’s how you ground your theory of morality, it is objective in that sense. It’s subjective only in that it depends on what makes you happy rather than what makes other possible beings happy.

If morality is a thing we have some reason to be interested in and care about, it’s going to have to be grounded in our preferences. Our preferences, not any possible intelligent being’s preferences—so it’s subjective in that sense. But we can’t make up anything, either. We already have a complete theory of how we should act, given by our preferences & our decision theory. Morality needs to be part of or implied by that in some way.

To figure out what’s moral, there is real work that needs to be done: evolutionary psychology, game theoretic arguments, revealed preferences, social science experiments, etc. Stuff needs to be justified. Any aggregation procedure we choose to use, any weights we choose to use in said aggregation procedure, need to be grounded—there has to be a reason we are interested in that aggregation procedure and these weights.

There are multiple kinds of utilities that have moral import for different reasons, some of them interpersonally comparable and others not. Preference utilities are not interpersonally comparable and we care about them for game theoretic reasons that would apply just as well to many agents very different from us (who would use different weights however); what weights and aggregation procedure to use must be grounded in these game theoretic reasons. However they are to be aggregated, it can’t be weighted-sum utilitarianism, since the utilities aren’t interpersonally comparable (which doesn’t mean they can’t be aggregated by other means). But pleasure utilities (dependent on any positive mental or emotional state) often are interpersonally comparable:

An [individual’s] inability to weigh between pleasures is an epistemic problem. [Some] pleasures are greater than others. The pleasure of eating food one really enjoys is greater than that of eating food one doesn’t really enjoy. We can make similar interpersonal comparisons. We know that one person being tortured causes more suffering than another stubbing their toe. (HT: Bentham’s bulldog)

At least it should be the case that some mental states can be biologically quantified in ways that should be interpersonally comparable. And they can have moral import. Why not? It all depends on what evolution did or didn’t do. We need to know in what ways people care about other beings (which state or thing related to these beings they care about), which ones of the beings and to what degrees (and there can be multiple true answers to these questions).

How do we know? Well, there are things like ultimatum game experiments, dictator game, kin altruism, and so on. The details matter and there seems to be much controversy on interpretation.

Can we just know through introspection? It would be awfully convenient if so, but that requires that evolution has given us a way to introspect on our preferences that regard other people and reliably get the real answers instead of getting social desirability bias. How do we know if that’s the case? Two ways.

Way one: by comparing the answers people claim to get through introspection with their actual behavior. If introspection is reliable, the two should probably match to a high degree.

Way two: by seeing how much variation there is in the answers people claim to get through introspection. We still need to interpret that variation. Is it more plausible that people have very different moralities than that their answers are very different for other reasons (which ones?)?

This fog is too thick for me to see through. Many smart people have tried, probably much harder than me, and sometimes have said a few smart things: [1] [2] [3]. There must be people who have figured much more out and if so I would highly appreciate links.

artifex 14 May 2022 22:11 UTC
2 points
on: Inequality is inseparable from markets
Why is inequality morally relevant?

artifex 2 May 2022 2:25 UTC
2 points
in reply to: ChristianKl’s comment on: How to be skeptical about meditation/Buddhism
I don’t, but… I’d like to see some indication that the real knowledge is generated by discussion or investigation of meditation or Buddhism here. For example: global workspace theory, predictive processing, cognitive psychology, EEG, neuroscience, these weren’t motivated by meditation and Buddhism, I don’t think? Yes, there are neuroscientists who will write books about meditation and talk about interesting things in these books and also about less interesting things like their profound spiritual insights and I’m afraid the latter part of these books is the one motivated by meditation and Buddhism. Sometimes these books will contain very good presentations of their subjects and rationalists will write good reviews of them and that will have value. This indicates that there’s a market for such books, not really that meditation and Buddhism generate useful knowledge. It’s not a justification for investigating meditation and Buddhism to a particularly greater degree.

artifex 27 Mar 2022 1:28 UTC
2 points
on: Is Metaculus Slow to Update?
Thank you for doing this analysis!

artifex 30 Dec 2019 23:04 UTC
2 points
on: Making decisions under moral uncertainty
I think I like this post, but not the approaches.

A correct solution to moral uncertainty must not be dependent on cardinal utility and requires some rationality. So Borda rule doesn’t qualify. Parliamentary model approaches are more interesting because they rely on intelligent agents to do the work.

An example of a good approach is the market mechanism. You do not assume any cardinal utility. You actually do not do anything directly with the preferences you have a probability distribution over at all. You have an agent for each and extrapolate what that agent would do, if they had no uncertainty over their preferences and rationally pursued them, when put in a carefully designed environment that allows them to create arbitrary binding consensual precommitments (“contracts”) with other agents, and weights each agent’s influence over the outcomes that the agents care about according to your probabilities.

What is tricky is making the philosophical argument that this is indeed the solution to moral uncertainty that we are interested in. I’m not saying it is the correct solution. But it follows some insights that any correct solution should:
- do not use cardinal utility, use partial orders;
- do not do anything with the preferences yourself, you are at high risk of doing something incoherent;
- use tools that are powerful and universal: intelligent agents, let them bargain using full Turing machines. You need strong properties, not (for example) mere Pareto efficiency.