Virtue signaling is sometimes the best or the only metric we have

Subtitle: Costly virtue signaling is an irreplaceable source of empirical information about character.

The following is cross-posted from my blog, which is written for a more general audience, but I think the topic is most important to discuss here on LW.

We all hate virtue signaling, right? Even “virtue” itself has taken on a negative connotation. When we’re too preoccupied with how we appear to others, or even too preoccupied with being virtuous, it makes us inflexible and puts us out of touch with our real values and goals.

But I believe the pendulum has swung too far against virtue signaling. A quality virtue signal shows that a person follows through with their best understanding of the right thing to do, and is still one of the only insights we have into others’ characters and our own. I don’t care to defend empty “cheap talk” signals, but the best virtue signals offer some proof of their claim by being difficult to fake. Maybe, like being vegan, they take a great deal of forethought, awareness, and require regular social sacrifices. Being vegan proves dedication to a cause like animal rights or environmentalism proportional to the level of sacrifice required. The virtuous sacrifice of being vegan isn’t what makes veganism good for the animals or the environment, but it is a costly signal of character traits associated with the ability to make such a sacrifice. So the virtue signal of veganism doesn’t mean you are necessarily having a positive impact or that veganism is the best choice, but it does show that you as a person are committed, conscientious, gentle, or deeply bought into the cause such that the sacrifice becomes easier for you than it would be for other people. It shows character and acting out your values. Out of your commitment to doing the most good possible, you may notice that you start to think veganism isn’t actually the best way to help animals for a lot of people.1 I believe this represents a step forward for helping animals, but one problem is that now it’s much easier to hide lack of virtuous character traits from measurement.2 It’s harder to know where the lines are or how to track the character of the people you may one day have to decide to trust or not to trust, it’s harder to support virtuous norms that make it easier for the community to act out its values, and it’s harder to be accountable to yourself.

Many will think that it is good when a person stops virtue signaling, or that ostentatiously refusing to virtue signal is a greater sign of virtue. But is it really better when we stop offering others proof of positive qualities that are otherwise hard to directly assess? Is it better to give others no reason to trust us? Virtue signals are a proxy for what actually matters— what we are likely to do and the goals that are likely to guide our behavior in the future. There is much fear about goodharting (when you take the proxy measure as an end in itself, rather than the thing it was imperfectly measuring) and losing track of what really matters, but we cannot throw out the baby with the bathwater. All measures are proxy measures, and using proxies is the only way to ask empirical questions. Goodharting is always a risk when you measure things, but that doesn’t mean we shouldn’t try to measure character.

The cost of virtue signals can be high, and sometimes not worth it, but I submit that most people undervalue quality virtue signals. Imagine if Nabisco took the stance that it didn’t have anything to prove about the safety and quality of its food, and that food safety testing is just a virtue signal that wastes a bunch of product. They could be sincere, and somehow keep product quality and safety acceptably high, but they are taking away your way of knowing that. Quality control is a huge part of what it is to sell food, and monitoring your adherence to your values should be a huge part of your process of having positive impact on the world.

Virtue signaling is bad when signaling virtue is confused for possessing the signal of virtue is confused for having the desired effect upon the world. It is at its worst when all your energy goes to signaling virtue at the expense of improving the world. But signals of virtue, especially costly signals that are difficult to fake, are very useful tools. Even if I don’t agree with someone else’s principles, I trust them more when I see they are committed to living by the principles they believe in, and I trust them even more if they pay an ongoing tithe in time or effort or money that forces them to be very clear about their values. I also think that person should trust themselves more if they have a track record of good virtue signals. Trust, but verify.

The most common objections to the first version of this post were not actually objections to virtue signals per se, I claim, but disagreements about what signals are virtuous. My support of virtue signals requires some Theory of Mind— a quality virtue signal demonstrates character given that person’s beliefs about what is good. Say a person virtue signals mainly as signal of group membership— I may still judge that to show positive character traits if they believe that taking cues from the group and repping the group are good. If someone uses “virtue signals” cynically to manipulate others, I do not think they have virtuous character. Might an unvirtuous person be able to fool me with their fake virtue signals? Sure, but that will be a lot harder than to do that emitting a genuine virtue signal. Signals don’t have to be 100% reliable to be useful evidence.

Why care about virtue signals? Why not just look at what people do? Because we need to make educated guesses about cooperating with people in the future, especially ourselves. “Virtue” or “character” are names we give to our models of other people, and those models give us predictions about how they will act across a range of anticipated and unanticipated situations. (In our own case, watching our virtue metrics can not only be a way to assess if we are falling into motivated reasoning or untrustworthiness, but also the metric we use to help us improve and become more aligned with our values.) Sometimes you can just look at results instead of evaluating the character of the people involved, but sometimes a person’s virtue is all we have to go on.

Take the lofty business of saving the world. It’s important to be sure that you are really trying to help the world and, for example, not just doing what makes you feel good about yourself or allows you to see the world in a way you like. Sometimes, we can track the impact of our actions and interventions well, and so it doesn’t matter if the people who implement them are virtuous or not as long as the job is getting done. But for the biggest scores, like steering the course of the longterm future, we’re operating in the dark. Someone can sketch out their logic for how a longtermist intervention should work, but there are thousands of judgment calls they will have to make, a million empirical unknowns as to how the plan will unfold over the years, and, if any of us somehow live long enough to see the result, it will be far too late to do anything about it. Beyond evaluating the idea itself, the only insight most of us realistically have into the likelihood of this plan’s success is the virtue of the person executing it. Indeed, if the person executing the plan doesn’t have any more insight into his own murky depths than the untested stories he tells, he probably just has blind confidence.

Quality virtue signals are better than nothing. We should not allow ourselves to be lulled into the false safety of dwelling in a place of moral ambiguities that doesn’t permit real measurements. It’s not good to goodhart, but we also can’t be afraid of approximation when that’s the best we have. Judging virtue and character gives us approximations that go into our complex proprietary models, developed over millenia of evolution, of other human beings, and we need to avail ourselves of that information where little else is available.

I urge you to do the prosocial thing and develop and adopt more legible and meaningful virtue signals— for others and especially for yourself.

(This post was edited after publication, which is a common practice for me. See standard disclaimer.)


I’m not taking a position here. In fact, I think a mixed strategy with at least some people pushing the no-animals-as-food norm and others reducing animal consumption in various ways is best for the animals. At the time I of writing I am in a moral trade that involves me eating dairy, i.e. no longer being vegan, and the loss of the clean virtue signal was one of the things that prompted me to write this post.


Discussed this example in depth with Jacob Peacock, which partly inspired the post.