An engineering student at Northwestern University.
aaq
Why is it a stretch?
AI development is a tragedy of the commons
Per Wikipedia:
In economic science, the tragedy of the commons is a situation in which individual users, who have open access to a resource unhampered by shared social structures or formal rules that govern access and use, act independently according to their own self-interest and, contrary to the common good of all users, cause depletion of the resource through their uncoordinated action.
The usual example of a TotC is a fishing pond: Everyone wants to fish as much as possible, but fish are not infinite, and if you fish them faster than they can reproduce, you end up with less and less fish per catch.
AI development seems to have a similar dynamic: Everyone has an incentive to build more and more powerful AIs, because there is a lot of money to be made in doing so. But more and more powerful AIs being made increases the likelihood of an unstoppable AGI being made.
There are some differences, but I think this is the underlying dynamic driving AI development today. The biggest point of difference is that, whereas one person’s overfishing eventually causes a noticeable negative effect on other fishers, and at the least does not improve their own catches, one firm building a more powerful AI probably does improve the economic situation of the other people who leverage it, up until a critical point.
Are there other tragedies of the commons that exhibit such non-monotonic behavior?
Would AGI still be an x-risk under communism?
1-bit verdict
Yes.
2-bit verdict
Absolutely, yes.
Explanation
An artificial general intelligence (AGI) is a computer program that can perform at least as good as an average human being can across a wide variety of tasks. The concept is closely linked to that of a general superintelligence, which can perform better than even the best human being can across a wide variety of tasks.
There are reasons to believe most, perhaps almost all, general superintelligences would end up causing human extinction. AI safety is a crossdisciplinary field of mathematics, economics, computer science, and philosophy which tackles the problem of how to stop such superintelligences.
AI alignment is a subfield of AI safety which studies theoretical conditions under which superintelligences aligned with human values can emerge. Another branch, which might be called AI deterrence, aims instead to make the production of unaligned superintelligences less likely in the first place.
One of the primary reasons why someone might want to create a superintelligence, even while understanding the risks involved, is because of the vast economic value such a program could generate. It makes sense then from a deterrence lens to look into the question of how this profit motive might be curtailed before catastrophe. Why not communism?
Unfortunately, this is almost certainly a bad move. Communism at almost every scale has to date never been able to escape the rampant black markets that appear due to the distortion of price signals. There is no reason to suspect such black markets wouldn’t have just as strong a profit motive to create stronger and stronger AGIs. Indeed, because black markets are already illegal, this may worsen the problem: Well funded teams of people producing AGI outside of the eyes of the broader public is likely to generate less pushback and to be better equipped to avoid deterrence oriented legislation than a clear market team such as OpenAI is.
Towards a #1-flavored answer, a Hansonian fine insured bounty system seems like it might scale well for enforcing cooperation against AI research.
https://www.overcomingbias.com/2018/01/privately-enforced-punished-crime.html
OP here, talking from an older account because it was easier to log into on mobile.
Kill: I never said anything about killing them. Prisoners like this don’t pose any immediate threat to anyone, and indeed are probably very skilled white collar workers who could earn a lot of money even behind bars. No reason you couldn’t just throw them into a minimum security jail in Sweden or something and keep an eye on their Internet activity.
McCarthyism: Communism didn’t take over in the US. That provides if anything weak evidence that these kinds of policies can work, even for suppressing much more controversial ideas than preventing the building of an unsafe AI.
q1: The hardcore answer would be “Sorry kid, nothing personal.” If there was ever a domain where false positives were acceptable losses, stopping unaligned AI from being created in the first place would probably be it. People have waged wars for far less. The softcore answer, and the one I actually believe, is that you’re probably a smart enough guy that if such a bounty were announced you would be able to drop those activities quickly and find new work or hobbies within a few months.
q2: I mean, you could. You can make a bounty to disincentivize any behavior. But who would have that kind of goal or support such a bounty, much less fund one? If you’re worried about Goodhart’s law here, just use a coarse enough metric like “gets paid to work on something AI-related” and accept there would be some false positives.
[Productivity] How not to use “Important // Not Urgent”
[ELDR Tactics] Consider switching to (mostly) decaf.
Metcalfe’s (revised!) law states that the value of a communications network grows at about .
I frequently give my friends the advice that they should aim to become pretty good at 2 synergistic disciplines (CS and EE for me, for example), but I have wondered in the past why I don’t give them the advice to become okay at 4 or 5 synergistic disciplines instead.
It just struck me these ideas might be connected in some way, but I am having trouble figuring out exactly how.
Try to think about this in terms of expected value. On your specific example, they do score more, but this is probabilistic thinking, so we want to think about it in terms of the long run trend.
Suppose we no longer know what the answer is, and you are genuinely 50⁄50 on it being either A or B. This is what you truly believe, you don’t think there’s a chance in hell it’s C. If you sit there and ask yourself, “Maybe I should do a 50-25-25 split, just in case”, you’re going to immediately realize “Wait, that’s moronic. I’m throwing away 25% of my points on something I am certain is wrong. This is like betting on a 3-legged horse.”
Now let’s say you do a hundred of these questions, and most of your 50-50s come up correct-ish as one or the other. Your opponent consistently does 50-25-25s, and so they end up more wrong than you overall, because half the time the answer lands on one of their two 25s, not their single 50.
It’s not a game of being more correct, it’s a game of being less wrong.
I disagree with your first point, I consider the 50:25:25:0 thing is the point. It’s hard to swallow because admitting ignorance rather than appearing falsely confident always is, but that’s why it makes for such a good value to train.
- 11 Dec 2019 14:32 UTC; 2 points) 's comment on Bayesian examination by (
Agreed on the difference. Different subcultures, I think, all try to push different narratives about how they are significantly different from other subcultures; they are in competition with other subcultures for brain-space. On that observation, my priors that rationalist content is importantly different to other subcultures in that regard are low.
I suppose my real point in writing this is to advise against a sort of subcultural Fear Of Being Ordinary—rationalism doesn’t have to be qualitatively different from other subcultures to be valuable. For people under its umbrella, it can be very valuable, for reasons that have almost nothing to do with the quirks of the subculture itself.
This actually seems like a really, really good idea. Thanks!
Great post! Simple and useful. For spaced-repetition junkies in the crowd, I created a small Anki deck, created from this post to help me retain the basics.
You could normalize the scoring rule back to 1, so that should be fine.
Scattered thoughts on how the rationalist movement has helped me:
On the topic of rationalist self-improvement, I would like to raise the point that simply feeling as though there’s a community of people who get me and that I can access when I want to has been hugely beneficial to my sense of happiness and belonging in the world.
That generates a lot of hedons for me, which then on occasion allow me to “afford” doing other things I wouldn’t otherwise, like spend a little more time studying mathematics or running through Anki flashcards. There’s a part of me that feels like I’m not just building up this knowledge for myself, but for the future possible good of “my people”. I might tie together stuff in a way that other people find interesting, or insightful, or at least enjoy reading about, and that’s honestly fricking awesome and blows standard delayed-gratification “self improvement” tactics outta the water 10⁄10 would recommend.
Also there’s the whole thing that Ozy who is rat-almost-maybe-adjacent wrote the greatest summary of the greatest dating advice book I ever read, and I literally read that effortpost every day for like 8 months while I was learning how to be a half-decent romantic option, and holy SHIT is my life better for that. But again—nothing specific to the rationalist techniques themselves there; the value of the community was pointing me to someone who thinks and writes in a way my brain sees and says “mmm yes tasty good word soup i liek thanke” and then that person happened to write a post that played a big role in helping me with a problem that was causing me a ton of grief.
TLDR rationalists > rationalism
aaq’s Shortform
Reading list: Starting links and books on studying ontology and causality
When I stop to think of people I support who I would peg as “extreme in words, moderate in actions”, I think I feel a sense of overall safety that might be relevant here.
Let’s say I’m in a fierce, conquering mood. I can put my weight behind their extremism, and feel powerful. I’m Making A Difference, going forth and reshaping the world a little closer to utopia.
When I’m in a defeatist mood, where nothing makes sense and I feel utterly hopeless, I can *also* get behind the extremism—but it’s in a different light, now. It’s more, “I am so small, and the world is so big, but I can still live by what I feel is right”.
Those are really emotionally powerful and salient times for me, and ones that have a profound effect on my sense of loyalty to certain causes. But most of the time, I’m puttering along and happy to be in the world of moderation. Intellectually, I understand that moderation is almost always going to be the best way forward; emotionally, it’s another story entirely.
Upon first reading, I had the thought that a lot of people don’t notice the extreme/moderate dichotomy of most of their leaders. I still think that’s true. And then a lot of people do learn of that dichotomy, and they become disgusted by it, and turn away from anyone who falls in that camp. Which makes sense, honesty is a great virtue, why can’t they just say what they mean? But then I look at myself, and while it doesn’t feel *optimal* to me, it does feel like just another element of playing the game of power. There’s this skill of reading between the lines that I think most people know is there, but they’re a little reluctant to look straight at it.
1a → Broadly agree. “Weaker” is an interesting word to pick here; I’m not sure whether an anarcho-primitivist society would be considered weaker or stronger than a communist one systemically. Maybe it depends on timescale. Of course, if this were the only size lever we had to move x-risk up and down, we’d be in a tough position—but I don’t think anyone takes that view seriously.
1b → Logically true, but I do see strong reason to think short term x-risk is mostly anthropogenic. That’s why we’re all here.
2 → I do agree it would probably take a while.
3a → Depends on how coarse or fine grained the distribution of resources is, a simple linear optimizer program would probably do the same job better for most coarser distribution schemes.
3b → Kind of. I’m looking into them as a curiosity.