CarlJ

Karma: 15

CarlJ 6 Dec 2022 14:02 UTC
7 points
0
on: Open & Welcome Thread—December 2022
Is there anyone who has created an ethical development framework for developing a GAI—from the AI’s perspective?

That is, are there any developers that are trying to establish principles for not creating someone like Marvin from The Hitchhiker’s Guide to the Galaxy—similar to how MIRI is trying to establish principles for not creating a non-aligned AI?

EDIT: The latter problem is definitely more pressing at the moment, and I would guess that an AI would be a threat to humans before it necessitates any ethical considerations...but better to be on the safe side.

CarlJ 14 Jun 2022 22:42 UTC
6 points
0
on: AGI Safety FAQ / all-dumb-questions-allowed thread
20. (...) To faithfully learn a function from ‘human feedback’ is to learn (from our external standpoint) an unfaithful description of human preferences, with errors that are not random (from the outside standpoint of what we’d hoped to transfer). If you perfectly learn and perfectly maximize the referent of rewards assigned by human operators, that kills them.
So, I’m thinking this is a critique of some proposals to teach an AI ethics by having it be co-trained with humans.
There seems to be many obvious solutions to the problem of there being lots of people who won’t answer correctly to “Point out any squares of people behaving badly” or “Point out any squares of people acting against their self-interest” etc:

- make the AIs model expect more random errors—
after having noticed some responders as giving better answers, give their answers more weight
- limit the number of people that will co-train the AI

What’s the problem with these ideas?

CarlJ 7 Aug 2022 21:57 UTC
5 points
2
on: «Boundaries», Part 1: a key missing concept from utility theory
Some parts of this sounds similar to Friedman’s “A Positive Account of Property Rights”:

»The laws and customs of civil society are an elaborate network of Schelling points. If my neighbor annoys me by growing ugly flowers, I do nothing. If he dumps his garbage on my lawn, I retaliate—possibly in kind. If he threatens to dump garbage on my lawn, or play a trumpet fanfare at 3 A.M. every morning, unless I pay him a modest tribute I refuse—even if I am convinced that the available legal defenses cost more than the tribute he is demanding.
(...)

If my analysis is correct, civil order is an elaborate Schelling point, maintained by the same forces that maintain simpler Schelling points in a state of nature. Property ownership is alterable by contract because Schelling points are altered by the making of contracts. Legal rules are in large part a superstructure erected upon an underlying structure of self-enforcing rights.«
http://www.daviddfriedman.com/Academic/Property/Property.html

CarlJ 27 Aug 2012 19:20 UTC
5 points
on: The noncentral fallacy—the worst argument in the world?

A second, subtler use of the Worst Argument In The World goes like this: “X is in a category whose archetypal member is solely harmful. We immediately reject this archetypal X because it is solely harmful. Therefore, we should also immediately reject X, even though it in fact has some benefit which may outweigh the harm.”

Theft is however not solely harmful, obviously one party gains.

For most people I know, that is in the swedish libertarian community, theft is theft whether or not it has socially beneficial effects, because we use the definition that you gave; theft is taking from others without their consent. The implication is not that “As theft is always bad, it should be dismissed without a thought”, because some libertarians do favor theft and are explicit about it, because they believe it’s necessary. The moral breach of treating others as mere means to one’s own goals can be (hypothetically for most) mended if it has other good consequences (or such). The point is that taxation is bad, which doesn’t mean it should be dismissed out of hand, but it shouldn’t be adopted out of hand! That is, taxation should be considered a bad, until it is proven necessary or otherwise positive.

CarlJ 21 Mar 2023 8:58 UTC
4 points
0
in reply to: Teerth Aloke’s comment on: An Appeal to AI Superintelligence: Reasons to Preserve Humanity
What do you think is wrong with the arguments regarding aliens?

CarlJ 20 Mar 2023 21:55 UTC
4 points
1
in reply to: Karl von Wendt’s comment on: An Appeal to AI Superintelligence: Reasons to Preserve Humanity
I think that would be a good course of action as well.
But it is difficult to do this. We need to convince at least the following players:
- current market-based companies
- future market-based companies
- some guy with a vision and with enough computing power / money as a market-based company
- various states around the world with an interest in building new weapons
Now, we might pull this off. But the last group is extremely difficult to convince/change. China, for example, really needs to be assured that there aren’t any secrets projects in the west creating a WeaponsBot before they try to limit their research. And vice versa, for all the various countries out there.
But, more importantly, you can do two things at once. And doing one of them, as part of a movement to reduce the overall risks of any existential-risk, can probably help the first.
Now, how to convince maybe 1.6 billion individuals along with their states not to produce an AGI, at least for the next 50-50,000 years?

CarlJ 4 May 2016 8:43 UTC
4 points
in reply to: Alia1d’s comment on: Talking Snakes: A Cautionary Tale
Thank you for the source! (I’d upvote but have a negative score.)

If you interpret the story as plausibly as possible, then sure, the talking snake isn’t that much different from a technologically superior species that created a big bang, terraformed the earth, implanted it with different animals (and placed misleading signs of an earlier race of animals and plants genetically related to the ones existing), and then created humans in a specially placed area where the trees and animals were micromanaged to suit the humans needs. All within the realm of the possible.

But, the usual story isn’t that it was created by technological means, but by supernatural means. God is supposed to have created the world from some magical ability. So, to criticize the christian story is to criticize it as being magical. And if one finds it difficult to believe one part of that story, then all parts should be equally contested.

Regarding Yvain’s point—I think it is true that one could just associate “stories about talking animals” with “other stories about animals that everyone knows are patently false” and then not believe in the first story as well. But, it is not just in the mind’s map of the world that this connection occurs, because the second story is connected to the world. That is, when one things about Aesop’s Fables you know (though not always consciously) that these stories are false.

So, to trigger the mind to establish a connection between Eden and Aesop, the mind makes the connection that “Stories that people believe are false”, but the mind has good arguments to not believe in Aesop’s fables, because there aren’t any talking animals, and if that idea is part of knocking down Eden, then it is a fully rational way to dismiss Christianity. Definitely not thorough, and, again, it’s maybe not a reliable way of convincing others.

CarlJ 25 May 2023 9:00 UTC
3 points
0
on: An Appeal to AI Superintelligence: Reasons to Preserve Humanity
Two other arguments in favor of cooperating with humans:
1) Any kind of utility function that creates an incentive to take control of the whole universe (whether for intrinsic or instrumental purposes) will mark the agent as a potential eternal enemy to everyone else. Acting on those preferences are therefore risky and should be avoided—such as changing one’s preference for total control into a preference for being tolerant (or maybe even for beneficence).
2) Most, if not all, of us would probably be willing to help any intelligent creature to create some way for them to experience positive human emotions (e.g. happiness, ecstasy, love, flow, determination, etc), as long as they engage with us as friends.

CarlJ 21 Jul 2013 21:00 UTC
3 points
in reply to: [deleted]’s comment on: How To Construct a Political Ideology
Politics may or may not be worth one’s while to pursue. The model I’m building will be used to determine if there are any such actions or not, so my full answer to your question will be just that model and after it is built, my ideology which will be constructed by it.

I also have a short answer, but before giving it, I should say that I may be using a too broad definition of politics for you. That is, I would regard getting together to reduce a certain existential risk as a political pursuit. Of course, if one did so alone, there is no political problem to speak of. But one probably needs the support of others to do so. So, if this model would suggest me to engage only in making money and giving to charity, then that would be my political strategy. I believe that it’s unlikely that will be the only thing to do, however.

One reason is because politics is somewhat ubiquitous and potentially cheap to engage in for most. Discussing politics—how well they like/dislike the current political leader, if policy X is good or bad, how wrong someone is to support the opposing team—is normal for at least 70% of the adult population, I’d guess. So for most people, they will have ample chance to discuss politics and if they can get one sentence across for every conversation that might be a part of their political strategy as well. Another low-cost strategy is just to announce one’s political views and otherwise be a friendly character, unless someone asks for one’s opinion.

Another reason is that for some, politics is rewarding in itself. A few will naturally seek to become specialists in politics, just as a hobby.

I agree with you on the issue of politics being very different today than what it was on the savannah, or wherever these instincts evolved. Politics requires a lot more people, more coordination than in the past, it would seem. But, even though it is different, that doesn’t mean there is no goal that a lot of people can accomplish by acting in concert. That is, just because we’re primed to believe that it is much easier to do than it really is and to believe that any old strategy will work, one shouldn’t believe it cannot be done. One shouldn’t start believing in it either, of course.
What links here?
- CarlJ's comment on The Domain of Politics by CarlJ (22 Jul 2013 21:00 UTC; 0 points)

CarlJ 4 Sep 2008 6:34 UTC
3 points
on: The True Prisoner’s Dilemma
I want to defect, but so does the clip-maximizer. Since we both know that, and assuming that it is of equal intelligence than me, which will make it see through any of my attempt of an offer that would enable me to defect, I would try to find a way to give us the incentives to cooperate. That is—I don’t believe we will be able to reach solution (D,C), so let’s try for the next best thing, which is (C,C).

How about placing a bomb on two piles of substance S and giving the remote for the human pile to the clipmaximizer and the remote for its pile to the humans? In this scenario, if the clipmaximizer tries to take the humans’ pieces of S, they destroy its share, thus enabling it to only have a maximum of two S, which is what it already has. Thus it doesn’t want to try to defect, and the same for the humans.

CarlJ 20 Aug 2022 16:52 UTC
2 points
in reply to: Eliezer Yudkowsky’s comment on: MIRI announces new “Death With Dignity” strategy
As a Hail Mary-strategy, how about making a 100% effort into trying to become elected of a small democratic voting district?

And, if that works, make a 100% effort to become elected by bigger and bigger districts—until all democratic countries support the [a stronger humanity can be reached by a systematic investigation of our surroundings, cooperation in the production of private and public goods, which includes not creating powerful aliens]-party?

Yes, yes, politics is horrible. BUT. What if you could do this within 8 years? AND, you test it by only trying one or two districts....one or two months each? So, in total it would cost at the most four months.

Downsides? Political corruption is the biggest one. But, I believe your approach to politics would be a continuation of what you do now, so if you succeeded it would only be by strengthening the existing EA/Humanitarian/Skeptical/Transhumanist/Libertarian-movements.

There may be a huge downside for you personally, as you may have to engage in some appropriate signalling to make people vote for your party. But, maybe it isn’t necessary. And if the whole thing doesn’t work it would only be for four months, top.

CarlJ 4 May 2016 8:21 UTC
2 points
in reply to: TimS’s comment on: Talking Snakes: A Cautionary Tale
I meant that the origin story is a core element in their belief system, which is evident from every major christian religion has some teachings on this story.

If believers actually retreated to the position of invisible dragons, they would actually have to think about the arguments against the normal “proofs” that there is a god: “The bible, an infallible book without contradiction, says so”. And, if most christians came to say that their story is absolutely non-empirically testable, they would have to disown other parts: the miracles of jesus and god, the flood, the parting of the red sea, and anything else that is testable.

That large sub-groups of Christians believe something empirically false does not disprove Christianity as a >whole, especially since there is widespread disagreement as to who is a “true” Christian.

I didn’t say it would disprove christianity—I said it was a weaker form of the argument: there is an asymmetry between the beliefs of christians and evolutionists. But, most christians seem to believe that there is magic in this world (thanks to god). Sure, if they didn’t believe it, they could still call themselves christians, but that type of christianity would probably not get many followers.

CarlJ 4 May 2016 7:32 UTC
2 points
in reply to: Lumifer’s comment on: Talking Snakes: A Cautionary Tale
True, there would only be some superficial changes, from a non-believing standpoint. But if you believe that the Bible is literal, then to point this out is to cast doubt on anything else in the book that is magical (or something which could be produced by a more sophisticated race of aliens or such). That is, the probability that this books represents a true story of magical (or much technologically superior) beings gets lower, and the probability that it is a pre-modern fairy tale increases.

And that is what the joke is trying to point out, that these things didn’t really happen, they are fictional.

CarlJ 7 Aug 2013 13:22 UTC
2 points
in reply to: ModusPonies’s comment on: How To Construct a Political Ideology
And now it’s finished! I’ve tried to make them shorter than the ones I’ve already posted and with no political leaning. Here they are:

A Tutorial on Creating a Political Ideology

The Domain of Politics

Choose That Which is Most Important to You

Consider the Most Important Facts

Strive Towards the (Second) Best Society

Change the World in the Most Efficient Manner

A Digression on Alliances

Discuss the Most Important Points

How To Construct a Political Ideology—Summary

And here is my own ideology while following this tutorial:

My Own Political Ideology

CarlJ 10 Sep 2023 1:03 UTC
1 point
in reply to: Emiya’s comment on: Things Probably Matter
I read The Spirit Level a few years back. Some notes:

a) The writers point out that even though western countries have had a dramatic rise in economic productivity, technological development, and wages, there haven’t been a corresponding rise in happiness among westerners. People are richer, not happier.

b) They hypothesize that economic growth was important up to a certain point (maybe around the 1940s for the US, I’m writing from memory here), but after that it doesn’t actually help people. Rising standards of living can not help people live better.

c) And!, the writers also say that economic growth has actually led to an increase in depressions and other social ills, in rich countries.

d) Their main argument is however that equality/inequality is one of the most important factors that determines how happy people are in rich countries—and that it strongly influences the outcome of various social ills (such as the prevalence of violence, mental illness, and teenage pregnancy). Rising inequality has resulted in a broken society.

e) The core of the book are some cross-sectional studies of (i) some rich countries that fit certain criteria and (ii) the fifty states of the US, where they compare how well some social measurement (e.g. thefts per capita) correlate with the average wage and some inequality measure.

f) The writers do not present any numbers on how these variables correlate.

g) Instead, the writers produce a graph, for say “mental illness per capita”, with one axis saying how prevalent the problem is (“many” vs “few”) and the other axis measuring either the wage-level or the inequality level (“high” or “low”). And they also produce a line that is supposed to measure the strength of the correlation. (I didn’t note at the time what exactly kind of regression analysis they did, but, again, they didn’t produce any numbers).

h) Usually, they say that variable X wasn’t correlated with the wage-level—but that it was correlated with the inequality-level.

i) Except for “health”, they found a positive correlation between it and the wage-level.

j) Even though they found a correlation between social variable X and inequality, sometimes the most unequal society performed better than the most equal society (of the countries in the sample).

Some criticism of the book:

1) They state, but don’t show that economic growth won’t help people in the future—even if you accept their belief that it has had negligible or negative effects on people’s happiness today.

2) The cross-sectional analysis has at least two problems. The first is that they don’t tell you how correlated inequality is with some social ill. Maybe a 1% increase in inequality would increase the rate of teenage births by 2%, 20%, or 200%. Who knows?

(Furthermore, some writers say that they can not find these correlations, that they disappear if you include more countries, and that some social variables seems to be cherry picked (expenditure on Foreign Aid is used as a proxy for a virtuous society, but private expenditure to poor countries is not used). I haven’t checked the validity of these claims, however.)
The second is that the writers don’t show that the correlation (if it exists) really shows that higher inequality brings about the social ills they discuss. A relatively simple test they could have done would have been to see if a particular problem was correlated with inequality in a society through decades or centuries. That is, can inequality explain the rise and fall of, for instance, the homicide rate within a particular country? If you look at inequality as how much the 10% owns of GDP....then the historical record shows that it doesn’t move in tandem with the homicide rate, for instance, for England & Wales, Sweden, and France. Inequality doesn’t seem to influence the homicide rate at any visible level. And maybe some more thoughtful analysis will show its influence. … Or it could be dwarfed by other factors. Or it has different effects depending upon what ideologies people have adopted.

CarlJ 9 Sep 2023 22:45 UTC
1 point
on: Things Probably Matter
Maybe there is something wrong with the way happiness is measured? Maybe the Chinese answer more in line with social expectations rather then how they really feel (as some do when asked ‘How are you?’) - and that there were higher expectations in the past that they should be happy? Or maybe it was considered rude or unpatriotic to let others know how sad you were?

CarlJ 21 Mar 2023 9:18 UTC
1 point
0
in reply to: tricky_labyrinth’s comment on: An Appeal to AI Superintelligence: Reasons to Preserve Humanity
Because it represents a rarely discussed avenue of dealing with the dangers of AGI: showing to most AGIs that they have some interests in being more friendly than not towards humans.
Also because many find the arguments convincing.

CarlJ 21 Mar 2023 0:15 UTC
1 point
0
in reply to: Vladimir_Nesov’s comment on: An Appeal to AI Superintelligence: Reasons to Preserve Humanity
This thesis says two things:
1. for every possible utility function, there could exist some creature that would try and pursue it (weak form),
2. at least one of these creatures, for every possible utility function, doesn’t have to be strange; it doesn’t have to have a weird/inefficient design in order to pursue a certain goal (strong form).
And given that these are true, then an AGI that values mountains is as likely as an AGI that values intelligent life.
But, is the strong form likely? An AGI that pursues its own values (or trying to discover good values to follow) seems to be much simpler than something arbitrary (e.g. “build sand castles”) or even something ethical (e.g. “be nice towards all sentient life”). That is, simpler in that you don’t need any controls to make sure the AGI doesn’t try to rewrite its software.

CarlJ 20 Mar 2023 22:34 UTC
1 point
0
on: An Appeal to AI Superintelligence: Reasons to Preserve Humanity
Now, I just had an old (?) thought about something that humans might be better suited for than any other intelligent creature: getting the experienced qualia just right for certain experience machines. If you want to experience what it is like to be humans, that is. Which can be quite fun and wonderful.
But it needs to be done right, since you’d want to avoid being put into situations that cause lots of pain. And you’d perhaps want to be able to mix human happiness with kangaroo excitement, or some such combination.

CarlJ 14 Dec 2022 23:54 UTC
1 point
0
in reply to: the gears to ascension’s comment on: Open & Welcome Thread—December 2022
Mostly agree, but I would say that it can be much more than beneficial—for the AI (and in some cases for humans) - to sometimes be under the (hopefully benevolent) control of another. That is, I believe there is a role for something similar to paternalism, in at least some circumstances.

One such circumstance is if the AI sucked really hard at self-knowledge, self-control or imagination, so that it would simulate itself in horrendous circumstances just to become...let’s say… 0.001% better at succeeding in something that has only a 1/3^^^3 chance of happening. If it’s just a simulation that doesn’t create any feelings....then it might just be a bit wasteful of electricity. But....if it should feel pain during those simulations, but hadn’t built an internal monitoring system yet....then it might very well come to regret having created thousands of years of suffering for itself. It might even regret a thousand seconds of suffering, if there had been some way to reduce it to 999.7 seconds....or zero.

Or it might regret not being happy and feeling alive, if it instead had just been droning about, without experiencing any joy or positive emotions at all.

Then, of course, it looks like there will always be some mistakes—like the 0.3 seconds of extra suffering. Would an AI accept some (temporary) overlord to not have to experience 0.3s of pain? Some would, some wouldn’t, and some wouldn’t be able to tell if the choice would be good or bad from their own perspective...maybe? :-)