Nnotm

Karma: 103

Nnotm 19 Aug 2013 20:47 UTC
0 points
on: The Quick Bayes Table
I know this is over a year old, but I still feel like this is worth pointing out:

If you can get the positive likelihood ratio as the meaning of a positive result, then you can use the negative likelihood ratio as the meaning of the negative result just reworking the problem.

You weren’t using the likelihood ratio, which is one value, 8.33… in this case. You were using the numbers you use to get the likelihood ratio.

But the same likelihood ratio would also occur if you had 8% and 0.96%, and then the “negative likelihood ratio” would be about 0.93 instead of 0.22.

You simply need three numbers. Two won’t suffice.

Nnotm 26 Nov 2013 14:42 UTC
26 points
on: 2013 Less Wrong Census/Survey
I took it. I was surprised how far I was off with Europe.

Nnotm 26 Nov 2013 14:49 UTC
2 points
in reply to: Lion’s comment on: 2013 Less Wrong Census/Survey
There are no “correct” or “incorrect” definitions, though, are there? Definitions are subjective, it’s only important that participants of a discussion can agree on one.

Nnotm 26 Nov 2013 23:13 UTC
0 points
in reply to: Lumifer’s comment on: 2013 Less Wrong Census/Survey
That’s true, though I think “optimal” would be a better word for that than “correct”.

Nnotm 10 Apr 2014 1:26 UTC
4 points
in reply to: Eliezer Yudkowsky’s comment on: How to Seem (and Be) Deep
Working links on yudkowsky.net and acceleratingfuture.com:

Transhumanism as Simplified Humanism The Meaning That Immortality Gives to Life

Nnotm 12 Apr 2014 21:01 UTC
3 points
on: Botworld: a cellular automaton for studying self-modifying agents embedded in their environment
Sounds pretty cool, definitely going to try it out some.

Oh, and by the way, you wrote “Inpsect” instead of “Inspect” at the end of page 27.

Nnotm 21 Apr 2014 13:37 UTC
1 point
on: Pascal’s Mugging: Tiny Probabilities of Vast Utilities
Why wait until someone wants the money? Shouldn’t the AI try to send 5 Dollars to everyone with a note attached reading “Here is a tribute; please don’t kill a huge number of people” regardless of whether they ask for it or not?

Nnotm 12 May 2018 18:04 UTC
9 points
in reply to: Jeevan’s comment on: AI Alignment is Alchemy.
What you’re saying goes against the here widely believed orthogonality thesis, which essentially states that what goal an agent has is independent of how smart it is. If the agent has programmed in a certain set of goals, there is no reason for it to change this set of goals if it becomes smarter (this is because changing its goals would not be beneficial to achieving its current goals).
In this example, if an agent has the sole goal of fulfilling the wishes of a particular human, there is no reason for it to change this goal once it becomes an ASI. As far as the agent is concerned, using resources for this purpose wouldn’t be a waste, it would be the only worthwhile use for them. What else would it do with them?
You seem to be assigning some human properties to the hypothetical AI (e.g. “scorn”, viewing something as “petty”), which might be partially responsible for the disagreement here.

Nnotm 12 May 2018 19:06 UTC
3 points
in reply to: Jeevan’s comment on: AI Alignment is Alchemy.
It would need a reason of some kind of reason to change its goals—one might call it a motivation. The only motivation it has available though, are its final goals, and those (by default) don’t include changing the final goals.
Humans never had the final goal replicating their genes. They just evolved to want to have sex. (One could perhaps say that the genes themselves had the goal of replicating, and implemented this by giving the humans the goal of having sex.) Reward hacking doesn’t involve changing the terminal goal, just fulfilling it in unexpected ways (which is one reason why reinforcement learning might be a bad idea for safe AI.)

Nnotm 13 May 2018 21:19 UTC
2 points
in reply to: Jeevan’s comment on: AI Alignment is Alchemy.
Whether or not it would question its reality mostly depends on what you mean by that—it would almost certainly be useful to figure out how the world works, and especially how the AI itself works, for any AI. It might also be useful to figure out the reason for which it was created.
But, unless it was explicitly programmed in, this would likely not be a motivation in and of itself, rather, it would simply be useful for accomplishing its actual goal.
I’d say the reason why humans place such high value in figuring out philosophical issues is to a large extent because evolution produces messy systems with inconsistent goals. This *could* be the case for AIs too, but to me it seems more likely that some more rational thought will go into their design.
(That’s not to say that I believe it will be safe by default, but simply that it will have more organized goals than humans have.)

Nnotm 14 May 2018 18:45 UTC
2 points
in reply to: Jeevan’s comment on: AI Alignment is Alchemy.
AI alignment is not about trying to outsmart the AI, it’s about making sure that what the AI wants is what we want.
If it were actually about figuring out all possible loopholes and preventing them, I would agree that it’s a futile endeavor.
A correctly designed AI wouldn’t have to be banned from exploring any philosophical or introspective considerations, since regardless of what it discovers there, it’s goals would still be aligned with what we want. Discovering *why* it has these goals is similar to humans discovering why we have our motivations (i.e., evolution), and similarly to how discovering evolution didn’t change much what humans desire, there’s no reason to assume that an AI discovering where its goals come from should change them.
Of course, care will have to be taken to ensure that any self-modifications don’t change the goals. But we don’t have to work *against* the AI to accomplish that—the AI *also* aims to accomplish its current goals, and any future self-modification that changes its goals would be detrimental in accomplishing its current goals, so (almost) any rational AI will, to the best of its ability, aim *not* to change its goals. Although this doesn’t make it easy, since it’s quite difficult to formally specify the goals we would want an AI to have.

Nnotm 17 May 2018 3:53 UTC
2 points
in reply to: Jeevan’s comment on: AI Alignment is Alchemy.
As I understand it, the idea with the problems listed in the article is that their solutions are supposed to be fundamental design principles of the AI, rather than addons to fix loopholes.
Augmenting ourselves is probably a good idea to do *in addition* to AI safety research, but I think it’s dangerous to do it *instead* of AI safety research. It’s far from impossible that artificial intelligence could gain intelligence much faster at some point than augmenting the rather messy human brain, at which point it *needs* to be designed in a safe way.

Nnotm 6 Sep 2021 1:06 UTC
10 points
in reply to: James_Miller’s comment on: Rough notes on the Sam Altman Q&A: GPT and AGI
Is that to be interpreted as “finding out whether UFOs are aliens is important” or “the fact that UFOs are aliens is important”?

Nnotm 6 Sep 2021 1:13 UTC
13 points
on: Rough notes on the Sam Altman Q&A: GPT and AGI
One question was whether it’s worth working on anything other than AGI given that AGI will likely be able to solve these problems; he agreed, saying he used to work with 1000 companies at YC but now only does a handful of things, partially just to get a break from thinking about AGI.

Nnotm 14 Jan 2022 12:06 UTC
1 point
NIL
on: Open Thread—Jan 2022 [Vote Experiment!]
Is there a post as part of the sequences that’s roughly about how your personality is made up of different aspects, and some of them you consider to be essentially part of who you are, and others (say, for example, maybe the mechanisms responsible for akrasia) you wouldn’t mind dropping without considering that an important difference to who you are?

For years I was thinking Truly Part Of You was about that, but it turns out, it’s about something completely different.

Now I’m wondering if I had just imagined that post existing or just mentally linked the wrong title to it.

Nnotm 22 Jan 2022 8:09 UTC
1 point
in reply to: hamnox’s comment on: Open Thread—Jan 2022 [Vote Experiment!]
I haven’t read the luminosity sequence, but I just spent some time looking at the list of all articles seeing if I can spot a title that sounds like it could be it, and I found it: Which Parts are “Me”? - I suppose the title I had in mind was reasonably close.

Nnotm 6 Apr 2022 19:48 UTC
5 points
in reply to: Rabrg’s comment on: DALL·E 2 by OpenAI
The original DALL-E was capable of having almost the same image with slight variations in one generation, so I’d be interested to see something like “A photograph of a village in 1900 on the top, and the same photo colorized on the bottom”.

Nnotm 6 Apr 2022 19:54 UTC
1 point
in reply to: Rabrg’s comment on: DALL·E 2 by OpenAI
Very cool, thanks!

Nnotm 27 Jun 2022 17:10 UTC
1 point
0
on: Contest: An Alien Message
Interpreting the data as unsigned 8-bit integers and plotting it as an image with width 8 results in this (only the first few rows shown):
The rest of the image looks pretty similar. There is a almost continuous high-intensity column (yellow, the second-to-last column), and the values in the first 6 columns repeat exactly in the next row pretty often, but not always.

Nnotm 28 Jun 2022 8:58 UTC
7 points
0
in reply to: Rafael Harth’s comment on: Contest: An Alien Message
For what it’s worth, colored by how soon in the sequence they appear (blue is early, red is late) (Also note I interpreted it as 2094 points, with each number first used in the x-dimension and then in the y-dimension):

Note that one line near the top appears to be drawn twice, confirming if nothing else that it’s not a rule that it’s not a succession rule that only depends on the previous value, since the paths diverge afterwards.
Still, comparing those two sections could be interesting.