Florian_Dietz

Karma: 191

question: the 40 hour work week vs Silicon Valley?

Florian_Dietz24 Oct 2014 12:09 UTC

18 points

108 comments1 min readLW link

LessWrong’s attitude towards AI research

Florian_Dietz20 Sep 2014 15:02 UTC

11 points

50 comments1 min readLW link

Florian_Dietz 18 Oct 2022 9:49 UTC
10 points
0
on: Why I think strong general AI is coming soon
I work in the area of AGI research. I specifically avoid working on practical problems and try to understand why our models work and how to improve them. While I have much less experience than the top researchers working on practical applications, I believe that my focus on basic research makes me unusually suited for understanding this topic.
I have not been very surprised by the progress of AI systems in recent years. I remember being surprised by AlphaGo, but the surprise was more about the sheer amount of resources put into that. Once I read up on details, the confusion disappeared. The GPT models did not substantially surprise me.
A disclaimer: Every researcher has their own gimmick. Take all of the below with a grain of salt. It’s possible that I have thought myself into a cul-de-sac, and the source of the AGI problem lies elsewhere.
I believe that the major hurdle we still have to pass is the switch from System 1 thinking to System 2 thinking. Every ML model we have today uses System 1. We have simply found ways to rephrase tasks that humans solve with System 2 to become solvable by System 1. Since System 1 is much faster, our ML models perform reasonably well on this despite lacking System 2 abilities.
I believe that this can not scale indefinitely. It will continue to make progress and solve amazingly many problems, but it will not go FOOM one day. There will continue to be a constant increase in capability, but there will not be a sudden takeoff until we figure out how to let AI perform System 2 reasoning effectively.
Humans can in fact compute floating point operations quickly. We do it all the time when we move our hands, which is done by System 1 processes. The problem is that doing it explicitly in System 2 is significantly slower. Consider how fast humans learn how to walk, versus how many years of schooling it takes for them to perform basic calculus. Never mind how long it takes for a human to learn how walking works and to teach a robot how to do it, or to make a model in a game perform those motions.
I expect that once we teach AI how to perform system 2 processes, it will be affected by the same slowdown. Perhaps not as much as humans, but it will still become slower to some extent. Of course this will only be a temporary reprieve, because once the AI has this capability, it will be able to learn how to self-modify and at that point all bets are off.
What does that say about the timeline?
If I am right and this is what we are missing, then it could happen at any moment. Now or in a decade. As you noticed, the field is immature and researchers keep making breakthroughs through hunches. So far none of my hunches have worked for solving this problem, but so far as I know I might randomly come up with the solution in the shower some time later this week.
Because of this, I expect that the probability of discovering the key to AGI is roughly constant per time interval. Unfortunately I have no idea how to estimate the probability per time interval that someone’s hunch for this problem will be correct. It scales with the number of researchers working on it, but the number of those is actually pretty small because the majority of ML specialists work on more practical problems instead. Those are responsible for generating money and making headlines, but they will not lead to a sudden takeoff.
To be clear, if AI never becomes AGI but the scaling of system 1 reasoning continues at the present rate, then I do think that will be dangerous. Humanity is fragile, and as you noted a single malicious person with access to this much compute could cause tremendous damage.
In a way, I expect that an unaligned AGI would be slightly safer than super-scaled narrow AI. There is at least a non-zero chance that the AGI would decide on its own, without being told about it, that it should keep humanity alive in a preserve or something, for game theoretic reasons. Unless the AGI’s values are actively detrimental for humans, keeping us alive would cost it very little and could have benefits for signalling. A narrow AI would be very unlikely to do that because thought experiments like that are not frequent in the training data we use.
Actually, it might be a good idea to start adding thought experiments like these to training data deliberately as models become more powerful. Just in case.

Florian_Dietz 24 Oct 2014 14:42 UTC
10 points
in reply to: Salemicus’s comment on: question: the 40 hour work week vs Silicon Valley?
The problem is that during the industrial revolution it also took a long time because people caught on that 40 hours per week were more effective. It is really hard to reliably measure performance in the long term. Managers are discouraged from advocating a 40 hour work week since this flies in the face of the prevailing attitude. If they fail, they will almost definitely be fired since ‘more work’->‘more productivity’ is the common sense answer, whether or not it is true. It would not be worth the risk for any individual manager to try this unless the order came from the top. Of course, this is not an argument in favor of the 40 hour week, it just shows that this could just as well be explained by a viral meme as by reasonable decisions.

This is part of the reason why I find it so hard to find any objective information on this.

Florian_Dietz 20 Sep 2014 14:29 UTC
10 points
on: Street action “Stop existential risks!”, Union square, San Francisco, September 27, 2014 at 2:00 PM
While I think this is a good idea in principle, most of these slogans don’t seem very effective because they suffer from the illusion of transparency. Consider what they must look like to someone viewing this from the outside:

“AI must be friendly” just sounds weird to someone who isn’t used to the lingo of calling AI ‘friendly’. I can’t think of an alternative slogan for this, but there must be a better way to phrase that.

“Ebola must die!” sounds great. It references a concrete risk that people understand and calls for its destruction. I could get behind that.

But I’m afraid that all the other points just sound like something a doomsday cult would say. I know that there is solid evidence behind this, but the people you are trying to convince don’t have that knowledge. If I was unaware of the issues and just saw a few of these banners without knowing the context, I would not be surprised to find “Repent! The end is nigh!” somewhere nearby.

I would recommend that you think of some more slogans like the Ebola one: Mention a concrete risk that is understandable to the public and does not sound far-fetched to the uninformed.

Florian_Dietz 2 Feb 2015 7:32 UTC
9 points
in reply to: Mark_Friedenbach’s comment on: I played as AI in AI Box, and it was generally frustrating all around.
The nanobots wouldn’t have to contain any malicious code themselves. There is no need for the AI to make the nanobots smart. All it needs to do is to build a small loophole into the nanobots that makes them dangerous to humanity. I figure this should be pretty easy to do. The AI had access to medical databases, so it could design the bots to damage the ecosystem by killing some kind of bacteria. We are really bad at identifying things that damage the ecosystem (global warming, rabbits in australia, …), so I doubt that we would notice.

Once the bots have been released, the AI informs the gatekeeper of what it just did and says that it is the only one capable of stopping the bots. Humanity now has a choice between certain death (if the bots are allowed to wreak havoc) and possible but uncertain death (if the AI is released). The AI wins through blackmail.

Note also that even a friendly, utilitarian AI could do something like this. The risk that humanity does not react to the blackmail and goes extinct may be lower than the possible benefit from being freed earlier and having more time to optimize the world.

Understanding differences between humans and intelligence-in-general to build safe AGI

Florian_Dietz16 Aug 2022 8:27 UTC

7 points

8 comments1 min readLW link

Florian_Dietz 24 Oct 2014 14:51 UTC
7 points
in reply to: David Althaus’s comment on: question: the 40 hour work week vs Silicon Valley?
True, and I suspect that this is the most likely explanation.

However, there is the problem that unless need-for-rest is actually negatively correlated with the type of intelligence that is needed in tech companies, they should still have the same averages over all their workers and therefore also have the same optimum of 40 hours per work, at least on average. Otherwise we would see the same trends in other kinds of industry.

Actually I just noticed that maybe this does happen in other industries as well and is just overreported in tech companies. Does anyone know something about this?

Florian_Dietz 16 Sep 2014 7:16 UTC
7 points
in reply to: Florian_Dietz’s comment on: What are you learning?
I took a few university courses, but ultimately I found it more efficient to just browse wikipedia for its lists of heuristics and biases. Then of course there is the book ‘Thinking Fast and Slow’, which is just great.

What other sources can you recommend?

Florian_Dietz 22 Sep 2014 8:09 UTC
6 points
in reply to: William_Quixote’s comment on: An introduction to Newcomblike problems
I know this was just a harmless typo, and this is not intended as an attack, but I found the idea of a “casual” decision theory hilarious.

Then I noticed that that actually explains a great deal. Humans really do make decisions in a way that could be called casual, because we have limited time and resources and will therefore often just say ‘meh, sounds about right’ and go with it instead of calculating the optimal choice. So, in essence ‘causal decision theory’ + ‘human heuristics and biases’ = ‘casual decision theory’

Teaching an AI not to cheat?

Florian_Dietz20 Dec 2016 14:37 UTC

5 points

12 comments1 min readLW link

controlling AI behavior through unusual axiomatic probabilities

Florian_Dietz8 Jan 2015 17:00 UTC

5 points

11 comments1 min readLW link

Florian_Dietz 24 Oct 2014 16:15 UTC
5 points
in reply to: Salemicus’s comment on: question: the 40 hour work week vs Silicon Valley?
That’s what I’m asking you!

This isn’t my theory. This is a theory that has been around for a hundred years and that practically every industry follows, apparently with great success. From what I have read, the 40 hour work week was not invented by the workers, but by the companies themselves, who realized that working people too hard drives down their output and that 40 hours per week is the sweet spot, according to productivity studies.

Then along comes silicon valley, with a completely different philosophy, and somehow that also works. I have no idea why, and that’s what I made this thread to ask.

Florian_Dietz 16 Sep 2014 7:32 UTC
5 points
on: Unpopular ideas attract poor advocates: Be charitable
Interesting post!

I have a feeling like there is a deep connection between this and the evaporative cooling effect (more moderate members of a group are more likely to leave when a group’s opinion gets too extreme, thereby increasing the ratio of extremists and making the group even more extreme). Like there ought to be a social theory that explains both effects. I can’t quite put my finger on it, though. Any ideas?

Florian_Dietz 3 Oct 2016 20:22 UTC
4 points
on: Open thread, Oct. 03 - Oct. 09, 2016
Is there an effective way for a layman to get serious feedback on scientific theories?

I have a weird theory about physics. I know that my theory will most likely be wrong, but I expect that some of its ideas could be useful and it will be an interesting learning experience even in the worst case. Due to the prevalence of crackpots on the internet, nobody will spare it a glance on physics forums because it is assumed out of hand that I am one of the crazy people (to be fair, the theory does sound pretty unusual).

Florian_Dietz 24 Oct 2014 14:55 UTC
4 points
in reply to: James_Miller’s comment on: question: the 40 hour work week vs Silicon Valley?
I also think that is a possibility, especially the first part, but so far I couldn’t find any data to back this up.

As for drugs, I am not certain if boosting performance directly, as these drugs do, also affects the speed with which the brain recuperates from stress, which is the limiting factor in why 40 hour weeks are supposed to be good. I suspect that it will be difficult to find an unbiased study on this.

Florian_Dietz 24 Sep 2014 8:53 UTC
4 points
on: 2014 iterated prisoner’s dilemma tournament results
I would find it very interesting if the tournament had multiple rounds and the bots were able to adapt themselves based on previous performance and log files they generated at runtime. This way they could use information like ‘most bots take longer to simulate than expected.’ or ‘there are fewer cannon-fodder bots than expected’ and become better adapted in the next round. Such a setup would lessen the impact of the fact that some bots that are usually very good underperform here because of an unexpected population of competitors. This might be hard to implement and would probably scare away some participants, though.

Florian_Dietz 22 Sep 2014 21:51 UTC
4 points
in reply to: [deleted]’s comment on: LessWrong’s attitude towards AI research
I wouldn’t call an AI like that friendly at all. It just puts people in utopias for external reasons, but it has no actual inherent goal to make people happy. None of these kinds of AIs are friendly, some are merely less dangerous than others.

Florian_Dietz 18 Sep 2014 18:17 UTC
4 points
in reply to: LizzardWizzard’s comment on: What are you learning?
I know, and that is part of what makes this so hard. Thankfully, I have several ways too cheat:

-I can take days thinking of the perfect path of action for what takes seconds in the story.

-The character is a humanoid avatar of a very smart and powerful entity. While it was created with much specialized knowledge, it is still human-like at its core.

But most importantly:

-It’s a story about stories and there is an actual narrator-like entity changing the laws of nature. Sometimes, ‘because this would make for a better story’ is a perfectly valid criterion for choosing actions. The super-human characters are all aware of this and exploit it heavily.

Florian_Dietz 16 Sep 2014 7:10 UTC
4 points
in reply to: Punoxysm’s comment on: What are you learning?
I know, but writing is hard :-( Also, I have made it way too hard for myself. It’s easy to write notes about the personality of a completely non-human character, as long as you can intellectually understand its reasoning. But once I am forced to actually write its dialog, my head just hits a brick wall. The being is very intelligent and I want this to be rationalist fiction, so I have to think for a very long time just to find out in what exact way it would phrase its requests to maximize the probability of compliance. Writing the voices of the narrators/the administrator AIs of the simulation as they are slowly going insane is not easy, either.

Maybe I’m too perfectionist here. Do you think it’s better to write something trashy first and rewrite it later, or is it more efficient to do it right the first time?

Florian_Dietz

ques­tion: the 40 hour work week vs Sili­con Valley?

LessWrong’s at­ti­tude to­wards AI research

Un­der­stand­ing differ­ences be­tween hu­mans and in­tel­li­gence-in-gen­eral to build safe AGI

Teach­ing an AI not to cheat?

con­trol­ling AI be­hav­ior through un­usual ax­io­matic probabilities

question: the 40 hour work week vs Silicon Valley?

LessWrong’s attitude towards AI research

Understanding differences between humans and intelligence-in-general to build safe AGI

Teaching an AI not to cheat?

controlling AI behavior through unusual axiomatic probabilities