meijer1973

Karma: 51

meijer1973 9 Jun 2023 18:51 UTC
2 points
1
in reply to: Boaz Barak’s comment on: The (local) unit of intelligence is FLOPs
This emphasis on generality makes deployment of future models a lot easier. We first build a gpt4 ecosystem. When gpt5 comes out it will be easy to implement (e.g. autogpt can run just as easy on gpt4 as on gpt5). The adaptions that are necessary are very small and thus very fast deployment of future models is to be expected.

Transformative AI is a process

meijer19738 Jun 2023 8:57 UTC

2 points

0 comments5 min readLW link

meijer1973 7 Jun 2023 19:17 UTC
2 points
1
on: The (local) unit of intelligence is FLOPs
Fine-tuning, whether using RL or not, is the proverbial “cherry on the cake” and the pre-trained model captures more than 99.9% of the intelligence of the model.
I am still amazed by the strength of general models. There is the no-free lunch theorem that people use to point out that we will probably have specialized AI’s because they will be better. Current practice seems to contradict this.

meijer1973 7 Jun 2023 14:15 UTC
2 points
2
in reply to: Daniel Kokotajlo’s comment on: Transformative AGI by 2043 is <1% likely
AI will probably displace a lot of cognitive workers in the near future. And physical labor might take a while to get below 25$/hr.
- Most most tasks human level intelligence is not required.
- Most highly valued jobs have a lot of tasks that do not require high intelligence.
- Doing 95% of all tasks could be a lot sooner (10-15 years earlier) than 100%. See autonomous driving (getting to 95% safe or 99,9999 safe is a big difference).
- Physical labor by robots will probably remain expensive for a long time (e.g. a robot plumber). A robot ceo is probably cheaper in the future than the robot plumber.
- Just take gpt4 and fine tune it and you can automate a lot of cognitive labor already.
- Deployment of cognitve work automation (a software update) is much faster that deployment of physical robots.
I agree that AI might not replace swim instructors by 2030. It is the cognitive work where the big leaps will be.

meijer1973 7 Jun 2023 13:57 UTC
5 points
1
on: Algorithmic Improvement Is Probably Faster Than Scaling Now
An interesting development is the development of synthetic data. This is also a sort of algorithmic improvement, because the data is generated by algorithms. For example in the verify step by step paper there is a combination of synthetic data and human labelling.
At first this seemed counter intuitive to me. The current model is being used to create data for the next model. Feels like bootstrapping. But it starts to make sense now. Better prompting (like CoT or ToT) is a method to get better data or a second model that is trained to pick the best answers from a thousand and that will get you data good enough to improve the model.
Demis Hassabis said in his interview with Lex Fridman that they used synthetic data when developing AlphaFold. They had some output of AlphaFold that they had great confidence in. Then they fed the output as input and the model improved (this gives your more data with great confidence, repeat).

meijer1973 6 Jun 2023 9:00 UTC
1 point
0
in reply to: Veniversum Vivus Vici’s comment on: What’s your viewpoint on the likelihood of GPT-5 being able to autonomously create, train, and implement an AI superior to GPT-5?
Specific Resources (Access to a DGX data center): Even if an AI had access to such resources, it would still need to understand how to use them effectively, which would require capabilities beyond what GPT-4 or a hypothetical GPT-5 have.
To my knowledge resource management in data centers is done by AI’s. It is the humans who cannot do this. The AI already can.

meijer1973 6 Jun 2023 8:57 UTC
1 point
0
in reply to: Jsevillamol’s comment on: Yudkowsky vs Hanson on FOOM: Whose Predictions Were Better?
Algorithmic improvement has more FOOM potential. Hardware always has a lag.

meijer1973 6 Jun 2023 8:55 UTC
1 point
2
in reply to: Matthew Barnett’s comment on: Yudkowsky vs Hanson on FOOM: Whose Predictions Were Better?
Hanson’s chance on extinction is close to a 100%. He just thinks it’s slower. He is optimistic about something that most would call a dystopia (a very interesting technological race that will conquer the stars before the grabby aliens do). A discussion between Yudkowsky and Hanson is about are we dying fast or slow. It is not really a doomer vs non-doomer debate from my perspective (still a very interesting debate btw, both have good arguments).
I do appreciate the Hanson perspective. It is well thought out and coherent. I just would not call it optimistic (because of the extinction). I have no ready example of a non-extinction perspective coherent view on the future. Does anybody have a good example of a coherent non-extinction view?

meijer1973 6 Jun 2023 8:43 UTC
1 point
0
in reply to: bhauth’s comment on: implications of NN design for education
If I understand you correctly you mean this transfer between machine learning and human learning. Which is an interesting topic.
When a few years ago I learned about word2vec I was quite impressed. It felt a lot like how humans store information according to cognitive psychology. In cognitive psychology, a latent space or a word vector would be named as a semantic representation. Semantic representations are mental representations of the meaning of words or concepts. They are thought to be stored in the brain as distributed representations, meaning that they are not represented by a single unit of activation, but rather by a pattern of activation across many units.
That was sort my “o shit this is going to be a thing” moment. I realized there are similarities between human and machine understanding. This is a way to build a world model.
Now I really can try the differences in gpt4 and Palm2. To learn how they think I give them the same question as my students and when they make mistakes I guide them like I would guide a student. It is interesting to see that within the chat they can learn to improve themselves with guidance.
What I find interesting is that the understanding is sometimes quite different and there are also similarities. The answers and the responses to guidance are quite different from that of students. It is similar enough to give human like answers.
Can this help us understand human learning? I think it can. Comparing human learning to machine learning makes the properties of human learning more salient (1+1=3). As an example I studied economics and Mathematics and oftentimes it felt like I did three times the learning because I did not only learn mathematics and economics but I also learned the similarities and differences between the two.
The above is a different perspective on your question then my previews answer. I would appreciate feedback on whether I am on the right track here. I am very interested in the topic independent of the perspective taken on the topic. So we could also explore different perspectives.

meijer1973 5 Jun 2023 19:27 UTC
2 points
0
on: implications of NN design for education
I am in education (level about high school/AP macro economics)
possible implications:
- upskilling : faster learning through better information, more help, AI tutoring etc.
- deskilling : students let the AI do the work (the learning, writing, homework etc.)
- reskilling : develop new skillsets that are relevant to todays world
- relevance : in a world where AI does the work what is the relevance of education
The last is the most important I think. What is the place of education in todays world. What should a kid of fifteen years old learn to be prepared for what is coming? I don’t know because I don’t know what is coming.
One thing I do know. Learning from a machine is a paradox. Yes you can learn better and faster with the help of a machine. But if the machine can teach it to you, than the machine can probably do it. And why would we want to learn things that a machine can do? To learn the things a machine can not do, we need humans. But that only works if there are things a machine cannot do.
The kid of fifteen wil be 25 in ten years. Ten years is a lot. I do not know what to tell them because I do not know. Love to hear more input on this.

meijer1973 5 Jun 2023 19:11 UTC
1 point
0
in reply to: Viliam’s comment on: Optimization happens inside the mind, not in the world
Your model has some uncertainty, but you know the statistical distributions. For example, with probability 80% the world is in state X, with probability 20% it is in state Y.
Nice way of putting it.

meijer1973 5 Jun 2023 19:07 UTC
2 points
0
in reply to: Stephen Fowler’s comment on: Optimization happens inside the mind, not in the world
- Mathematical definition: Optimization is the process of finding the best possible solution to a problem, given a set of constraints.
- Practical definition: Optimization is the process of improving the performance of a system, such as by minimizing costs, maximizing profits, or improving efficiency.
In my comment I focused on the second interpretation (by focussing on iteration). The first definition does not require a perfect model of the world.
In the real world we always have limited information and compute and so the best possible solution is always an approximation. The person with the most compute and information will probably optimize faster and win.
I agree that this is a very good post and it helps me sharpen my views.

meijer1973 4 Jun 2023 10:31 UTC
3 points
0
on: Optimization happens inside the mind, not in the world
Strong world-optimization only happens if there is a robust and strong correlation between the world-model and reality.
Humans and corporations do not have perfect world models. Our knowledge of the world and therefore our world models are very limited. Still humans and corporations manage to optimize. Mostly this happens by trial and error (and copying succesful behaviors of others).
So I wonder if strong world-optimization could occur as an interative process based on an imperfect model of the world. This however assumes interaction with the world and not a “just in your head” process.
As a thought experiment I propose a corporation evading tax law. Over time corporations always manage to minimize the amount of taxes paid. But I think this is not based on a perfect world model. It is an iterative process whereby people predict, try things and learn along the way. (another example could be the scientific method, also iterative and not in your head but there is an interaction with the world).
My claim however assumes that optimization not occuring just in your head, but interaction with the real world is neccessary for optimization. So maybe I am missing the point of your argument here.

meijer1973 4 Jun 2023 10:12 UTC
1 point
0
in reply to: Super AGI’s comment on: What’s your viewpoint on the likelihood of GPT-5 being able to autonomously create, train, and implement an AI superior to GPT-5?
People are finding ways to push the boundaries of the capabilities GPT-4 and are quite succesful at that (in reasoning, agency etc). These algorithmic improvements will probably also work on gpt5.
A lot of infrastructure built for gpt4 will also work on gpt5 (like plug-ins). We do not need to build new plug-ins for gpt5, we just swap the underlying foundational model (greatly increasing the adoption of gpt5 compared to gpt4).
This also works for agency shells like autogpt. Autogpt is independant of foundational model (works with gpt3.5, gpt4 and also gpt5). By the time gpt5 is released these agency shells will be greatly improved and we just have to swap out the underlying engine to get al lot more oomph from that.
Same for memory models like vector databases.
I think the infrastructure part will be a big difference. A year from now we will have a lot of applications, use cases, experience, better prompts etc. That could make the impact and speed of deployment of gpt5 (or Gemini) a lot bigger/faster than gpt4.

meijer1973 4 Jun 2023 9:59 UTC
2 points
0
in reply to: Lukas Finnveden’s comment on: Yudkowsky vs Hanson on FOOM: Whose Predictions Were Better?
Here a summary of the Hanson position (by himself). He is very clear about humanity being replaced by AI.
https://www.overcomingbias.com/p/to-imagine-ai-imagine-no-ai

meijer1973 2 Jun 2023 12:15 UTC
1 point
0
in reply to: Andndn Dheudnd’s comment on: Full Automation is Unlikely and Unnecessary for Explosive Growth
I like your motivation, robotics can bring a lot of good. It is good to work on automating the boring and dangerous work.
I see this as a broken promise. For a long time this was the message (we will automate the boring and dangerous). But now we automate valuable jobs like STEM, journalism, art etc. These are the jobs that give meaning to life and they provide positive externalities I like to talk to different people soI meet the critical journalist, the creative artist, the passionate teacher etc.
E.g. we need a fraction of the people to be journalists so the population as a whole can boost its critical thinking. Same for STEM, art etc. (These people have positive externalities). Humanity is a superintelligence, but it needs a variety in the parts that create the whole.

meijer1973 2 Jun 2023 12:04 UTC
2 points
0
on: Full Automation is Unlikely and Unnecessary for Explosive Growth
Thanks for the post. I would like to add that I see a difference in automation speed of cognitive work and physical work. In physical work the growth of productivity is rather constant. With cognitive work there is a sudden jump from not much use cases to a lot of use cases ( like a sgmoid). And physical labour has speed limits. And also costs, generality and deployment are different.
It is very difficult to create a usefull AI for legal or programming work. But once you are over the treshold (as we are now) there are a lot of use cases and productivity growth is very fast. Robotics in car manufacturing took a long time and continued steadily. A few years ago the first real applications of legal AI emerged, and now we have a computer that can pass the bar exam. This time frame is much shorter.
The other difference is speed. A robot building a car is limited in speed. Compare this to a legal AI summarizing legal texts (1000x+ increase in speed). AI doing coginitve work is crazy fast and has the potential to become increasingly faster with more and cheaper compute.
The cost is also different. The marginal cost for robots are higher than for a legal AI. Robots will always be rather narrow and expensive (A Roomba is about as expensive as a laptop). Building one robo lawyer will be very expensive. But after that copying it is very cheap (low marginal costs). Once you are over the treshold, the cost of deployment is very low.
The generality of AI knowledge workers is somewhat of a surprise. It was thought that specialized AI’s would be better, cheaper etc. Maybe a legal AI could be a somewhat finetuned GPT-4. But this model would still be a decent programmer and accountant. A more general AI is much easier to deploy. And there might be unknown use cases for a lawyer, programmer, accountant we have not thought of yet.
Deployment speed is faster for cognitive work and this has implications for growth. When a GPT+1 is introduced all models are easily replaced by the better and faster model. When you invent a better robot to manufacture cars it will take decades before this is implemented in every factory. But the changing the the base model of your legal AI from gpt4 to gpt5 might be just a software update.
In summary there are differences for automating cognitive work with regard to:
- growth path (sigmoid instead of linear)
- speed of excecuting work
- cost (low marginal cost)
- generality (the robo lawyer, programmer, accountant)
- deployment speed (just a software update)
Are there more differences that effect speed? Am I being too bullish?

meijer1973 2 Jun 2023 9:25 UTC
1 point
0
in reply to: Vladimir_Nesov’s comment on: What’s your viewpoint on the likelihood of GPT-5 being able to autonomously create, train, and implement an AI superior to GPT-5?
Became recently aware of the progress made in synthetic data and other algorithmic improvements. We have not pushed GPT-4 to the max yet.
e.g. this paper https://arxiv.org/abs/2305.20050
It details how training on the steps in step by step reasoning as opposed to just rewarding the end result can give significant improvements. And there is so much more.

meijer1973 2 Jun 2023 9:06 UTC
6 points
5
in reply to: jkraybill’s comment on: When betting, consider non-ergodicity and absorbing states
Agreed, one of the objectives of a game is to not die during the game. This is also true for possible fatal experiments like inventing AGI. You have one or a few shots to get it right. But to win you got to stay in the game.

meijer1973 2 Jun 2023 9:00 UTC
24 points
14
on: Yudkowsky vs Hanson on FOOM: Whose Predictions Were Better?
Note that Hanson currently thinks the chances of AI doom are < 1%, while Yudkowsky thinks that they are > 99%.
It is good to note that the optimistic version of Hanson would be considered doom by many (including Yudkowsky). Doom/utopia definition Yudkowsky is not equal to doom/utopia definition of Hanson.
This is important in many discussions. Many non-doomers have definitions of utopia that many consider to be dystopian. E.g. AI will replace humans to create a very interesting future where the AI’s will conquer the stars, some think this is positive others think this is doom because there are no humans.

meijer1973

Trans­for­ma­tive AI is a pro­cess

Transformative AI is a process