lorepieri(Lorenzo Pieri)

Karma: 184

Knowledge Seeker https://lorenzopieri.com/

lorepieri 8 Apr 2022 23:24 UTC
27 points
on: Convincing All Capability Researchers
Unpopular opinion (on this site I guess): AI alignment is not a well defined problem, there is no clear cut resolution to it. It will be an incremental process, similar to cybersecurity research.
About the money, I would do the opposite: select researchers that would do it for free, just pay them living expenses and give them arbitrary resources.

lorepieri 12 May 2022 23:46 UTC
17 points
on: “A Generalist Agent”: New DeepMind Publication
Possibly the first truly AGI paper.
Even though it is just exploiting the fact that all the narrow problems can be solved as sequence problems via tokenisation, it’s remarkable that the tasks do not interferee distructively between each other. My gut feeling is that this is due the very high dimensional space of the embedding vectors.
It leaves ample room for grow.

lorepieri 12 May 2022 23:50 UTC
11 points
in reply to: Ankesh Anand’s comment on: “A Generalist Agent”: New DeepMind Publication
The fact that adding new tasks doesn’t diminuish performance on previous tasks is highly non trivial!
It may be that there is a lot of room in the embedding space to store them. The wild thing is that nothing (apart few hardware iterations) stop us to increase the embedding space if really needed.

lorepieri 1 Jun 2022 14:13 UTC
8 points
on: The Problem With The Current State of AGI Definitions
I’m working on these lines to create an easy to understand numeric evaluation scale for AGIs. The dream would be something like: “Gato is AGI level 3.5, while the average human is 8.7.” I believe the scale should factor in that no single static test can be a reliable test of intelligence (any test can be gamed and overfitted).
A good reference on the subject is “The Measure of All Minds” by Orallo.
Happy to share a draft, send me a DM if interested.

lorepieri 13 May 2022 8:28 UTC
8 points
in reply to: Yuli_Ban’s comment on: “A Generalist Agent”: New DeepMind Publication
I would agree with “proto-AGI”. I might soon write a blog on this, but ideally we could define a continuous value to track how close we are to AGI, which is increasing if:
-the tasks to solve are very different from each other
-the tasks are complex
-how well a task have been solved
-few experience (or info) is fed to the system
-experience is not directly related to the task
-experience is very raw
-computation is done in few steps
Then adding new tasks and changing the environment.

lorepieri 10 Apr 2022 16:47 UTC
7 points
in reply to: bryjnar’s comment on: The Proof of Doom
This would not be a conclusive test, but definitely a cool one and may spark a lot of research. Perhaps we could get started with something NLP based, opening up more and more knowledge access to the AI in the form of training data. Probably still not feasible as of 2022 in term of raw compute required.

lorepieri 11 Mar 2022 0:27 UTC
7 points
on: The Proof of Doom
The newly-created AGI will immediately kill everyone on the planet, and proceed to the destruction of the universe. Its sphere of destruction will expand at light speed, eventually encompassing everything reachable.
Why?
In fact, if not consensus, then at least the majority opinion amongst those mathematicians, computer scientists, and AI researchers who have given the subject more than a few days thought.
Is this true, or you have asked only inside an AI-pessimistic bubble?
And if True, why should opinions matter at all? Opinions cannot influence reality which is outside human control.
Overall I don’t see a clear argument about why should we worried about AGI. Quite the contrary, building AGI is still an active area of research with no clear solution.

lorepieri 18 Jun 2022 19:49 UTC
6 points
in reply to: Kenny’s comment on: Understanding Gato’s Supervised Reinforcement Learning
Yea, what I meant is that the slides of Full Stack Deep Learning course materials provide a decent outline of all of the significant architectures worth learning.
I would personally not go to that low level of abstraction (e.g. implementing NNs in a new language) unless you really feel your understanding is shaky. Try building an actual side project, e.g. an object classifier for cars, and problems will arise naturally.

lorepieri 13 May 2022 8:35 UTC
6 points
in reply to: Michaël Trazzi’s comment on: “A Generalist Agent”: New DeepMind Publication
If by “sort of general, flexible learning ability that would let them tackle entirely new domains” we include adding new tokenised vectors in the training set, then this fit the definition. Of course this is “cheating” since the system is not learning purely by itself, but for the purpose of building a product or getting the tasks done this does not really matter.
And it’s not unconcievable to imagine self-supervised tokens generation to get more skills and perhaps a K-means algorithm to make sure that the new embeddings do not interfere with previous knowledge. It’s a dumb way of getting smarter, but apparently it works thanks to scale effects!

lorepieri 10 Apr 2022 17:21 UTC
6 points
on: A concrete bet offer to those with short AI timelines
Matthew, Tamay: Refreshing post, with actual hard data and benchmarks. Thanks for that.
My predictions:
- A model/ensemble of models achieves >80% on all tasks in the MMLU benchmark
No in 2026, no in 2030. Mainly due to the fact that we don’t have much structured data and incentives to solve some of the categories. A powerful unsupervised AI would be needed to clear those categories, or more time.
- A credible estimate reveals that an AI lab deployed EITHER >10^30 FLOPs OR hardware that would cost $1bn if purchased through competitive cloud computing vendors at the time on a training run to develop a single ML model (excluding autonomous driving efforts)
This may actually happen (the 1B one, not the 10^30), also due to inflation and USD created out of thin air and injected into the market. I would go for no in 2026 and yes in 2030.
- A model/ensemble of models will achieve >90% on the MATH dataset using a no-calculator rule
No in 2026, no in 2030. Significant algorithmic improvements needed. It may be done if prompt engineering is allowed.
- A model/ensemble of models achieves >80% top-1 strict accuracy on competition-level problems on the APPS benchmark
No in 2026, no in 2030. Similar to the above, but there will be more progress, as a lot of data is available.
- A gold medal for the IMO Grand Challenge (conditional on it being clear that the questions were not in the training set)
No in 2026, no in 2030.
- A robot that can, from beginning to end, reliably wash dishes, take them out of an ordinary dishwasher and stack them into a cabinet, without breaking any dishes, and at a comparable speed to humans (<120% the average time)
I work with smart robots, this cannot happen so fast also due to hardware limitations. The speed requirement is particularly harsh. Without the speed limit and with the system known in advance I would say yes in 2030. As the bet stands, I go for No in 2026, no in 2030.
- Tesla’s full-self-driving capability makes fewer than one major mistake per 100,000 miles
Not sure about this one, but I lean on No in 2026, no in 2030.

lorepieri 18 May 2022 11:48 UTC
5 points
on: Gato’s Generalisation: Predictions and Experiments I’d Like to See
Fair analysis, I agree with the conclusions. The main contribution seems to be a proof that transformers can handle many tasks at the same time.
Not sure if you sorted the tests in order of relevance, but I also consider the “held-out” test as being the more revealing. Besides finetuning, it would be interesting to test the zero-shot capabilities.

lorepieri 5 Aug 2021 20:59 UTC
5 points
in reply to: maximkazhenkov’s comment on: Analysis of World Records in Speedrunning [LINKPOST]
Apparently many records have been subjected to cheating:

lorepieri 7 Feb 2023 18:27 UTC
4 points
8
on: Childhoods of exceptional people
Let’s start with one of those insights that are as obvious as they are easy to forget: if you want to master something, you should study the highest achievements of your field.
Even if we assume this, it does not follow that we should try to recreate the subjective conditions that led to (perceived) “success”. The environment is always changing (tech, knowledge base, tools), so many learnings will not apply. Moreover, biographies tend to create a narrative after the fact, emphasizing the message the writer want to convey.
I prefer the strategy to master the basics from previous works and then figure out yourself how to innovate and improve the state of the art.

lorepieri 2 Jun 2022 14:15 UTC
4 points
in reply to: delton137’s comment on: The Problem With The Current State of AGI Definitions
ARC is a nice attempt. I also participated in the original challenge on Kaggle. The issue is that the test can be gamed (as anyone on Kaggle did) brute forcing over solution strategies.
An open-ended or interactive version of ARC may solve this issue.

lorepieri 12 May 2022 23:52 UTC
4 points
in reply to: Yitz’s comment on: “A Generalist Agent”: New DeepMind Publication
I have always been cautios, but I would say yes this time.
With the caveat that it learns new tasks only from supervised data, and not reusing previous experience.

lorepieri 11 Apr 2022 17:01 UTC
4 points
on: What can people not smart/technical enough for AI research/AI risk work do to reduce AI-risk/maximize AI safety? (which is most people?)
Geniuses or talented researchers are not that impactful as much as the right policy. Contribute creating the right conditions (work environment, education, cross contamination, funding, etc.) to make good research flourish. At the same time if fundamentals are not covered (healthcare, housing, etc.) people are not able to focus on much more than suvival. So pretty much anything that makes the whole system works better helps.
As an example, there are plenty of smart individuals in poor counties which are not able to express their potential.

lorepieri 6 Jun 2022 17:03 UTC
3 points
in reply to: Davidmanheim’s comment on: The Problem With The Current State of AGI Definitions
Thanks for the link, I will check it out.

lorepieri 11 Apr 2022 16:42 UTC
3 points
in reply to: FeepingCreature’s comment on: A concrete bet offer to those with short AI timelines
A close call, but I would lean still on no. Engineering the prompt is where humans leverage all their common sense and vast (w.r.t.. the AI) knowledge.

lorepieri 10 Apr 2022 17:45 UTC
3 points
on: AI safety: the ultimate trolley problem
The downvotes are excessive, the post is provoking, but interesting.
I think you will not even need to “push the fat man”. The development on an AGI will be slow and gradual (as any other major technology) and there will be incidents along the way (e.g. an AGI chatbot harassing someone). Those incidents will periodically mandate new regulations, so that measurements to tackle real AGI related dangers will be enacted, similarly to what happens in the nuclear energy sector. They will not be perfect, but there will be regulations.
The tricky part is that not all nations will set similar safety level, in fact some may encourage the development of unsafe, but high reward, AGI. So overall it looks like “pushing the fat man” will not even work that well.

lorepieri 10 Apr 2022 16:56 UTC
3 points
in reply to: johnlawrenceaspden’s comment on: The Proof of Doom
This is a possible AGI scenario, but it’s not clear why it should be particularly likely. For instance the AGI may reason that going aggressive will also be the fastest route to be terminated. Or the AGI may consider that keeping humans alive is good, since they were responsable for the AGI creation in the first place.
What you describe is the paper-clip maximiser scenario, which is arguably the most extreme end of the spectrum of super-AGI behaviours.