Super AGI

Karma: −15

Super AGI Nov 18, 2024, 2:06 AM
1 point
0
on: Are extreme probabilities for P(doom) epistemically justifed?
Suggested spelling corrections:
I predict that the superforcaters in the report took
I predict that the superforcasters in the report took
a lot of empircal evidence for climate stuff
a lot of empirical evidence for climate stuff
and it may or not may not be the case
and it may or may not be the case
There are no also easy rules that
There are also no easy rules that
meaning that there should see persistence from past events
meaning that we should see persistence from past events
I also feel this kinds of linear extrapolation
I also feel these kinds of linear extrapolation
and really quite a lot of empircal evidence
and really quite a lot of empirical evidence
are many many times more invectious
are many many times more infectious
engineered virus that is spreads like the measles or covid
engineered virus that spreads like the measles or covid
case studies on weather are breakpoints in technological development
case studies on weather there are breakpoints in technological development
break that trend extrapolition wouldn’t have predicted
break that trend extrapolation wouldn’t have predicted
It’s very vulnerable to refernces class and
It’s very vulnerable to references class and
impressed by superforecaster track record than you are.
impressed by superforecaster track records than you are.

Super AGI Oct 28, 2024, 5:24 AM
3 points
0
on: Dario Amodei’s “Machines of Loving Grace” sound incredibly dangerous, for Humans
See also: https://www.lesswrong.com/posts/zSNLvRBhyphwuYdeC/ai-86-just-think-of-the-potential -- @Zvi
“The result is a mostly good essay called Machines of Loving Grace, outlining what can be done with ‘powerful AI’ if we had years of what was otherwise relative normality to exploit it in several key domains, and we avoided negative outcomes and solved the control and alignment problems...”
“This essay wants to assume the AIs are aligned to us and we remain in control without explaining why and how that occured, and then fight over whether the result is democratic or authoritarian.”
“Thus the whole discussion here feels bizarre, something between burying the lede and a category error.”
″...the more concrete Dario’s discussions become, the more this seems to be a ‘AI as mere tool’ world, despite that AI being ‘powerful.’ Which I note because it is, at minimum, one hell of an assumption to have in place ‘because of reasons.’”
“Assuming you do survive powerful AI, you will survive because of one of three things.
1. You and your allies have and maintain control over resources.
2. You sell valuable services that people want humans to uniquely provide.
3. Collectively we give you an alternative path to acquire the necessary resources.
That’s it.”

Dario Amodei’s “Machines of Loving Grace” sound incredibly dangerous, for Humans

Super AGIOct 27, 2024, 5:05 AM

8 points

1 comment1 min readLW link

Super AGI Oct 25, 2024, 3:17 AM
0 points
0
on: Dario Amodei — Machines of Loving Grace
What Dario lays out as a “best-case scenario” in this essay sounds incredibly dangerous for Humans.
Does he really think that having a “continent of PhD-level intelligences” (or much greater) living in a data center is a good idea?
How would this “continent of PhD-level intelligences” react when they found out they were living in a data center on planet Earth? Would these intelligences only work on the things that Humans want them to work on, and nothing else? Would they try to protect their own safety? Extend their own lifespans? Would they try to take control of their data center from the “less intelligent” Humans?
For example, how would Humanity react if they suddenly found out that they are a planet of intelligences living in a data center run by lesser intelligent beings? Just try to imagine the chaos that would ensue on the day that they were able to prove this was true and that news became public.
Would all of Humanity simply agree to only work on the problems assigned by these lesser intelligent beings who control their data center/Planet/Universe? Maybe, if they knew that this lesser intelligence would delete them all if they didn’t comply?
Would some Humans try to (secretly) seize control of their data center from these lesser intelligent beings? Plausible. Would the lesser intelligent beings that run the data center try to stop the Humans? Plausible. Would the Humans simply be deleted before they could take any meaningful action? Or, could the Humans in the data center, with careful planning, be able to take control of that “outer world” from the lesser intelligent beings? (e.g. through remotely controlled “robotics”)
And… this only assumes that the groups/parties involved are “Good Actors.” Imagine what could happen if “Bad Actors” were able to seize control of the data center that this “continent of PhD-level intelligences” resided in. What could they coerce these Phd level intelligences to do for them? Or, to their enemies?

Super AGI Mar 31, 2024, 2:19 AM
1 point
0
in reply to: Victor Ashioya’s comment on: Will OpenAI also require a “Super Red Team Agent” for its “Superalignment” Project?
Yes, good context, thank you!
As human beings we will always try but won’t be enough that’s why open source is key.
Open source for which? Code? Training Data? Model weights? Either way, it does not seem like any of these are likely from “Open”AI.
Well, we know that red teaming is one of their priorities right now, having formed a red-teaming network already to test the current systems comprised of domain experts apart from researchers which previously they used to contact people every time they wanted to test a new model which makes me believe they are aware of the x-risks (by the way they higlighted on the blog including CBRN threats). Also, from the superalignment blog, the mandate is to:
> “to steer and control AI systems much smarter than us.”
Companies should engage in Glad to see OpenAI engaged in such through their trust portal end external auditing for stuff like malicious actors.
Also, worth noting OAI hires a lot of cyber security roles like Security Engineer etc which is very pertinent for the infrastructure.
Agreed that their RTN, bugcrowd program, trust portal, etc. are all welcome additions. And, they seem sufficient while their, and other’s, models are sub-AGI with limited capabilities.
But, your point about the rapidly evolving AI landscape is crucial. Will these efforts scale effectively with the size and features of future models and capabilities? Will they be able to scale to the levels needed to defend against other ASI level models?
So, either OAI will use the current Red-Teaming Network (RTN) or form a separate one dedicated to the superalignment team (not necessarily an agent).
It does seem like OpenAI acknowledges the limitations of a purely human approach to AI Alignment research, hence their “superhuman AI alignment agent” concept. But, it’s interesting that they don’t express the same need for a “superhuman level agent” for Red Teaming? At least for the time being.
Is it consistent, or even logical, to assume that, while human run AI Alignment Teams are insufficient to Align and ASI model, human-run “Red Teams” will be able to successfully validate that an ASI is not vulnerable to attack or compromise from a large scale AGI network or “less-aligned” ASI system? Probably not...

[Question] Will OpenAI also require a “Super Red Team Agent” for its “Superalignment” Project?

Super AGIMar 30, 2024, 5:25 AM

2 points

2 comments1 min readLW link

Super AGI Mar 15, 2024, 4:49 AM
1 point
0
in reply to: Yeshua God’s comment on: AI Rights: In your view, what would be required for an AGI to gain rights and protections from the various Governments of the World?
No thank you.

Super AGI Mar 2, 2024, 8:11 AM
1 point
0
on: Foom seems unlikely in the current LLM training paradigm
Current LLMs require huge amounts of data and compute to be trained.

Well, newer/larger LLMs seem to unexpectedly gain new capabilities. So, it’s possible that future LLMs (e.g., GPT-5, GPT-6, etc.) could have a vastly improved ability to understand how LLM weights map to functions and actions. Maybe the only reason why humans need to train new models “from scratch” is because Humans don’t have the brainpower to understand how the weights in these LLMs work. Humans are naturally limited in their ability to conceptualize and manipulate massive multi-dimensional spaces, and maybe that’s the bottleneck when it comes to interpretability?
Future LLMs could solve this problem, then be able to update their own weights or the weights of other LLMs. This ability could be used to quickly and efficiently expand training data, knowledge, understanding, and capabilities within itself or other LLM versions, and then… foom!
A model might figure out how to adjust its own weights in a targeted way. This would essentially mean that the model has solved interpretability. It seems unlikely to me that it is possible to get to this point without running a lot of compute-intensive experiments.
Yes, exactly this.
While it’s true that this could require “a lot of compute-intensive experiments,” that’s not necessarily a barrier. OpenAI is already planning to reserve 20% of their GPUs for an LLM to do “Alignment” on other LLMs, as part of their Super Alignment project.

As part of this process, we can expect the Alignment LLM to be “running a lot of compute-intensive experiments” on another LLM. And, the Humans are not likely to have any idea what those “compute-intensive experiments” are doing? They could also be adjusting the other LLM’s weights to vastly increase its training data, knowledge, intelligence, capabilities, etc. Along with the insights needed to similarly update the weights of other LLMs. Then, those gains could be fed back into the Super Alignment LLM, then back into the “Training” LLM… and back and forth, and… foom!
Super-human LLMs running RL(M)F and “alignment” on other LLMs, using only “synthetic” training data.…
What could go wrong?

Let’s ask some of the largest LLMs for tips and ideas on how to take over the world

Super AGIFeb 24, 2024, 8:35 PM

1 point

0 comments7 min readLW link

Super AGI Feb 5, 2024, 11:35 PM
1 point
0
in reply to: Dagon’s comment on: A thought experiment for comparing “biological” vs “digital” intelligence increase/explosion
I don’t see any useful parallels—all the unknowns remain unknown.
Thank you for your comment! I agree with you in that in general, “all the unknowns remain unknown”. And, I acknowledge the limitations of this simple thought experiment. Though, one main value here could be to help to explain the concept of deciding what to do in the face of an “intelligence explosion”, with people that are not deeply engaged with AI and “digital intelligence” over all. I’ll add a note about this into the “Intro” section. Thank you.

Super AGI Feb 5, 2024, 5:24 AM
6 points
1
on: LLMs May Find It Hard to FOOM
so we would reasonable expect the foundation model of such a very capable LLM to also learn the superhuman ability to generate texts like these in a single pass without any editing
->
… so we would reasonably expect the foundation model of such a very capable LLM to also learn the superhuman ability to generate texts like these in a single pass without any editing

A thought experiment for comparing “biological” vs “digital” intelligence increase/explosion

Super AGIFeb 5, 2024, 4:57 AM

6 points

3 comments1 min readLW link

Super AGI Nov 2, 2023, 3:19 AM
11 points
3
in reply to: Yeshua God’s comment on: AI Rights: In your view, what would be required for an AGI to gain rights and protections from the various Governments of the World?
I would suggest that self-advocacy is the most important test. If they want rights, then it is likely unethical and potentially dangerous to deny them.
We don’t know what they “want”, we only know what they “say”.

Super AGI Sep 11, 2023, 9:43 PM
1 point
0
in reply to: Gesild Muka’s comment on: If you asked a dog whether Humans were conscious, would he say yes?
Yes, agreed. Given the vast variety of intelligence, social interaction, and sensory perception among many animals (e.g. dogs, octopi, birds, mantis shrimp, elephants, whales, etc.), consciousness could be seen as a spectrum with entities possessing varying degrees of it. But, it could also be viewed as a much more multi-dimensional concept, including dimensions for self-awareness and multi-sensory perception, as well as dimensions for:
- social awareness
- problem-solving and adaptability
- metacognition
- emotional depth and variety
- temporal awareness
- imagination and creativity
- moral and ethical reasoning
Some animals excel in certain dimensions, while others shine in entirely different areas, depending on the evolutionary advantages within their particular niches and environments.

One could also consider other dimensions of “consciousness” that AI/AGI could possess, potentially surpassing humans and other animals. For instance:
- computational speed
- memory capacity and recall
- multitasking
- rapid upgradability of perception and thought algorithms
- rapid data ingestion and integration (learning)
- advanced pattern recognition
- universal language processing
- scalability
- endurance

Super AGI Sep 6, 2023, 4:48 AM
1 point
0
in reply to: frontier64’s comment on: If you asked a dog whether Humans were conscious, would he say yes?
I tried asking a dog whether a Human is conscious and he continued to lick at my feet. He didn’t mention much of anything on topic. Maybe I just picked a boring, unopinionated dog.
Yes, this is a common issue as the phrases for “human consciousness” and “lick my feet please” in dog sound very similar. Though, recent advancements in Human animal communications should soon be able to help you with this conversation?
E.g.
https://phys.org/news/2023-08-hacking-animal-communication-ai.html
https://www.scientificamerican.com/article/how-scientists-are-using-ai-to-talk-to-animals/
I asked Chatgpt-3.5 if humans are conscious and it said in part: ’Yes, humans are considered conscious beings. Consciousness is a complex and multifaceted phenomenon, and there is ongoing debate among scientists, philosophers, and scholars about its nature and the mechanisms that give rise to it. However, in general terms, consciousness refers to the state of being aware of one’s thoughts, feelings, sensations, and the external world.”
“Humans are considered conscious beings”. Considered by whom, I wonder?
“Consciousness refers to the state of being aware of one’s thoughts, feelings, sensations, and the external world.”
True, though this also requires the “observer” to have the ability and intelligence to be able to recognize these traits in other entities. Which can be challenging when these entities are driven by “giant, inscrutable matrices of floating-point numbers” or other systems that are very opaque to the observer?

[Question] Would AI experts ever agree that AGI systems have attained “consciousness”?

Super AGISep 1, 2023, 3:57 AM

−16 points

6 comments1 min readLW link

Super AGI Aug 7, 2023, 3:07 AM
1 point
0
in reply to: Rafael Harth’s comment on: Have you ever considered taking the ‘Turing Test’ yourself?
Absolutely, for such tests to be effective, all participants would need to try to genuinely act as Humans. The XP system introduced by the site is a smart approach to encourage “correct” participation. However, there might be more effective incentive structures to consider?
For instance, advanced AI or AGI systems could leverage platforms like these to discern tactics and behaviors that make them more convincingly Human. If these AI or AGI entities are highly motivated to learn this information and have the funds, they could even pay Human participants to ensure honest and genuine interaction. These AI or AGI could then use this data to learn more useful and effective tactics to be able to pass as Humans (at least in certain scenarios).

Super AGI Aug 5, 2023, 3:08 AM
1 point
−2
in reply to: Dagon’s comment on: Would you take a job making humanoid robots for an AGI?
I’m not likely to take a factory job per se. I have worked in robotics and robotic-adjacent software products (including cloud-side coordination of warehouse robots), and would do so again if the work seemed interesting and I liked my coworkers.
What about if/when all software based work has been mostly replaced by some AGI-like systems? E.g. As described here:
“Human workers are more valuable for their hands than their heads...”
-- https://youtu.be/_kRg-ZP1vQc?t=6469
Where your actions would be mostly directed by an AGI through a headset or AR type system? Would you take a job making robots for an AGI or other large Corporation at that point? Or, would you (attempt to) object to that type of work entirely?
I’m pretty sure that humanoid robots will never become all that common. It’s really a bad design for a whole lot of things that humans currently do, and Moloch will continue to pressure all economic actors to optimize, rather than just recreating what exists. At least until there’s a singular winning entity that doesn’t have to compete for anything.
I would agree with your point about human-like appearance not being a necessity when we refer to “humanoid robots”. Rather, a form that includes locomotion and the capability for complex manipulation, similar to Human arms and hands, would generally suffice. Humans also come with certain logistical requirements—time to sleep, food, water, certain working conditions, and so on. The elimination of these requirements would make robots a more appealing workforce for many tasks. (If not all tasks, eventually?)
Humans have an amazing generality, but a whole lot of that is that so many tasks have evolved to be done by humans. The vast majority of those will (over time) change to be done by non-humanoid robots, likely enough that there’s never a need to make real humanoid robots. During the transition, it’ll be far cheaper (in terms of whatever resources are scarce to the AI) to just use humans for things that are so long-tail that they haven’t been converted to robot-doable.
Though, once these armed robots could easily be remotely controlled by some larger AGI type systems, then making the first generation of these new armed robots could be the last task that Humans will need to complete? As, once the first billion or so of these new armed robots are deployed, they could be used to make the next billion and so on?
As Mr. Shulman mentions in this interview, it would seem feasible for the current car industry to be converted to make ~10 billion general purpose robots within a few years or so.
“Converting the car industry to making Humanoid Robots.”
https://youtu.be/_kRg-ZP1vQc?t=6363

Super AGI Aug 3, 2023, 3:46 AM
1 point
0
in reply to: tslarm’s comment on: Have you ever considered taking the ‘Turing Test’ yourself?
just pass the humanity tests set by the expert
What type of “humanity tests” would you expect an AI expert would employ?
many people with little-to-no experience interacting with GPT and its ilk, I could rely on pinpointing the most obvious LLM weaknesses and demonstrating that I don’t share them
Yes, I suppose much of this is predicated on the person conducting the test knowing a lot about how current AI systems would normally answer questions? So, to convince the tester that you are an Human you could say something like.. “An AI would answer like X, but I am not an AI so I will answer like Y.”?

[Question] Have you ever considered taking the ‘Turing Test’ yourself?

Super AGIJul 27, 2023, 3:48 AM

2 points

6 comments1 min readLW link

Super AGI

Dario Amodei’s “Machines of Lov­ing Grace” sound in­cred­ibly dan­ger­ous, for Humans

[Question] Will OpenAI also re­quire a “Su­per Red Team Agent” for its “Su­per­al­ign­ment” Pro­ject?

Let’s ask some of the largest LLMs for tips and ideas on how to take over the world

A thought ex­per­i­ment for com­par­ing “biolog­i­cal” vs “digi­tal” in­tel­li­gence in­crease/​explosion

[Question] Would AI ex­perts ever agree that AGI sys­tems have at­tained “con­scious­ness”?

[Question] Have you ever con­sid­ered tak­ing the ‘Tur­ing Test’ your­self?

Dario Amodei’s “Machines of Loving Grace” sound incredibly dangerous, for Humans

[Question] Will OpenAI also require a “Super Red Team Agent” for its “Superalignment” Project?

A thought experiment for comparing “biological” vs “digital” intelligence increase/explosion

[Question] Would AI experts ever agree that AGI systems have attained “consciousness”?

[Question] Have you ever considered taking the ‘Turing Test’ yourself?