Random Developer

Karma: 60

Random Developer 2 Feb 2024 13:30 UTC
2 points
0
in reply to: ThomasCederborg’s comment on: Managing risks while trying to do good
If I had to summarize your argument, it would be something like, “Many people’s highest moral good involves making their ideological enemies suffer.” This is indeed a thing that happens, historically.

But another huge amount of damage is caused by people who believe things like “the ends justify the means” or “you can’t make an omelette without breaking a few eggs.” Or “We only need 1 million surviving Afghanis [out of 15 million] to build a paradise for the proletariat,” to paraphrase an alleged historical statement I read once. The people who say things like this cause immediate, concrete harm. They attempt to justify this harm as being outweighed by the expected future value of their actions. But that expected future value is often theoretical, and based on dubious models of the world.

I do suspect that a significant portion of the suffering in the world is created by people who think like this. Combine them with the people you describe whose conception of “the good” actually involves many people suffering (and with people who don’t really care about acting morally at all), and I think you account for much of the human-caused suffering in the world.

One good piece of advice I heard from someone in the rationalist community was something like, “When you describe your proposed course of action, do you sound like a monologuing villain from a children’s TV show, someone who can only be defeated by the powers of friendship and heroic teamwork? If so, you would be wise to step back and reconsider the process by which you arrived at your plans.”

Random Developer 3 Mar 2024 20:03 UTC
5 points
2
on: AI things that are perhaps as important as human-controlled AI (Chi version)
Making AIs wiser seems most important in worlds where humanity stays in control of AI. It’s unclear to me what the sign of this work is if humanity doesn’t stay in control of AI.

A significant fraction of work on AI assumes that humans will somehow be able to control entities which are far smarter than we are, and maintain such control indefinitely. My favorite flippant reply to that is, “And how did that work out for Homo erectus? Surely they must have benefited enormously from all the technology invented by Homo sapiens!” Intelligence is the ultimate force multiplier.

If there’s no mathematical “secret” to alignment, and I strongly suspect there isn’t, then we’re unlikely to remain in control.

So I see four scenarios if there’s no magic trick to stay in control:
1. We’re wise enough refrain from building anything significantly smarter than us.
2. We’re pets. (Loss of control)
3. We’re dead. (X-risk)
4. We envy the dead. (S-risk)
I do not have a lot of hope for (1) without dramatic changes in public opinion and human society. I’ve phrased (2) provocatively, but the essence is that we would lose control. (Fictional examples are dangerous, but this category would include the Culture, CelestAI or arguably the Matrix.) Pets might be beloved or they might be abused, but they rarely get asked to participate in human decisions. And sometimes pets get spayed or euthanized based on logic they don’t understand. They might even be happier than wild animals, but they’re not in control of their own fate.

Even if we could control AI indefinitely (and I don’t think we can), there is literally no human organization or institution I would trust with that power. Not governments, not committees, and certainly not a democratic vote.

So if we must regrettably build AI, and lose all control over the future, then I do think it matters that the AI has a decent moral and philosophical system. What kind of entity would you trust with vast, unaccountable, inescapable power? If we’re likely to wind up as pets of our own creations, then we should definitely try to create kind, ethical and what you call “unfussy” pet owners, and ones that respect real consent.

Or to use a human analogy, try to raise the sort of children you’d want to pick your nursing home. So I do think the philosophical and moral questions matter even if humans lose control.

Random Developer 14 Mar 2024 10:41 UTC
5 points
2
in reply to: Dagon’s comment on: Gunnar_Zarncke’s Shortform
It’s surprising that it’s taken this long, given how good public AI coding assistants were a year ago.
The way I explain this to people is that current LLMs can be modeled as having three parts:

1. The improv actor, which is is amazing.
2. The reasoner, which is inconsistent but not totally hopeless at simple things.
3. The planner/execution/troubleshooting engine, which is still inferior to the average squirrel trying to raid a bird feeder.

Copilot is designed to rely on (1) and (2), but it is still almost entirely reliant on humans for (3). (GPT 4 Code Interpeter is slightly better at (3).)

Since I don’t really believe in any reliable way to control a super-human intelligence for long, I do not look forward to people completely fixing (3). Sometime after that point, we’re either pets or paperclips.

Random Developer 19 Mar 2024 1:01 UTC
5 points
0
in reply to: MichaelDickens’s comment on: On Devin
Rather, people who suck at programming (and thus can’t get jobs) apply to way more positions than people who are good at programming.
I have interviewed a fair number of programmers, and I’ve definitely seen plenty of people who talked a good game but who couldn’t write FizzBuzz (or sum the numbers in an array). And this was stacking the deck in their favor: They could use a programming language of their choice, plus a real editor, and if they appeared unable to deal with coding in front of people, I’d go sit on the other side of the office and let them work for a bit.

I do not think these people were representative of the average working programmer, based on my experiences consulting at a variety of companies. The average engineer can write code.

Random Developer 19 Mar 2024 1:08 UTC
3 points
0
in reply to: quiet_NaN’s comment on: The Worst Form Of Government (Except For Everything Else We’ve Tried)
I think veto powers as part of a system of checks and balances are good in moderation, but add to many of them and you end up with a stalemate.
Yes, there’s actually some research into this area: https://www.jstor.org/stable/j.ctt7rvv7 “Veto Players: How Political Institutions Work”. The theory apparently suggested that if you have too many “veto players”, your government quickly becomes unable to act.

And I suspect that states which are unable to act are vulnerable to major waves of public discontent during perceived crises.

Random Developer 23 Mar 2024 11:43 UTC
11 points
9
in reply to: tailcalled’s comment on: ChatGPT can learn indirect control
One thing we know about these models is that they’re good at interpolating within their training data, and that they have seen enormous amounts of training data. But they’re weak outside those large training sets. They have a very different set of strengths and weaknesses than humans.

And yet… I’m not 100% convinced that this matters. If these models have seen a thousand instances of self-reflection (or mirror test awareness, or whatever), and if they can use those examples to generalize to other forms of self-awareness, then might that still give them very rudimentary ability to pass the mirror test?

I’m not sure that I’m explaining this well—the key question here is “does generalizing over enough examples of passing the ‘mirror test’ actually teach the models some rudimentary (unconscious) self-awareness?” Or maybe, “Will the model fake until it makes it?” I could not confidently answer either way.

Random Developer 27 Mar 2024 15:35 UTC
LW: 7 AF: 3
2
AF
in reply to: Steven Byrnes’s comment on: Modern Transformers are AGI, and Human-Level
Yes, this is almost exactly it. I don’t expect frontier LLMs to carry out a complicated, multi-step process and recover from obstacles.

I think of this as the “squirrel bird feeder test”. Squirrels are ingenious and persistent problem solvers, capable of overcoming chains of complex obstacles. LLMs really can’t do this (though Devin is getting closer, if demos are to be believed).

Here’s a simple test: Ask an AI to open and manage a local pizza restaurant, buying kitchen equipment, dealing with contractors, selecting recipes, hiring human employees to serve or clean, registering the business, handling inspections, paying taxes, etc. None of these are expert-level skills. But frontier models are missing several key abilities. So I do not consider them AGI.

However, I agree that LLMs already have superhuman language skills in many areas. They have many, many parts of what’s needed to complete challenges like the above. (On principle, I won’t try to list what I think they’re missing.)

I fear the period between “actual AGI and weak ASI” will be extremely short. And I don’t actually believe there is any long-term way to control ASI.

I fear that most futures lead to a partially-aligned super-human intelligence with its own goals. And any actual control we have will be transitory.

Random Developer 28 Mar 2024 16:43 UTC
LW: 16 AF: 8
4
AF
in reply to: AnthonyC’s comment on: Modern Transformers are AGI, and Human-Level
Yeah, the precise ability I’m trying to point to here is tricky. Almost any human (barring certain forms of senility, severe disability, etc) can do some version of what I’m talking about. But as in the restaurant example, not every human could succeed at every possible example.

I was trying to better describe the abilities that I thought GPT-4 was lacking, using very simple examples. And it started looking way too much like a benchmark suite that people could target.

Suffice to say, I don’t think GPT-4 is an AGI. But I strongly suspect we’re only a couple of breakthroughs away. And if anyone builds an AGI, I am not optimistic we will remain in control of our futures.