OneManyNone

Karma: 121

I am a Computer Science PhD who has worked in Machine Learning at both Amazon and Google Brain.

I have a blog at https://onemanynone.substack.com/ where I publish posts aimed at a broader and less technical audience.

Inference Speed is Not Unbounded

OneManyNone8 May 2023 16:24 UTC

35 points

31 comments16 min readLW link

We Shouldn’t Expect AI to Ever be Fully Rational

OneManyNone18 May 2023 17:09 UTC

19 points

31 comments6 min readLW link

Proposal: Align Systems Earlier In Training

OneManyNone16 May 2023 16:24 UTC

18 points

0 comments11 min readLW link

Proposal: Tune LLMs to Use Calibrated Language

OneManyNone7 Jun 2023 21:05 UTC

9 points

0 comments5 min readLW link

Why I Believe LLMs Do Not Have Human-like Emotions

OneManyNone22 May 2023 15:46 UTC

8 points

6 comments7 min readLW link

OneManyNone 8 May 2023 18:21 UTC
7 points
2
in reply to: Max H’s comment on: Inference Speed is Not Unbounded
Yes, I wasn’t sure if it was wise to use TSP as an example for that reason. Originally I wrote it using the Hamiltonian Path problem, but thought a non-technical reader would be more able to quickly understand TSP. Maybe that was a mistake. It also seems I may have underestimated how technical my audience would be.

But your point about heuristics is right. That’s basically what I think an AGI based on LLMs would do to figure out the world. However, I doubt there would be one heuristic which could do Solomonoff induction in all scenarios, or even most. Which means you’d have to select the right one, which means you’d need a selection criteria, which takes us back to my original points.

OneManyNone 26 Apr 2023 16:04 UTC
6 points
1
in reply to: DirectedEvolution’s comment on: On AutoGPT
I think there’s definitely some truth to this sometimes, but I don’t think you’ve correctly described the main driver of genius. I actually think it’s the opposite: my guess is that there’s a limit to thinking speed, and genius exists precisely because some people just have better thoughts. Even Von Neumann himself attributed much of his abilities to intuition. He would go to sleep and in the morning he would have the answer to whatever problem he was toiling over.
I think, instead, that ideas for the most part emerge through some deep and incomprehensible heuristics in our brains. Think about a chess master recognizing the next move at just a glance. However much training it took to give him that ability, he is not doing a tree search at that moment. It’s not hard to imagine a hypothetical where his brain, with no training, came pre-configured to make the same decisions, and indeed I think that’s more or less what happens with Chess prodigies. They don’t come preconfigured, but their brains are better primed to develop those intuitions.
In other words, I think that genius is making better connections with the same number of “cycles”, and I think there’s evidence that LLMs do this too as they advance. For instance, part of the significance of DeepMind’s Chinchilla paper was that by training longer they were able to get better performance in a smaller network. The only explanation for this is that the quality of the processing had improved enough to counteract the effects of the lost quantity.

OneManyNone 19 May 2023 0:29 UTC
5 points
0
in reply to: the gears to ascension’s comment on: We Shouldn’t Expect Superintelligent AI to be Fully Rational
I see, but I’m still not convinced. Humans behave in anger as a way to forcibly change a situation into one that is favorable to itself. I don’t believe that’s what the AI was doing, or trying to do.
I feel like there’s a thin line I’m trying to walk here, and I’m not doing a very good job. I’m not trying to comment on whether or not the AI has any sort of subjective experience. I’m just saying that even if it did, I do not believe it would bare any resemblance to what we as humans experience as anger.

OneManyNone 12 May 2023 15:38 UTC
5 points
1
in reply to: M. Y. Zuo’s comment on: Inference Speed is Not Unbounded
I think you’re broadly right, but I think it’s worth mentioning that DNA is a probabilistic compression (evidence: differences in identical twins), so it gets weird when you talk about compressing an adult at age 25 - what is probabilistic compression at that point?

But I think you’ve mostly convinced me. Whatever it takes to “encode” a human, it’s possible to compress it to be something very small.

OneManyNone 7 Jun 2023 22:18 UTC
4 points
3
in reply to: O O’s comment on: Algorithmic Improvement Is Probably Faster Than Scaling Now
Fair enough. But for the purposes of this post, the point is that capability increased without increased compute. If you prefer, bucket it as “compute” vs “non-compute” instead of “compute” vs “algorithmic”.

OneManyNone 7 Jun 2023 21:26 UTC
4 points
4
in reply to: O O’s comment on: Algorithmic Improvement Is Probably Faster Than Scaling Now
I think whether or not it’s trivial isn’t the point: they did it, it worked, and they didn’t need to increase the compute to make it happen.

OneManyNone 8 May 2023 20:08 UTC
4 points
2
in reply to: rotatingpaguro’s comment on: Inference Speed is Not Unbounded
Yeah, I agree that it’s a surprising fact requiring a bit of updating on my end. But I think the compression point probably matters more than you would think, and I’m finding myself more convinced the more I think about it. A lot of processing goes into turning that 1GB into a brain, and that processing may not be highly reducible. That’s sort of what I was getting at, and I’m not totally sure the complexity of that process wouldn’t add up to a lot more than 1GB.

It’s tempting to think of DNA as sufficiently encoding a human, but (speculatively) it may make more sense to think of DNA only as the input to a very large function which outputs a human. It seems strange, but it’s not like anyone’s ever built a human (or any other organism) in a lab from DNA alone; it’s definitely possible that there’s a huge amount of information stored in the processes of a living human which isn’t sufficiently encoded just by DNA.

You don’t even have to zoom out to things like organs or the brain. Just knowing which bases match to which amino acids is an (admittedly simple) example of processing that exists outside of the DNA encoding itself.

OneManyNone 19 May 2023 0:34 UTC
3 points
0
in reply to: the gears to ascension’s comment on: We Shouldn’t Expect Superintelligent AI to be Fully Rational
Hmmm… I think I still disagree, but I’ll need to process what you’re saying and try to get more into the heart of my disagreement. I’ll respond when I’ve thought it over.
Thank you for the interesting debate. I hope you did not perceive as me being overly combative.

OneManyNone 19 May 2023 0:24 UTC
3 points
0
in reply to: the gears to ascension’s comment on: We Shouldn’t Expect Superintelligent AI to be Fully Rational
Ah okay. My apologies for misunderstanding.

OneManyNone 19 May 2023 0:05 UTC
3 points
0
in reply to: the gears to ascension’s comment on: We Shouldn’t Expect Superintelligent AI to be Fully Rational
I would argue that “models generated by RL-first approaches” are not more likely to be the primary threat to humanity, because those models are unlikely to yield AGI any time soon. I personally believe this is a fundamental fact about RL-first approaches, but even if it wasn’t it’s still less likely because LLMs are what everyone is investing in right now and it seems plausible that LLMs could achieve AGI.
Also, by what mechanism would Bing’s AI actually be experiencing anger? The emotion of anger in humans is generally associated with a strong negative reward signal. The behaviors that Bing exhibited were not brought on by any associated negative reward, it was just contextual text completion.

OneManyNone 18 May 2023 18:07 UTC
3 points
2
in reply to: Vladimir_Nesov’s comment on: We Shouldn’t Expect Superintelligent AI to be Fully Rational
I was aware of that, and maybe my statement was too strong, but fundamentally I don’t know if I agree that you can just claim that it’s rational even though it doesn’t produce rational outputs.
Rationality is the process of getting to the outputs. What I was trying to talk about wasn’t scholarly disposition or non-eccentricity, but the actual process of deciding goals.
Maybe another way to say it is this: LLMs are capable of being rational, but they are also capable of being extremely irrational, in the sense that, to quote EY, their behavior is not a form of “systematically promot[ing] map-territory correspondences or goal achievement.” There is nothing about the pre-training that directly promotes this type of behavior, and any example of this behavior in fundamentally incidental.

OneManyNone 12 May 2023 21:13 UTC
3 points
1
in reply to: M. Y. Zuo’s comment on: Inference Speed is Not Unbounded
To that first sentence, I don’t want to get lost in semantics here. My specific statement is that the process that takes DNA into a human is probabilistic with respect to the DNA sequence alone. Add in all that other stuff, and maybe at some point it becomes deterministic, but at that point you are no longer discussing the <1GB that makes DNA. If you wanted to be truly deterministic, especially up to the age of 25, I seriously doubt it could be done in less than millions of petabytes, because there are such a huge number of miniscule variations in conditions and I suspect human development is a highly chaotic process.
As you said, though, we’re at the point of minor nitpicks here. It doesn’t have to be a deterministic encoding for your broader points to stand.

OneManyNone 26 Apr 2023 16:14 UTC
3 points
2
in reply to: David Scott Krueger (formerly: capybaralet)’s comment on: On AutoGPT
In the context of his argument I think the claim is reasonable, since I interpreted it as the claim that, since it can be used a tool that designs plans, it has already overcome the biggest challenge of being an agent.
But if we take that claim out of context and interpret it literally, then I agree that it’s not a justified statement per se. It may be able to simulate a plausible causal explanation, but I think that is very different from actually knowing it. As long as you only have access to partial information, there are theoretical limits to what you can know about the world. But it’s hard to think of contexts where that gap would matter a lot.

OneManyNone 8 May 2023 17:20 UTC
2 points
1
in reply to: rotatingpaguro’s comment on: Inference Speed is Not Unbounded
You’re right that my points lack a certain rigor. I don’t think there is a rigorous answer to questions like “what does slow mean?”.

However, there is a recurring theme I’ve seen in discussions about AI where people express incredulity about neural networks as a method for AGI since they require so much “more data” than humans to train. My argument was merely that we should expect things to take a lot of data, and situations where they don’t are illusory. Maybe that’s less common in this space, so it I should have framed it differently. But I wrote this mostly to put it out there and get people’s thoughts.

Also, I see your point about DNA only accounting for 1GB. I wasn’t aware it was so low. I think it’s interesting and suggests the possibility of smaller learning systems than I envisioned, but that’s as much a question about compression as anything else. Don’t forget that that DNA still needs to be “uncompressed” into a human, and at least some of that process is using information stored in the previous generation of human. Admittedly, it’s not clear how much that last part accounts for, but there is evidence that part of a baby’s development is determined by the biological state of the mother.

But I guess I would say my argument does rely on us not getting that <1GB of stuff, with the caveat that that 1GB is super highly compressed through a process that takes a very complex system to uncompress.

I should add as well that I definitely don’t believe that LLMs are remotely efficient, and I wouldn’t necessarily be surprised if humans are as close to the maximum on data efficiency as possible. I wouldn’t be surprised if they weren’t, either. But we were built over millions (billions?) of years under conditions that put a very high price tag on inefficiency, so it seems reasonable to believe our data efficiency is at least at some type of local minima.

EDIT: Another way to phrase the point about DNA: You need to account not just for the storage size of the DNA, but also the Kolmogrov complexity of turning that into a human. No idea if that adds a lot to its size, though.

OneManyNone 8 Jun 2023 19:45 UTC
1 point
0
on: Sentience matters
I feel as if I can agree with this statement in isolation, but can’t think of a context where I would consider this point relevant.

I’m not even talking about the question of whether or not the AI is sentient, which you asked us to ignore. I’m talking about how do we know that an AI is “suffering,” even if we do assume it’s sentient. What exactly is “suffering” in something that is completely cognitively distinct from a human? Is it just negative reward signals? I don’t think so, or at least if it was, that would likely imply that training a sentient AI is unethical in all cases, since training requires negative signals.

That’s not to say that all negative signals are the same or that maybe in some contexts it’s painful or not, just that I think determining that is an even harder problem than determining if the AI is sentient.

OneManyNone

In­fer­ence Speed is Not Unbounded

We Shouldn’t Ex­pect AI to Ever be Fully Rational

Pro­posal: Align Sys­tems Ear­lier In Training

Pro­posal: Tune LLMs to Use Cal­ibrated Language

Why I Believe LLMs Do Not Have Hu­man-like Emotions

Inference Speed is Not Unbounded

We Shouldn’t Expect AI to Ever be Fully Rational

Proposal: Align Systems Earlier In Training

Proposal: Tune LLMs to Use Calibrated Language

Why I Believe LLMs Do Not Have Human-like Emotions