Robbo

Karma: 296

I’m Rob Long. I work on AI consciousness and related issues.

http://robertlong.online/

https://experiencemachines.substack.com/

Key questions about artificial sentience: an opinionated guide

Robbo25 Apr 2022 12:09 UTC

51 points

31 comments18 min readLW link

Using Brain-Computer Interfaces to get more data for AI alignment

Robbo7 Nov 2021 0:00 UTC

43 points

10 comments7 min readLW link

Robbo 24 Nov 2021 12:02 UTC
19 points
in reply to: Daniel Kokotajlo’s comment on: Yudkowsky and Christiano discuss “Takeoff Speeds”

The core part of Ajeya’s model is a probability distribution over how many OOMs of compute we’d need with today’s ideas to get to TAI / AGI / APS-AI / AI-PONR / etc.

I didn’t know the last two acronyms despite reading a decent amount of this literature, so thought I’d leave this note for other readers. Listing all of them for completeness (readers will of course know the first two):

TAI: transformative AI

AGI: artificial general intelligence

APS-AI: Advanced, Planning, Strategically aware AI [1]

AI-PONR: AI point of no return [2]

[1] from Carlsmith, which Daniel does link to

[2] from Daniel, which he also linked

Robbo 23 Nov 2021 18:11 UTC
19 points
in reply to: Matthew Barnett’s comment on: Yudkowsky and Christiano discuss “Takeoff Speeds”
In general, I don’t yet see a strong reason to think that our general brain architecture is the sole, or potentially even primary reason why we’ve developed civilization, discontinuous with the rest of the animal kingdom. A strong requirement for civilization is the development of cultural accumulation via language, and more specifically, the ability to accumulate knowledge and technology over generations.

In The Secrets of Our Success, Joe Henrich argues that without our stock of cultural knowledge, individual humans are not particularly more generally intelligent than apes. (Neanderthals may very well have been more generally intelligent than humans—and indeed, their brains are bigger than ours.)

And, he claims, to the extent that individual humans are now especially intelligent, this was because of culture-driven natural selection. For Henrich, the story of human uniqueness is a story of a feedback loop: increased cultural know-how, which drives genetic selection for bigger brains and better social learning, which leads to increased cultural know-how, which drives genetic selection for bigger brains….and so forth, until you have a very weird great ape that is weak, hairless, and has put a flag on the moon.

Note: this evolution + culture feedback loop is still a huge discontinuity that led to massive changes in relatively short evolutionary time!

Just having a generalist brain doesn’t seem like enough; for example, could there have been a dolphin civilization?

Heinrich speculates that a bunch of idiosyncratic features came together to launch us into the feedback loop that led to us being cultural species. Most species, including dolphins, do not get onto this feedback loop because of a “startup” problem: bigger brains will give a fitness advantage only up to a certain point, because individual learning can only be so useful. For there to be further selection for bigger brains, you need a stock of cultural know-how (cooking, hunting, special tools) that makes individual learning very important for fitness. But, to have a stock of cultural know-how, you need big brains.

Heinrich speculates that humans overcame the startup problem due to a variety of factors that came together when we descended from the trees and started living on the ground. The important consequences of a species being on the ground (as opposed to in the trees):
1. It frees up your hands for tool use. Captive chimps, which are more “grounded” than wild chimps, make more tools.
2. It’s easier for you to find tools left by other people.
3. It’s easier for you to see what other people are doing and hang out with them. (“Hang out” being inapt, since that’s precisely not what you’re doing).
4. You need to group up with people to survive, since there are terrifying predators on the ground. Larger groups offer protection; these larger groups will accelerate the process of people messing around with tools and imitating each other.
Larger groups also produce new forms of social organization. Apparently, in smaller groups of chimps, the reproductive strategy that every male tries to follow is “fight as many males as you can for mating opportunities.” But in a larger group, it becomes better for some males to try to pair bond – to get multiple reproductive opportunities with one female, by hanging around her and taking care of her.

Pair bonding in turn allows for more kinship relationships. Kinship relationships mean you grow up around more people; this accelerates learning. Kinship also allows for more genetic selection for big-brained, slow-developing learners: it becomes less prohibitively costly to give birth to big-brained, slow-growing children, because more people are around to help out and pool food resources.

This story is, by Henrich’s own account, quite speculative. You can find it in Chapter 16 of the book.

Robbo 10 Dec 2021 21:10 UTC
17 points
in reply to: Steven Byrnes’s comment on: There is essentially one best-validated theory of cognition.
would read a review!

Robbo 1 Oct 2021 15:12 UTC
17 points
on: You can talk to EA Funds before applying
I scheduled a conversation with Evan based on this post and it was very helpful. If you’re on the fence, do it! For me, it was helpful as a general career / EA strategy discussion, in addition to being useful for thinking about specifically Long-Term Future Fund concerns.
And I can corroborate that Evan is indeed not that intimidating.

80k podcast episode on sentience in AI systems

Robbo15 Mar 2023 20:19 UTC

15 points

0 comments13 min readLW link

(80000hours.org)

Robbo 12 Jan 2024 14:52 UTC
11 points
0
in reply to: sapphire’s comment on: Universal Love Integration Test: Hitler
That poem was not written by Hitler.

According to this website and other reputable-seeming sources, the German poet Georg Runsky published that poem, “Habe Geduld”, around 1906.

On 14 May 1938 a copy of this poem was printed in the Austrian weekly Agrarische Post, under the title ‘Denke es’. It was then falsely attributed to Adolf Hitler.

In the Hitler biography of John Toland (1976) it appeared for the first time in English translation. Toland made the mistake in identifying it as a true Hitler poem, supposedly written in 1923.

Robbo 24 Nov 2021 17:28 UTC
10 points
in reply to: Gram Stone’s comment on: Yudkowsky and Christiano discuss “Takeoff Speeds”

I guess even though I don’t disagree that knowledge accumulation has been a bottleneck for humans dominating all other species, I don’t see any strong reason to think that knowledge accumulation will be a bottleneck for an AGI dominating humans, since the limits to human knowledge accumulation seem mostly biological. Humans seem to get less plastic with age, mortality among other things forces us to specialize our labor, we have to sleep, we lack serial depth, we don’t even approach the physical limits on speed, we can’t run multiple instances of our own source, we have no previous example of an industrial civilization to observe, I could go on: a list of biological fetters that either wouldn’t apply to an AGI or that an AGI could emulate inside of a single mind instead of across a civilization.

I agree with this, and I think that you are hitting on a key a reason that these debates don’t hinge on what the true story of the human intelligence explosion ends up being. Whichever of these is closer to the truth

a) the evolution of individually smarter humans using general reasoning ability was the key factor

b) the evolution of better social learners and the accumulation of cultural knowledge was the key factor

...either way, there’s no reason to think that AGI has to follow the same kind of path that humans did. I found an earlier post on the Henrich model of the evolution of intelligence, Musings on Cumulative Cultural Evolution and AI. I agree with Rohin Shah’s takeaway on that post :

I actually don’t think that this suggests that AI development will need both social and asocial learning: it seems to me that in this model, the need for social learning arises because of the constraints on brain size and the limited lifetimes. Neither of these constraints apply to AI—costs grow linearly with “brain size” (model capacity, maybe also training time) as opposed to superlinearly for human brains, and the AI need not age and die. So, with AI I expect that it would be better to optimize just for asocial learning, since you don’t need to mimic the transmission across lifetimes that was needed for humans.

What to think when a language model tells you it’s sentient

Robbo21 Feb 2023 0:01 UTC

9 points

6 comments6 min readLW link

[Question] Who has argued in detail that a current AI system is phenomenally conscious?

Robbo14 May 2021 22:03 UTC

8 points

2 comments1 min readLW link

Robbo 27 Sep 2021 13:10 UTC
8 points
on: Brain-Computer Interfaces and AI Alignment
Thanks for collecting these things! I have been looking into these arguments recently myself, and here are some more relevant things:
1. EA forum post “A New X-Risk Factor: Brain-Computer Interfaces” (August 2020) argues for BCI as a risk factor for totalitarian lock-in.
2. In a comment on that post, Kaj Sotala excerpts a section of Sotala and Yampolskiy (2015), “Responses to catastrophic AGI risk: a survey”. This excerpts contains links to many other relevant discussions:
  1. “De Garis [82] argues that a computer could have far more processing power than a human brain, making it pointless to merge computers and humans. The biological component of the resulting hybrid would be insignificant compared to the electronic component, creating a mind that was negligibly different from a ‘pure’ AGI. Kurzweil [168] makes the same argument, saying that although he supports intelligence enhancement by directly connecting brains and computers, this would only keep pace with AGIs for a couple of additional decades.
  2. “The truth of this claim seems to depend on exactly how human brains are augmented. In principle, it seems possible to create a prosthetic extension of a human brain that uses the same basic architecture as the original brain and gradually integrates with it [254]. A human extending their intelligence using such a method might remain roughly human-like and maintain their original values. However, it could also be possible to connect brains with computer programs that are very unlike human brains and which would substantially change the way the original brain worked. Even smaller differences could conceivably lead to the adoption of ‘cyborg values’ distinct from ordinary human values [290].
  3. “Bostrom [49] speculates that humans might outsource many of their skills to non-conscious external modules and would cease to experience anything as a result. The value-altering modules would provide substantial advantages to their users, to the point that they could outcompete uploaded minds who did not adopt the modules. [...]
  4. “Moravec [194] notes that the human mind has evolved to function in an environment which is drastically different from a purely digital environment and that the only way to remain competitive with AGIs would be to transform into something that was very different from a human.”
3. The sources in question from the above are:
  1. de Garis H 2005 The Artilect War: Cosmists vs Terrans (Palm Springs, CA: ETC Publica-Tions)
  2. Kurzweil, R. (2001). Response to Stephen Hawking. Kurzweil Accelerating Intelligence. September, 5.
  3. Sotala K and Valpola H 2012 Coalescing minds Int. J. Machine Consciousness 4 293–312
  4. Warwick K 2003 Cyborg morals, cyborg values, cyborg ethics Ethics Inf. Technol. 5 131–7
  5. Bostrom N 2004 The future of human evolution ed C Tandy pp 339–71 Two Hundred Years After Kant, Fifty Years After Turing (Death and Anti-Death vol 2)
  6. Moravec H P 1992 Pigs in cyberspace www.frc.ri.cmu.edu/~hpm/project.archive/general.articles/1992/CyberPigs.html
4. Here’s a relevant comment on that post from Carl Shulman, who notes that FHI has periodically looked into BCI in unpublished work: “I agree the idea of creating aligned AGI through BCI is quite dubious (it basically requires having aligned AGI to link with, and so is superfluous; and could in any case be provided by the aligned AGI if desired long term)”

Robbo 9 Sep 2021 12:37 UTC
7 points
on: Pleasure and Pain are Long-Tailed
Thank you for writing about this. It’s a tremendously interesting issue.
I feel qualitatively more conscious, which I mean in the “hard problem of consciousness” sense of the word. “Usually people say that high-dose psychedelic states are indescribably more real and vivid than normal everyday life.” Zen practitioners are often uninterested in LSD because it’s possible to reach states that are indescribably more real and vivid than (regular) real life without ever leaving real life. (Zen is based around being totally present for real life. A Zen master meditates eyes open.) It is not unusual for proficient meditators to describe mystical experiences as at least 100× more conscious than regular everyday experience.
I’m very curious about the issue of what it means to say that one creature is “more conscious” than another—or, that one person is more conscious while meditating than while surfing Reddit. Especially if this is meant in the sense of “more phenomenally conscious”. (I take it that you do mean “more phenomenally conscious”, and that’s what you are saying by invoking the hard problem. But let me know if that’s not right). Can you say more about what you mean? Some background:
Pautz (2019) has been influential on my thinking about this kind of talk about ‘more conscious’ or ‘level of conscious’ or ‘degree of consciousness’. Pautz distinguishes between many consciousness-related things that certainly do come in degrees.
On the one hand, we have certain features of the particular character of phenomenally conscious experiences:
- Intensity level (193)
  - A whisper is less intense than a heavy metal concert; faint pink is less intense than bright red. And of course, certain pleasures and pains are more intense than others
- Complexity level
  - The whiff of mint is a ‘simpler’ experience than visual experience of a bustling London street
- Determinacy level
  - A tomato in the center of vision is represented more determinately than a tomato in the periphery
- Access level
  - If you think that things can be more or less ‘access’ of phenomenal conscious experiences, then there might be some experiences that are not accessed, versus those that are fully accessed—e.g. something right in front of you that you are paying full attention to.
And then there is a ‘global’ feature of a creature’s phenomenal consciousness:
- Richness of experiential repertoire: the ‘number’ of distinct experiences (types and tokens) the creature has the capacity to have (194). Adult humans probably have a greater richness of experiential repertoire than a worm (if indeed worms are phenomenally conscious).
In light of this, my questions for you:
1. Along which of these dimensions are you ‘more’ conscious when meditating? Would love to hear more. (I’m guessing: intensity, complexity, and access?)
2. Do you think there is some further way in which you are ‘more conscious’, that is not cashed out in these terms? (Pautz does not, and he uses this to criticize Integrated Information Theory)
Finally: this post has inspired me to be more ambitious about exploring the broader regions of consciousness space for myself. (“Our normal waking consciousness, rational consciousness as we call it, is but one special type of consciousness, whilst all about it, parted from it by the filmiest of screens, there lie potential forms of consciousness entirely different.” -William James). And for that, I am grateful.

Robbo 20 May 2021 15:45 UTC
6 points
in reply to: Willa’s comment on: Willa’s Shortform
I enjoyed reading this and skimming through your other shortforms. I’m intrigued by this idea of using the short form as something like a journal (albeit a somewhat public facing one).

Any tips, if I might want to start doing this? How helpful have you found it? Any failure modes?

Robbo 11 Nov 2021 21:56 UTC
4 points
in reply to: niplav’s comment on: Using Brain-Computer Interfaces to get more data for AI alignment
fixed the “Samberg” typo—thanks!

Robbo 29 Sep 2021 15:33 UTC
4 points
on: A review of Steven Pinker’s new book on rationality
“I’m tempted to recommend this book to people who might otherwise be turned away by Rationality: From A to Z.”
Within the category of “recent accessible introduction to rationality”, would you recommend this Pinker book, or Julia Galef’s “Scout Mindset”? Do thoughts on the pros and cons of each, or who would benefit more from each?

Robbo 8 May 2021 17:32 UTC
4 points
on: Interview with Christine M. Korsgaard: Animal Ethics, Kantianism, Utilitarianism
Thanks for this! People interested in the claim (which Korsgaard takes to be a deficiency of utilitarianism) that for utilitarians “people and animals don’t really matter at all; they are just the place where the valuable things happen”, might be interested in Richard Yetter Chappell’s [1] paper “Value Receptacles” (pdf). It’s an exploration of what this claim could even mean, and a defense of utilitarianism in light of it.
[1] Not incidentally, a long-time effective altruist. Whose blog is great.

Robbo 22 Feb 2023 0:45 UTC
3 points
1
in reply to: Gunnar_Zarncke’s comment on: What to think when a language model tells you it’s sentient
To clarify, what question were you thinking that is more interesting than? I see that as one of the questions that is raised in the post. But perhaps you are contrasting “realize it is conscious by itself” with the methods discussed in “Could we build language models whose reports about sentience we can trust?”

Robbo 9 Nov 2021 11:01 UTC
3 points
in reply to: Steven Byrnes’s comment on: Using Brain-Computer Interfaces to get more data for AI alignment
Even if we were able to get good readings from insula & cingulate cortex & amygdala et alia, do you have thoughts on how and whether we could “ground” these readings? Would we calibrate on someone’s cringe signal, then their gross signal, then their funny signal—matching various readings to various stimuli and subjective reports?

Robbo 8 Nov 2021 11:01 UTC
3 points
in reply to: Steven Byrnes’s comment on: Using Brain-Computer Interfaces to get more data for AI alignment
Hi Steven, thanks!
1. On terminology, I agree.
Wait But Why, which of course is not an authoritative neuroscience source, uses “scale” to mean “how many neurons can be simultaneously recorded”. But then it says fMRI and EEG have “high scale”, but “low spatial resolution”—somewhat confusing since low spatial resolution means that fMRI and EEG don’t record any individual neurons. So, my gloss on “scale” is more like WBW actually is talking about, and probably is better called “coverage”. And then it’s best to just talk about “number of simultaneously recorded [individual] neurons” without giving that a shorthand—and only talk about that when we really are recording individual neurons. That’s what Stevenson and Kording (2011) do in “How advances in neural recording affect data analysis”.
1. Good call on Kernel, I’ll edit to reflect that.
2. Yep—invasive techniques are necessary—but not sufficient, as the case of ECoG shows.

Robbo

Key ques­tions about ar­tifi­cial sen­tience: an opinionated guide

Us­ing Brain-Com­puter In­ter­faces to get more data for AI alignment

80k pod­cast epi­sode on sen­tience in AI systems

What to think when a lan­guage model tells you it’s sentient

[Question] Who has ar­gued in de­tail that a cur­rent AI sys­tem is phe­nom­e­nally con­scious?

Key questions about artificial sentience: an opinionated guide

Using Brain-Computer Interfaces to get more data for AI alignment

80k podcast episode on sentience in AI systems

What to think when a language model tells you it’s sentient

[Question] Who has argued in detail that a current AI system is phenomenally conscious?