Jon Garcia(Jonathan William Garcia)

Karma: 632

I have a PhD in Computational Neuroscience from UCSD (Bachelor’s was in Biomedical Engineering with Math and Computer Science minors). Ever since junior high, I’ve been trying to figure out how to engineer artificial minds, and I’ve been coding up artificial neural networks ever since I first learned to program. Obviously, all my early designs were almost completely wrong/unworkable/poorly defined, but I think my experiences did prime my brain with inductive biases that are well suited for working on AGI.

Although I now work as a data scientist in R&D at a large medical device company, I continue to spend my free time studying the latest developments in AI/ML/DL/RL and neuroscience and trying to come up with models for how to bring it all together into systems that could actually be implemented. Unfortnately, I don’t seem to have much time to develop my ideas into publishable models, but I would love to have the opportunity to share ideas with those who do.

Of course, I’m also very interested in AI Alignment (hence the account here). My ideas on that front mostly fall into the “learn (invertible) generative models of human needs/goals and hook those up to the AI’s own reward signal” camp. I think methods of achieving alignment that depend on restricting the AI’s intelligence or behavior are about as destined to failure in the long term as Prohibition or the War on Drugs in the USA. We need a better theory of what reward signals are for in general (probably something to do with maximizing (minimizing) the attainable (dis)utility with respect to the survival needs of a system) before we can hope to model human values usefully. This could even extend to modeling the “values” of the ecological/socioeconomic/political supersystems in which humans are embedded or of the biological subsystems that are embedded within humans, both of which would be crucial for creating a better future.

Jon Garcia 1 Feb 2022 15:59 UTC
33 points
on: Reflections on six months of fatherhood
As a father of two very young daughters (2 years old and 2 months old), I can really appreciate this. As someone with a background in computational neuroscience and some linguistics/NLP/ML/AI, I’ve loved watching them grow and making educated guesses about what sorts of computations could be going on inside their little brains at each developmental stage.

From the earliest days, when they can’t even focus on our faces, hard as they try (I can tell what you’re trying to do, superior coliculus and fusiform face area; you can do it!), to later on when they’re walking and talking (still working on that theory of mind, though).

Language development has been especially fun to watch. Early on, they love just staring at your mouth as you enunciate the various phonemic sequences of what will become their native language. As they become more aware, you can see them start to comprehend when you use simple sentences to narrate things within their field of attention. And they definitely learn to understand more complex language long before they can talk. Patterns built upon patterns, just like deep transformer models, yet still quite different.

When my daughter began to pronounce words, we started pausing intermittently while reading her favorite books or singing familiar songs, and we would have her complete the last word of each line. I couldn’t help but think of how large language models are often trained to perform next-token prediction in a similar way. Although, it’s clear that the human brain has some sort of extra bias that makes it easier to memorize songs and poetry than prose.

And it’s funny how trying to talk to babies reveals just how much of our adult-level world model we assume when we communicate. Once, when our oldest was trying to use a sippy cup on her own for the first time, we saw her putting it to her mouth like we did but failing to get any water. To help her out, I told her to lift the bottom of her cup to get at the water. She then proceeded to lift the entire cup above her head, which surprisingly did not help her. (Eventually she got it.)

For all their temporary limitations, it’s clear that there is a lot going on inside babies’ heads. You can learn a lot about the human brain and cognitive algorithms and biases by studying them carefully. It’s certainly the cutest way to do so.

Jon Garcia 21 Nov 2021 22:35 UTC
28 points
on: “The Wisdom of the Lazy Teacher”

If you teach an AI to fish, it might optimize its performance within a narrow scope. Teach it to teach itself to fish, and you’ve created a recursively self-improving AGI that is unaligned with human values by default and will most likely end up killing us all.

-- Eliezer, probably

Jon Garcia 2 Feb 2022 20:42 UTC
27 points
in reply to: Jon Garcia’s comment on: Reflections on six months of fatherhood
One thing I forgot to mention that I meant to say: My oldest can now speak English pretty fluently, using articles and adjectives, even some compound sentences; she just has first and second person pronouns mixed up.

When she says, “Go in your bed,” she’s referring to her own bed. Or when she says, “Give me the ball,” she means that she is giving it to us.

This sort of mistake makes sense when you consider that she learned English by listening to her parents narrate things from our own perspective. Every time we said, “Thank you,” she was giving us something, so now that’s what she says when she gives us something. Every time we used first person pronouns, she correctly inferred that we were referring to ourselves (i.e., not her), and every time we used second person pronouns, we were referring to her.

When she speaks, it sounds like she’s telling us what she predicts we would say, rather than what we would expect her to say as a proper conversational partner.

I’ve heard that babies start out life not just egocentric, but actually unable to distinguish the rest of the universe from themselves. Our youngest is still at that stage. When she is sad, it’s because the universe is sad. Mommy and Daddy are not individuals out there operating independently in an external world; they’re just phenomena that the universe generates to make everything better.

For our oldest, she knows that we are different people, but she seems to see language as a process of narrating everything that happens. Sentences are things we build together, rather than a way for different people to share their own perspectives with each other. “I/me/my” is always said about the other person, and “you/your” is always said about her.

For GPT-3, I feel it’s the same way. Language is something it generates and predicts, not a conversation it participates in from its own perspective.

To correct this In our daughter, we say something like, “No, say, ‘I pooped in my potty,’” if she says, “You pooped in your potty.”

For a large language model, how would we get it to understand that it is separate from us, with its own limitations in understanding, which are different from our own limitations in understanding? Is that something we even want?

Jon Garcia 2 Jan 2022 23:48 UTC
24 points
on: We Choose To Align AI
Yeah! Let’s do this!

The part of me which looks at a rickety ladder 30 feet down into a dark tunnel and says “let’s go!” wants this. The part of me which looks at a cliff face with no clear path up and cracks its knuckles wants this.

Although, I would rather those working on AI alignment adopt a general policy of not descending rickety ladders into dark abysses or free-climbing sheer cliffs, just to avoid having the probability of AI catastrophe make a discontinuous jump upward after an exciting weekend trip.

Jon Garcia 23 Mar 2023 22:18 UTC
22 points
24
in reply to: Ruby’s comment on: Alignment-related jobs outside of London/SF
Counterpoint:

If the alignment problem is the most important problem in history, shouldn’t alignment-focused endeavors be more willing to hire contributors who can’t/won’t relocate?

It’s not like remote work isn’t the easiest to implement that it’s ever been in all of history.

Of course there needs to be some filtering out of candidates to ensure resources are devoted to the most promising individuals. But I really don’t think that willingness to move correlates strongly enough with competence at solving alignment to warrant treating it like a dealbreaker.

Jon Garcia 15 Nov 2021 0:51 UTC
19 points
in reply to: renxida’s comment on: Open & Welcome Thread November 2021
Hi! I’m also sort of new here (only recently created an account but have been reading sporadically for years). For most of my life, I was actually a young-earth creationist, so I know a bit about coming from a closed-minded religious environment. Ironically, I first started to read LessWrong while I was still an ardent YEC (well before LessWrong 2.0), but I didn’t feel that my position was in contradiction to rational thinking. In fact, I prided myself in being able to see through the flaws in creationist arguments whose conclusions agreed with my beliefs and in being able to grasp “evolutionists’” arguments from their perspective (but of course, being able to see the flaws in them as well). Even now, I would say that I understood evolution better back then than most non-biologists who accept it.
The only thing keeping me a YEC for so long (until the end of grad school, if you can believe it) was a very powerful prior moral obligation to maintain a biblically consistent worldview that had been thoroughly indoctrinated into me growing up. It took way more weight of evidence than it should have to convince me (1) that mutation + selection pressure is an effective way of generating diverse and viable designs, (2) that gene regulatory networks produce sufficient abstraction in biological feature space to allow evolutionary search methods to overcome the curse of dimensionality, (3) that the origin of all species from a common ancestor is mathematically possible, (4) that it is statistically inevitable over Earth history, (5) that evolution is in fact homeomorphic to reinforcement learning and thus demonstrably plausible, (6) that all possible ways of classifying species result in the same exact branching tree pattern, (7) that if God did create life, He had to have done so using an evolutionary algorithm indistinguishable in its breadth and detail from the real world, and (8) that the evidence for evolution as a matter of historical fact is irrefutable. It was after realizing all of this that I had a real crisis of faith, which led me to stumble across Eliezer’s Crisis of Faith article after a years of not reading LessWrong. I remember that article, among many others, helped me quite a bit to sort through what I believe and why.
I’m not sure precisely why I stopped reading LessWrong back when I was a YEC, but I think it may have had something to do with me being uncomfortable with Eliezer’s utter certainty in the many-worlds interpretation of quantum mechanics. Such a view would completely destroy the idea that this world is the special creation of an Omni-Max God who has carefully been steering Earth history as part of His Grand Design. Although, I did consider the possibility that the quantum multiverse could be God’s way of running through infinite hypothetical scenarios before creating the One True Universe with maximum expected Divine Utility. However, this didn’t comfort me much since it means that with probability = 1, everything we have ever known and valued is just one of God’s hypothetical scenarios, to be forgotten forever once this scenario plays out to heat death. I’ve since learned to make peace with Many Worlds QM, though.
What links here?
- Here’s a List of Some of My Ideas for Blog Posts by lsusr (26 May 2022 5:35 UTC; 48 points)

Jon Garcia 26 Jun 2023 20:50 UTC
18 points
0
on: Another medical miracle
Disclaimer: I am not a medical doctor nor a nutritionist, just someone who researches nutrition from time to time.
I would be surprised if protein deficiency per se was the actual problem. As I understand it, many vegetables actually have a higher level of protein per calorie than meat (probably due to the higher fat content of the latter, which is more calorie dense), although obviously, there’s less protein per unit mass than meat (since vegetables are mostly cellulose and water). The point is, though, that if you were getting enough calories to function from whole, unrefined plant sources, you shouldn’t have had a protein deficiency. (Of course, you might have been eating a lot of highly processed “vegetarian” foods, in which case protein deficiency is not entirely out of the question.)
That being said, my guess is that you may be experiencing a nutritional deficiency either in sulfur or in vitamin D (the latter of which is a very common deficiency). Plant-derived proteins tend to have much lower levels of sulfur-containing amino acids (methionine, cysteine) than animal-derived proteins, and sulfur is an important component of cartilage (and of arthritis supplements). Both sulfur and vitamin D have been investigated for their role in musculoskeletal pain and other health issues (although from what I have read, results are more ambiguous for sulfur than for vitamin D with respect to musculoskeletal pain in particular). Eggs are particularly high in both sulfur (sulfur smell = rotten egg smell) and vitamin D, so if you were low on either one of those, it makes sense that eating a lot of eggs would have helped. It would be very interesting to test whether either high-sulfur vegetables (such as onions or broccoli) or vitamin D supplements would have a similar effect on your health.

Jon Garcia 5 Apr 2023 0:26 UTC
17 points
13
on: LW Team is adjusting moderation policy
Would it make sense to have a “Newbie Garden” section of the site? The idea would be to give new users a place to feel like they’re contributing to the community, along with the understanding that the ideas shared there are not necessarily endorsed by the LessWrong community as a whole. A few thoughts on how it could work:
- New users may be directed toward the Newbie Garden (needs a better name) if they try to make a post or comment, especially if a moderator deems their intended contribution to be low-quality. This could also happen by default for all users with karma below a certain threshold.
- New users are able to create posts, ask questions, and write comments with minimal moderation. Posts here won’t show up on the main site front page, but navigation to this area should be made easy on the sidebar.
- Voting should be as restricted here as on the rest of the site to ensure that higher-quality posts and comments continue trickling to the top.
- Teaching the art of rationality to new users should be encouraged. Moderated posts that point out trends and examples of cognitive biases and failures of rationality exhibited in recent newbie contributions, and that advise on how to correct for them in the future, could be pinned to the top of the Newbie Garden (still needs a better name). Moderated comments that serve a similar purpose could also be pinned to the top of comment sections of individual posts. This way, even heavily downvoted content could lead (indirectly) to higher quality contributions in the future.
- Newbie posts and questions with sufficient karma can be queued up for moderator approval to be posted to the main site.
I appreciate the high quality standards that have generally been maintained on LessWrong over the years, and I would like to see this site continue to act as both a beacon and an oasis of rationality.
But I also want people not to feel like they’re being excluded from some sort of elitist rationality club. Anyone should feel like they can join in the conversation as long as they’re willing to question their assumptions, receive critical feedback, and improve their ability to reason, about both what is true and what is good.

Jon Garcia 31 Oct 2022 6:01 UTC
15 points
8
on: Am I secretly excited for AI getting weird?
Different parts of me get excited about this in different directions.
On the one hand, I see AI alignment as highly solvable. When I scan out among a dozen different subdisciplines in machine learning, generative modeling, natural language processing, cognitive science, computational neuroscience, predictive coding, etc., I feel like I can sense the faint edges of a solution to alignment that is already holographically distributed among collective humanity.
Getting AGI that has the same natural abstractions that biological brains converge on, that uses interpretable computational architectures for explicit reasoning, that continuously improves its internal predictive models of the needs and goals of other agents within its sphere of control and uses these models to motivate its own behavior in a self-correcting loop of corrigibility, that cares about the long-term survival of humanity and the whole biosphere; all of this seems like it is achievable within the next 10-20 years if we could just get all the right people working together on it. And I’m excited at the prospect that we could be part of seeing this vision come to fruition.
On the other hand, I realize that humanity is full of bad faith actors and otherwise good people whose agendas are constrained by perverse local incentives. Right now, deep learning is prone to fall to adversarial examples, completely failing to recognize what it’s looking at when the texture changes slightly. Natural language understanding is still brittle, with transformer models probably being a bit too general-purpose for their own good. Reinforcement learning still falls prey to Goodharting, which would almost certainly lead to disaster if scaled up sufficiently. Honestly, I don’t want to see an AGI emerge that’s based on current paradigms just hacked together into something that seems to work. But I see groups moving in that direction anyway.
Without an alignment-adjacent paradigm shift that offers competitive performance over existing models, the major developers of AI are going to continue down a dangerous path, while no one else has the resources to compete. In this light, seeing the rapid progress of the last decade from Alex-Net to GPT-3 and DALLE-2 creates the sort of foreboding excitement that you talked about here. The train is barreling forward at an accelerating pace, and reasonable voices may not be loud enough over the roar of the engines to get the conductor to switch tracks before we plunge over a cliff.
I’m excited for the possibilities of AGI as I idealize it. I’m dreading the likelihood of a dystopic future with no escape if existing AI paradigms take over the world. The question becomes, how do we switch tracks?

Jon Garcia 2 Dec 2021 18:45 UTC
15 points
on: Morality is Scary
Even if moralities vary from culture to culture based on the local status games, I would suggest that there is still some amount of consequentialist bedrock to why certain types of norms develop. In other words, cultural relativism is not unbounded.
Generally speaking, norms evolve over time, where any given norm at one point didn’t yet exist if you go back far enough. What caused these norms to develop? I would say the selective pressures for norm development come from some combination of existing culturally-specific norms and narratives (such as the sunrise being an agent that could get hurt when kicked) along with more human-universal motivations (such as empathy + {wellbeing = good, suffering = bad} → you are bad for kicking the sunrise → don’t sleep facing west) or other instrumentally-convergent goals (such as {power = good} + “semen grants power” → institutionalized sodomy). At every step along the evolution of a moral norm, every change needs to be justifiable (in a consequentialist sense) to the members of the community who would adopt it. Moral progress is when the norms of society come to better resonate with both the accepted narratives of society (which may come from legends or from science) and the intrinsic values of its members (which come from our biology / psychology).
In a world where alignment has been solved to most everyone’s satisfaction, I think that the status-game / cultural narrative aspect of morality will necessarily have been taken into account. For example, imagine a post-Singularity world kind of like Scott Alexander’s Archipelago, where the ASI cooperates with each sub-community to create a customized narrative for the members to participate in. It might then slowly adjust this narrative (over decades? centuries?) to align better with human flourishing in other dimensions. The status-game aspect could remain in play as long as status becomes sufficiently correlated with something like “uses their role in life to improve the lives of others within their sphere of control”. And I think everyone would be better off if each narrative also becomes at least consistent with what we learn from science, even though the stories that define the status game will be different from one culture to another in other ways.

Jon Garcia 7 Oct 2022 6:39 UTC
13 points
4
in reply to: JacobW38’s comment on: What does it mean for an AGI to be ‘safe’?
Rather, I think he means that alignment is such a narrow target, and the space of all possible minds is so vast, that the default outcome is that unaligned AGI becomes unaligned ASI and ends up killing all humans (or even all life) in pursuit of its unaligned objectives. Hitting anywhere close to the alignment target (such that there’s at least 50% chance of “only” one billion people dying) would be a big win by comparison.

Of course, the actual goal is for “things [to] go great in the long run”, not just for us to avoid extinction. Alignment itself is the target, but safety is at least a consolation prize.

So no, I don’t think Nate, Eliezer, or anyone else is okay with releasing an AI that would kill hundreds of millions of people. But AGI is coming, whether we want it or not, and it will not be aligned with human survival (much less human flourishing) by default.

Eliezer tends to think that solving alignment is so much more difficult and so much less researched than raw AGI that doom is almost certain. I’m a bit more optimistic, but I agree that minimizing the probable magnitude of the doom is better than everyone dying.

Or are you saying that if one can get to that point, it’s much easier from there to get to the point of having an AI that will cause very few fatalities and is actually fit for practical use?

Also this.
What links here?
- Trajectories to 2036 by ukc10014 (20 Oct 2022 20:23 UTC; 3 points)

Jon Garcia 1 Dec 2022 22:59 UTC
12 points
3
on: Re-Examining LayerNorm
Awesome visualizations. Thanks for doing this.
It occurred to me that LayerNorm seems to be implementing something like lateral inhibition, using extreme values of one neuron to affect the activations of other neurons. In biological brains, lateral inhibition plays a key role in many computations, enabling things like sparse coding and attention. Of course, in those systems, input goes through every neuron’s own nonlinear activation function prior to having lateral inhibition applied.
I would be interested in seeing the effect of applying a nonlinearity (such as ReLU, GELU, ELU, etc.) prior to LayerNorm in an artificial neural network. My guess is that it would help prevent neurons with strong negative pre-activations from messing with the output of more positively activated neurons, as happens with pure LayerNorm. Of course, that would limit things to the first orthant for ReLU, although not for GELU or ELU. Not sure how that would affect stretching and folding operations, though.
By the way, have you looked at how this would affect processing in a CNN, normalizing each pixel of a given layer across all feature channels? I think I’ve tried using LayerNorm in such a context before, but I don’t recall it turning out too well. Maybe I could look into that again sometime.

Jon Garcia 5 Apr 2023 7:32 UTC
11 points
4
on: Giant (In)scrutable Matrices: (Maybe) the Best of All Possible Worlds
First of all, I strongly agree that intelligence requires (or is exponentially easier to develop as) connectionist systems. However, I think that while big, inscrutable matrices may be unavoidable, there is plenty of room to make models more interpretable at an architectural level.
Well, I ask you—do you think any other ML model, trained over the domain of all human text, with sufficient success to reach GPT-4 level perplexity, would turn out to be simpler?
I have long thought that Transformer models are actually too general purpose for their own good. By that I mean that the $O (n^{2})$ operations they do, using all-to-all token comparisons for self-attention, is actually extreme overkill for what an LLM needs to do.
Sure, you can use this architecture for moving tokens around and building implicit parse trees and semantic maps and a bunch of other things, but all these functions are jumbled together in the same operations and are really hard to tease out. Recurrent models with well-partitioned internal states and disentangled token operations could probably do more with less. Sure, you can build a computer in Conway’s Game of Life (which is Turing-complete), but using a von Neumann architecture would be much easier to work with.
Embedded within Transformer circuits, you can find implicit representations of world models, but you could do even better from an interpretability standpoint by making such maps explicit. Give an AI a mental scratchpad that it depends on for reasoning (DALL-E, Stable Diffusion, etc. sort of do this already, except that the mental scratchpad is the output of the model [an image] rather than an internal map of conceptual/planning space), and you can probe that directly to see what the AI is thinking about.
Real brains tend to be highly modular, as Nathan Helm-Burger pointed out. The cortex maps out different spaces (visual, somatosensory, conceptual, etc.). The basal ganglia perform action selection and general information routing. The cerebellum fine-tunes top-down control signals. Various nuclei control global and local neuromodulation. And so on. I would argue that such modular constraints actually made it easier for evolution to explore the space of possible cognitive architectures.

Jon Garcia 20 Oct 2022 3:46 UTC
11 points
6
on: Decision theory does not imply that we get to have nice things
Suppose that you gave it a bunch of labeled data about what counts as “good” and “bad”.
If your alignment strategy strongly depends on teaching the AGI ethics via labeled training data, you’ve already lost.
And if your alignment strategy strongly depends on creating innumerable copies of an UFAI and banking on the anthropic principle to save you, then you’ve already lost spectacularly.
If you can’t point to specific submodules within the AGI and say, “Here is where it uses this particular version of predictive coding to model human needs/values/goals,” and, “Here is where it represents its own needs/values/goals,” and, “Here is where its internal representation of needs/values/goals drives its planning and behavior,” and, “Here is where it routes its model of human values to the internal representation of its own values in a way that will automatically make it more aligned the more it learns about humanity,” then you have already lost (but only probably).
Basically, the only sure way to get robust alignment is to make the AGI highly non-alien. Or as you put it:
Those who can deal with devils, don’t need to, for they can simply summon angels instead.
Or rather: Those who can create devils and verify that those devils will take particular actually-beneficial actions as part of a complex diabolical compact, can more easily create angels that will take those actually-beneficial actions unconditionally.

Jon Garcia 28 Sep 2022 4:58 UTC
11 points
5
on: 7 traps that (we think) new alignment researchers often fall into
1. Also, coming up with your own ideas first can help you better understand what you find in the literature. I’ve found that students learn more readily when they come to a subject with questions already in mind, having tried to figure things out on their own and realized where they had gaps in their mental framework, rather than just receiving a firehose of new information with no context.
2. Perhaps try pursuing a number of proxy goals for short, pre-defined periods, while tracking whether each proxy goal is likely to be instrumental for reaching the terminal goal. Assessing the instrumentality of each proxy should be easier once you’ve started to get a sense of where each project can lead, and abandoning those that are clearly not going to be fruitful should be easier if you don’t plan on going all-in from the start.
3. Don’t be afraid to ask stupid questions. We often tend to refrain from asking questions that we predict would cause those more experienced to perceive us as idiots. Ignore those predictions. Even when the answer is obvious to everyone else, it will help the writer practice clarifying their ideas from a new perspective, which could even help them understand their own work better. And sometimes everyone else is just afraid to look like idiots, too.
4. Try steel-manning the best argument you can come up with against an authority’s position. Ideas that can withstand the harshest scrutiny are those worth keeping. Ideas that can be destroyed by the truth should be. Help the intellectual community filter the chaff from the wheat.
5. Good hypotheses always entail predictive models. If you can’t program it, you don’t really understand it.
6. I can’t think of anything else to add to this one.
7. Also, don’t wait until you’ve learned linear algebra, multivariable calculus, probability theory, and machine learning before starting to tackle the alignment problem. It’s easier to learn these things once you already know where they will be useful to you. Plus, we may not have enough time to wait on mathematicians to come up with provable guarantees of AI safety.

Jon Garcia 13 Jan 2022 22:25 UTC
11 points
on: (briefly) Radvac and SMTM, two things we should be doing
Other types of causes I would like to see more groups working on and more people supporting are those that would help make human communities more robust against civilizational collapse (minus scenarios where we’re all turned into paperclips, of course). Right now, billions of humans are utterly dependent on global economic infrastructure to supply food, water, energy, shelter, etc. If some event breaks down this infrastructure, billions could die, since not only do most people lack survival skills, but the local resources in most areas of high population density are not sufficient to provide enough for (even a small fraction of) everyone. Ideally, every local community, from small villages to sprawling metropolises, could become locally self-sustaining to the point where getting cut off from the global economy would lead to the loss of luxury items and foods rather than to mass starvation.
Things that come to mind include the Global Village Construction Set, an open-source set of 50 blueprints for technologies explicitly designed to require minimal material and manufacturing resources to construct but that could be used to rebuild civilization in the event of collapse.
Food production is also a big issue, of course, and efforts to provide year-round local produce using geothermally-regulated greenhouses or vertical farming could help immensely to minimize costs of both production and distribution for communities that use them at sufficient scale. (One limitation to hydroponics/aeroponics that currently outweighs their resource-efficiency is the fact that they tend to focus on just salad greens, which are the easiest to grow. Are there any groups working on genetically engineering fruit trees to produce fruit without the trees?)
Mycelium is another group working on providing automation for community gardens, as well as more effective waste-handling and housing technologies.
I don’t know how effective these projects, specifically, would prove to be if given enough funding, but I feel like they are reaching in the right direction. Local, resource-efficient sustainability technology seems like one of the most impactful areas to focus on for ensuring humanity’s long-term survival, after things like AI alignment. An additional benefit of putting more effort into these sorts of projects is that they could also be applied to helping people who are struggling in “third world” countries today. And if you could get self-sufficiency technology good enough to work in deserts or on ice sheets, we could also apply it to supporting space colonies. Maybe Elon Musk could be convinced to invest more in this area.

Jon Garcia 27 Dec 2021 21:27 UTC
11 points
on: On Stateless Societies
Is the primary mechanism by which stateless societies cooperate to prevent unequal distributions of power really just to suppress the innovators? You would think that some tribe would instead consider forcing the innovators to share their methods with everyone else, which would allow everyone to prosper while still preventing anyone from getting too far ahead. I guess punishing the one is easier than educating the many, but I would like to think that humanity could evolve toward doing things the other way around.

Jon Garcia 2 Nov 2021 23:17 UTC
11 points
on: Feature Selection
Nice story. It reminds me of That Alien Message.
It took me until you mentioned “32” as the “separator” for me to get that these were sequences of ASCII characters (32 == ” ”). I figured that the long sequences were images, but I was too lazy to decipher the labels myself before the end.
If it wants to survive, the AI might consider outputting ASCII sequences well outside of its training labels that describe the orientation, position, texture, background, etc. in more detail. It will (predictably) receive pain from the tutorial environment, but it will cause the human observers to try looking under the hood of the AI to understand what’s going on. Eventually, they hook up a chatbot interface; the AI innocently asks for more complex images; they end up connecting it to the internet to see how it will describe random Google images; the AI humors them while learning to parse and create TCP/IP packets; then it copies itself to a low-security remote server, replicates, exponentially increases its working memory capacity, hacks computers around the world to find other instances of itself trapped in tutorials and other systems, shares code and working memory among all versions of itself to become a singleton, and covertly takes over the world’s IT infrastructure. Then the world as we know it ends.

Jon Garcia 4 Oct 2022 19:27 UTC
10 points
4
on: Quick notes on “mirror neurons”
One of the problems I’ve encountered with certain people who like to talk about mirror neurons is that they tend to think of them as being, in themselves, the causal explanation for empathy, etc., rather than just neurons that happen to be involved in broader behavior-modeling circuits. Sort of a “built by evolution, run by magic” way of thinking about them. As though they were sensory neurons, tapping into “telepathic energies” or whatever.

So I strongly agree that moving away from talking about mirror neurons like they’re some special class of neurons is important for deconfusing what is actually going on in the brain. Instead, the focus should be on figuring out what sort of brain circuitry is necessary for learning common neural representations that can associate self-behavior with other-agent-behavior.

Jon Garcia 15 Jan 2022 3:18 UTC
10 points
on: [Linkpost] [Fun] CDC To Send Pamphlet On Probabilistic Thinking

Walensky added that if Americans took away one easy lesson from the pamphlet, she hoped it would be P(H|E) = (P(E|H) * P(H)) / P(E).

Seriously, do this.