mesaoptimizer

Karma: 772

https://mesaoptimizer.com

learn math or hardware

mesaoptimizer 13 May 2024 9:47 UTC
12 points
6
in reply to: Linch’s comment on: Linch’s Shortform

Similarly, governmental institutions have institutional memories with the problems of major historical fuckups, in a way that new startups very much don’t.

On the other hand, institutional scars can cause what effectively looks like institutional traumatic responses, ones that block the ability to explore and experiment and to try to make non-incremental changes or improvements to the status quo, to the system that makes up the institution, or to the system that the institution is embedded in.

There’s a real and concrete issue with the amount of roadblocks that seem to be in place to prevent people from doing things that make gigantic changes to the status quo. Here’s a simple example: would it be possible for people to get a nuclear plant set up in the United States within the next decade, barring financial constraints? Seems pretty unlikely to me. What about the FDA response to the COVID crisis? That sure seemed like a concrete example of how ‘institutional memories’ serve as gigantic roadblocks to the ability for our civilization to orient and act fast enough to deal with the sort of issues we are and will be facing this century.

In the end, capital flows towards AGI companies for the sole reason that it is the least bottlenecked / regulated way to multiply your capital, that seems to have the highest upside for the investors. If you could modulate this, you wouldn’t need to worry about the incentives and culture of these startups as much.

mesaoptimizer 13 May 2024 9:35 UTC
2 points
0
on: mesaoptimizer’s Shortform
I had the impression that SPAR was focused on UC Berkeley undergrads and had therefore dismissed the idea of being a SPAR mentor or mentee. It was only recently that I looked at the website when someone mentioned that they wanted to learn from this one SPAR mentor, and then I looked at the website, and SPAR now seems to focus on the same niche as AI Safety Camp.

Did SPAR pivot in the past six months, or did I just misinterpret SPAR when I first encountered it?

mesaoptimizer 11 May 2024 10:40 UTC
2 points
0
on: Linear infra-Bayesian Bandits
Sort-of off-topic, so feel free to maybe move this comment elsewhere.

I’m quite surprised to see that you have just shipped an MSc thesis, because I didn’t expect you to be doing an MSc (or anything in traditional academia). I didn’t think you needed one, since I think you have enough career capital to continue to work indefinitely on the things you want to work on and get paid well for it. I also assumed that you might find academia somewhat a waste of your time in comparison to doing stuff you wanted to do.

Perhaps you could help clarify what I’m missing?

mesaoptimizer 8 May 2024 8:19 UTC
1 point
0
in reply to: jacobjacob’s comment on: jacobjacob’s Shortform Feed

fiber at Tata Industries in Mumbai

Could you elaborate on how Tata Industries is relevant here? Based on a DDG search, the only news I find involving Tata and AI infrastructure is one where a subsidiary named TCS is supposedly getting into the generative AI gold rush.

mesaoptimizer 8 May 2024 6:55 UTC
2 points
2
in reply to: quila’s comment on: quila’s Shortform

My thought is that I don’t see why a pivotal act needs to be that.

Okay. Why do you think Eliezer proposed that, then?

mesaoptimizer 3 May 2024 10:17 UTC
1 point
0
in reply to: ryan_greenblatt’s comment on: Please stop publishing ideas/insights/research about AI
Note that I agree with your sentiment here, although my concrete argument is basically what LawrenceC wrote as a reply to this post.

mesaoptimizer 3 May 2024 10:15 UTC
7 points
0
in reply to: ryan_greenblatt’s comment on: Please stop publishing ideas/insights/research about AI
Ryan, this is kind of a side-note but I notice that you have a very Paul-like approach to arguments and replies on LW.

Two things that come to notice:
1. You have a tendency to reply to certain posts or comments with “I don’t quite understand what is being said here, and I disagree with it.” or, “It doesn’t track with my views”, or equivalent replies that seem not very useful for understanding your object level arguments. (Although I notice that in the recent comments I see, you usually postfix it with some elaboration on your model.)
2. In the comment I’m replying to, you use a strategy of black-box-like abstraction modeling of a situation to try to argue for a conclusion, one that usually involves numbers such as multipliers or percentages. (I have the impression that Paul uses this a lot, and one concrete example that comes to mind is the takeoff speeds essay. I usually consider such arguments invalid when they seem to throw away information we already have, or seem to use a set of abstractions that don’t particularly feel appropriate to the information I believe we have.
I just found this interesting and plausible enough to highlight to you. Its a moderate investment of my time to find out examples from your comment history to highlight all these instances, but writing this comment still seemed valuable.

mesaoptimizer 3 May 2024 10:06 UTC
5 points
3
in reply to: LawrenceC’s comment on: Please stop publishing ideas/insights/research about AI
This is a really well-written response. I’m pretty impressed by it.

mesaoptimizer 3 May 2024 9:52 UTC
7 points
2
in reply to: mako yass’s comment on: Please stop publishing ideas/insights/research about AI

If your acceptable lower limit for basically anything is zero you wont be allowed to do anything, really anything. You have to name some quantity of capabilities progress that’s okay to do before you’ll be allowed to talk about AI in a group setting.

“The optimal amount of fraud is non-zero.”

mesaoptimizer 29 Apr 2024 7:52 UTC
1 point
0
in reply to: Épiphanie Gédéon’s comment on: Constructability: Plainly-coded AGIs may be feasible in the near future
Okay I just read the entire thing. Have you looked at Eric Drexler’s CAIS proposal? It seems to have played some role as the precursor to the davidad / Evan OAA proposal, and has involved the use of composable narrow AI systems.

mesaoptimizer 27 Apr 2024 21:50 UTC
2 points
0
in reply to: Zack_M_Davis’s comment on: Refusal in LLMs is mediated by a single direction

but I’m a bit disappointed that x-risk-motivated researchers seem to be taking the “safety”/”harm” framing of refusals seriously

I’d say a more charitable interpretation is that it is a useful framing: both in terms of a concrete thing one could use as scaffolding for alignment-as-defined-by-Zack research progress, and also a thing that is financially advantageous to focus on since frontier labs are strongly incentivized to care about this.

mesaoptimizer 27 Apr 2024 18:26 UTC
1 point
−2
on: Constructability: Plainly-coded AGIs may be feasible in the near future
Haven’t read the entire post, but my thoughts on seeing the first image: Pretty sure this is priced into Anthropic / Redwood / OpenAI cluster of strategies where you use an aligned boxed (or ’mostly aligned) generative LLM-style AGI to help you figure out what to do next.

mesaoptimizer 26 Apr 2024 9:16 UTC
1 point
0
in reply to: Quinn’s comment on: Eric Neyman’s Shortform
e/acc is not a coherent philosophy and treating it as one means you are fighting shadows.

Landian accelerationism at least is somewhat coherent. “e/acc” is a bundle of memes that support the self-interest of the people supporting and propagating it, both financially (VC money, dreams of making it big) and socially (the non-Beff e/acc vibe is one of optimism and hope and to do things—to engage with the object level—instead of just trying to steer social reality). A more charitable interpretation is that the philosophical roots of “e/acc” are founded upon a frustration with how bad things are, and a desire to improve things by yourself. This is a sentiment I share and empathize with.

I find the term “techno-optimism” to be a more accurate description of the latter, and perhaps “Beff Jezos philosophy” a more accurate description of what you have in your mind. And “e/acc” to mainly describe the community and its coordinated movements at steering the world towards outcomes that the people within the community perceive as benefiting them.

mesaoptimizer 26 Apr 2024 9:06 UTC
3 points
0
in reply to: NicholasKees’s comment on: NicholasKees’s Shortform
I use GreaterWrong as my front-end to interface with LessWrong, AlignmentForum, and the EA Forum. It is significantly less distracting and also doesn’t make my ~decade old laptop scream in agony when multiple LW tabs are open on my browser.

mesaoptimizer 24 Apr 2024 23:25 UTC
1 point
0
in reply to: Lucie Philippon’s comment on: Lucie Philippon’s Shortform

The main part of the issue was actually that I was not aware I had internal conflicts. I just mysteriously felt less emotions and motivation.

Yes, I believe that one can learn to entirely stop even considering certain potential actions as actions available to us. I don’t really have a systematic solution for this right now aside from some form of Noticing practice (I believe a more refined version of this practice is called Naturalism but I don’t have much experience with this form of practice).

mesaoptimizer 24 Apr 2024 23:21 UTC
1 point
0
in reply to: Lucie Philippon’s comment on: Lucie Philippon’s Shortform

What do you think antidepressants would be useful for?

In my experience I’ve gone months through a depressive episode while remaining externally functional and convincing myself (and the people around me) that I’m not going through a depressive episode. Another thing I’ve noticed is that with medication (whether anxiolytics, antidepressants or ADHD medication), I regularly underestimate the level at which I was ‘blocked’ by some mental issue that, after taking the medication, would not exist, and I would only realize it previously existed due to the (positive) changes in my behavior and cognition.

Essentially, I’m positing that you may be in a similar situation.

mesaoptimizer 23 Apr 2024 19:47 UTC
1 point
0
in reply to: Lucie Philippon’s comment on: Lucie Philippon’s Shortform
Have you considered antidepressants? I recommend trying them out to see if they help. In my experience, antidepressants can have non-trivial positive effects that can be hard-to-put-into-words, except you can notice the shift in how you think and behave and relate to things, and this shift is one that you might find beneficial.

I also think that slowing down and taking care of yourself can be good—it can help build a generalized skill of noticing the things you didn’t notice before that led to the breaking point you describe.

Here’s an anecdote that might be interesting to you: There’s a core mental shift I made over the past few months that I haven’t tried to elicit and describe to others until now, but in essence it involves a sort of understanding that the sort of self-sacrifice that usually is involved in working as hard as possible leads to globally unwanted outcomes, not just locally unwanted outcomes. (Of course, we can talk about hypothetical isolated thought experiments and my feelings might change, but I’m talking about a holistic relating to the world here.)

Here’s one argument for this, although I don’t think this captures the entire source of my feelings about this: When parts of someone is in conflict, and they regularly are rejecting a part of them that wants something (creature comforts) to privilege the desires of another part of them that wants another thing (work more), I expect that their effectiveness in navigating and affecting reality is lowered in comparison to one where they take the time to integrate the desires and beliefs of the parts of them that are in conflict. In extreme circumstances, it makes sense for someone to ‘override’ other parts (which is how I model the flight-fight-fawn-freeze response, for example), but this seems unsustainable and potentially detrimental when it comes to navigating a reality where sense-making is extremely important.

mesaoptimizer 18 Apr 2024 12:27 UTC
3 points
0
in reply to: Gunnar_Zarncke’s comment on: When is a mind me?
This is a very interesting paper, thanks.

mesaoptimizer 17 Apr 2024 12:42 UTC
5 points
0
in reply to: Wei Dai’s comment on: Paul Christiano named as US AI Safety Institute Head of AI Safety
What was the requirement? Seems like this was a deliberate effect instead of a side effect.

mesaoptimizer 17 Apr 2024 12:41 UTC
4 points
0
in reply to: habryka’s comment on: Paul Christiano named as US AI Safety Institute Head of AI Safety

which I know you object to

Buck, could you (or habryka) elaborate on this? What does Buck call the set of things that ARC theory and METR (formerly known as ARC evals) does, “AI control research”?

My understanding is that while Redwood clearly does control research, METR evals seem more of an attempt to demonstrate dangerous capabilities than help with control. I haven’t wrapped my head around ARC’s research philosophy and output to confidently state anything.