orthonormal

Karma: 17,083

The Loudest Alarm Is Probably False

orthonormal2 Jan 2018 16:38 UTC

171 points

28 comments2 min readLW link 1 review

Choosing the Zero Point

orthonormal6 Apr 2020 23:44 UTC

170 points

24 comments3 min readLW link 2 reviews

orthonormal 17 Oct 2021 21:26 UTC
151 points
on: My experience at and around MIRI and CFAR (inspired by Zoe Curzi’s writeup of experiences at Leverage)
Thank you for writing this, Jessica. First, you’ve had some miserable experiences in the last several years, and regardless of everything else, those times sound terrifying and awful. You have my deep sympathy.
Regardless of my seeing a large distinction between the Leverage situation and MIRI/CFAR, I agree with Jessica that this is a good time to revisit the safety of various orgs in the rationality/EA space.
I almost perfectly overlapped with Jessica at MIRI from March 2015 to June 2017. (Yes, this uniquely identifies me. Don’t use my actual name here anyway, please.) So I think I can speak to a great deal of this.
I’ll run down a summary of the specifics first (or at least, the specifics I know enough about to speak meaningfully), and then at the end discuss what I see overall.
Claim: People in and adjacent to MIRI/CFAR manifest major mental health problems, significantly more often than the background rate.
I think this is true; I believe I know two of the first cases to which Jessica refers; and I’m probably not plugged-in enough socially to know the others. And then there’s the Ziz catastrophe.
Claim: Eliezer and Nate updated sharply toward shorter timelines, other MIRI researchers became similarly convinced, and they repeatedly tried to persuade Jessica and others.
This is true, but non-nefarious in my genuine opinion, because it’s a genuine belief and because given that belief, you’ll have better odds of success if the whole team at least takes the hypothesis quite seriously.
(As for me, I’ve stably been at a point where near-term AGI wouldn’t surprise me much, but the lack of it also wouldn’t surprise me much. That’s all it takes, really, to be worried about near-term AGI.)
Claim: MIRI started getting secretive about their research.
This is true, to some extent. Nate and Eliezer discussed with the team that some things might have to be kept secret, and applied some basic levels of it to things we thought at the time might be AGI-relevant instead of only FAI-relevant. I think that here, the concern was less about AGI timelines and more about the multipolar race caused by DeepMind vs OpenAI. Basically any new advance gets deployed immediately in our current world.
However, I don’t recall ever being told I’m not allowed to know what someone else is working on, at least in broad strokes. Maybe my memory is faulty here, but it diverges from Jessica’s.
(I was sometimes coy about whether I knew anything secret or not, in true glomarization fashion; I hope this didn’t contribute to that feeling.)
There are surely things that Eliezer and Nate only wanted to discuss with each other, or with a specific researcher or two.
Claim: MIRI had rarity narratives around itself and around Eliezer in particular.
This is true. It would be weird if, given MIRI’s reason for being, it didn’t at least have the institutional rarity narrative—if one believed somebody else were just as capable of causing AI to be Friendly, clearly one should join their project instead of starting one’s own.
About Eliezer, there was a large but not infinite rarity narrative. We sometimes joked about the “bus factor”: if researcher X were hit by a bus, how much would the chance of success drop? Setting aside that this is a ridiculous and somewhat mean thing to joke about, the usual consensus was that Eliezer’s bus quotient was the highest one but that a couple of MIRI’s researchers put together exceeded it. (Nate’s was also quite high.)
(My expectation is that the same would not have been said about Geoff within Leverage.)
Claim: Working at MIRI/CFAR made it harder to connect with people outside the community.
There’s an extent to which this is true of any community that includes an idealistic job (i.e. a paid political activist probably has likeminded friends and finds it a bit more difficult to connect outside that circle). Is it true beyond that?
Not for me, at least. I maintained my ties with the other community I’d been plugged into (social dancing) and kept in good touch with my family (it helps that I have a really good family). As with the above example, the social path of least resistance would have been to just be friends with the same network of people in one’s work orbit, but there wasn’t anything beyond that level of gravity in effect for me.
Claim: CFAR got way too far into Shiny-Woo-Adjacent-Flavor-Of-The-Week.
This is a unfair framing… because I agree with Jessica’s claim 100%. Besides Kegan Levels and the MAPLE dalliance, there was the Circling phase and probably much else I wasn’t around for.
As for causes, I’ve been of the opinion that Anna Salamon has a lot of strengths around communicating ideas, but that her hiring has had as many hits as misses. There’s massive churn, people come in with their Big Ideas and nobody to stop them, and also people come in who aren’t in a good emotional place for their responsibilities. I think CFAR would be better off if Anna delegated hiring to someone else. [EDIT: Vaniver corrects me to say that Pete Michaud has been mostly in charge of hiring for the past several years, in which case I’m criticizing him rather than Anna for any bad hiring decisions during that time.]
Overall Thoughts
Essentially, I think there’s one big difference between issues with MIRI/CFAR and issues at Leverage:
The actions of CFAR/MIRI harmed people unintentionally, as evidenced by the result that people burned out and left quickly and with high frequency. The churn, especially in CFAR, hurt the mission, so it was definitely not the successful result of any strategic process.
Geoff Anders and others at Leverage harmed people intentionally, in ways that were intended to maintain control over those people. And to a large extent, that seems to have succeeded until Leverage fell apart.
Specifically, [accidentally triggering psychotic mental states by conveying a strange but honestly held worldview without adding adequate safeties] is different from [intentionally triggering psychotic mental states in order to pull people closer and prevent them from leaving], which is Zoe’s accusation. Even if it’s possible for a mental breakdown to be benign under the right circumstances, and even if an unplanned one is more likely to result in very very wrong circumstances, I’m far more terrified of a group that strategically plans for its members to have psychosis with the intent of molding those members further toward the group’s mission.
Unintentional harm is still harm, of course! It might have even been greater harm in total! But it makes a big difference when it comes to assessing how realistic a project of reform might be.
There are surely some deep reforms along these lines that CFAR/MIRI must consider. For one thing: scrupulosity, in the context of AI safety, seems to be a common thread in several of these breakdowns. I’ve taken this seriously enough in the past to post extensively on it here. I’d like CFAR/MIRI leadership to carefully update on how scrupulosity hurts both their people and their mission, and think about changes beyond surface-level things like adding a curriculum on scrupulosity. The actual incentives ought to change.
Finally, a good amount of Jessica’s post (similarly to Zoe’s post) concerns her inner experiences, on which she is the undisputed expert. I’m not ignoring those parts above. I just can’t say anything about them, merely that as a third person observer it’s much easier to discuss the external realities than the internal ones. (Likewise with Zoe and Leverage.)

Developmental Stages of GPTs

orthonormal26 Jul 2020 22:03 UTC

140 points

71 comments7 min readLW link 1 review

Optimizing Fuzzies And Utilons: The Altruism Chip Jar

orthonormal1 Jan 2011 18:53 UTC

138 points

49 comments3 min readLW link

Robust Cooperation in the Prisoner’s Dilemma

orthonormal7 Jun 2013 8:30 UTC

120 points

147 comments7 min readLW link

AlphaStar: Impressive for RL progress, not for AGI progress

orthonormal2 Nov 2019 1:50 UTC

113 points

58 comments2 min readLW link 1 review

Decision Theories: A Less Wrong Primer

orthonormal13 Mar 2012 23:31 UTC

109 points

174 comments9 min readLW link

Akrasia Tactics Review

orthonormal21 Feb 2010 4:25 UTC

92 points

151 comments2 min readLW link

Adding Up To Normality

orthonormal24 Mar 2020 21:53 UTC

84 points

22 comments3 min readLW link

Consequentialism Need Not Be Nearsighted

orthonormal2 Sep 2011 7:37 UTC

83 points

119 comments5 min readLW link

Outside Analysis and Blind Spots

orthonormal21 Jul 2009 1:00 UTC

82 points

34 comments5 min readLW link

Rationalists, Post-Rationalists, And Rationalist-Adjacents

orthonormal13 Mar 2020 20:25 UTC

79 points

43 comments3 min readLW link

orthonormal 7 Sep 2023 15:31 UTC
78 points
50
on: Sharing Information About Nonlinear
Ben, I want to say thank you for putting in a tremendous amount of work, and also for being willing to risk attempts at retaliation when that’s a pretty clear threat.
You’re in a reasonable position to take this on, having earned the social standing to make character smears unlikely to stick, and having the institutional support to fight a spurious libel claim. And you’re also someone I trust to do a thorough and fair job.
I wish there were someone whose opportunity cost were lower who could handle retaliation-threat reporting, but it’s pretty likely that anyone with those attributes will have other important opportunities.

orthonormal 16 Apr 2013 2:12 UTC
78 points
on: Help us name the Sequences ebook
Why not call the e-book “The Methods of Rationality”?

Or maybe something that is clearly not HPMoR, but clearly connected to it.

orthonormal 3 Apr 2022 7:16 UTC
76 points
in reply to: Eliezer Yudkowsky’s comment on: MIRI announces new “Death With Dignity” strategy
and they, I’m afraid, will be PrudentBot, not FairBot.
This shouldn’t matter for anyone besides me, but there’s something personally heartbreaking about seeing the one bit of research for which I feel comfortable claiming a fraction of a point of dignity, being mentioned validly to argue why decision theory won’t save us.
(Modal bargaining agents didn’t turn out to be helpful, but given the state of knowledge at that time, it was worth doing.)
What links here?
- anonce's comment on Negotiating Up and Down the Simulation Hierarchy: Why We Might Survive the Unaligned Singularity by David Udell (4 May 2022 18:18 UTC; 7 points)

orthonormal 31 Jul 2010 18:29 UTC
72 points
0
on: Eliezer Yudkowsky Facts
Eliezer Yudkowsky can consistently assert the sentence “Eliezer Yudkowsky cannot consistently assert this sentence.”

orthonormal 11 Apr 2013 4:01 UTC
71 points
on: LW Women Submissions: On Misogyny
To avoid the aforementioned failure mode of silent approval and loud dissent, let me say that I appreciate this post and this series. I’m trying to update my priors about how many women (in the rationalist cluster) have experienced outright horrific abuse of several sorts, and how many more have had to worry about it; it’s obvious in retrospect that I wouldn’t have been exposed to these kinds of stories as I was growing up even if they happened around me. That really bears on the question of what policies are best overall, though I’ll have to think through all the implications.

Roleplaying As Yourself

orthonormal6 Jan 2018 6:48 UTC

68 points

4 comments1 min readLW link

Nature: Red, in Truth and Qualia

orthonormal29 May 2011 23:50 UTC

62 points

64 comments6 min readLW link

orthonormal

Overall Thoughts