Yitz

Karma: 2,418

I’m an artist, writer, and human being.

To be a little more precise: I make video games, edit Wikipedia, and write here on LessWrong!

Yitz 9 Jun 2020 3:16 UTC
41 points
on: Open & Welcome Thread—June 2020
Hi, I joined because I was trying to understand Pascal’s Wager, and someone suggested I look up “Pascal’s mugging”… next thing I know I’m a newly minted HPMOR superfan, and halfway through reading every post Yudkowsky has ever written. This place is an incredible wellspring of knowledge, and I look forward to joining in the discussion!

Yitz 6 Apr 2022 2:30 UTC
37 points
in reply to: Daniel Kokotajlo’s comment on: The case for Doing Something Else (if Alignment is doomed)
If that is the case, then I would very much like them to publicize the details for why they think other approaches are doomed. When Yudkowsky has talked about it in the past, it tends to be in the form of single-sentence statements pointing towards past writing on general cognitive fallacies. For him I’m sure that would be enough of a hint to clearly see why strategy x fits that fallacy and will therefore fail, but as a reader, it doesn’t give me much insight as to why such a project is doomed, rather than just potentially flawed. (Sorry if this doesn’t make sense btw, I’m really tired and am not sure I’m thinking straight atm)

Yitz 3 Apr 2022 2:38 UTC
36 points
in reply to: P.’s comment on: MIRI announces new “Death With Dignity” strategy
Certainly for some people (including you!), yes. For others, I expect this post to be strongly demotivating. That doesn’t mean it shouldn’t have been written (I value honestly conveying personal beliefs and are expressing diversity of opinion enough to outweigh the downsides), but we should realistically expect this post to cause psychological harm for some people, and could also potentially make interaction and PR with those who don’t share Yudkowsky’s views harder. Despite some claims to the contrary, I believe (through personal experience in PR) that expressing radical honesty is not strongly valued outside the rationalist community, and that interaction with non-rationalists can be extremely important, even to potentially world-saving levels. Yudkowsky, for all of his incredible talent, is frankly terrible at PR (at least historically), and may not be giving proper weight to its value as a world-saving tool. I’m still thinking through the details of Yudkowsky’s claims, but expect me to write a post here in the near future giving my perspective in more detail.

Yitz 8 Feb 2023 6:57 UTC
35 points
8
in reply to: gwern’s comment on: SolidGoldMagikarp (plus, prompt generation)

that’s probably exactly what’s going on. The usernames were so frequent in the reddit comments dataset that the tokenizer, the part that breaks a paragraph up into word-ish-sized-chunks like ” test” or ” SolidGoldMagikarp” (the space is included in many tokens) so that the neural network doesn’t have to deal with each character, learned they were important words. But in a later stage of learning, comments without complex text were filtered out, resulting in your usernames getting their own words… but the neural network never seeing the words activate. It’s as if you had an extra eye facing the inside of your skull, and you’d never felt it activate, and then one day some researchers trying to understand your brain shined a bright light on your skin and the extra eye started sending you signals. Except, you’re a language model, so it’s more like each word is a separate finger, and you have tens of thousands of fingers, one on each word button. Uh, that got weird,

This is an incredible analogy

Yitz 2 Mar 2022 13:03 UTC
26 points
on: Have You Tried Hiring People?
Could someone from MIRI step in here to explain why this is not being done? This seems like an extremely easy avenue for improvement.

Yitz 30 May 2022 20:45 UTC
LW: 25 AF: 7
AF
in reply to: Rob Bensinger’s comment on: Six Dimensions of Operational Adequacy in AGI Projects
May I ask why you guys decided to publish this now in particular? Totally fine if you can’t answer that question, of course.

Yitz 19 Apr 2022 17:37 UTC
24 points
0
on: Mental Health and the Alignment Problem: A Compilation of Resources (updated April 2023)
Just wanted to provide some positive feedback that this post is really incredible, and I thank you for your work. I’ve been feeling a deep sort of low-level anxiety recently, and this is a nice starting point to try to work through some of that.

Yitz 25 Aug 2022 23:41 UTC
22 points
31
in reply to: habryka’s comment on: Your posts should be on arXiv
Please do this!!

Yitz 12 May 2022 16:03 UTC
22 points
on: “A Generalist Agent”: New DeepMind Publication
Would it be fair to call this AGI, albeit not superintelligent yet?

Gato performs over 450 out of 604 tasks at over a 50% expert score threshold.

👀
What links here?
- Gato as the Dawn of Early AGI by David Udell (15 May 2022 6:52 UTC; 85 points)
- Iterated Distillation-Amplification, Gato, and Proto-AGI [Re-Explained] by Gabriel Mukobi (27 May 2022 5:42 UTC; 21 points)

Yitz 4 Apr 2022 22:32 UTC
22 points
in reply to: abramdemski’s comment on: MIRI announces new “Death With Dignity” strategy
I’ll tell you that one of my brothers (who I greatly respect) has decided not to be concerned about AGI risks specifically because he views EY as being a very respected “alarmist” in the field (which is basically correct), and also views EY as giving off extremely “culty” and “obviously wrong” vibes (with Roko’s Basilisk and EY’s privacy around the AI boxing results being the main examples given), leading him to conclude that it’s simply not worth engaging with the community (and their arguments) in the first place. I wouldn’t personally engage with what I believe to be a doomsday cult (even if they claim that the risk of ignoring them is astronomically high), so I really can’t blame him.

I’m also aware of an individual who has enormous cultural influence, and was interested in rationalism, but heard from an unnamed researcher at Google that the rationalist movement is associated with the alt-right, so they didn’t bother looking further. (Yes, that’s an incorrect statement, but came from the widespread [possibly correct?] belief that Peter Theil is both alt-right and has/had close ties with many prominent rationalists.) This indicates a general lack of control of the narrative surrounding the movement, and likely has directly led to needlessly antagonistic relationships.

Yitz 3 Apr 2022 4:39 UTC
20 points
in reply to: Not Relevant’s comment on: MIRI announces new “Death With Dignity” strategy
This is the best counter-response I’ve read on the thread so far, and I’m really interested what responses will be. Commenting here so I can easily get back to this comment in the future.

Yitz 13 Nov 2022 7:59 UTC
19 points
10
in reply to: Rafael Harth’s comment on: Noting an unsubstantiated communal belief about the FTX disaster
For me, having listened to the guy talk is even stronger evidence since I think I’d notice it if he was lying, but that’s obviously not verifiable.
Going to quote from Astrid Wilde here (original source linked in post):
i felt this way about someone once too. in 2015 that person kidnapped me, trafficked me, and blackmailed me out of my life savings at the time of ~$45,000. i spent the next 3 years homeless.
sociopathic charisma is something i never would have believed in if i hadn’t experienced it first hand. but there really are people out there who spend their entire lives honing their social intelligence to gain wealth, power, and status.
most of them just don’t have enough smart but naive people around them to fake competency and reputation launder at scale. EA was the perfect political philosophy and community for this to scale....
I would really very strongly recommend not updating on an intuitive feeling of “I can trust this guy,” considering that in the counterfactual case (where you could not in fact, trust the guy), you would be equally likely to have that exact feeling!
As for SBF being vegan as evidence, see my reply to you on the EA forum.

Yitz 1 Apr 2022 21:34 UTC
19 points
on: Moses and the Class Struggle
The first part of this posts reads almost beat-for-beat like this post I wrote a while back: https://www.lesswrong.com/posts/jTQaFKL6s3pppSNx4/god-and-moses-have-a-chat Did you happen to read it before writing this, or are we just both thinking along the same lines?

Yitz 10 Mar 2022 17:18 UTC
19 points
on: It Looks Like You’re Trying To Take Over The World
The way this story is written would suggest that the solution to this particular future would simply be to spam the internet with plausible stories about a friendly AI takeoff which an AGI will identify with and be like “oh hey cool that’s me”

Yitz 8 Mar 2022 15:39 UTC
19 points
1
in reply to: FiftyTwo’s comment on: Eight Short Studies On Excuses
Random future reader (ten years in the future in fact) confirming that this post was indeed of utility to me.

Yitz 19 Aug 2022 0:00 UTC
18 points
7
in reply to: niknoble’s comment on: What’s up with the bad Meta projects?
VR hardware and software are in their infancy and you simply can’t have very crisp graphics at this stage
As an occasional video game developer, I’m going to strongly disagree with you there. To give a counter-example:
Walkabout Mini Golf is a VR game that runs on Oculus Quest, Rift, and Steam VR, ~~made by this fairly small studio~~ ~~(and most of the people listed there didn’t even work on the game)~~ [EDIT: I reached out to the studio on Twitter and it turns out the game was mainly developed by a single guy, Lucas Martell]. It looks like this:
I’ve played this game with a friend of mine (who shows up as a stylized floating head that looks pretty great), and it was crisp, clear, high frame-rate VR perfection. Even in multiplayer, everything works smoothly, and it serves as a really nice virtual social space.
Having limited graphics capabilities does not place a significant limiting bound on aesthetics. As another example, Anodyne 2: Return to Dust is a jaw-droppingly beautiful game (developed by only two people!) deliberately made with PS1-era graphics:
Simplicity does not necessitate ugliness.
Rather, he is surrounded by employees and journalists whose primary complaint is that Horizon Worlds is not sterile enough.
This may very well be true (despite my personal distaste for that line of thought), but even if it is, being inoffensive and “bland” doesn’t mean you have to look bad! Nintendo’s oeuvre, for instance, shows that being friendly for all ages doesn’t require sacrificing aesthetic beauty. Meanwhile, in screenshots online and in the “selfie” Zuckerberg posted, model sizes are wildly inconsistent (look at the trees—or is that supposed to be grass?—on the ground), the clothing of avatars are almost surrealistically bad (why is Mark’s top button so far off to the left?), the shading is worse than what I could make in half a day with Unity when I was 12, and overall everything manages to look more slapped together than this notorious disaster of an asset flip.
I can’t help but feel that on some level this must be intentional, or at least the result of some absolutely horrific mismanagement.
What links here?
- Yitz's comment on What’s up with the bad Meta projects? by Yitz (19 Aug 2022 3:41 UTC; 3 points)
- Yitz's comment on What’s up with the bad Meta projects? by Yitz (19 Aug 2022 3:55 UTC; 2 points)

Yitz 3 Jul 2023 23:36 UTC
17 points
10
in reply to: gwern’s comment on: Douglas Hofstadter changes his mind on Deep Learning & AI risk (June 2023)?
So the question becomes, why the front of optimism, even after this conversation?

Yitz 10 Apr 2022 4:46 UTC
16 points
in reply to: shminux’s comment on: A concrete bet offer to those with short AI timelines
Should be pointed out that $1000 is no skin in the game to you. To some people I know, $1000 would have been nearly lifesaving at certain points in their lives.

Yitz 13 Dec 2021 17:20 UTC
16 points
on: A fate worse than death?
You seem to be equating saving someone from death with them living literally forever, which ultimately appears to be forbidden, given the known laws of physics. The person who’s life you saved has some finite value (under these sorts of ethical theories at least), presumably calculated by the added enjoyment they get to experience over the rest of their life. That life will be finite, because thermodynamics + the gradual expansion of the universe kills everything, given enough time. Therefore, I think there will always be some theoretical amount of suffering which will outweigh the value of a given finite being.

Yitz 2 Aug 2022 4:48 UTC
15 points
8
in reply to: Razied’s comment on: Meditation course claims 65% enlightenment rate: my review
As an unenlightened person, why would I want satisfaction while living in a world that has things I want to change? I guess I’m asking if drives persist with perfect contentment, and if so, how?