Yitz

Karma: 2,416

I’m an artist, writer, and human being.

To be a little more precise: I make video games, edit Wikipedia, and write here on LessWrong!

Public-facing Censorship Is Safety Theater, Causing Reputational Damage

Yitz23 Sep 2022 5:08 UTC

149 points

42 comments6 min readLW link

Testing PaLM prompts on GPT3

Yitz6 Apr 2022 5:21 UTC

103 points

14 comments8 min readLW link

[Question] Convince me that humanity is as doomed by AGI as Yudkowsky et al., seems to believe

Yitz10 Apr 2022 21:02 UTC

92 points

141 comments2 min readLW link

An Introduction To The Mandelbrot Set That Doesn’t Mention Complex Numbers

Yitz17 Jan 2024 9:48 UTC

81 points

11 comments9 min readLW link

[Linkpost] Solving Quantitative Reasoning Problems with Language Models

Yitz30 Jun 2022 18:58 UTC

76 points

15 comments2 min readLW link

(storage.googleapis.com)

I Am Scared of Posting Negative Takes About Bing’s AI

Yitz17 Feb 2023 20:50 UTC

63 points

27 comments1 min readLW link

[Question] What’s the contingency plan if we get AGI tomorrow?

Yitz23 Jun 2022 3:10 UTC

61 points

23 comments1 min readLW link

[Question] Convince me that humanity isn’t doomed by AGI

Yitz15 Apr 2022 17:26 UTC

61 points

49 comments1 min readLW link

Noting an unsubstantiated communal belief about the FTX disaster

Yitz13 Nov 2022 5:37 UTC

50 points

52 comments1 min readLW link

What’s up with the bad Meta projects?

Yitz18 Aug 2022 5:34 UTC

42 points

29 comments1 min readLW link

Yitz 9 Jun 2020 3:16 UTC
41 points
on: Open & Welcome Thread—June 2020
Hi, I joined because I was trying to understand Pascal’s Wager, and someone suggested I look up “Pascal’s mugging”… next thing I know I’m a newly minted HPMOR superfan, and halfway through reading every post Yudkowsky has ever written. This place is an incredible wellspring of knowledge, and I look forward to joining in the discussion!

The Problem With The Current State of AGI Definitions

Yitz29 May 2022 13:58 UTC

40 points

22 comments8 min readLW link

Yitz 6 Apr 2022 2:30 UTC
37 points
in reply to: Daniel Kokotajlo’s comment on: The case for Doing Something Else (if Alignment is doomed)
If that is the case, then I would very much like them to publicize the details for why they think other approaches are doomed. When Yudkowsky has talked about it in the past, it tends to be in the form of single-sentence statements pointing towards past writing on general cognitive fallacies. For him I’m sure that would be enough of a hint to clearly see why strategy x fits that fallacy and will therefore fail, but as a reader, it doesn’t give me much insight as to why such a project is doomed, rather than just potentially flawed. (Sorry if this doesn’t make sense btw, I’m really tired and am not sure I’m thinking straight atm)

Yitz 3 Apr 2022 2:38 UTC
36 points
in reply to: P.’s comment on: MIRI announces new “Death With Dignity” strategy
Certainly for some people (including you!), yes. For others, I expect this post to be strongly demotivating. That doesn’t mean it shouldn’t have been written (I value honestly conveying personal beliefs and are expressing diversity of opinion enough to outweigh the downsides), but we should realistically expect this post to cause psychological harm for some people, and could also potentially make interaction and PR with those who don’t share Yudkowsky’s views harder. Despite some claims to the contrary, I believe (through personal experience in PR) that expressing radical honesty is not strongly valued outside the rationalist community, and that interaction with non-rationalists can be extremely important, even to potentially world-saving levels. Yudkowsky, for all of his incredible talent, is frankly terrible at PR (at least historically), and may not be giving proper weight to its value as a world-saving tool. I’m still thinking through the details of Yudkowsky’s claims, but expect me to write a post here in the near future giving my perspective in more detail.

Yitz 8 Feb 2023 6:57 UTC
35 points
8
in reply to: gwern’s comment on: SolidGoldMagikarp (plus, prompt generation)

that’s probably exactly what’s going on. The usernames were so frequent in the reddit comments dataset that the tokenizer, the part that breaks a paragraph up into word-ish-sized-chunks like ” test” or ” SolidGoldMagikarp” (the space is included in many tokens) so that the neural network doesn’t have to deal with each character, learned they were important words. But in a later stage of learning, comments without complex text were filtered out, resulting in your usernames getting their own words… but the neural network never seeing the words activate. It’s as if you had an extra eye facing the inside of your skull, and you’d never felt it activate, and then one day some researchers trying to understand your brain shined a bright light on your skin and the extra eye started sending you signals. Except, you’re a language model, so it’s more like each word is a separate finger, and you have tens of thousands of fingers, one on each word button. Uh, that got weird,

This is an incredible analogy

Null-boxing Newcomb’s Problem

Yitz13 Jul 2020 16:32 UTC

33 points

9 comments4 min readLW link

Short story speculating on possible ramifications of AI on the art world

Yitz1 Sep 2022 21:15 UTC

30 points

8 comments3 min readLW link

(archiveofourown.org)

[Question] What could one do with truly unlimited computational power?

Yitz11 Nov 2020 10:03 UTC

30 points

22 comments2 min readLW link

[Question] What are some low-cognitive -workload tasks that can help improve the world?

Yitz1 Mar 2022 17:47 UTC

29 points

11 comments1 min readLW link

A Tentative Timeline of The Near Future (2022-2025) for Self-Accountability

Yitz5 Dec 2022 5:33 UTC

26 points

0 comments4 min readLW link