lc

Karma: 8,590

What an actually pessimistic containment strategy looks like

lc5 Apr 2022 0:19 UTC

667 points

138 comments6 min readLW link 2 reviews

A non-magical explanation of Jeffrey Epstein

lc28 Dec 2021 21:15 UTC

324 points

59 comments15 min readLW link 1 review

lc 4 Feb 2023 3:07 UTC
165 points
146
in reply to: Aella’s comment on: Fucking Goddamn Basics of Rationalist Discourse
I’ve been impressed lately by how, while the EA forum has become basically overrun with useless scandal discussion, LessWrong has stayed virtually unafflicted. I think I’m the only person who ever commented about the Bostrom fiasco (in a shortform), and I feel bad about that and won’t do suchwise again. We must preserve our garden of autistic truth seeking and alignmentposts.

You can just spontaneously call people you haven’t met in years

lc13 Nov 2023 5:21 UTC

154 points

19 comments1 min readLW link

POC || GTFO culture as partial antidote to alignment wordcelism

lc15 Mar 2023 10:21 UTC

145 points

11 comments7 min readLW link

Stop posting prompt injections on Twitter and calling it “misalignment”

lc19 Feb 2023 2:21 UTC

144 points

9 comments1 min readLW link

The U.S. is becoming less stable

lc18 Aug 2023 21:13 UTC

135 points

66 comments2 min readLW link

My simple AGI investment & insurance strategy

lc31 Mar 2024 2:51 UTC

117 points

16 comments2 min readLW link

Announcing $5,000 bounty for (responsibly) ending malaria

lc24 Sep 2022 4:28 UTC

116 points

40 comments4 min readLW link

Yes, AI research will be substantially curtailed if a lab causes a major disaster

lc14 Jun 2022 22:17 UTC

103 points

31 comments2 min readLW link

lc 22 Oct 2023 6:04 UTC
97 points
52
on: AI Safety is Dropping the Ball on Clown Attacks, and Mind Control in General
Would you mind if I rewrote this in a less “manic” tenor, keeping the content and mood largely the same, and reposted? I like this essay and think the core of what you’re suggesting is reasonable, for reasons both stated and unstated, but I would like to try to say it differently in a way I think it will be taken better.

lc 7 Sep 2023 19:16 UTC
91 points
32
on: Sharing Information About Nonlinear
There are certain claims here that are concretely bad, but they’re also mixed in confusingly with what seem like nonsense complaints that are just… the reality of people spending extended time with other people, like:
- “My roommates didn’t buy me vegan food while I was sick”
- “Someone gives a lot of compliments to me but I don’t think they’re being genuine”
- “I feel ‘low-value’”
If someone is being defrauded, yeah that’s one thing, but I’d rather not litigate “Is Kat/Emerson an asshole” in the court of public opinion.

In defense of flailing, with foreword by Bill Burr

lc17 Jun 2022 16:40 UTC

88 points

6 comments4 min readLW link

lc 8 Jun 2022 5:32 UTC
86 points
44
in reply to: Eliezer Yudkowsky’s comment on: AGI Ruin: A List of Lethalities
FWIW you taking off the Mr. Nice guy gloves has actually made me make different life decisions. I’m glad you tried it even if it doesn’t work.

Addendum: A non-magical explanation of Jeffrey Epstein

lc18 Jul 2022 17:40 UTC

80 points

21 comments11 min readLW link

lc 3 Jan 2024 3:23 UTC
68 points
1
on: Dating Roundup #2: If At First You Don’t Succeed
If he is looking at as many profiles as he can and swiping right a reasonable portion of the time, this should never happen. As a paying customer, a large portion of the women he swipes right on will see his profile. He can view hundreds of profiles a day.
So if you go even one month with zero preliminary matches, and you are neither being super selective or highly unusually ugly, you know you are doing something very wrong with your profile or your profile pictures.
This is extremely embarassing to admit, but it seems like an important anecdote and I’ll say it anyways: in 2022 I spent a sizable amount of money on a “paid dating app service”. What that means is, I got on a call with a dating advisor who asked me questions for a few hours, and took notes. They helped me pick out new clothing, arranged a photographer to take my profile photos (which they checked up on photofeeler), and helped me write and setup profiles on Hinge, OkCupid, Tinder, Bumble, and Coffee Meets Bagel, for which I also elected to get premium subscriptions.
Then the service had another worker use those apps on my behalf full-time, for around ~50 hours a week, swiping on anything that seemed reasonable and then arranging text communication so that I could set up dates. While I was using this service, I checked in regularly to make sure that this worker was doing their job, that they were viewing new profiles, and that the conversations seemed solid.
I didn’t get literally zero matches, but after six months, I had only gone on one date. My advisor was so embarassed by my lack of success that when I told them I had to cancel for obvious reasons, she gave me free extensions for about two and a half months until she finally couldn’t swing that anymore.
I don’t think I’m particularly ugly; I’m around 23 and I’ve been in three relationships, with women who were not model gorgeous but (I think) were fairly attractive, one of which I ended up dating for six years. But I tried the entire motley crew of apps under what seemed like unnaturally strong conditions, and it just didn’t work.

lc 13 Jun 2022 23:24 UTC
68 points
on: Contra EY: Can AGI destroy us without trial & error?
I upvoted your post because it seems relatively lucid and raises some important points, but would like to say that I’m in the middle of writing a pretty long, detailed explanation of why I agree with most of the gripes (e.g. AIs can’t use magic to mine coal/build nanobots) and yet the object-level conclusions here are still untrue. In practice, I seriously doubt we would have more than a year to live after the release of AGI with the long term planning and reasoning abilities of most accountants, even without FOOM. People here shouldn’t assume that, because Eliezer never posted a detailed analysis on LessWrong, everyone on the doomer train is starting from unreasonable premises regarding how robot building and research could function in practice.

lc 27 Aug 2023 6:59 UTC
66 points
35
on: Eliezer Yudkowsky Is Frequently, Confidently, Egregiously Wrong
You say

Eliezer sounds good whenever he’s talking about a topic that I don’t know anything about.

But then you go on to talk about a bunch of philosophy & decision theory questions that no one has actual “expertise” in, except the sort that comes from reading other people talk about the thing. I was hoping Eliezer had said something about say, carpentry that you disagreed with, because then the dispute would be much more obvious and concrete. As it stands I disagree with your reasoning on the sample of questions I scanned and so it seems to me like this is sufficient to explain the dispute.

lc 28 Aug 2022 11:47 UTC
66 points
23
in reply to: iamthouthouarti’s comment on: Common misconceptions about OpenAI
Suppose you’re in middle school, and one day you learn that your teachers are planning a mandatory field trip, during which the entire grade will jump off of a skyscraper without a parachute. You approach a school administrator to talk to them about how dangerous that would be, and they say, “Don’t worry! We’ll all be wearing hard hats the entire time.”

Hearing that probably does not reassure you even a little bit, because hard hats alone would not nudge the probability of death below ~100%. It might actually make you more worried, because the fact that they have a prepared response means school administrators were aware of potential issues and then decided the hard hat solution was appropriate. It’s generally harder to argue someone out of believing in an incorrect solution to a problem, than into believing the problem exists in the first place.

This analogy overstates the obviousness of (and my personal confidence in) the risk, but to a lot of alignment researchers it’s an essentially accurate metaphor for how ineffective they think OpenAI’s current precautions will turn out in practice, even if making a doomsday AI feels like a more “understandable” mistake.

lc 25 Feb 2023 21:35 UTC
62 points
75
in reply to: Ratios’s comment on: [Link] A community alert about Ziz

When you take this idea seriously and commit to stopping this with all your heart, you get Ziz.

No, you don’t, because Ziz-style violence is completely ineffective at improving animal welfare. It’s dramatic and self-destructive and might express soundly their factional leanings, but that doesn’t make it accomplish the thing in question.

Further, none of the murders & attempted murders the gang has committed so far seem to be against factory farm workers, so I don’t understand this idea that Ziz is motivated by ambitions of political terrorism at all. Reading their posts it sounds more like Ziz misunderstood decision theory as saying “retaliate aggressively all the time” and started a cult around that.