Wei Dai 19 Oct 2020 7:12 UTC
125 points
0
on: A tale from Communist China
Lessons I draw from this history:
1. To predict a political movement, you have to understand its social dynamics and not just trust what people say about their intentions, even if they’re totally sincere.
2. Short term trends can be misleading so don’t update too much on them, especially in a positive direction.
3. Lots of people who thought they were on the right side of history actually weren’t.
4. Becoming true believers in some ideology probably isn’t good for you or the society you’re hoping to help. It’s crucial to maintain empirical and moral uncertainties.
5. Risk tails are fatter than people think.

AI Safety “Success Stories”

Wei Dai7 Sep 2019 2:54 UTC

124 points

27 comments4 min readLW link 1 review

Shut Up and Divide?

Wei Dai9 Feb 2010 20:09 UTC

113 points

276 comments1 min readLW link

[Question] Where are people thinking and talking about global coordination for AI safety?

Wei Dai22 May 2019 6:24 UTC

112 points

22 comments1 min readLW link

A broad basin of attraction around human values?

Wei Dai12 Apr 2022 5:15 UTC

109 points

17 comments2 min readLW link

Wei Dai 24 Jul 2009 23:02 UTC
102 points
in reply to: Wei Dai’s comment on: Eliezer Yudkowsky Facts
- It was easier for Eliezer Yudkowsky to reformulate decision theory to exclude time than to buy a new watch.
- Eliezer Yudkowsky’s favorite sport is black hole diving. His information density is so great that no black hole can absorb him, so he just bounces right off the event horizon.
- God desperately wants to believe that when Eliezer Yudkowsky says “God doesn’t exist,” it’s just good-natured teasing.
- Never go in against Eliezer Yudkowsky when anything is on the line.
What links here?
- Agreeing With Stalin in Ways That Exhibit Generally Rationalist Principles by Zack_M_Davis (2 Mar 2024 22:05 UTC; 37 points)

[Question] Why is so much discussion happening in private Google Docs?

Wei Dai12 Jan 2019 2:19 UTC

100 points

22 comments1 min readLW link

Wei Dai 14 Apr 2023 19:19 UTC
99 points
76
on: Moderation notes re: recent Said/Duncan threads
(Tangentially) If users are allowed to ban other users from commenting on their posts, how can I tell when the lack of criticism in the comments of some post means that nobody wanted to criticize it (which is a very useful signal that I would want to update on), or that the author has banned some or all of their most prominent/frequent critics? In addition, I think many users may be mislead by lack of criticism if they’re simply not aware of the second possibility or have forgotten it. (I think I knew it but it hasn’t entered my conscious awareness for a while, until I read this post today.)

(Assuming there’s not a good answer to the above concerns) I think I would prefer to change this feature/rule to something like allowing the author of a post to “hide” commenters or individual comments, which means that those comments are collapsed by default (and marked as “hidden by the post author”) but can be individually expanded, and each user can set an option to always expand those comments for themselves.

Wei Dai 27 Feb 2020 22:08 UTC
99 points
in reply to: Wei Dai’s comment on: Open & Welcome Thread—February 2020
The option I bought is up 700% since I bought them, implying that as of 2/10/2020 the market thought there was less than ¹⁄₈ chance things would be as bad as they are today. At least for me this puts a final nail in the coffin of EMH.

Added on Mar 24: Just in case this thread goes viral at some point, to prevent a potential backlash against me or LW (due to being perceived as caring more about making money than saving lives), let me note that on Feb 8 I thought of and collected a number of ideas for preventing or mitigating the pandemic that I foresaw and subsequently sent them to several people working in pandemic preparedness, and followed up with several other ideas as I came across them.
What links here?

Two Neglected Problems in Human-AI Safety

Wei Dai16 Dec 2018 22:13 UTC

98 points

24 comments2 min readLW link

Tips and Tricks for Answering Hard Questions

Wei Dai17 Jan 2010 23:56 UTC

97 points

54 comments2 min readLW link

Wei Dai 18 Mar 2019 7:59 UTC
LW: 95 AF: 24
0
AF
on: More realistic tales of doom
I think AI risk is disjunctive enough that it’s not clear most of the probability mass can be captured by a single scenario/story, even as broad as this one tries to be. Here are some additional scenarios that don’t fit into this story or aren’t made very salient by it.
1. AI-powered memetic warfare makes all humans effectively insane.
2. Humans break off into various groups to colonize the universe with the help of their AIs. Due to insufficient “metaphilosophical paternalism”, they each construct their own version of utopia which is either directly bad (i.e., some of the “utopias” are objectively terrible or subjectively terrible according to my values), or bad because of opportunity costs.
3. AI-powered economies have much higher economies of scale because AIs don’t suffer from the kind of coordination costs that humans have (e.g., they can merge their utility functions and become clones of each other). Some countries may try to prevent AI-managed companies from merging for ideological or safety reasons, but others (in order to gain a competitive advantage on the world stage) will basically allow their whole economy to be controlled by one AI, which eventually achieves a decisive advantage over the rest of humanity and does a treacherous turn.
4. The same incentive for AIs to merge might also create an incentive for value lock-in, in order to facilitate the merging. (AIs that don’t have utility functions might have a harder time coordinating with each other.) Other incentives for premature value lock-in might include defense against value manipulation/corruption/drift. So AIs end up embodying locked-in versions of human values which are terrible in light of our true/actual values.
5. I think the original “stereotyped image of AI catastrophe” is still quite plausible, if for example there is a large amount of hardware overhang before the last piece of puzzle for building AGI falls into place.
What links here?