Error

LW server reports: not allowed.

This probably means the post has been deleted or moved back to the author's drafts.

[ ]
[deleted]
- acylhalide 15 Dec 2025 12:36 UTC
  1 point
  0
  Parent
  Yes agreed. But none of these are problems whose solutions require acquiring power of any sort. Acquiring political power requires outgroups often.
acylhalide 15 Dec 2025 8:56 UTC
−1 points
0
Disagreement
Some people in lesswrong and EA circles support AI safety researchers working with or inside AI companies, and policymakers working with US govt.

Example: work done by Paul Christiano,

I support social media channels hostile to AI companies and US govt, and protests against AI companies and US govt, and electing politicians that are in favour of pausing AI research.

Example: work done by John Sherman, Holly Elmore

Not a crux

We both agreed that these plans are in conflict with each other. Doing protests and social media channels makes it harder for ai safety researchers or policymakers to collaborate with AI companies and US govt.
We both had different guesses which plan has higher probability of working.

Crux
He said it is possible to use social media to raise public awareness of AI risk without being hostile to AI companies or naming CEOs and leaders specifically.

I said being hostile to AI companies is almost a necessary precondition to doing social media successfully.

My argument
The most popular ideas in society all have outgroups that are the enemy.
Here is empirical evidence.
https://chatgpt.com/s/t_693fc0fbb6548191bc169ad2d0f8511d
We want AI risk to become one of the most popular ideas in society. This means AI companies and the governments supporting them must become an outgroup for signification fractions of society.
See also: I can tolerate anything but the outgroup by Scott Alexander
https://slatestarcodex.com/2014/09/30/i-can-tolerate-anything-except-the-outgroup/
[ ]
[deleted]
[ ]
[deleted]
- Pattern 4 Jan 2022 18:39 UTC
  4 points
  0
  Parent
  If you read https://meaningness.com there’s probably some stuff in there about ‘partial control’. (Though I haven’t double checked what is where since https://metarationality.com got broken off into a separate website.)
  It’s critical of a lot of stuff on LW in a particular, reasoned fashion.
  Although you might see some criticism on here as well, like this post today: https://www.lesswrong.com/posts/cBH9FT7AWNNhJycaG/the-map-territory-distinction-creates-confusion
  LessWrong isn’t exactly founded on the map-territory model of truth, but it’s definitely pretty core to the LessWrong worldview. The map-territory model implies a correspondence theory of truth. But I’d like to convince you that the map-territory model creates confusion and that the correspondence theory of truth, while appealing, makes unnecessary claims that infect your thinking with extraneous metaphysical assumptions. Instead we can see what’s appealing about the map-territory metaphor but drop most of it in favor of a more nuanced and less confused model of how we know about the world.
  - [ ]
    [deleted]
    - Pattern 5 Jan 2022 22:07 UTC
      2 points
      0
      Parent
      Do you by any chance have link to a summary or review?
      There are some summaries in the (hyper text) book, but they’re probably too short to give an overview.
      I could write a review, but I’d probably want to PM rather than post that.
      One reason I haven’t written a review is that I find the book to be: easy to read, and short enough that it probably doesn’t save time, without some work.
      
      I could try to summarize anything you have questions about after or while reading this: https://meaningness.com/control
      A dialogue might be more (immediately) constructive and save time relative to trying to cover everything.
    - Pattern 5 Jan 2022 21:59 UTC
      2 points
      0
      Parent
      That is, however, not as short as things could be with skimming. I figured the first two sections are more pre-requisites than things which go off somewhere else.
      Something rather fast, which might fail, is just reading the page: https://meaningness.com/control. Maybe it has what you’re looking for there, maybe it doesn’t. If you have any questions feel free to PM me.
      
      I found that page using this search: site:meaningness.com partial control
      The next two hits from that search only have a one hit for the page search for partial, at the very end of them. Beyond that hits seem to be from a repeated phrase, which is just a short summary of a section in the table of contents.
    - Pattern 5 Jan 2022 21:54 UTC
      2 points
      0
      Parent
      I don’t have a map of what is a pre-requisite of what. But, assuming that that’s handled if you read it in order, then: ‘partial control’ is addressed in https://meaningness.com/control. I’d guess… you can read section one (Why meaningness? and it’s sub pages 4), section 2 (Stances and its subpages about 14). Then that page on control is late section 3. The preceding sub-pages don’t have sub-pages, but eternalism has 3 sub-subpages of some kind or another before it).
      (Things that might change: https://meaningness.com/meaningness-practice which comes late part 2 contains a subsection that is currently 100 words but could become more relevant to what you’re looking for if it’s updated, or pages are added after that page. That possibility is mentioned in a note from July 2014 though, so if you read this soon, I’m guessing that won’t happen by then.)
      Not counting: https://meaningness.com/all-dimensions-schematic-overview towards the word count or anything because it’s a bunch of charts. (Which might help summarize if you have a little bit of the necessary background.)
      Here’s how long the first two sections are: (using https://wordcount.com, copying the text in, (adding a return a “-” and another two returns) not the table of contents on the left because it’s repeated. Using the url led to a massive overcount by at least an order of magnitude, possibly some builtin recursive counting on the links or something, so I didn’t use that.)
      This does not include the comments (a separate page, which you don’t have to read, but might be useful if you’re confused or have questions, though such things might also be answered later on in the book).
      12,154
      Words
      80,013
      Characters
      67,310
      Characters without space
      21,985
      Syllables
      827
      Sentences
      487
      Paragraphs*
      This one includes every “-”, all 18 I added, so it’s actually:
      469
      Paragraphs
      
      If you figure section 3 is 75% the length of all the stuff before it, then, including the page on control, that’s:
      estimated numbers:
      8,000
      Words
      60,000
      Characters
      50,250
      Characters without space
      16,500
      Syllables
      620.25
      Sentences
      351.75
      Paragraphs
      
      That comes out to an estimated 20,000 words.
[ ]
[deleted]
[ ]
[deleted]
- Gunnar_Zarncke 9 Nov 2021 21:46 UTC
  5 points
  0
  Parent
  Plus some loops to iterate if the limit is too large. And handle errors. Plus some way to share the credentials securely—or make it into a browser plugin.
[ ]
[deleted]
[ ]
[deleted]
- Trinley Goldenberg 4 Jan 2022 22:30 UTC
  2 points
  0
  Parent
  Logical uncertainty is hard. But the intuition that I have is that humans exist, so there’s at least a proof of concept for a sort of aligned AGI (although admittedly not a proof of concept for an ASI)
  - [ ]
    [deleted]
    - Trinley Goldenberg 5 Jan 2022 14:13 UTC
      2 points
      0
      Parent
      I don’t think it’s that weak?
      - [ ]
        [deleted]
        Trinley Goldenberg 5 Jan 2022 19:35 UTC
        2 points
        0
        Parent
        But if your definition of alignment is “an AI that does things in a way such that all humans agree on it’s ethical choices” I think you’re doomed from the start, so this counterintuition proves too much. I don’t think there is an action an AI could take or a recommendation it could make that would satisfy that criteria (in fact, many people would say that the AI by it’s nature shouldn’t be taking actions or making recommendations)
        [ ]
        [deleted]
        Trinley Goldenberg 6 Jan 2022 14:39 UTC
        2 points
        0
        Parent
        It seems like something like “An AI that acts and reasons in a way that most people who are broadly considered moral consider moral” would be a pretty good outcome.
        [ ]
        [deleted]
[ ]
[deleted]
- ChristianKl 28 Dec 2021 10:51 UTC
  2 points
  0
  Parent
  The problem is that this sets a lot of incentives for corruption.
  - [ ]
    [deleted]
    - ChristianKl 28 Dec 2021 14:55 UTC
      2 points
      0
      Parent
      People will try to drive votes to get their predictions to come true and thus the vote count becomes a worse signal for quality.
      - [ ]
        [deleted]
        ChristianKl 28 Dec 2021 21:35 UTC
        2 points
        0
        Parent
        In the case of LessWrong comments that argue against the OP or for it’s value can effect voting.
        [ ]
        [deleted]
[ ]
[deleted]
[ ]
[deleted]
- [ ]
  [deleted]
[ ]
[deleted]
[ ]
[deleted]
[ ]
[deleted]
- Dagon 4 Nov 2021 14:47 UTC
  3 points
  0
  Parent
  I haven’t seen any government, let alone the set of governments, demonstrate any capability of commitment on this kind of topic. States (especially semi-representative ones like modern democracies) just don’t operate with a model that makes this effective.
  - [ ]
    [deleted]
    - Dagon 4 Nov 2021 17:55 UTC
      5 points
      0
      Parent
      I don’t know if it is or not. Human cloning seems both less useful and less harmful (just less impactful overall), so simultaneously easier to implement and not a good comparison to AGI.
      - [ ]
        [deleted]
        Dagon 5 Nov 2021 13:40 UTC
        2 points
        0
        Parent
        I’m not following the connection between human cloning and AGI. Are you talking about something different from https://en.wikipedia.org/wiki/Human_cloning , where a baby is created with only one parent’s genetic material?
        To me, human cloning is just an expensive way to make normal babies.
        [ ]
        [deleted]
        Dagon 8 Nov 2021 16:03 UTC
        2 points
        0
        Parent
        You can keep cloning the most intelligent people.
        Do you have any reason to believe that this is happening AT ALL? I’d think the selection of who gets cloned (especially when it’s illicit, but probably even if it were common) would follow wealth more than intelligence.
        Selective embryo implantation based on genetic examination of two-parent IVF would seem more effective, and even that’s not likely to do much unless it becomes a whole lot more common, and if intelligence were valued more highly in the general population.
        Since these clones will hopefully retain basic human values
        Huh? Why these any more than the general population? The range of values and behaviors found in humans is very wide, and “basic human values” is a pretty thin set.
        Most importantly, a 25-year improvement cycle, with a mandatory 15-20 year socialization among many many humans of each new instance is just not as scary as an AGI with an improvement cycle under a year (perhaps much much faster), and with direct transmission of models/beliefs from previous generations. Just not comparable.
        [ ]
        [deleted]
  - [ ]
    [deleted]
[ ]
[deleted]
- Dagon 3 Nov 2021 16:53 UTC
  5 points
  0
  Parent
  I don’t think, at this scale, that “the government” is a useful model. There are MANY governments, and many non-government coalitions that will impact any large-scale system. The trick is delivering incremental value at each stage of your path, to enough of the selectorate of each group who can destroy you.
  - [ ]
    [deleted]
[ ]
[deleted]
- Vladimir_Nesov 31 Oct 2021 22:10 UTC
  4 points
  0
  Parent
  
  atoms they causally impact
  
  This doesn’t help. In a counterfactual, atoms are not where they are in actuality. Worse, they are not even where the physical laws say they must be in the counterfactual, the intervention makes the future contradict the past before the intervention.
  - [ ]
    [deleted]
    - Vladimir_Nesov 1 Nov 2021 12:53 UTC
      2 points
      0
      Parent
      The point is that the weirdness with counterfactuals breaking physical laws is the same for controlling the world through one agent (as in orthodox CDT) and for doing the same through multiple copies of an agent in concert (as in FDT). Similarly, in actuality neither one-agent intervention nor coordinated many-agent intervention breaks physical laws. So this doesn’t seem relevant for comparing the two, that’s what I meant by “doesn’t help”.
      
      By “outside view” you seem to be referring to actuality. I don’t know what you mean by “inside view”. Counterfactuals are not actuality as normally presented, though to the extent they can be constructed out of data that also defines actuality, they can aspire to be found in some nonstandard semantics of actuality.
      - [ ]
        [deleted]
[ ]
[deleted]
- niplav 31 Oct 2021 13:49 UTC
  2 points
  0
  Parent
  I believe that in the LCPW it would be the right decision to kill one person to save two, and I also predict that I wouldn’t do it anyway, mainly because I couldn’t bring myself to do it.
  
  In general, I understood the Complexity of Value sequence to be saying “The right way to look at ethics is consequentialism, but utilitarianism specifically is too narrow, and we want to find a more complex utility function that matches our values better.”
  - [ ]
    [deleted]
    - niplav 3 Nov 2021 13:19 UTC
      1 point
      0
      Parent
      
      Why do you feel it would be the right decision to kill one? Who defines “right”?
      
      I define “right” to be what I want, or, more exactly, what I would want if I knew more, thought faster and was more the person I wish I could be. This is of course mediated by considerations on ethical injunctions, when I know that the computations my brain carries out are not the ones I would consciously endorse, and refrain from acting since I’m running on corrupted hardware. (You asked about the LCPW, so I didn’t take these into account and assumed that I could know that I was being rational enough).
      
      It’s been a while since I read Thou Art Godshatter and the related posts, so maybe I’m conflating the message in there with things I took from other LW sources.
      - [ ]
        [deleted]
        niplav 3 Nov 2021 15:52 UTC
        1 point
        0
        Parent
        Just FYI, I’ve become convinced that most online communication through comments with a lot of context are much better settled through conversations, so if you want, we could also talk about this over audio call.
        [ ]
        [deleted]
      - [ ]
        [deleted]
[ ]
[deleted]
[ ]
[deleted]
- Viliam 1 Oct 2021 9:27 UTC
  2 points
  0
  Parent
  You can’t prove why inconsistent FOL theories are “bad” inside of the very same FOL theory
  If the theory is inconsistent, you can prove anything in it, can’t you? So you should also be able to prove that inconsistent theories are “bad”.
  - [ ]
    [deleted]
[ ]
[deleted]
- Dagon 9 Dec 2021 21:25 UTC
  2 points
  0
  Parent
  I think that the link from micro to macro is too weak for this to be a useful line of inquiry. “intelligence” applies on a level of abstraction that is difficult (perhaps impossible for human-level understanding) to predict/define in terms of neural configuration, let alone Turing-machine or quantum descriptions.
  - [ ]
    [deleted]
    - Dagon 13 Dec 2021 1:14 UTC
      2 points
      0
      Parent
      I’m not sure what you’re asking. A lot of reality doesn’t make sense to me, so that’s pretty weak evidence either way. And it does seem believable that, since there is a very wide range of consistency and dimensionality to human values that don’t seem well-correlated to intelligence, the same could be true of AIs.
      - [ ]
        [deleted]
    - AprilSR 13 Dec 2021 5:47 UTC
      1 point
      0
      Parent
      I think this could reasonably be true for some definitions of “intelligence”, but that’s mostly because I have no idea how intelligence would be formalized anyways?
      - [ ]
        [deleted]
        AprilSR 14 Dec 2021 6:42 UTC
        1 point
        0
        Parent
        i think asking well-formed questions is useful but we shouldn’t confuse our well-formed question as being what we actually care about unless we are sure it is in fact what we care about
        [ ]
        [deleted]