Ricardo Meneghin

Karma: 199

Ricardo Meneghin 27 Jul 2020 16:32 UTC
LW: 41 AF: 6
AF
on: Are we in an AI overhang?
One thing that’s bothering me is… Google/DeepMind aren’t stupid. The transformer model was invented at Google. What has stopped them from having *already* trained such large models privately? GPT-3 isn’t that large an evidence for the effectiveness of scaling transformer models; GPT-2 was already a shock and caused huge public commotion. And in fact, if you were close to building an AGI, it would make sense for you not to announce this to the world, specially as open research that anyone could copy/reproduce, for obvious safety and economic reasons.
Maybe there are technical issues keeping us from doing large jumps in scale (i.e. , we only learn how to train a 1 trillion parameter model after we’ve trained a 100 billion one)?

Ricardo Meneghin 10 Apr 2022 15:55 UTC
24 points
in reply to: Not Relevant’s comment on: A concrete bet offer to those with short AI timelines
Is it really constructive? This post presents no arguments for why they believe what they believe which should serve very little to convince others of long timelines. Moreover it proposes a bet from an assymetric position that is very undesirable for short-timeliners to take, since money is worth nothing to the dead, and even in the weird world where they win the bet and are still alive to settle it, they have locked their money for 8 years for a measly 33% return—less than expected by simply say, putting it in index funds. Believing in longer timelines gives you the privilege of signalling epistemic virtue by offering bets like this from a calm, unbothered position, while people sounding the alarm sound desperate and hasty, but there is no point in being calm when a meteor is coming towards you, and we are much better served by using our money to do something now rather than locking it in a long term bet.

Not only that, the decision from mods to push this to the frontpage is questionable since it served as a karma boost to this post that the other didn’t have, possibly giving the impression of higher support than it actually has.

Ricardo Meneghin 20 Oct 2023 9:27 UTC
23 points
13
on: AGI and the EMH: markets are not expecting aligned or unaligned AI in the next 30 years
Really interesting to go back to this today. Rates are at their highest level in 16 years, and TTT is up 60%+.

Ricardo Meneghin 12 Nov 2023 22:10 UTC
19 points
2
on: Don’t Donate A Kidney To A Stranger
A simple evolutionary argument is enough to justify a very strong prior that kidney donation is significantly harmful for health: we have two of them, they aren’t on an evolutionary path to disappearing, and modern conditions have changed almost nothing about the usage or availability of kidneys.

I think the whole situation with kidney donations reflects quite poorly on the epistemic rigor of the community. Scott Alexander probably paid more than $5k merely in the opportunity cost of the time he spent researching the topic, given the positive externalities of his work.

Ricardo Meneghin 22 Apr 2022 21:10 UTC
14 points
0
in reply to: anonymousaisafety’s comment on: “Pivotal Act” Intentions: Negative Consequences and Fallacious Arguments
You should stop thinking about AI designed nanotechnology like human technology and start thinking about it like actual nanotechnology, i.e. life. There is no reason to believe you can’t come up with a design for self-replicating nanorobots that can also self-assemble into larger useful machines, all from very simple and abundant ingredients—life does exactly that.

[Question] What will happen with real estate prices during a slow takeoff?

Ricardo Meneghin8 Nov 2023 11:58 UTC

8 points

1 comment1 min readLW link

Ricardo Meneghin 7 Apr 2022 13:03 UTC
8 points
on: What I Was Thinking About Before Alignment
I don’t think we have hope of developing such tools, at least not in a way that looks like anything we had in the past. In the past we have been able to analyse large systems by throwing away an immense amount of detail—it turns out that you don’t need the specific position of atoms to predict the movement of the planets, and you don’t need the details to predict all of the other things we have successfully predicted with traditional math.
With the systems you are describing, this is simply impossible. Changing a single bit in a computer can change its output completely, so you can’t build a simple abstraction that predicts it, you need to simulate it completely.
We already have a way of taking immense amounts of complicated data and finding patterns in it, it’s machine learning itself. If you want to translate what it learned into human readable descriptions, you just have to incorporate language in it—humans after all can describe their reasoning steps and why they believe what they believe (maybe not easily).
Google throws tremendous amounts of data and computational resources into training neural networks, but decoding the internal models used by those networks? We lack the mathematical tools to even know where to start.
I predict this will be done in the coming years by using large multimodal models to analyse neural network parameters, or to explain their own workings.

Ricardo Meneghin 26 Jun 2022 12:49 UTC
6 points
4
in reply to: TurnTrout’s comment on: Where I agree and disagree with Eliezer
I think the focus on “inclusive genetic fitness” as evolution’s “goal” is weird. I’m not even sure it makes sense to talk about evolution’s “goals”, but if you want to call it an optimization process, the choice of “inclusive genetic fitness” as its target is arbitrary as there are many other boundaries one could trace. Evolution is acting at all levels, e.g. gene, cell, organism, species, the entirety of life on Earth. For example, it is not selecting adaptations which increase the genetic fitness of an individual but lead to the extinction of the species later. In the most basic sense evolution is selecting for “things that expand”, in the entire universe, and humans definitely seem partially aligned with that—the ways in which they aren’t seem non-competitive with this goal.

Ricardo Meneghin 12 Apr 2022 12:03 UTC
6 points
on: We should stop being so confident that AI coordination is unlikely
If you change the analogy to developing nuclear weapons instead of launching them, the picture becomes much grimmer.

Ricardo Meneghin 25 Mar 2022 2:08 UTC
6 points
on: Rational and irrational infinite integers
In computers, signed integers are actually represented quite similar to this, as two’s complements, as a trick to reuse the exact same logical components to perform sums of both positive and negative numbers.

Ricardo Meneghin 2 Nov 2023 15:28 UTC
5 points
4
in reply to: Rana Dexsin’s comment on: Public Weights?
The vast majority of the risk seems to lie on following through with synthesizing and releasing the pathogen, not learning how to do it, and I think open-source LLMs change little about that.

Ricardo Meneghin 9 Aug 2020 11:48 UTC
5 points
in reply to: CronoDAS’s comment on: Craniofacial dystrophy: A possible syndrome relating malocclusion, sleep-disordered breathing, allergies, depression and a range of other diseases
Honestly, that whole comment section felt pretty emotional and low quality. I haven’t touched things like myofunctional therapy or wearable appliances in my post because those really maybe are “controversial at best”, but the effects of RPE on SDB, especially in children, have been widely replicated by multiple independent research groups.

Calling something controversial is also an easy way to undermine credibility without actually making any concrete explanations as to whether it is true or not. Are there any specific points in my post that you disagree with?

Ricardo Meneghin 28 Jul 2020 12:19 UTC
5 points
in reply to: ChristianKl’s comment on: Are we in an AI overhang?
I’m not sure what model is used in production, but the SOTA reached 600 billion parameters recently.

Ricardo Meneghin 10 Apr 2022 19:59 UTC
4 points
in reply to: Jotto999’s comment on: A concrete bet offer to those with short AI timelines
Well, if OP is willing then I’d love to take a high-interest loan from him to be paid back in 2030.

Ricardo Meneghin 6 Apr 2022 20:41 UTC
4 points
in reply to: johnswentworth’s comment on: [Link] A minimal viable product for alignment
A model which is just predicting the next word isn’t optimizing for strategies which look good to a human reviewer, it’s optimizing for truth itself (as contained in it’s training data). If you begin re-feeding its outputs as training inputs then there could be a feedback loop leading to such incentives, but if the model is general and sufficient intelligent, you don’t need to do that. You can train it in a different domain and it will generalize to your domain of interest.
Even if you that, you can try to make the new data grounded in reality in some way, like including experiment results. And the model won’t just absorb the new data as truth, it will include it in it’s world model to make better predictions. If it’s fed a bunch of new alignment forum posts that are bad ideas which look good to humans, it will just predict that alignment forum produces that kind of post, but that doesn’t mean there isn’t some prompt that can make it output what it actually thinks is correct.

Ricardo Meneghin 27 Jul 2020 16:46 UTC
4 points
in reply to: Mitchell_Porter’s comment on: Open & Welcome Thread—July 2020
I think there’s the more pressing question of how to position yourself in a way that you can influence the outcomes of AI development. Having the right ideas won’t matter if your voice isn’t heard by the major players in the field, big tech companies.

Ricardo Meneghin 7 Nov 2023 16:54 UTC
3 points
1
in reply to: O O’s comment on: How to (hopefully ethically) make money off of AGI
High growth rates means there is a higher opportunity cost in lending money, since you could invest it elsewhere and get a higher return, reducing the supply of loans, and more demand for loans, since if interests are low, people will borrow to buy assets that appreciate more than the interest rate.

Ricardo Meneghin 25 Jun 2022 13:39 UTC
3 points
in reply to: johnswentworth’s comment on: Air Conditioner Test Results & Discussion
Since the outdoor temperature was lower in the control, ignoring it will inflate how much the two-hose unit outperforms by bringing the effect of both units closer to zero. If we assume the temperature difference the units and the control produce are approximately constant in this outdoor temperature range, then the difference to control would be 3.1ºC for the one hose unit and 5ºC for the two hose unit if the control outdoor temperature was the same, meaning two-hose only outperforms by ~60% with the fan on high, and merely ~30% with the fan on low.

Ricardo Meneghin 13 Nov 2021 14:30 UTC
3 points
on: Discussion with Eliezer Yudkowsky on AGI interventions
Because it’s too technically hard to align some cognitive process that is powerful enough, and operating in a sufficiently dangerous domain, to stop the next group from building an unaligned AGI in 3 months or 2 years. Like, they can’t coordinate to build an AGI that builds a nanosystem because it is too technically hard to align their AGI technology in the 2 years before the world ends.
I’m not totally convinced by this argument because of the quote below:
The flip side of this is that I can imagine a system being scaled up to interesting human+ levels, without “recursive self-improvement” or other of the old tricks that I thought would be necessary, and argued to Robin would make fast capability gain possible. You could have fast capability gain well before anything like a FOOM started. Which in turn makes it more plausible to me that we could hang out at interesting not-superintelligent levels of AGI capability for a while before a FOOM started. It’s not clear that this helps anything, but it does seem more plausible.
It seems to me this does hugely change things. I think we are underestimating the amount of change humans will be able to make in the short timeframe after we get human level AI and before recursive self improvement gets developed. Human level AI + huge amounts of compute would allow you to take over the world through much more conventional means, like massively hacking computer systems to render your opponents powerless (and other easy-to-imagine more gruesome ways). So the first group to develop near-human level AI wouldn’t need to align it in 2 years, because it would have the chance to shut down everyone else. It may not even come down to the first group who develops it, but the first people who have access to some powerful system, since they could use that to hack the group itself and do what they wish without requiring the buy-in from others—this would depend on a lot of factors like how controlled is the access to the AI and how quickly a single person can use AI to take control over physical stuff. I’m not saying this would be easy to do, but certainly seems within the realm of plausibility.

Ricardo Meneghin 28 Aug 2020 15:45 UTC
3 points
in reply to: Rafael Harth’s comment on: Rafael Harth’s Shortform
I think that the way to not get frustrated about this is to know your public and know when spending your time arguing something will have a positive outcome or not. You don’t need to be right or honest all the time, you just need to say things that are going to have the best outcome. If lying or omitting your opinions is the way of making people understand/not fight you, so be it. Failure to do this isn’t superior rationality, it’s just poor social skills.

Ricardo Meneghin

[Question] What will hap­pen with real es­tate prices dur­ing a slow take­off?

[Question] What will happen with real estate prices during a slow takeoff?