RobertM

Karma: 2,664

LessWrong dev & admin as of July 5th, 2022.

RobertM 18 May 2024 1:32 UTC
4 points
0
on: RobertM’s Shortform
Unfortunately, it looks like non-disparagement clauses aren’t unheard of in general releases:
http://www.shpclaw.com/Schwartz-Resources/severance-and-release-agreements-six-6-common-traps-and-a-rhetorical-question
Release Agreements commonly include a “non-disparagement” clause – in which the employee agrees not to disparage “the Company.”
https://joshmcguirelaw.com/civil-litigation/adventures-in-lazy-lawyering-the-broad-general-release
The release had a very broad definition of the company (including officers, directors, shareholders, etc.), but a fairly reasonable scope of the claims I was releasing. So far, so good. But then it included a general non-disparagement provision, which basically said I couldn’t say anything bad about the company, which, by itself, is also fairly typical and reasonable.
Given the way the contract is worded it might be worth checking whether executing your own “general release” (without a non-disparagement agreement in it) would be sufficient, but I’m not a lawyer and maybe you need the counterparty to agree to it for it to count.
And as a matter of industry practice, this is of course an extremely non-standard requirement for retaining vested equity (or equity-like instruments), whereas it’s pretty common when receiving an additional severance package. (Though even in those cases I haven’t heard of any such non-disparagement agreement that was itself covered by a non-disclosure agreement… but would I have?)

RobertM 17 May 2024 4:47 UTC
5 points
0
in reply to: RussellThor’s comment on: Against “argument from overhang risk”
This seems to be arguing that the big labs are doing some obviously-inefficient R&D in terms of advancing capabilities, and that government intervention risks accidentally redirecting them towards much more effective R&D directions. I am skeptical.
1. If such training runs are not dangerous then the AI safety group loses credibility.
2. It could give a false sense of security when a different arch requiring much less training appears and is much more dangerous than the largest LLM.
3. It removes the chance to learn alignment and safety details from such large LLM
1. I’m not here for credibility. (Also, this seems like it only happens, if it happens, after the pause ends. Seems fine.)
2. I’m generally unconvinced by arguments of the form “don’t do [otherwise good thing x]; it might cause people to let their guard down and get hurt by [bad thing y]” that don’t explain why they aren’t a fully-general counterargument.
3. If you think LLMs are hitting a wall and aren’t likely to ever lead to dangerous capabilities then I don’t know why you expect to learn anything particularly useful from the much larger LLMs that we don’t have yet, but not from those we do have now.

RobertM 17 May 2024 4:38 UTC
3 points
0
in reply to: RussellThor’s comment on: Against “argument from overhang risk”
This seems non-reponsive to arguments already in my post:
If we institute a pause, we should expect to see (counterfactually) reduced R&D investment in improving hardware capabilities, reduced investment in scaling hardware production, reduced hardware production, reduced investment in research, reduced investment in supporting infrastructure, and fewer people entering the field.

RobertM 17 May 2024 4:37 UTC
5 points
1
in reply to: Logan Zoellner’s comment on: Against “argument from overhang risk”
We ran into a hardware shortage during a period of time where there was no pause, which is evidence that the hardware manufacturer was behaving conservatively. If they’re behaving conservatively during a boom period like this, it’s not crazy to think they might be even more conservative in terms of novel R&D investment & ramping up manufacturing capacity if they suddenly saw dramatically reduced demand from their largest customers.
For example, suppose we pause now for 3 years and during that time NVIDIA releases the RTX5090,6090,7090 which are produced using TSMC’s 3nm, 2nm and 10a processes.
This and the rest of your comment seems to have ignored the rest of my post (see: multiple inputs to progress, all of which seem sensitive to “demand” from e.g. AGI labs), so I’m not sure how to respond. Do you think NVIDIA’s planning is totally decoupled from anticipated demand for their products? That seems kind of crazy, but that’s the scenario you seem to be describing. Big labs are just going to continue to increase their willingness-to-spend along a smooth exponential for as a long as the pause lasts? What if the pause lasts 10 years?
If you think my model of how inputs to capabilities progress are sensitive to demand for those inputs from AGI labs is wrong, then please argue so directly, or explain how your proposed scenario is compatible with it.

Against “argument from overhang risk”

RobertM16 May 2024 4:44 UTC

28 points

9 comments5 min readLW link

RobertM 14 May 2024 21:05 UTC
2 points
0
in reply to: Aaron_Scher’s comment on: RobertM’s Shortform
Yeah, “they’re following their stated release strategy for the reasons they said motivated that strategy” also seems likely to share some responsibility. (I might not think those reasons justify that release strategy, but that’s a different argument.)

RobertM 14 May 2024 0:50 UTC
12 points
2
in reply to: ryan_greenblatt’s comment on: RobertM’s Shortform
Yeah, I agree that it’s too early to call it re: hitting a wall. I also just realized that releasing 4o for free might be some evidence in favor of 4.5/5 dropping soon-ish.

RobertM 13 May 2024 20:35 UTC
46 points
21
on: RobertM’s Shortform
Vaguely feeling like OpenAI might be moving away from GPT-N+1 release model, for some combination of “political/frog-boiling” reasons and “scaling actually hitting a wall” reasons. Seems relevant to note, since in the worlds where they hadn’t been drip-feeding people incremental releases of slight improvements over the original GPT-4 capabilities, and instead just dropped GPT-5 (and it was as much of an improvement over 4 as 4 was over 3, or close), that might have prompted people to do an explicit orientation step. As it is, I expect less of that kind of orientation to happen. (Though maybe I’m speaking too soon and they will drop GPT-5 on us at some point, and it’ll still manage to be a step-function improvement over whatever the latest GPT-4* model is at that point.)

RobertM 12 May 2024 20:40 UTC
7 points
3
on: RobertM’s Shortform
It’s not obvious to me why training LLMs on synthetic data produced by other LLMs wouldn’t work (up to a point). Under the model where LLMs are gradient-descending their way into learning algorithms that predict tokens that are generated by various expressions of causal structure in the universe, tokens produced by other LLMs don’t seem redundant with respect to the data used to train those LLMs. LLMs seem pretty different from most other things in the universe, including the data used to train them! It would surprise me if the algorithms that LLMs developed to predict non-LLM tokens were perfectly suited for predicting other LLM tokens “for free”.

RobertM 9 May 2024 21:37 UTC
3 points
0
in reply to: Nevin Wetherill’s comment on: Open Thread Spring 2024
EDIT: looks like habryka got there earlier and I didn’t see it.
https://www.lesswrong.com/posts/zXJfH7oZ62Xojnrqs/#sLay9Tv65zeXaQzR4
Intercom is indeed hidden on mobile (since it’d be pretty intrusive at that screen size).

RobertM 9 May 2024 19:22 UTC
5 points
15
in reply to: aogara’s comment on: RobertM’s Shortform
Ah, does look like Zach beat me to the punch :)
I’m also still moderately confused, though I’m not that confused about labs not speaking up—if you’re playing politics, then not throwing the PM under the bus seems like a reasonable thing to do. Maybe there’s a way to thread the needle of truthfully rebutting the accusations without calling the PM out, but idk. Seems like it’d be difficult if you weren’t either writing your own press release or working with a very friendly journalist.

RobertM 9 May 2024 1:06 UTC
17 points
13
in reply to: jeffreycaruso’s comment on: RobertM’s Shortform
I hadn’t, but I just did and nothing in the article seems to be responsive to what I wrote.
Amusingly, not a single news source I found reporting on the subject has managed to link to the “plan” that the involved parties (countries, companies, etc) agreed to.
Nothing in that summary affirmatively indicates that companies agreed to submit their future models to pre-deployment testing by the UK AISI. One might even say that it seems carefully worded to avoid explicitly pinning the companies down like that.

RobertM 8 May 2024 3:00 UTC
71 points
48
on: RobertM’s Shortform
EDIT: I believe I’ve found the “plan” that Politico (and other news sources) managed to fail to link to, maybe because it doesn’t seem to contain any affirmative commitments by the named companies to submit future models to pre-deployment testing by UK AISI.
I’ve seen a lot of takes (on Twitter) recently suggesting that OpenAI and Anthropic (and maybe some other companies) violated commitments they made to the UK’s AISI about granting them access for e.g. predeployment testing of frontier models. Is there any concrete evidence about what commitment was made, if any? The only thing I’ve seen so far is a pretty ambiguous statement by Rishi Sunak, who might have had some incentive to claim more success than was warranted at the time. If people are going to breathe down the necks of AGI labs about keeping to their commitments, they should be careful to only do it for commitments they’ve actually made, lest they weaken the relevant incentives. (This is not meant to endorse AGI labs behaving in ways which cause strategic ambiguity about what commitments they’ve made; that is also bad.)
What links here?
- MIRI’s May 2024 Newsletter by Harlan (15 May 2024 0:13 UTC; 79 points)

RobertM 8 May 2024 2:45 UTC
5 points
2
in reply to: Decaeneus’s comment on: Decaeneus’s Shortform
Huh, that went somewhere other than where I was expecting. I thought you were going to say that ignoring letter-of-the-rule violations is fine when they’re not spirit-of-the-rule violations, as a way of communicating the actual boundaries.

RobertM 7 May 2024 4:08 UTC
2 points
0
in reply to: jessicata’s comment on: How do open AI models affect incentive to race?
Yeah, there needs to be something like a nonlinearity somewhere. (Or just preference inconsistency, which humans are known for, to say nothing of larger organizations.)

RobertM 7 May 2024 3:11 UTC
8 points
0
on: How do open AI models affect incentive to race?
I’m not sure I personally endorse the model I’m proposing, but imagine a slightly less spherical AGI lab which has more than one incentive (profit maximization) driving its behavior. Maybe they care at least a little bit about not advancing the capabilities frontier as fast as possible. This can cause a preference ordering like:
1. don’t argmax capabilities, because there’s no open-source competition making it impossible to profit from current-gen models
2. argmax capabilities, since you need to stay ahead of open-source models nipping at your heels
3. don’t argmax capabilities; go bankrupt because open-source catches up to you (or gets “close enough” for enough of your customers)
ETA: But in practice most of my concerns around open-source AI development are elsewhere.

LW Frontpage Experiments! (aka “Take the wheel, Shoggoth!”)

Ruby and RobertM

23 Apr 2024 3:58 UTC

65 points

25 comments4 min readLW link

RobertM 18 Apr 2024 20:43 UTC
2 points
0
in reply to: ChristianKl’s comment on: RobertM’s Shortform
I think there might be many local improvements, but I’m pretty uncertain about important factors like elasticity of “demand” (for robbery) with respect to how much of a medication is available on demand. i.e. how many fewer robberies do you get if you can get at most a single prescriptions’ worth of some kind of controlled substance (and not necessarily any specific one), compared to “none” (the current situation) or “whatever the pharmacy has in stock” (not actually sure if this was the previous situation—maybe they had time delay safes for storing medication that wasn’t filling a prescription, and just didn’t store the filled prescriptions in the safes as well)?

RobertM 18 Apr 2024 3:49 UTC
10 points
0
on: RobertM’s Shortform
Headline claim: time delay safes are probably much too expensive in human time costs to justify their benefits.
The largest pharmacy chains in the US, accounting for more than 50% of the prescription drug market^[1]^[2], have been rolling out time delay safes (to prevent theft)^[3]. Although I haven’t confirmed that this is true across all chains and individual pharmacy locations, I believe these safes are used for all controlled substances. These safes open ~5-10 minutes after being prompted.
There were >41 million prescriptions dispensed for adderall in the US in 2021^[4]. (Note that likely means ~12x fewer people were prescribed adderall that year.) Multiply that by 5 minutes and you get >200 million minutes, or >390 person-years, wasted. Now, surely some of that time is partially recaptured by e.g. people doing their shopping while waiting, or by various other substitution effects. But that’s also just adderall!
Seems quite unlikely that this is on the efficient frontier of crime-prevention mechanisms, but alas, the stores aren’t the ones (mostly) paying the costs imposed by their choices, here.

RobertM 24 Mar 2024 0:29 UTC
12 points
6
on: General Thoughts on Secular Solstice
use spaces that your community already has (Lighthaven?), even if they’re not quite set up the right way for them
Not set up the right way would be an understatement, I think. Lighthaven doesn’t have an indoor space which can seat several hundred people, and trying to do it outdoors seems like it’d require solving maybe-intractable logistical problems (weather, acoustics, etc). (Also Lighthaven was booked, and it’s not obvious to me to what degree we’d want to subsidize the solstice celebration. It’d also require committing a year ahead of time, since most other suitable venues are booked up for the holidays quite far in advance.)
I don’t think there are other community venues that could host the solstice celebration for free, but there might be opportunities for cheaper (or free) venues outside the community (with various trade-offs).

RobertM

Against “ar­gu­ment from over­hang risk”

LW Front­page Ex­per­i­ments! (aka “Take the wheel, Shog­goth!”)

Against “argument from overhang risk”

LW Frontpage Experiments! (aka “Take the wheel, Shoggoth!”)