tamgent

Karma: 83

tamgent 31 Mar 2019 21:28 UTC
18 points
on: The Case for The EA Hotel
I visited the EA Hotel last year for a few days and enjoyed my stay and think the project is good on net and would like to see it funded. But I think it could be better, namely I disagree about the vetting policy being so open if it aims to be an incubator:
- The fact that it is possible to randomly get some good outcomes despite low vetting standards does not make a cost-effective way to get good outcomes. The Hotel being hits-based approach does not preclude a better vetting policy.
- IMO, the acceptance policy should be: by default all rooms cost and then people working on impact projects or prep for future impact work can apply to have a room for free. If they meet the minimum standard I take them. If there are more applications that meet the minimum standard than rooms, I prioritise them. I would have no fixed amount of rooms for paying or non-paying guests. My minimum standard would vary according to the financial situation and with reflections when enough data builds up.
- You said, “strong vetting is nice, but there’s no replacement for simply trying many things and seeing what works”—these are not mutually exclusive. When you do strong vetting, you typically have criteria (priors) that you update as well as updating the process as you learn what is working.
- About the plans to vet post-hoc: I predict bias towards keeping people because of sunken costs on both sides.
- Projects are not independent, interesting projects happening there will attract even more interesting projects (in particular complementary ones). They could be put on the website under Current Residents creating positive feedback loops. Especially since people who are planning to drop things and move to Blackpool are going to want to have some confidence in where they’re going.
- I agree that tapping into the thought diversity of the larger community is good, I just think that you need some vetting—what I would like to see is a plan for a vetting process which both gets this diversity but also maximises quality. I don’t think you need a total open doors policy to get thought diversity, although acknowledge the trade off.
The comparison to https://www.cityyear.org/boston is interesting. If the goal is to create something similar to that then I take back all my points about vetting. I just think that is a quite a different goal to the one of being an incubator for high impact projects. Why do I think this. Because I think the set of people who need lots of support to “stay on track” and the set of people whose incubated projects are going to make very high positive impact in the world overlap rarely. The project aims to target these rare people, but I think this particular rare group are exceptional by definition and have already learnt how to bootstrap by themselves.

However, maybe there are people in the tails of a slightly different but similar distribution of people who will not make very high impact things but might do medium impact things, who do not yet know how to bootstrap. This population seems hard to model in my head somehow, at least the boundaries are fuzzy. The EA Hotel’s approach makes more sense to me if this is the goal. If I were them I might test the hypothesis of a very low bar, though I would still have the bar be a little bit higher than it is currently, if only for the feedback loops I talked about.

Basically I think as an incubator the model doesn’t work without increasing the bar, as a refuge it works and could be very valuable but then it shouldn’t be portrayed as an incubator. If it aims to be both, then it is really hard to model this medium potential impact target audience and I think that some work should be done to identify who is and who is not in it—and ultimately there should still be some raising of the bar. I also think the Hotel is valuable for visitors and for random events like retreats and unconferences. All round, a truly uniquely good idea.

An irrelevant aside: I don’t like the pyramid. In particular, the distinction drawn between someone who self-identifies strongly or weakly as EA seems irrelevant. Do you believe there is any correlation of interest with respect to getting a project funded? When I look at the most interesting things, some are done by people who self-identify strongly and others are done by people who keep their identity small.
What links here?
- The Case for the EA Hotel by Halffull (EA Forum; 31 Mar 2019 12:34 UTC; 73 points)

tamgent 7 May 2022 17:43 UTC
9 points
in reply to: Aiyen’s comment on: The Regulatory Option: A response to near 0% survival odds
Thank you for your elaboration, I appreciate it a lot, and upvoted for the effort. Here are your clearest points paraphrased as I understand them (sometimes just using your words), and my replies:
1. The FDA is net negative for health, therefore creating an FDA-for-AI would be likely net negative for the AI challenges.
I don’t think you can come to this conclusion, even if I agree with the premise. The counterfactuals are very different. With drugs the counterfactual of no FDA might be some people get more treatments, and some die but many don’t, and they were sick anyway so need to do something, and maybe fewer die than do with the FDA around, so maybe the existence of the FDA compared to the counterfactual is net bad. I won’t dispute this, I don’t know enough about it. However, the counterfactual in AI is different. If unregulated, AI progress steams on ahead, competition over the high rewards is high, and if we don’t have good safety plan (which we don’t) then maybe we all die at some point, who knows when. However, if an FDA-for-AI creates bad regulation (as long as it’s not bad enough to cause AI regulation winter) then it starts slowing down that progress. Maybe it’s bad for, idk, the diseases that could have been solved during the 10 years slowing down from when AI would have solved cancer vs not, and that kind of thing, but it’s nowhere near as bad as the counterfactual! These scenarios are different and not comparable, because the counterfactual of no FDA is not as bad as the counterfactual of no AI regulator.
1. Enough errors would almost certainly occur in AI regulation to make it net negative.
You gave a bunch of examples from non-AI regulation of bad regulation (I am not going to bother to think about whether I agree that they are bad regulation as it’s not cruxy) - but you didn’t explain how exactly errors lead to making AI regulation net negative? Again I think similar to the previous claim, the counterfactuals likely make this not hold.
1. ...a field where there is bound to be vastly more misunderstanding should be at least as prone to regulation backfiring
That is an interesting claim, I am not sure what makes you think it’s obviously true, as it depends what your goal is. My understanding of the OP is that the goal of the type of regulation they advocate is simply to slow down AI development, nothing more, nothing less. If the goal is to do good regulation of AI, that’s totally different. Is there a specific way in which you imagine it backfiring for the goal of simply slowing down AI progress?
1. ...an [oppressive] regime gaining controllable AI would produce an astronomical suffering risk.
I am unsure what point you were making in the paragraph about evil. Was it about another regime getting there first that might not do safety? For response, see the OP Objection 4 which I share and added additional reason for that not being a real worry in this world.
1. ...unwise to think that people who take blatant actions to kill innocents for political convenience would be safe custodians of AI..
I don’t think it’s fair to say regulators would be a custodian. They have a special kind of lever called “slow things down”, and that lever does not mean that they can, for example, seize and start operating the AI. It is not in their power to do that, legally, nor do they have the capability to do anything with it. We are talking here about slowing things down before AGI, not post AGI.
1. the electorate does not understand AI
Answer is same as my answer to 3. and also similar to OP Objection 1.

And finally to reply to this: “hopefully this should clarify to a degree why I anticipate both severe X risks and S risks from most attempts at AI regulation”

Basically, no, it doesn’t really clarify it. You started off with a premise I agreed with or at least do not know enough to refute, that the FDA may be net negative, and then drew a conclusion that I disagree with (see 1. above), and then all your other points were assuming that conclusion, so I couldn’t really follow. I tried to pick out bits that seemed like possible key points and reply, but yeah I think you’re pretty confused.

What do you think of my reply to 1. - the counterfactuals being different. I think that’s the best way to progress the conversation.

tamgent 4 Jul 2022 17:29 UTC
8 points
2
in reply to: Robert Miles’s comment on: What Are You Tracking In Your Head?
I was explicitly taught to model this physical thing in a wood carving survivalist course.

tamgent 23 Dec 2022 20:44 UTC
6 points
5
in reply to: CarlShulman’s comment on: Let’s think about slowing down AI
I think the two camps are less orthogonal than your examples of privacy and compute reg portray. There’s room for plenty of excellent policy interventions that both camps could work together to support. For instance, increasing regulatory requirements for transparency on algorithmic decision-making (and crucially, building a capacity both in regulators and in the market supporting them to enforce this) is something that I think both camps would get behind (the xrisk one because it creates demand for interpretability and more and the other because eg. it’s easier to show fairness issues) and could productively work on together. I think there are subculture clash reasons the two camps don’t always get on, but that these can be overcome, particularly given there’s a common enemy (misaligned powerful AI). See also this paper Beyond Near- and Long-Term: Towards a Clearer Account of Research Priorities in AI Ethics and Society I know lots of people who are uncertain about how big the risks are, and care about both problems, and work on both (I am one of these—I care more about AGI risk, but I think the best things I can do to help avert it involve working with the people you think aren’t helpful).

tamgent 5 Jun 2022 22:08 UTC
5 points
on: Deep Learning Systems Are Not Less Interpretable Than Logic/Probability/Etc
I’m not in these fields, so take everything I say very lightly, but intuitively this feels wrong to me. I understood your point to be something like: the labels are doing all the work. But for me, the labels are not what makes those approaches seem more interpretable than a DNN. It’s that in a DNN, the features are not automatically locatable (even pseudonymously so) in a way that lets you figure out the structure /shape that separates them—each training run of the model is learning a new way to separate them and it isn’t clear how to know what those shapes tend to turn out as and why. However, the logic graphs already agree with you an initial structure/shape.

Of course there are challenges in scaling up the other methods, but I think claiming they’re no more interpretable than DNNs feels incorrect to me. [Reminder, complete outsider to these fields].

tamgent 5 Jun 2022 21:26 UTC
5 points
on: Benign Boundary Violations
Siblings do this a lot growing up.

tamgent 1 May 2022 21:22 UTC
5 points
in reply to: ambigram’s comment on: Narrative Syncing
I am curious about how you felt when writing this bit:

There’s no need to make reference to culture.

tamgent 3 Jan 2023 17:16 UTC
4 points
0
in reply to: MalcolmOcean’s comment on: Slack matters more than any outcome
I like, ‘do the impossible—listen’.

tamgent 4 May 2022 18:24 UTC
4 points
on: The Regulatory Option: A response to near 0% survival odds
Another response to the China objection is that similar to regulators copying each other internationally, so do academics/researchers, so if you slow down development of research in some parts of the world you also might slow down development of that research in other parts of the world too. Especially when there’s an asymmetry with openness of publication of the research.

tamgent 4 May 2022 18:11 UTC
4 points
in reply to: Daniel Kokotajlo’s comment on: The Regulatory Option: A response to near 0% survival odds
I would also appreciate an elaboration by Aiyen on the suffering risk point.

tamgent 11 Jan 2023 18:06 UTC
3 points
0
in reply to: Valentine’s comment on: Slack matters more than any outcome
Ha! I meant the former, but I like your second interpretation too!

tamgent 1 Jan 2023 22:10 UTC
3 points
0
in reply to: Ben Pace’s comment on: Let’s think about slowing down AI
Recruitment—in my experience often a weeks long process from start to finish, well oiled and systematic and using all the tips from the handbook on organizational behaviour on selection, often with feedback given too. By comparison, some tech companies can take several months to hire, with lots of ad hoc decision-making, no processes around biases or conflicts of interest, and no feedback.

Happy to give more examples if you want by DM.

I should say my sample size is tiny here—I know one gov dept in depth, one tech company in depth and a handful of other tech companies and gov depts not fully from the inside but just from talking with friends that work there, etc.

tamgent 12 Aug 2022 14:48 UTC
3 points
1
in reply to: Ivan Vendrov’s comment on: Jack Clark on the realities of AI policy
Support.

I would add to this that The Alignment Problem by Brian Christian is a fantastic general audience book that shows how the immediate and long-term AI policy really are facing the same problem and will work better if we all work together.

tamgent 8 Jul 2022 6:23 UTC
3 points
in reply to: elspood’s comment on: Security Mindset: Lessons from 20+ years of Software Security Failures Relevant to AGI Alignment
I just want to let you know that this table was really useful for me for something I’m working on. Thank you for making it.

tamgent 23 Jun 2022 7:55 UTC
3 points
on: Security Mindset: Lessons from 20+ years of Software Security Failures Relevant to AGI Alignment
Thanks for writing this, I find the security mindset useful all over the place and appreciate its applicability in this situation.

I have a small thing unrelated to the main post:

To my knowledge, no one tried writing a security test suite that was designed to force developers to conform their applications to the tests. If this was easy, there would have been a market for it.

I think weak versions exist (ie things that do not guarantee/force, but nudge/help). I first learnt to code in a bootcamp which emphasised test-driven development (TDD). One of the first packages I made was a TDD linter. It would simply highlight in red any functions you wrote that did not have a corresponding unit test, and any file you made without a corresponding test file.

Also if you wrote up anywhere the scalable solutions to 80% of web app vulnerabilities, I’d love to see.

tamgent 5 Jun 2022 22:13 UTC
3 points
in reply to: tamgent’s comment on: Deep Learning Systems Are Not Less Interpretable Than Logic/Probability/Etc
Even if you could find some notion of a, b, c we think are features in this DNN—how would you know you were right? How would you know you’re on the correct level of abstraction / cognitive separation / carving at the joints instead of right through the spleen and then declaring you’ve found a, b and c. It seems this is much harder than in a model where you literally assume the structure and features all upfront.

tamgent 5 Jun 2022 21:03 UTC
3 points
in reply to: agrippa’s comment on: What an actually pessimistic containment strategy looks like
I didn’t downvote this just because I disagree with it (that’s not how I downvote), but if I could hazard a guess at why people might downvote, it’d be that some might think it’s a ‘thermonuclear idea’.

tamgent 4 May 2022 18:09 UTC
3 points
in reply to: Matthew Lowenstein’s comment on: The Regulatory Option: A response to near 0% survival odds
I’d find it really hard to imagine MIRI getting regulated. It’s more common that regulation steps in where an end user or consumer could be harmed, and for that you need to deploy products to those users/consumers. As far as I’m aware, this is quite far from the kind of safety research MIRI does.

Sorry I must be really dumb but I didn’t understand what you mean by the alignment problem for regulation? Aligning regulators to regulate the important/potentially harmful bits? I don’t think this is completely random, even if focused more on trivial issues, they’re more likely to support safety teams (although sure the models they’ll be working on making safe won’t be as capable, that’s the point).

tamgent 1 May 2022 21:02 UTC
3 points
in reply to: DirectedEvolution’s comment on: Narrative Syncing
This comment reminded me of the confusion Anna mentioned at the end around self-fulfilling prophecies. It also reminded me of a book called Leadership is Language (which I recommend), with some interesting stories and examples. One I recall often from the book is about asking questions that invite open, thoughtful answers rather than closed, agreement-like answers. For example, “what am I missing”, rather than “am I missing anything”. I find I’m often in the latter mode and default to it, just wanting confirmation to roll ahead with my plan, rather than actually inviting others’ views, so I try to remember this one.

tamgent 2 Apr 2019 8:07 UTC
3 points
in reply to: null’s comment on: The Case for The EA Hotel
Actually I agree this is really good, I hadn’t thought enough about it before. Not sure I agree that reserving rooms for non-paying impact projects/people is good though. I think this should vary with demand of good projects.