tamgent

Karma: 84

tamgent 4 Mar 2023 14:44 UTC
2 points
0
in reply to: Adele Lopez’s comment on: AI alignment researchers don’t (seem to) stack
Not a textbook (more for a general audience) but The Alignment Problem by Brian Christian is a pretty good introduction that I reckon most people interested in this would get behind.

tamgent 19 Jan 2023 9:13 UTC
1 point
0
in reply to: Hgbanana123’s comment on: How it feels to have your mind hacked by an AI
I like it—interesting how much is to do with the specific vulnerabilities of humans, and how humans exploiting other humans’ vulnerabilities was what enabled and exacerbated the situation.

tamgent 19 Jan 2023 8:46 UTC
1 point
0
in reply to: Hgbanana123’s comment on: How it feels to have your mind hacked by an AI
Whilst we’re sharing stories...I’ll shamelessly promote one of my (very) short stories on human manipulation by AI. In this case the AI is being deliberative at least in achieving its instrumental goals. https://docs.google.com/document/d/1Z1laGUEci9rf_aaDjQKS_IIOAn6D0VtAOZMSqZQlqVM/edit

tamgent 16 Jan 2023 18:58 UTC
1 point
0
on: How it feels to have your mind hacked by an AI
Is it a coincidence that your handle is blaked? (It’s a little similar to Blake) Just curious.

tamgent 11 Jan 2023 18:06 UTC
3 points
0
in reply to: Valentine’s comment on: Slack matters more than any outcome
Ha! I meant the former, but I like your second interpretation too!

tamgent 23 Dec 2022 21:08 UTC
1 point
0
in reply to: Neel Nanda’s comment on: AI alignment is distinct from its near-term applications
I also interpreted it this way and was confused for a while. I think your suggested title is clearer, Neel.

tamgent 23 Dec 2022 20:50 UTC
2 points
0
on: Let’s think about slowing down AI
Thank you for writing this. On your section ‘Obstruction doesn’t need discernment’ - see also this post that went up on LW a while back called The Regulatory Option: A response to near 0% survival odds. I thought it was an excellent post, and it didn’t get anywhere near the attention it deserved, in my view.

tamgent 23 Dec 2022 20:44 UTC
6 points
5
in reply to: CarlShulman’s comment on: Let’s think about slowing down AI
I think the two camps are less orthogonal than your examples of privacy and compute reg portray. There’s room for plenty of excellent policy interventions that both camps could work together to support. For instance, increasing regulatory requirements for transparency on algorithmic decision-making (and crucially, building a capacity both in regulators and in the market supporting them to enforce this) is something that I think both camps would get behind (the xrisk one because it creates demand for interpretability and more and the other because eg. it’s easier to show fairness issues) and could productively work on together. I think there are subculture clash reasons the two camps don’t always get on, but that these can be overcome, particularly given there’s a common enemy (misaligned powerful AI). See also this paper Beyond Near- and Long-Term: Towards a Clearer Account of Research Priorities in AI Ethics and Society I know lots of people who are uncertain about how big the risks are, and care about both problems, and work on both (I am one of these—I care more about AGI risk, but I think the best things I can do to help avert it involve working with the people you think aren’t helpful).

tamgent 15 Sep 2022 7:48 UTC
1 point
0
on: Your posts should be on arXiv
This seems solvable and very much worth solving!

tamgent 12 Aug 2022 8:48 UTC
1 point
0
in reply to: countingtoten’s comment on: More Is Different for AI
If you know of any more such analyses could you share?

tamgent 12 Aug 2022 7:22 UTC
2 points
1
on: Where are the red lines for AI?
I would be interested in seeing a list of any existing work in this area. I think determining the red lines well are going to be very useful for policymakers in the next few years.