Announcing the Alignment Research Center

paulfchristiano26 Apr 2021 23:30 UTC

LW: 178 AF: 58

I’m now working full-time on the Alignment Research Center (ARC), a new non-profit focused on intent alignment research.

I left OpenAI at the end of January and I’ve spent the last few months planning, doing some theoretical research, doing some logistical set-up, and taking time off.

For now it’s just me, focusing on theoretical research. I’m currently feeling pretty optimistic about this work: I think there’s a good chance that it will yield big alignment improvements within the next few years, and a good chance that those improvements will be integrated into practice at leading ML labs.

My current goal is to build a small team working productively on theory. I’m not yet sure how we’ll approach hiring, but if you’re potentially interested in joining you can fill out this tiny form to get notified when we’re ready.

Over the medium term (and maybe starting quite soon) I also expect to implement and study techniques that emerge from theoretical work, to help ML labs adopt alignment techniques, and to work on alignment forecasting and strategy.

What links here?

paulfchristiano26 Apr 2021 23:30 UTC

LW: 178 AF: 58

6 comments1 min readLW link

AI Organization Updates

Ben Pace 26 Apr 2021 23:51 UTC
LW: 31 AF: 13
0
AF
Best of skill to you.
- adamShimi 27 Apr 2021 9:07 UTC
  LW: 7 AF: 3
  0
  AF Parent
  This is so great! I always hate wishing people luck when I trust in their competence to mostly deal with bad luck and leverage good luck. I’ll use that one now.
adamShimi 27 Apr 2021 9:06 UTC
LW: 10 AF: 5
0
AF
Sounds really exciting! I’m wondering which kind of theoretical computer science do you have in mind specifically? Like which part of that do you think has the most uses for alignment? (Still trying to find a way to use my PhD in the theory of distributed computing for something alignment related ^^)
What links here?
- adamShimi's comment on AMA: Paul Christiano, alignment researcher by paulfchristiano (28 Apr 2021 20:45 UTC; 18 points)
Lu_Tong 29 Apr 2021 21:24 UTC
4 points
0
What do you see as central examples of work on “alignment forecasting”? I’m unclear on what that means.
Charlie Steiner 27 Apr 2021 0:05 UTC
LW: 2 AF: 1
0
AF
Congrats!

I’ll procrastinate from thesis-writing to fill out the form :)
- Ben Pace 27 Apr 2021 0:08 UTC
  LW: 17 AF: 9
  0
  AF Parent
  You’re gonna get back to thesis writing quickly, it’s a very short form.