Center for Human-Compatible AI (CHAI)

TagLast edit: 5 Feb 2023 21:04 UTC by _will_

The Center for Human-Compatible AI is a research institute at UC Berkeley, founded and led by Stuart Russell. Its stated objective is to prevent building unfriendly AI by focusing research on provably beneficial behaviour.

External links:

Homepage of the Center for Human-Compatible AI

Announcing the Alignment Newsletter

Rohin Shah9 Apr 2018 21:16 UTC

29 points

3 comments1 min readLW link

Our take on CHAI’s research agenda in under 1500 words

Alex Flint17 Jun 2020 12:24 UTC

112 points

18 comments5 min readLW link

CHAI Internship applications are due by 12/15

martinfukui13 Dec 2019 9:19 UTC

24 points

0 comments1 min readLW link

[AN #69] Stuart Russell’s new book on why we need to replace the standard model of AI

Rohin Shah19 Oct 2019 0:30 UTC

60 points

12 comments15 min readLW link

(mailchi.mp)

Stuart Russell: AI value alignment problem must be an “intrinsic part” of the field’s mainstream agenda

Rob Bensinger26 Nov 2014 11:02 UTC

55 points

38 comments3 min readLW link

Rebuttal piece by Stuart Russell and FHI Research Associate Allan Dafoe: “Yes, the experts are worried about the existential risk of artificial intelligence.”

crmflynn3 Nov 2016 17:54 UTC

14 points

0 comments1 min readLW link

(www.technologyreview.com)

Stuart Russell’s Center for Human Compatible AI is looking for an Assistant Director

crmflynn25 Apr 2017 10:21 UTC

4 points

0 comments1 min readLW link

(humancompatible.ai)

Debate on Instrumental Convergence between LeCun, Russell, Bengio, Zador, and More

Ben Pace4 Oct 2019 4:08 UTC

221 points

61 comments15 min readLW link 2 reviews

Human beats SOTA Go AI by learning an adversarial policy

Vanessa Kosoy19 Feb 2023 9:38 UTC

57 points

32 comments1 min readLW link

(goattack.far.ai)

Research internship position at CHAI

DanielFilan16 Jan 2018 6:25 UTC

10 points

3 comments1 min readLW link

(humancompatible.ai)

2019 AI Alignment Literature Review and Charity Comparison

Larks19 Dec 2019 3:00 UTC

130 points

18 comments62 min readLW link

2018 AI Alignment Literature Review and Charity Comparison

Larks18 Dec 2018 4:46 UTC

190 points

26 comments62 min readLW link 1 review

AXRP Episode 8 - Assistance Games with Dylan Hadfield-Menell

DanielFilan8 Jun 2021 23:20 UTC

22 points

1 comment72 min readLW link

CHAI internship applications are open (due Nov 13)

Erik Jenner26 Oct 2023 0:53 UTC

34 points

0 comments3 min readLW link

Learning preferences by looking at the world

Rohin Shah12 Feb 2019 22:25 UTC

43 points

10 comments7 min readLW link

(bair.berkeley.edu)

The Alignment Problem Needs More Positive Fiction

Netcentrica21 Aug 2022 22:01 UTC

5 points

2 comments5 min readLW link

UC Berkeley launches Center for Human-Compatible Artificial Intelligence

ignoranceprior29 Aug 2016 22:43 UTC

15 points

1 comment2 min readLW link

Tensor Trust: An online game to uncover prompt injection vulnerabilities

Luke Bailey and qxcv

1 Sep 2023 19:31 UTC

30 points

0 comments5 min readLW link

(tensortrust.ai)

Current AI Safety Roles for Software Engineers

ozziegooen9 Nov 2018 20:57 UTC

70 points

9 comments4 min readLW link

2017 AI Safety Literature Review and Charity Comparison

Larks24 Dec 2017 18:52 UTC

41 points

5 comments23 min readLW link

CHAI Internship Application

martinfukui11 Nov 2020 21:10 UTC

31 points

0 comments1 min readLW link

Swimming Upstream: A Case Study in Instrumental Rationality

TurnTrout3 Jun 2018 3:16 UTC

76 points

7 comments8 min readLW link

Rohin Shah on reasons for AI optimism

abergal31 Oct 2019 12:10 UTC

40 points

58 comments1 min readLW link

(aiimpacts.org)

No comments.

Cen­ter for Hu­man-Com­pat­i­ble AI (CHAI)

Center for Human-Compatible AI (CHAI)