RSS

xrchz(Ramana Kumar)

Karma: 318

Speci­fi­ca­tion gam­ing: the flip side of AI ingenuity

6 May 2020 23:51 UTC
41 points
3 comments6 min readLW link

Clas­sify­ing speci­fi­ca­tion prob­lems as var­i­ants of Good­hart’s Law

19 Aug 2019 20:40 UTC
71 points
2 comments5 min readLW link

Model­ing AGI Safety Frame­works with Causal In­fluence Diagrams

xrchz
21 Jun 2019 12:50 UTC
47 points
6 comments1 min readLW link
(arxiv.org)

Thoughts on Hu­man Models

21 Feb 2019 9:10 UTC
125 points
22 comments10 min readLW link