Vika

Karma: 3,041

Victoria Krakovna. Research scientist at DeepMind working on AI safety, and cofounder of the Future of Life Institute. Website and blog: vkrakovna.wordpress.com

DeepMind alignment team opinions on AGI ruin arguments

Vika12 Aug 2022 21:06 UTC

376 points

37 comments14 min readLW link 1 review

Possible takeaways from the coronavirus pandemic for slow AI takeoff

Vika31 May 2020 17:51 UTC

135 points

36 comments3 min readLW link 1 review

[Linkpost] Some high-level thoughts on the DeepMind alignment team’s strategy

Vika and Rohin Shah

7 Mar 2023 11:55 UTC

128 points

13 comments5 min readLW link

(drive.google.com)

Strategic choice of identity

Vika8 Mar 2014 16:27 UTC

119 points

58 comments2 min readLW link

When discussing AI risks, talk about capabilities, not intelligence

Vika11 Aug 2023 13:38 UTC

116 points

7 comments3 min readLW link

(vkrakovna.wordpress.com)

Refining the Sharp Left Turn threat model, part 1: claims and mechanisms

Vika, Vikrant Varma, Ramana Kumar and Mary Phuong

12 Aug 2022 15:17 UTC

85 points

4 comments3 min readLW link 1 review

(vkrakovna.wordpress.com)

Optimization Concepts in the Game of Life

Vika and Ramana Kumar

16 Oct 2021 20:51 UTC

74 points

16 comments11 min readLW link

Classifying specification problems as variants of Goodhart’s Law

Vika19 Aug 2019 20:40 UTC

72 points

5 comments5 min readLW link 1 review

New organization—Future of Life Institute (FLI)

Vika14 Jun 2014 23:00 UTC

70 points

35 comments1 min readLW link

Specification gaming: the flip side of AI ingenuity

Vika, Vlad Mikulik, Matthew Rahtz, tom4everitt, Zac Kenton and janleike

6 May 2020 23:51 UTC

65 points

9 comments6 min readLW link

Power-seeking can be probable and predictive for trained agents

Vika and janos

28 Feb 2023 21:10 UTC

56 points

22 comments9 min readLW link

(arxiv.org)

Paradigms of AI alignment: components and enablers

Vika2 Jun 2022 6:19 UTC

53 points

4 comments8 min readLW link

Vika 22 Jul 2019 19:46 UTC
53 points
in reply to: paulfchristiano’s comment on: The AI Timelines Scam
Definitely agree that the AI community is not biased towards short timelines. Long timelines are the dominant view, while the short timelines view is associated with hype. Many researchers are concerned about the field losing credibility (and funding) if the hype bubble bursts, and this is especially true for those who experienced the AI winters. They see the long timelines view as appropriately skeptical and more scientifically respectable.
Some examples of statements that AGI is far away from high-profile AI researchers:
Geoffrey Hinton: https://venturebeat.com/2018/12/17/geoffrey-hinton-and-demis-hassabis-agi-is-nowhere-close-to-being-a-reality/
Yann LeCun: https://www.facebook.com/yann.lecun/posts/10153426023477143 https://futurism.com/conscious-ai-decades-away https://www.facebook.com/yann.lecun/posts/10153368458167143
Yoshua Bengio: https://www.lesswrong.com/posts/4qPy8jwRxLg9qWLiG/yoshua-bengio-on-ai-progress-hype-and-risks
Rodney Brooks: https://rodneybrooks.com/the-seven-deadly-sins-of-predicting-the-future-of-ai/ https://rodneybrooks.com/agi-has-been-delayed/

Moving on from community living

Vika17 Apr 2024 17:02 UTC

49 points

7 comments3 min readLW link

(vkrakovna.wordpress.com)

Specification gaming examples in AI

Vika3 Apr 2018 12:30 UTC

45 points

9 comments1 min readLW link 2 reviews

New DeepMind AI Safety Research Blog

Vika27 Sep 2018 16:28 UTC

43 points

0 comments1 min readLW link

(medium.com)

Refining the Sharp Left Turn threat model, part 2: applying alignment techniques

Vika, Vikrant Varma, Ramana Kumar and Rohin Shah

25 Nov 2022 14:36 UTC

39 points

9 comments6 min readLW link

(vkrakovna.wordpress.com)

To contribute to AI safety, consider doing AI research

Vika16 Jan 2016 20:42 UTC

39 points

39 comments2 min readLW link

Tradeoff between desirable properties for baseline choices in impact measures

Vika4 Jul 2020 11:56 UTC

37 points

24 comments5 min readLW link

Future of Life Institute existential risk news site

Vika19 Mar 2015 14:33 UTC

35 points

2 comments1 min readLW link