Michaël Trazzi

Karma: 1,684

theinsideview.ai

Connor Leahy on Dying with Dignity, EleutherAI and Conjecture

Michaël Trazzi22 Jul 2022 18:44 UTC

194 points

29 comments14 min readLW link

(theinsideview.ai)

OpenAI Solves (Some) Formal Math Olympiad Problems

Michaël Trazzi2 Feb 2022 21:49 UTC

78 points

27 comments2 min readLW link

A Gym Gridworld Environment for the Treacherous Turn

Michaël Trazzi28 Jul 2018 21:27 UTC

74 points

9 comments3 min readLW link

(github.com)

Ethan Caballero on Private Scaling Progress

Michaël Trazzi5 May 2022 18:32 UTC

63 points

2 comments2 min readLW link

(theinsideview.github.io)

An Increasingly Manipulative Newsfeed

Michaël Trazzi1 Jul 2019 15:26 UTC

62 points

16 comments5 min readLW link

Book Review: AI Safety and Security

Michaël Trazzi21 Aug 2018 10:23 UTC

51 points

2 comments11 min readLW link

The Codex Skeptic FAQ

Michaël Trazzi24 Aug 2021 16:01 UTC

49 points

24 comments2 min readLW link

Jesse Hoogland on Developmental Interpretability and Singular Learning Theory

Michaël Trazzi6 Jul 2023 15:46 UTC

42 points

2 comments4 min readLW link

(theinsideview.ai)

Blake Richards on Why he is Skeptical of Existential Risk from AI

Michaël Trazzi14 Jun 2022 19:09 UTC

41 points

12 comments4 min readLW link

(theinsideview.ai)

Victoria Krakovna on AGI Ruin, The Sharp Left Turn and Paradigms of AI Alignment

Michaël Trazzi12 Jan 2023 17:09 UTC

40 points

3 comments4 min readLW link

(www.theinsideview.ai)

Katja Grace on Slowing Down AI, AI Expert Surveys And Estimating AI Risk

Michaël Trazzi16 Sep 2022 17:45 UTC

40 points

2 comments3 min readLW link

(theinsideview.ai)

Human-Aligned AI Summer School: A Summary

Michaël Trazzi11 Aug 2018 8:11 UTC

39 points

5 comments4 min readLW link

Neel Nanda on the Mechanistic Interpretability Researcher Mindset

Michaël Trazzi21 Sep 2023 19:47 UTC

36 points

1 comment3 min readLW link

(theinsideview.ai)

Why Copilot Accelerates Timelines

Michaël Trazzi26 Apr 2022 22:06 UTC

35 points

14 comments7 min readLW link

[Question] What will GPT-4 be incapable of?

Michaël Trazzi6 Apr 2021 19:57 UTC

34 points

33 comments1 min readLW link

Shahar Avin On How To Regulate Advanced AI Systems

Michaël Trazzi23 Sep 2022 15:46 UTC

31 points

0 comments4 min readLW link

(theinsideview.ai)

Evan Hubinger on Homogeneity in Takeoff Speeds, Learned Optimization and Interpretability

Michaël Trazzi8 Jun 2021 19:20 UTC

28 points

0 comments55 min readLW link

Robert Long On Why Artificial Sentience Might Matter

Michaël Trazzi28 Aug 2022 17:30 UTC

26 points

5 comments5 min readLW link

(theinsideview.ai)

Ethan Perez on the Inverse Scaling Prize, Language Feedback and Red Teaming

Michaël Trazzi24 Aug 2022 16:35 UTC

26 points

0 comments3 min readLW link

(theinsideview.ai)

Collin Burns on Alignment Research And Discovering Latent Knowledge Without Supervision

Michaël Trazzi17 Jan 2023 17:21 UTC

25 points

5 comments4 min readLW link

(theinsideview.ai)