Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Michaël Trazzi
Karma:
1,684
theinsideview.ai
All
Posts
Comments
New
Top
Old
Page
1
Connor Leahy on Dying with Dignity, EleutherAI and Conjecture
Michaël Trazzi
22 Jul 2022 18:44 UTC
194
points
29
comments
14
min read
LW
link
(theinsideview.ai)
OpenAI Solves (Some) Formal Math Olympiad Problems
Michaël Trazzi
2 Feb 2022 21:49 UTC
78
points
27
comments
2
min read
LW
link
A Gym Gridworld Environment for the Treacherous Turn
Michaël Trazzi
28 Jul 2018 21:27 UTC
74
points
9
comments
3
min read
LW
link
(github.com)
Ethan Caballero on Private Scaling Progress
Michaël Trazzi
5 May 2022 18:32 UTC
63
points
2
comments
2
min read
LW
link
(theinsideview.github.io)
An Increasingly Manipulative Newsfeed
Michaël Trazzi
1 Jul 2019 15:26 UTC
62
points
16
comments
5
min read
LW
link
Book Review: AI Safety and Security
Michaël Trazzi
21 Aug 2018 10:23 UTC
51
points
2
comments
11
min read
LW
link
The Codex Skeptic FAQ
Michaël Trazzi
24 Aug 2021 16:01 UTC
49
points
24
comments
2
min read
LW
link
Jesse Hoogland on Developmental Interpretability and Singular Learning Theory
Michaël Trazzi
6 Jul 2023 15:46 UTC
42
points
2
comments
4
min read
LW
link
(theinsideview.ai)
Blake Richards on Why he is Skeptical of Existential Risk from AI
Michaël Trazzi
14 Jun 2022 19:09 UTC
41
points
12
comments
4
min read
LW
link
(theinsideview.ai)
Victoria Krakovna on AGI Ruin, The Sharp Left Turn and Paradigms of AI Alignment
Michaël Trazzi
12 Jan 2023 17:09 UTC
40
points
3
comments
4
min read
LW
link
(www.theinsideview.ai)
Katja Grace on Slowing Down AI, AI Expert Surveys And Estimating AI Risk
Michaël Trazzi
16 Sep 2022 17:45 UTC
40
points
2
comments
3
min read
LW
link
(theinsideview.ai)
Human-Aligned AI Summer School: A Summary
Michaël Trazzi
11 Aug 2018 8:11 UTC
39
points
5
comments
4
min read
LW
link
Neel Nanda on the Mechanistic Interpretability Researcher Mindset
Michaël Trazzi
21 Sep 2023 19:47 UTC
36
points
1
comment
3
min read
LW
link
(theinsideview.ai)
Why Copilot Accelerates Timelines
Michaël Trazzi
26 Apr 2022 22:06 UTC
35
points
14
comments
7
min read
LW
link
[Question]
What will GPT-4 be incapable of?
Michaël Trazzi
6 Apr 2021 19:57 UTC
34
points
33
comments
1
min read
LW
link
Shahar Avin On How To Regulate Advanced AI Systems
Michaël Trazzi
23 Sep 2022 15:46 UTC
31
points
0
comments
4
min read
LW
link
(theinsideview.ai)
Evan Hubinger on Homogeneity in Takeoff Speeds, Learned Optimization and Interpretability
Michaël Trazzi
8 Jun 2021 19:20 UTC
28
points
0
comments
55
min read
LW
link
Robert Long On Why Artificial Sentience Might Matter
Michaël Trazzi
28 Aug 2022 17:30 UTC
26
points
5
comments
5
min read
LW
link
(theinsideview.ai)
Ethan Perez on the Inverse Scaling Prize, Language Feedback and Red Teaming
Michaël Trazzi
24 Aug 2022 16:35 UTC
26
points
0
comments
3
min read
LW
link
(theinsideview.ai)
Collin Burns on Alignment Research And Discovering Latent Knowledge Without Supervision
Michaël Trazzi
17 Jan 2023 17:21 UTC
25
points
5
comments
4
min read
LW
link
(theinsideview.ai)
Back to top
Next