Charbel-Raphaël

Karma: 3,363

Charbel-Raphael Segerie

https://crsegerie.github.io/

Living in Paris

OpenAI’s red line for AI self-improvement is fundamentally flawed

Charbel-Raphaël2 May 2026 14:44 UTC

35 points

3 comments3 min readLW link

Basics of How Not to Die

Camille B. , Jérémy Andréoletti, elisareine, Charbel-Raphaël, Lucie Philippon, RationalHippy and T-bo🔸

31 Jan 2026 19:04 UTC

111 points

20 comments4 min readLW link

AI Red Lines: A Research Agenda

Charbel-Raphaël22 Nov 2025 8:41 UTC

30 points

1 comment5 min readLW link

A Call for Better Risk Modelling

Jan Wehner and Charbel-Raphaël

18 Nov 2025 9:08 UTC

20 points

0 comments4 min readLW link

Global Call for AI Red Lines—Signed by Nobel Laureates, Former Heads of State, and 200+ Prominent Figures

Charbel-Raphaël22 Sep 2025 18:22 UTC

339 points

27 comments6 min readLW link

Dissolving moral philosophy: from pain to meta-ethics

Charbel-Raphaël4 Aug 2025 20:20 UTC

9 points

3 comments2 min readLW link

The bitter lesson of misuse detection

Hadrien and Charbel-Raphaël

10 Jul 2025 14:50 UTC

37 points

6 comments7 min readLW link

The 80/20 playbook for mitigating AI scheming in 2025

Charbel-Raphaël31 May 2025 21:17 UTC

40 points

2 comments4 min readLW link

[Paper] Safety by Measurement: A Systematic Literature Review of AI Safety Evaluation Methods

markov and Charbel-Raphaël

19 May 2025 10:38 UTC

23 points

0 comments1 min readLW link

Charbel-Raphaël’s Shortform

Charbel-Raphaël21 Apr 2025 20:49 UTC

6 points

36 comments1 min readLW link

🇫🇷 Announcing CeSIA: The French Center for AI Safety

Charbel-Raphaël20 Dec 2024 14:17 UTC

102 points

2 comments8 min readLW link

Are we dropping the ball on Recommendation AIs?

Charbel-Raphaël23 Oct 2024 17:48 UTC

53 points

17 comments6 min readLW link

[Question] We might be dropping the ball on Autonomous Replication and Adaptation.

Charbel-Raphaël and Épiphanie Gédéon

31 May 2024 13:49 UTC

63 points

30 comments4 min readLW link

AI Safety Strategies Landscape

Charbel-Raphaël9 May 2024 17:33 UTC

41 points

1 comment42 min readLW link

Constructability: Plainly-coded AGIs may be feasible in the near future

Épiphanie Gédéon and Charbel-Raphaël

27 Apr 2024 16:04 UTC

91 points

15 comments13 min readLW link

[Question] What convincing warning shot could help prevent extinction from AI?

Charbel-Raphaël and cozyfractal

13 Apr 2024 18:09 UTC

109 points

22 comments2 min readLW link

My intellectual journey to (dis)solve the hard problem of consciousness

Charbel-Raphaël6 Apr 2024 9:32 UTC

48 points

44 comments30 min readLW link

AI Safety 101 : Capabilities—Human Level AI, What? How? and When?

markov and Charbel-Raphaël

7 Mar 2024 17:29 UTC

46 points

8 comments54 min readLW link

The case for training frontier AIs on Sumerian-only corpus

Alexandre Variengien, Charbel-Raphaël and Jonathan Claybrough

15 Jan 2024 16:40 UTC

143 points

16 comments3 min readLW link

aisafety.info, the Table of Content

Charbel-Raphaël31 Dec 2023 13:57 UTC

23 points

1 comment11 min readLW link