RSS

Lukas Fluri

Karma: 41

Zurich AI Safety is look­ing for (Co-)Direc­tors—EOI

3 Sep 2025 17:40 UTC
12 points
0 comments4 min readLW link

The Per­ils of Op­ti­miz­ing Learned Re­ward Functions

Lukas Fluri11 Jul 2025 16:06 UTC
17 points
1 comment21 min readLW link

Eval­u­at­ing Su­per­hu­man Models with Con­sis­tency Checks

1 Aug 2023 7:51 UTC
21 points
2 comments9 min readLW link
(arxiv.org)

Open Prob­lems in Nega­tive Side Effect Minimization

6 May 2022 9:37 UTC
12 points
6 comments17 min readLW link