Mateusz Bagiński

Karma: 3,669

I endorse and operate by Crocker’s rules.

I have not signed any agreements whose existence I cannot mention.

Applications open for the Online wing of the AFFINE Superintelligence Alignment Seminar

Mateusz Bagiński and Grace Roberts

15 Apr 2026 16:10 UTC

20 points

0 comments1 min readLW link

Slack in Cells, Slack in Brains

Mateusz Bagiński31 Mar 2026 0:35 UTC

43 points

3 comments6 min readLW link

Don’t Overdose Locally Beneficial Changes

Mateusz Bagiński28 Mar 2026 18:24 UTC

79 points

12 comments4 min readLW link

Scaffolded Reproducers, Scaffolded Agents

Mateusz Bagiński26 Mar 2026 23:47 UTC

37 points

1 comment3 min readLW link

Superintelligence Alignment Seminar (1 month focused upskilling)

Mateusz Bagiński17 Feb 2026 17:03 UTC

115 points

13 comments3 min readLW link

Reasons to sign a statement to ban superintelligence (+ FAQ for those on the fence)

Mateusz Bagiński and Ishual

13 Oct 2025 19:00 UTC

83 points

4 comments13 min readLW link

Safety researchers should take a public stance

Mateusz Bagiński and Ishual

19 Sep 2025 18:55 UTC

252 points

65 comments8 min readLW link

Counter-considerations on AI arms races

Mateusz Bagiński and JustinShovelain

15 May 2025 14:54 UTC

24 points

0 comments18 min readLW link

[Question] Comprehensive up-to-date resources on the Chinese Communist Party’s AI strategy, etc?

Mateusz Bagiński18 Apr 2025 4:58 UTC

14 points

6 comments1 min readLW link

Goodhart Typology via Structure, Function, and Randomness Distributions

JustinShovelain and Mateusz Bagiński

25 Mar 2025 16:01 UTC

35 points

1 comment15 min readLW link

Bounded AI might be viable

Mateusz Bagiński and JustinShovelain

6 Mar 2025 12:55 UTC

24 points

4 comments20 min readLW link

Less Anti-Dakka

Mateusz Bagiński31 May 2024 9:07 UTC

79 points

10 comments3 min readLW link

Some Problems with Ordinal Optimization Frame

Mateusz Bagiński6 May 2024 5:28 UTC

9 points

0 comments7 min readLW link

[Question] What are the weirdest things a human may want for their own sake?

Mateusz Bagiński20 Mar 2024 11:15 UTC

7 points

16 comments1 min readLW link

Three Types of Constraints in the Space of Agents

Nora_Ammann and Mateusz Bagiński

15 Jan 2024 17:27 UTC

26 points

3 comments17 min readLW link

‘Theories of Values’ and ‘Theories of Agents’: confusions, musings and desiderata

Mateusz Bagiński and Nora_Ammann

15 Nov 2023 16:00 UTC

35 points

8 comments24 min readLW link

Charbel-Raphaël and Lucius discuss interpretability

Mateusz Bagiński, Charbel-Raphaël and Lucius Bushnaq

30 Oct 2023 5:50 UTC

112 points

7 comments21 min readLW link

“Wanting” and “liking”

Mateusz Bagiński30 Aug 2023 14:52 UTC

23 points

3 comments29 min readLW link

GPTs’ ability to keep a secret is weirdly prompt-dependent

Mateusz Bagiński, Filip Sondej and Marcel Windys

22 Jul 2023 12:21 UTC

31 points

0 comments9 min readLW link

[Question] How do you manage your inputs?

Mateusz Bagiński28 Mar 2023 18:26 UTC

15 points

2 comments1 min readLW link