RSS

Mateusz Bagiński

Karma: 536

~[agent foundations]

Char­bel-Raphaël and Lu­cius dis­cuss Interpretability

30 Oct 2023 5:50 UTC
104 points
7 comments21 min readLW link

‘The­o­ries of Values’ and ‘The­o­ries of Agents’: con­fu­sions, mus­ings and desiderata

15 Nov 2023 16:00 UTC
34 points
8 comments24 min readLW link

GPTs’ abil­ity to keep a se­cret is weirdly prompt-dependent

22 Jul 2023 12:21 UTC
31 points
0 comments9 min readLW link

“Want­ing” and “lik­ing”

Mateusz Bagiński30 Aug 2023 14:52 UTC
22 points
2 comments29 min readLW link

[Question] How do you man­age your in­puts?

Mateusz Bagiński28 Mar 2023 18:26 UTC
15 points
3 comments1 min readLW link

[Question] What are the weirdest things a hu­man may want for their own sake?

Mateusz Bagiński20 Mar 2024 11:15 UTC
5 points
16 comments1 min readLW link