FTPickle

Karma: 108

I think I’m just confused. Once a model exists, how do you “red-team” it to see whether it’s safe. Isn’t it already dangerous?

FTPickle18 Nov 2023 14:16 UTC

21 points

14 comments1 min readLW link

[Question] Beginner’s question about RLHF

FTPickle8 Aug 2023 15:48 UTC

1 point

4 comments1 min readLW link

Random Observation on AI goals

FTPickle8 Apr 2023 19:28 UTC

−11 points

2 comments1 min readLW link

The alien simulation meme doesn’t make sense

FTPickle24 Feb 2023 19:27 UTC

4 points

1 comment1 min readLW link

I believe some AI doomers are overconfident

FTPickle20 Dec 2022 17:09 UTC

8 points

15 comments2 min readLW link