RSS

FTPickle

Karma: 108

I think I’m just con­fused. Once a model ex­ists, how do you “red-team” it to see whether it’s safe. Isn’t it already dan­ger­ous?

FTPickle18 Nov 2023 14:16 UTC
21 points
14 comments1 min readLW link

[Question] Begin­ner’s ques­tion about RLHF

FTPickle8 Aug 2023 15:48 UTC
1 point
4 comments1 min readLW link

Ran­dom Ob­ser­va­tion on AI goals

FTPickle8 Apr 2023 19:28 UTC
−11 points
2 comments1 min readLW link

The alien simu­la­tion meme doesn’t make sense

FTPickle24 Feb 2023 19:27 UTC
4 points
1 comment1 min readLW link

I be­lieve some AI doomers are overconfident

FTPickle20 Dec 2022 17:09 UTC
8 points
15 comments2 min readLW link