RSS

TW123

Karma: 1,236

Did ChatGPT just gaslight me?

TW123Dec 1, 2022, 5:41 AM
124 points
45 comments9 min readLW link
(aiwatchtower.substack.com)

A philoso­pher’s cri­tique of RLHF

TW123Nov 7, 2022, 2:42 AM
55 points
8 comments2 min readLW link

ML Safety Schol­ars Sum­mer 2022 Retrospective

TW123Nov 1, 2022, 3:09 AM
29 points
0 comments21 min readLW link