RSS

Dhruv Trehan

Karma: 6

Say­ing “for AI safety re­search” made mod­els re­fuse more on a harm­less task

Dhruv Trehan8 Sep 2025 19:39 UTC
7 points
1 comment2 min readLW link
(lossfunk.substack.com)