RSS

Richard Ren

Karma: 80

In­tro­duc­ing MASK: A Bench­mark for Mea­sur­ing Hon­esty in AI Systems

5 Mar 2025 22:56 UTC
35 points
5 comments2 min readLW link
(www.mask-benchmark.ai)

The Bit­ter Les­son for AI Safety Research

2 Aug 2024 18:39 UTC
57 points
5 comments3 min readLW link