RSS

Julian H

Karma: 45

Ex­plor­ing Re­in­force­ment Learn­ing Effects on Chain-of-Thought Legibility

6 Jan 2026 3:04 UTC
41 points
3 comments21 min readLW link

In­tro­duc­ing the XLab AI Se­cu­rity Guide

27 Dec 2025 16:50 UTC
19 points
1 comment5 min readLW link