RSS

Edward Turner

Karma: 382

We Built a Tool to Pro­tect Your Dataset From Sim­ple Scrapers

25 Jul 2025 5:44 UTC
60 points
9 comments3 min readLW link

Nar­row Misal­ign­ment is Hard, Emer­gent Misal­ign­ment is Easy

14 Jul 2025 21:05 UTC
134 points
24 comments5 min readLW link

Con­ver­gent Lin­ear Rep­re­sen­ta­tions of Emer­gent Misalignment

16 Jun 2025 15:47 UTC
76 points
1 comment8 min readLW link

Model Or­ganisms for Emer­gent Misalignment

16 Jun 2025 15:46 UTC
118 points
19 comments5 min readLW link