RSS

Sam Marks

Karma: 5,366

Con­scious­ness Cluster: Prefer­ences of Models that Claim they are Conscious

18 Mar 2026 16:06 UTC
86 points
28 comments5 min readLW link

Cen­sored LLMs as a Nat­u­ral Testbed for Se­cret Knowl­edge Elicitation

9 Mar 2026 18:50 UTC
30 points
2 comments5 min readLW link

The per­sona se­lec­tion model

Sam Marks23 Feb 2026 22:56 UTC
168 points
52 comments43 min readLW link
(alignment.anthropic.com)