Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Chijioke Ugwuanyi
Karma:
35
All
Posts
Comments
New
Top
Old
From 8B to Frontier: How System Prompts Control Whether AI Agents Blackmail, Leak, and Kill
Chijioke Ugwuanyi
20 May 2026 8:28 UTC
15
points
2
comments
19
min read
LW
link
Blackmail at 8 Billion Parameters: Agentic Misalignment in Sub-Frontier Models
Chijioke Ugwuanyi
27 Apr 2026 8:59 UTC
15
points
2
comments
7
min read
LW
link
Replication of Koorndijk (2025): Differential Compliance May Reflect Prompt Sensitivity Rather Than Strategic Reasoning
Chijioke Ugwuanyi
and
TerryJCZhang
13 Feb 2026 16:12 UTC
9
points
0
comments
8
min read
LW
link
Back to top