RSS

David Africa

Karma: 566

Research Scientist with the Alignment team at UK AISI.

A Pro­posal for TruesightBench

David Africa5 Feb 2026 14:33 UTC
14 points
0 comments4 min readLW link

Mas­sive Ac­ti­va­tions in DroPE: Ev­i­dence for At­ten­tion Reorganization

David Africa18 Jan 2026 15:05 UTC
19 points
0 comments8 min readLW link

David Africa’s Shortform

David Africa13 Jan 2026 13:13 UTC
4 points
4 comments1 min readLW link

Align­ment Pre­train­ing: AI Dis­course Causes Self-Fulfilling (Mis)alignment

21 Dec 2025 0:53 UTC
194 points
24 comments9 min readLW link

[Paper] Does Self-Eval­u­a­tion En­able Wire­head­ing in Lan­guage Models?

David Africa8 Dec 2025 16:03 UTC
25 points
2 comments2 min readLW link