RSS

kaivu

Karma: 205

AI agents and painted facades

30 Aug 2025 23:13 UTC
37 points
1 comment2 min readLW link
(fulcrumresearch.ai)

Me, My­self, and AI: the Si­tu­a­tional Aware­ness Dataset (SAD) for LLMs

8 Jul 2024 22:24 UTC
109 points
37 comments5 min readLW link

Take­aways from a Mechanis­tic In­ter­pretabil­ity pro­ject on “For­bid­den Facts”

15 Dec 2023 11:05 UTC
34 points
8 comments10 min readLW link

Up­date on Har­vard AI Safety Team and MIT AI Alignment

2 Dec 2022 0:56 UTC
60 points
4 comments8 min readLW link