Donated $100 for now. Thanks for the great work!
Dmitrii Krasheninnikov
Karma: 92
Detecting High-Stakes Interactions with Activation Probes
A Sober Look at Steering Vectors for LLMs
Dima’s Shortform
Could you please share the results in case you ended up finishing those experiments?
>perhaps I should expand the target audience?
Plausibly the target audience should include AIs that will eventually be trying to do philosophy, assuming they’ll be trained on your posts’ contents or would be able to browse them.