Tim Duffy

Karma: 3

Tim Duffy 24 Jul 2025 2:08 UTC
4 points
0
in reply to: Cameron Berg’s comment on: So You Think You’ve Awoken ChatGPT
Hi Cameron, is the SAE testing you’re describing here the one you demoed in your interview with John Sherman using Goodfire’s Llama 3.3 70B SAE tool? If so could you share the prompt you used for that? With the prompts I’m using I’m having a hard time getting Llama to say that it is conscious at all. It would be nice if we had SAE feature tweaking available for a model that was more ambivalent about its consciousness, seems it would be a bit easier to robustly test if that were the case.