If you want to go full autonomous research mode you could even have another Claude find adversarial parameters of the SynthSAEBench dataset (within some reasonable constraints) to see where the methods break or would perform worse than baselines.
I imagine you could find some nice robust improvements this way.
If you want to go full autonomous research mode you could even have another Claude find adversarial parameters of the SynthSAEBench dataset (within some reasonable constraints) to see where the methods break or would perform worse than baselines.
I imagine you could find some nice robust improvements this way.