yeah, to be clear: my experience [of prompting the models] is that it’s basically impossible to get them to have judgment that they don’t naively exhibit. no amount of “think carefully about whether its relevant” will suddenly elicit good taste; it appears that “always do this” and “never do this” are too heavy as attractors, and the boundary [for any behavior worth asking for] is fractally complex.
i think we’re in agreement about the state of the world, here. apologies for the semantic quibble.
yeah, to be clear: my experience [of prompting the models] is that it’s basically impossible to get them to have judgment that they don’t naively exhibit. no amount of “think carefully about whether its relevant” will suddenly elicit good taste; it appears that “always do this” and “never do this” are too heavy as attractors, and the boundary [for any behavior worth asking for] is fractally complex.
i think we’re in agreement about the state of the world, here. apologies for the semantic quibble.