True, or the result that few-shot prompting for multiple choice questions doesn’t require the answers in the prompt to be correct.
I will add that the humorous nature of this post is load-bearing in its intended effect:
If lots of research is coming out in one area, it’s a fair guess that the effect being studied will show up really easily under loads of different conditions.
That’s what the guano in graphene paper authors realized. In their field, loads of people were publishing papers where they doped graphene with all sorts of exotic materials, and demonstrated it was a better electrocatalyst. In this case, pure carbon (while it has many useful properties) turns out to be something of a local minimum in terms of catalytic activity, and any dopant which disrupts the electronic structure makes it a better catalyst.
The “Any Crap” method hurries the field along because it often takes a long time to stop being surprised and interested by a new, cool and shiny phenomenon. Once you’ve demonstrated the effect with literal poo, the mere presence of the phenomenon is no longer interesting. This lets the field move on.
For example, I think a post that was a strict superset of this post, which contained the same scatological dataset alongside several similar ones, and which was called something like “Testing the limits of emergent misalignment” would do worse at the intended job of this post. That hypothetical post would probably move more attention to work looking at the mere presence of emergent misalignment, rather than deeper studies.
True, or the result that few-shot prompting for multiple choice questions doesn’t require the answers in the prompt to be correct.
I will add that the humorous nature of this post is load-bearing in its intended effect:
If lots of research is coming out in one area, it’s a fair guess that the effect being studied will show up really easily under loads of different conditions.
That’s what the guano in graphene paper authors realized. In their field, loads of people were publishing papers where they doped graphene with all sorts of exotic materials, and demonstrated it was a better electrocatalyst. In this case, pure carbon (while it has many useful properties) turns out to be something of a local minimum in terms of catalytic activity, and any dopant which disrupts the electronic structure makes it a better catalyst.
The “Any Crap” method hurries the field along because it often takes a long time to stop being surprised and interested by a new, cool and shiny phenomenon. Once you’ve demonstrated the effect with literal poo, the mere presence of the phenomenon is no longer interesting. This lets the field move on.
For example, I think a post that was a strict superset of this post, which contained the same scatological dataset alongside several similar ones, and which was called something like “Testing the limits of emergent misalignment” would do worse at the intended job of this post. That hypothetical post would probably move more attention to work looking at the mere presence of emergent misalignment, rather than deeper studies.
I like your framing. Very cool to see as a junior researcher.