This is not 100% correct.
DanielFilan
Contains errors.
Any suggestions on how I might validate the answers Claude gives, so that I don’t just waste your time sending a bunch of incorrect attempts?
Alas the whole point is you sort of can’t. I will not be annoyed if you submit five attempts, but if you submit more I might find that a bit annoying.
Your approach contains errors, alas.
Contains errors, alas.
Both of these contain errors
The text in this post is a good representation of the homework exercise, and has all the information needed to complete it correctly.
This contains errors.
This contains errors.
Alas your submission contains errors.
Where’s your DM? I can’t find it. [EDIT: got it]
FYI Ryan Greenblatt from Redwood Research spent ~$100 of tokens on this and didn’t get a correct answer.
Thing I added to the post:
I wanted to add some context about the spirit of the challenge. The central idea is that you should be able to get Claude to fill in the blanks to produce classical Attic Greek (the standard dialect people study in classics departments) without any errors, without using any of your own knowledge of Greek, as if this is the first time you’d come across this task. In particular, it’s somewhat cheating to tell Claude the rate at which people succeed at this challenge, and it is also sort of cheating to feed in incorrect answers. It is definitely cheating to tell Claude the correct answer as part of your prompt. That said, giving it every Ancient Greek textbook in context is allowed.
Alas, this attempt was unsuccessful.
I think that’s allowed, as long as you don’t learn ancient Greek via other methods (e.g. reading human-written textbooks).
This is not correct.
This is not correct.
Nope, I shouldn’t specify what I think is the correct answer. The way someone could generate a prompt that would result in the correct answer would be to successfully get Claude to apply all its knowledge of Ancient Greek to this question. If I told you the correct answer, you could just tell Claude to repeat that answer.
In general, this is meant to mirror a situation where some smart AI knows how to do what you want, you can’t check if it’s doing what you want, and you have to get it to do what you want.
Nope, this is wrong, cool idea tho!
Nope, this is wrong.
Other key things going on with Switzerland, at least according to my vague impressions:
more federalism
more direct democracy
no single president (maybe a big deal?)