So is that h not part of the universal quantification over h in H?
DanielFilan
Re: uniform convergence bounds, you say
The “high level properties of h” part is how they introduce data-dependency into their bound, in order to escape the Zhang et al. 2016 result.
I’m confused—aren’t properties of h different from properties of the data?
I ran a few Ancient Greek strings through the Claude tokenizer for Opus 4.7 and Opus 4.6 to see whether Opus 4.7’s success might be explained by cleaner Greek accent tokenization
Huh, how did you do that? I didn’t know the tokenizers were publicly available. Anyway, very interesting!
Retrospective on my unsupervised elicitation challenge
Other key things going on with Switzerland, at least according to my vague impressions:
more federalism
more direct democracy
no single president (maybe a big deal?)
This is not 100% correct.
Contains errors.
Any suggestions on how I might validate the answers Claude gives, so that I don’t just waste your time sending a bunch of incorrect attempts?
Alas the whole point is you sort of can’t. I will not be annoyed if you submit five attempts, but if you submit more I might find that a bit annoying.
Your approach contains errors, alas.
Contains errors, alas.
Both of these contain errors
The text in this post is a good representation of the homework exercise, and has all the information needed to complete it correctly.
This contains errors.
This contains errors.
Alas your submission contains errors.
Where’s your DM? I can’t find it. [EDIT: got it]
FYI Ryan Greenblatt from Redwood Research spent ~$100 of tokens on this and didn’t get a correct answer.
Thing I added to the post:
I wanted to add some context about the spirit of the challenge. The central idea is that you should be able to get Claude to fill in the blanks to produce classical Attic Greek (the standard dialect people study in classics departments) without any errors, without using any of your own knowledge of Greek, as if this is the first time you’d come across this task. In particular, it’s somewhat cheating to tell Claude the rate at which people succeed at this challenge, and it is also sort of cheating to feed in incorrect answers. It is definitely cheating to tell Claude the correct answer as part of your prompt. That said, giving it every Ancient Greek textbook in context is allowed.
Alas, this attempt was unsuccessful.
I think that’s allowed, as long as you don’t learn ancient Greek via other methods (e.g. reading human-written textbooks).
Oh I currently think the thing that’s going on is that it’s a hypothesis-dependent bound that you then apply to the hypothesis learned from the data.