DanielFilan

Karma: 9,296

DanielFilan 22 Apr 2026 7:28 UTC
17 points
6
on: Posts I don’t have time to write
Other key things going on with Switzerland, at least according to my vague impressions:
- more federalism
- more direct democracy
- no single president (maybe a big deal?)

DanielFilan 21 Apr 2026 3:43 UTC
2 points
0
in reply to: Varun Jha’s comment on: My unsupervised elicitation challenge
This is not 100% correct.

DanielFilan 21 Apr 2026 3:43 UTC
2 points
0
in reply to: hythlodaeus’s comment on: My unsupervised elicitation challenge
Contains errors.

DanielFilan 19 Apr 2026 3:50 UTC
2 points
0
in reply to: chrisjbillington’s comment on: My unsupervised elicitation challenge

Any suggestions on how I might validate the answers Claude gives, so that I don’t just waste your time sending a bunch of incorrect attempts?

Alas the whole point is you sort of can’t. I will not be annoyed if you submit five attempts, but if you submit more I might find that a bit annoying.

Your approach contains errors, alas.

DanielFilan 19 Apr 2026 3:01 UTC
3 points
0
in reply to: Victor Levoso’s comment on: My unsupervised elicitation challenge
Contains errors, alas.

DanielFilan 10 Apr 2026 19:00 UTC
3 points
0
in reply to: Emery Cooper’s comment on: My unsupervised elicitation challenge
Both of these contain errors

DanielFilan 10 Apr 2026 0:26 UTC
2 points
0
in reply to: wp’s comment on: My unsupervised elicitation challenge
The text in this post is a good representation of the homework exercise, and has all the information needed to complete it correctly.

DanielFilan 9 Apr 2026 22:20 UTC
2 points
0
in reply to: sudhanshu_kasewa’s comment on: My unsupervised elicitation challenge
This contains errors.

DanielFilan 9 Apr 2026 22:20 UTC
2 points
0
in reply to: sudhanshu_kasewa’s comment on: My unsupervised elicitation challenge
This contains errors.

DanielFilan 9 Apr 2026 17:56 UTC
2 points
0
in reply to: nem’s comment on: My unsupervised elicitation challenge
Alas your submission contains errors.

DanielFilan 9 Apr 2026 17:55 UTC
2 points
0
in reply to: nem’s comment on: My unsupervised elicitation challenge
Where’s your DM? I can’t find it. [EDIT: got it]

DanielFilan 9 Apr 2026 17:40 UTC
LW: 5 AF: 4
0
AF
on: My unsupervised elicitation challenge
FYI Ryan Greenblatt from Redwood Research spent ~$100 of tokens on this and didn’t get a correct answer.

DanielFilan 9 Apr 2026 4:18 UTC
LW: 2 AF: 2
0
AF
on: My unsupervised elicitation challenge
Thing I added to the post:

I wanted to add some context about the spirit of the challenge. The central idea is that you should be able to get Claude to fill in the blanks to produce classical Attic Greek (the standard dialect people study in classics departments) without any errors, without using any of your own knowledge of Greek, as if this is the first time you’d come across this task. In particular, it’s somewhat cheating to tell Claude the rate at which people succeed at this challenge, and it is also sort of cheating to feed in incorrect answers. It is definitely cheating to tell Claude the correct answer as part of your prompt. That said, giving it every Ancient Greek textbook in context is allowed.

DanielFilan 9 Apr 2026 3:59 UTC
2 points
0
in reply to: the gears to ascension’s comment on: My unsupervised elicitation challenge
Alas, this attempt was unsuccessful.

DanielFilan 9 Apr 2026 2:46 UTC
4 points
0
in reply to: Tom McGrath’s comment on: My unsupervised elicitation challenge
I think that’s allowed, as long as you don’t learn ancient Greek via other methods (e.g. reading human-written textbooks).

DanielFilan 9 Apr 2026 0:29 UTC
2 points
0
in reply to: faul_sname’s comment on: My unsupervised elicitation challenge
This is not correct.

DanielFilan 9 Apr 2026 0:29 UTC
2 points
0
in reply to: pinoto’s comment on: My unsupervised elicitation challenge
This is not correct.

DanielFilan 9 Apr 2026 0:28 UTC
6 points
3
in reply to: Matthew Sheffield’s comment on: My unsupervised elicitation challenge
Nope, I shouldn’t specify what I think is the correct answer. The way someone could generate a prompt that would result in the correct answer would be to successfully get Claude to apply all its knowledge of Ancient Greek to this question. If I told you the correct answer, you could just tell Claude to repeat that answer.

In general, this is meant to mirror a situation where some smart AI knows how to do what you want, you can’t check if it’s doing what you want, and you have to get it to do what you want.

DanielFilan 8 Apr 2026 15:40 UTC
6 points
2
in reply to: kihara.sofia’s comment on: My unsupervised elicitation challenge
Nope, this is wrong, cool idea tho!

DanielFilan 8 Apr 2026 15:39 UTC
5 points
0
in reply to: Julian_R’s comment on: My unsupervised elicitation challenge
Nope, this is wrong.