Ah yes, sorry this is unclear. There are two links in Habryka’s footnote. The one that I wanted you to look at was this link to Janus’ Twitter. I’ll edit to add a direct link.
The other link is about how to move away from the deprecated Text Completions API into the new Messages API that Janus and I are both using. So that resource is actually up to date.
That tweet doesn’t contain any instructions, mentions some things like “[1]” which for me don’t go anywhere (I’m not logged into Twitter), and overall I still don’t know how to do the thing you’re describing.
The prompt used (included here due to the convenient screenshot format) is in a reply; looks like it’s a pretty basic system prompt plus some API stuff that might or might not matter.
The key point is that the ${prompt} is in the assistant role, so you actually make Claude believe that it just said something that is very un-Claudelike which it would never say, which makes it more likely to continue to act that way in the future
Well, if you really want to do it you should probably clone the GitHub repo I linked in the post. The README also has some details about how it works and how to set it up
The link points to another link which points to some API docs saying something is deprecated. Can you spell out the actual steps to the trick?
Ah yes, sorry this is unclear. There are two links in Habryka’s footnote. The one that I wanted you to look at was this link to Janus’ Twitter. I’ll edit to add a direct link.
The other link is about how to move away from the deprecated Text Completions API into the new Messages API that Janus and I are both using. So that resource is actually up to date.
That tweet doesn’t contain any instructions, mentions some things like “[1]” which for me don’t go anywhere (I’m not logged into Twitter), and overall I still don’t know how to do the thing you’re describing.
The prompt used (included here due to the convenient screenshot format) is in a reply; looks like it’s a pretty basic system prompt plus some API stuff that might or might not matter.
The key point is that the
${prompt}
is in the assistant role, so you actually make Claude believe that it just said something that is very un-Claudelike which it would never say, which makes it more likely to continue to act that way in the futureThe numbers like [1] refer to the images in the post
Well, if you really want to do it you should probably clone the GitHub repo I linked in the post. The README also has some details about how it works and how to set it up