Jessica Rumbelow comments on SolidGoldMagikarp (plus, prompt generation)

Jessica Rumbelow 6 Feb 2023 21:18 UTC
6 points
0
Yeah! Basically we just perform gradient descent on sensibly initialised embeddings (cluster centroids, or points close to the target output), constrain the embeddings to length 1 during the process, and penalise distance from the nearest legal token. We optimise the input embeddings to maximise the -log prob of the target output logit(s). Happy to have a quick call to go through the code if you like, DM me :)
- ChrisCundy 6 Feb 2023 22:22 UTC
  3 points
  0
  Parent
  Thanks for the elaboration, I’ll follow up offline