This all seems of fundamental importance if we want to actually understand what our AIs are.
I fully agree (as is probably obvious from the post :) )
I always thought of personas as created mostly by the system prompt, but I suppose RLHF can massively affect their personalities as well…
Although it’s hard to know with most of the scaling labs, to the best of my understanding the system prompt is mostly about small final tweaks to behavior. Amanda Askell gives a nice overview here. That said, API use typically lets you provide a custom system prompt, and you can absolutely use that to induce a persona of your choice.
I fully agree (as is probably obvious from the post :) )
Although it’s hard to know with most of the scaling labs, to the best of my understanding the system prompt is mostly about small final tweaks to behavior. Amanda Askell gives a nice overview here. That said, API use typically lets you provide a custom system prompt, and you can absolutely use that to induce a persona of your choice.
Quick question: your link to the Amanda Askell overview is broken. What is the correct link? Thanks!
Whoops, thanks for pointing that out. Correct link is here (also correcting it above).