One of the posts about GPT-2 (and 3) that has most stuck with me, and helped me to model what the system is doing.
One of the posts about GPT-2 (and 3) that has most stuck with me, and helped me to model what the system is doing.