IMO a good way to explain how LLMs work to a layman is to print the weights on sheets of paper and compute a forward pass by hand. Anyone wanna shoot this video and post it on youtube?
Assuming humans can do one multiplication 4bit per second using a lookup table,
1.5B 4bit weights ⇒ ~1.5B calculations ⇒ 1.5B seconds = 47.5 years (working 24x7) = 133 years (working 60 hours/week)
So you’ll need to hire ~100 people for 1 year.
You don’t actually have to run the entire experiment for people to get the concept, just run a small fraction of it. Although it’ll be cool to run the whole thing as well.
IMO a good way to explain how LLMs work to a layman is to print the weights on sheets of paper and compute a forward pass by hand. Anyone wanna shoot this video and post it on youtube?
Assuming humans can do one multiplication 4bit per second using a lookup table,
1.5B 4bit weights ⇒ ~1.5B calculations ⇒ 1.5B seconds = 47.5 years (working 24x7) = 133 years (working 60 hours/week)
So you’ll need to hire ~100 people for 1 year.
You don’t actually have to run the entire experiment for people to get the concept, just run a small fraction of it. Although it’ll be cool to run the whole thing as well.