Max Hawkins

Karma: 7

Max Hawkins 22 May 2026 17:00 UTC
2 points
0
in reply to: Farhan’s comment on: The case for fine-grained tracking of compute for AI
I look forward to seeing it. I’m not sure what that characterization would look like, but I personally would like to see a diagram/roadmap of computing from raw resource extraction through manufacturing as well as an organized list of computing architectures. It seems there are enough novel devices out there to warrant some categorization and listing.

Max Hawkins 21 May 2026 16:19 UTC
1 point
0
in reply to: Farhan’s comment on: The case for fine-grained tracking of compute for AI
Good questions. First, but answering your last question, my thesis is really just about proposing another way to quantify computational WORK. Currently, we use application-generic measures like Flops or application-specific measures like images/tokens processed. These definitions of work can then be normalized by unit time, energy, power, GHG emissions, water, etc, but my focus is on getting the numerator right (work) regardless of the denominator. I’m also only interested in application-generic measures of work.

You’ve got it exactly right that the amount of work done is dependent on the runtime data seen. This is a critical aspect. Mutual information is this runtime quantity that measure the actual work performed whereas channel capacity is an upper-bound on the mutual information possible and is only dependent on the hardware itself. The channel capacity is what would show up on a spec sheet, and MI is what would be plotted for a kernel on a roofline plot or similar runtime evaluation.
You’ve also got it right that if a compute operation always outputs the same value, its mutual information (and computational work) is zero. Could this operation not just be optimized away with a fixed constant? If so, what was the use of performing the operation? Either the inputs are fixed, or the inputs vary but the output is always the same. In both cases, replacing the output with a constant changes nothing. Shouldn’t this mean there was no value to performing the operation in the first place? This is where discussions about structural sparsity come into play. If we know a certain input is 0 and we’re multiplying it with another number, we already know what the outcome is, so there is no reduction of uncertainty by multiplying by a known 0.
The general intuition behind this work is that, just like communication, computation is not just about the outputs you see, its about what outputs you DIDN’T see. We use math to formalize relationships and to talk about the exactness of real numbers, but no hardware in finite space/time can operate on the infinite set of real numbers. Instead, when we get an output value, it’s simply one of 2^64 (or 2^32, 2^16, 2^8, 2^4, …) possible output states. An FP64 operation manages a larger state space than an NVFP4 operation does. Bit width approximations of Flops (where a Flops value is scaled by the bit-width of the operands) is sometimes used to capture this difference in complexity—like with the TPP export control definitions. However, this is exactly where the runtime distribution of data matters. If I know my inputs are restricted to NVFP4 range, and computing in FP64 doesn’t use any more number of states than computing in NVFP4, should such an inefficient program’s work really scale by a factor of 16? I think not, and that’s why the runtime data is critical to measuring computational work IMHO.
Essentially, how much uncertainty about the output does an operation resolve? That’s what I think we should measure. By doing so, we unify computation and communication performance measurement and are able to generalize computing measures across digital, noisy, analog, neuromorphic, and maybe even quantum regimes. Communication has had a theoretically-grounded and universal measure of throughput for ~80 years. Why doesn’t computing?
There’s more, but I’ll leave it here. Please feel free to email me anytime: mhawkins60@gatech.edu

Max Hawkins 19 May 2026 15:39 UTC
1 point
0
in reply to: Farhan’s comment on: The case for fine-grained tracking of compute for AI
I’ve re-read your proposal and thought about it more. First, to answer your question, I think SRAM architectures fit into my proposal by accounting for the operations that are performed (just as in a von-Neumann machine). The only difference is the location, amount, and speed of memory. Ideal channel capacity and operational mutual information apply to any hardware that moves and processes data. An application most suited for SRAM architectures are slightly different than those for traditional architectures, but the performance accounting stays the same. This and the other deployment parameters just affect how close the runtime mutual information is to the upper limit (channel capacity). I have follow-on, unpublished work that adapts the roofline model to the information-based measure of computing that might clarify some of these things. Overall, my goal with this measure was what this article explicitly suggests NOT to do: rely on a central, general measure of computing (which is what caught my attention).

Thinking more on your work, I do really like the wholistic approach from manufacturing through operational outcomes. However, what additional value does this broader evaluation bring? For whom? If you get more funding and more people to work on this, does the outcome look something like a more academic SemiAnalysis? I think the computing ecosystem is certainly very complex, so going for depth and breadth will quickly exhaust your given resources. Then what is the more narrowed focus of this project? I personally would like to see a continually-updated characterization of the ecosystem, but I’m not sure that’s what you all are aiming for. Regardless, I look forward to your work. Please keep me up-to-date.

Max Hawkins 15 May 2026 14:14 UTC
2 points
0
in reply to: Farhan’s comment on: The case for fine-grained tracking of compute for AI
I would love to see a poem synthesizing your ideas from this post!
What are your next steps for this work? It seems like there are many options, but what are you two planning on prioritizing?

Max Hawkins 15 May 2026 0:15 UTC
5 points
0
on: The case for fine-grained tracking of compute for AI
I also thought you might enjoy this poem I wrote about the growing issues with FLOP-based accounting (read it in a somewhat Dr. Seuss rhythm):
Isn’t it funny, the metric we flaunt?
A computer that’s better,
Faster!
More flop/s!
So what is a flop?
It’s a fine, funny thing.
Been around some years,
Well defined, so clean!
A flop is an op
On a float, not more
Than anything given
by (IEEE)-seven-five-four
You take sixty-four bits,
put them all in a line,
multiply, add them.
Flops divine!
And for 40 odd years,
This was stable and glad.
Then Dennard died,
And architects went mad!
To speed up the math
and save on power,
They chopped up the floats,
getting smaller each hour!
It meant less bits.
Less to transmit.
Less gates in hardware,
You have to commit
With less range, less precision,
they built a new vision,
For more than AI,
These were big decisions!
The number of numbers each number could hold,
Fell from a trillion,
To 2, I’m told!
So when an app demands,
More bits than you’ve got,
Emulate.
What your hardware is not.
Now the old Flop began to fail,
A metric so rigid, and dusty, and stale.
Too narrow. Too tiny. Simply too small,
To capture the breadth of computing, that’s all!
For how does a noisy, a wobbly op,
Compare to a flawless one right at the top?
Or a tiny two-bit to a big sixty-four?
Is emulation as good as before?
To answer such questions,
Think channels! Think Shannon!
His framework is canon!
Information is mutual
Determined throughout,
Measure the work:
Reduction of doubt!
The metric, I think
Is in bits per second,
How outputs from inputs
Are properly reckoned!
It’s fair to the noisy.
Fair to the small.
Fair to the emulated.
One measure for all!
Computers are more than fast, perfect math.
That stubborn old notion is stuck in the past!
So away with the flop.
Doubt drops to the floor.
Let’s go Back to Bits
To compute a bit more!

Max Hawkins 15 May 2026 0:05 UTC
2 points
0
on: The case for fine-grained tracking of compute for AI
I like this failure-mode analysis of computing performance modeling. I’m pretty ignorant about many of these topics, but I would like to add that, at least for measuring operational performance, I think mutual information and uncertainty reduction offer a more general solution. My thoughts are written up here and include comparison to historical US computing export control definitions of computing performance: https://arxiv.org/pdf/2508.05621