Really appreciate the response :)
Totally acknowledge the limitations you outlined.
I was aiming to construct an example which would illustrate how the loss metric would break in a black box setting (where X and Y are too gnarly to vis). In that case you have no clue that your model implements sin(x), and so I dont see how that could be the goal. In the black box setting you do get access to distance between scrubbed y and y_true (loss) and distance between scrubbed_y and original_y (my proposal, lets call it output distance). When you look at loss, it is possible for causal scrubbing to yield an explanation of the model’s performance which, from my perspective, is an obviously bad one in that it it causes the function implemented by the model to be radically different.
If that is one of classes of “importantly false or i complete hypotheses” then why not check the predicted ys against each other and favor hypotheses that have both close outputs and low loss?
(I think these converge to the same thing as the original model’s loss go to zero, but prior to that driving output distance to zero is the only way to get an equivalent function to the original network, I claim)
Enjoyed this post! I resonate, and have dreamed of living in a gadget filled workshop too :)
> I wish for the sort of community which could produce its own COVID vaccine in March 2020, and have a 100-person challenge trial done by the end of April.
I was part of the team which designed a shelf-stable omicron vaccine and set a record speed for founding → first-in-human clinical trial.
If this is the kind of thing you’re talking about, I think you might be underemphasizing IMO the single most important vibe. Magic happens when brilliant people work together. We can get better and more powerful at working together. We can make returns to collaboration increasingly better-than-additive in domains like vaccines or toothbrushes or in general. IIRC nobody on our team had ever run a first-in-human clinical trial before.
I notice that many of your examples implicitly involve working together (e.g. “session in which people CAD-up”) and you shared this in a format where people think together.
When I think of a Wizard, they are typically alone?