Talk of “density in thingspace” sweeps a lot of interesting problems under the rug. Thingspace is very high-dimensional. What distance metric are we supposed to use? EY writes:
I believe that in the field of statistical learning, for algorithms that actually do depend on distance metrics, the standard cheap trick is to “sphere” the space by making the standard deviation equal 1 in all directions.
An alternative (which I use at work) is to make the overall range 1 in all directions. Doubtless there are other, equally arbitrary choices available.
Talk of “density in thingspace” sweeps a lot of interesting problems under the rug. Thingspace is very high-dimensional. What distance metric are we supposed to use? EY writes:
An alternative (which I use at work) is to make the overall range 1 in all directions. Doubtless there are other, equally arbitrary choices available.