gwern comments on Photo Curation Approach

gwern 25 Mar 2024 18:55 UTC
4 points
0
If you want to make it more efficient and spend less time fast-forwarding through redundant images, you could experiment with clustering and sorting images in a NN embedding space: https://github.com/MaartenGr/Concept https://every.to/napkin-math/6-new-theories-about-ai https://gwern.net/design#sort-by-magic
- jefftk 25 Mar 2024 20:07 UTC
  2 points
  0
  Parent
  It could be fun to see how much of this is automatable: I have a camera roll that goes back to early 2012 combined with my selections for each year. That’s a decent amount of annotated data!
  - gwern 25 Mar 2024 21:29 UTC
    2 points
    0
    Parent
    I don’t think phrasing it as a classification problem is necessarily the best approach. It may be that there are many very similar images and which one out of a cluster you pick is fairly arbitrary, so any attempt at a classifying ’picked/‘not picked’ wouldn’t tell you much, but clustering/sorting still makes it much easier/pleasant to do the curation.
    
    (Speaking of classification and images and blogs, you might find it useful to know that we just launched the InvertOrNot.com API (HN) for dark-mode images.)