the existing software that is both extant and exists that finds identical files should reduce most of your storage overburden.
finding duplicate other sizes of the same image is compute intensive. jpgs and other images get resized , cropped, memed, and quality-squished all the time. A tiny image patch test search index would reduce the time, but generate false positives. An area average grid of say 8x8 might compute a searchable index, with an adjustable similarity search to weed out the larger (or smaller) duplicates. I can't imagine that such software doesn't already exist.
All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
This is a 4chan archive - all of the shown content originated from that site. This means that 4Archive shows their content, archived. If you need information for a Poster - contact them.
If a post contains personal/copyrighted/illegal content, then use the post's [Report] link! If a post is not removed within 24h contact me at firstname.lastname@example.org with the post's information.