[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y ] [Search | Free Show | Home]

3D porn sort and duplicate search program

This is a red board which means that it's strictly for adults (Not Safe For Work content only). If you see any illegal content, please report it.

Thread replies: 19
Thread images: 2

File: 3DFileViewer_2017-07-13_21-12-08.png (926KB, 2087x1558px) Image search: [Google]
3DFileViewer_2017-07-13_21-12-08.png
926KB, 2087x1558px
sup /h/

I thought I'd share something I'd been working on for a while- I have a habit of coming to H every once in a while and just binge saving images, but never remembering to sort them. That, and even if I get them in the right folder, I forget over time what images I've already saved. Since I don't feel like spending any money trying to remedy that with a similar image program, and I'm not that good of a programmer, I needed a different way to find duplicate images.

I run a folder of images through a python script to rename each image them to their average color in hexadecimal. Then, I open up a unity program I cooked up to see each image as a point in 3D space, the coordinates based on the R, G, and B values in the name of the image.

clicking on a point shows you the image, it's size, and a list of all the closest points to it. once you find duplicate images, you can switch between them and find out which one you wanna get rid of. you can delete (read: move to folder created by game) pictures from there.

if any one is interested in playing around with it, lemme know
>>
>>4686713
I just use VisiPics. It's free and gets the job done in a less complicated and more automated way.
I just removed 5k images (2~GB) yesterday.
>>
>>4686736

... well fuck then, thanks for the tip. Guess I still learned stuff making it.

for instance, hentai color tends to fit a line of best fit
>>
>>4686753
Haha, well, VisiPics could certainly be faster and it hasn't been updated in a couple of years, but damn, it still does a pretty good job, I've been using it for years.

And yeah, even if you don't continue the project, the knowledge could be useful for something else in the future.
I guess the color there is mostly because of skin/penis color, though.
>>
while on the software topic, is there a good program that will display selected images into a well ordered across all my monitors with minimum space waste?
>>
>>4687013

Well, it doesn't sound too hard to make if nothing else. Care to elaborate?
>>
>>4686736
I use awesome duplicate photo finder. it's pretty straight forward and works well. I've even had it pick up on artists redrawing old images. I do wish there was a way to flag images as false positives or something for future scans though

http://www.duplicate-finder.com/photo.html
>>
>>4687013
Do you mean it should just tile them next to each other so as to minimize empty pixels?
>>
I have an old ass folder full of hentai, with a lot of doubles, what software should I use to filter those doubles and clean them quickly?
>>
that moment when hentai pics and data analytics merge, I´ll still fap to that hentai-graph tho!
>>
>>4689092
I use Doublekiller and Visipics. Both are free and small downloads.
>>
>>4689140
The thing about my folders is that there is a lot of raws,translated and licensed stuff that are the same, would doublekiller and visipics work with them?
>>
>>4686713
You may want to look into some of the more classical solutions to this kind of problem.

# Removing exact duplicates #
Apply hashing function to each image and store each hash in a database next to a reference to each image. Then you can just search the database for files with duplicate hashes.

# Removing non exact duplicates #
For this you want an algorithm that tells you the similarity between two images. There are a number of these but structural similarity index is probably where yo want to start.

Additionally if the goal is to save hard drive space then you may also want to consider stripping image meta data and running them through a lossless image compressor.
>>
>>4692232
>Additionally if the goal is to save hard drive space then you may also want to consider stripping image meta data and running them through a lossless image compressor.
This would probably be better achieved via an external script. Some formats, like png, have a massive spectrum of approaches to filter selection, bitdepth reduction, palette ordering, etc. The old pngout -r, then huffmixing can still beat many hours with something like pngwolf, then zopfli defluff and deflopt.
>>
>>4686713
I want to fuck with this for the sake of fucking with it

link?
>>
>>4686713
Wow this is more useful than the shit /g/ comes up with for the most part
>>
Doublekiller uses matching hash numbers, or other selectable criteria. Visipics allows you to examine each file before deletion, it also uses a slider bar to specific how close multiple files match.
>>
>>4686713
I wanna try it
>>
>>4689148
Yep, if you relax the slider on visipics enough it'll give a ton of false "close-enoughs".
Thread posts: 19
Thread images: 2


[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y] [Search | Top | Home]

I'm aware that Imgur.com will stop allowing adult images since 15th of May. I'm taking actions to backup as much data as possible.
Read more on this topic here - https://archived.moe/talk/thread/1694/


If you need a post removed click on it's [Report] button and follow the instruction.
DMCA Content Takedown via dmca.com
All images are hosted on imgur.com.
If you like this website please support us by donating with Bitcoins at 16mKtbZiwW52BLkibtCr8jUg2KVUMTxVQ5
All trademarks and copyrights on this page are owned by their respective parties.
Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
This is a 4chan archive - all of the content originated from that site.
This means that RandomArchive shows their content, archived.
If you need information for a Poster - contact them.