If you download many images from the Internet, bulk collections for instance from Usenet or P2P, then you will eventually face the problem that you have duplicate images in your collection. Manually exploring and deleting duplicate images should be out of the question if the collection contains 10K+ images unless you really have lots of time at hand. The best way would be to use a software that scans the entire image collection and either automatically deletes all dupes or displays possible hits so that the user only needs to confirm if it is indeed a dupe.
Deduper can do that. This software is easy to use and scans a folder and all its subfolders for images. It gives the user the choice to automatically run and delete images or display ever duplicate that was found so that the user can finalize the process. The user has the choice to select a folder that is save so to speak which means that the duplicate of an image in that folder will be deleted and not the image in the folder specified.
One weak point of Deduper is the fact that it only recognizes binary identical images which means that it does not recognize a dupe if it finds an image in two different sizes or one with text and one without text but the same size. I guess that's why it's so fast. A test run on a folder with 18000 wallpaper images did not take longer than a few minutes and had the result that almost 50% of the images were dupes resulting in a saving of more than 700 Megabytes on my hard drive.
Deduper is a nice fast program that definitely has its uses if you regularly download images from the Internet or already have many images on your hard drives.
Update: Deduper is no longer available. I suggest you use AntiDupl instead which provides you with similar options to find duplicate images on your system.
Advertising revenue is falling fast across the Internet, and independently-run sites like Ghacks are hit hardest by it. The advertising model in its current form is coming to an end, and we have to find other ways to continue operating this site.
We are committed to keeping our content free and independent, which means no paywalls, no sponsored posts, no annoying ad formats or subscription fees.
If you like our content, and would like to help, please consider making a contribution:
Ghacks is a technology news blog that was founded in 2005 by Martin Brinkmann. It has since then become one of the most popular tech news sites on the Internet with five authors and regular contributions from freelance writers.