Skip to content

Latest commit

 

History

History
17 lines (12 loc) · 781 Bytes

File metadata and controls

17 lines (12 loc) · 781 Bytes

Duplicate Image Detector

This program searches a directory for potentially duplicate images using perceptual hashing algorithms.

Performance

In experiments, the program is able to find all 1,002 duplicate images (out of a dataset of 11,002 total images) in about 40 seconds. This efficiency is achieved through the use of BK-Trees, which allows fast hash comparison. For more information, see here.

Runtimes

Requirements:

Instructions:

Place the script in the folder containing the images.