difPy - Python package for finding duplicate or similar images within folders
Major enhancements and new features:
lazy
was added to difPy.search
which allows difPy to search more efficiently for exact duplicates (i. e. two exact file copies). By default, lazy
is set to True
and should only be turned off when searching for images that are not exact duplicates (i. e. having different dimensions, different file types, etc.). Read more here.similarity
parameter was reduced from 50 to 5.difPy.search
now supports the rotate
parameter. If set to False
, images will not be rotated on comparison, which can significantly reduce comparison times. Read more here.search.result
is now simpler with less levels of depth, and search.lower_quality
now comes as a list
. When invoked via the CLI, the lower_quality output file will now be in .txt
format.See the difPy usage guide for more details. Happy deduplicating! 🎉
Initial beta version release of difPy v4.1.0.
Minor bug fixes:
search.delete()
would never successfully delete the lower quality images.difPy v4 comes with major updates, code improvements and new features 🎉
difPy.build
and difPy.search
so that multiple searches can be performed on the same image repository and not having to re-build it.in_folder
search, allowing to search separately among directories, instead of in the union of all directories. Suggested by @audiomuze in #53.The usage of difPy v4 varies significantly to previous versions. It is therefore recommended to see the updated difPy Usage Documentation.
Initial beta version release of difPy v4.
New features and bug fixes:
limit_extensions
option has been added to limit difPy's file search only to known image file extensions, leading to increased speed and performance. Suggested by @audiomuze and implemented by @UplandsDynamic.search.stats
output related to limit_extensions
. It now includes logs of which files were skipped. Implemented by @UplandsDynamic.ModuleNotFoundError
.Improvements and bug fixes:
search.stats
now includes the logs of the deleted_files
if the logs
parameter is set to True
Announcement: :tada: On March 24, 2023 difPy reached 20k downloads on PyPi - thank you! :bouquet: To celebrate, I am happy to announce the release of the official difPy.app, a web based app that lets you compare images with difPy right from your browser! Read more.
Improvements and new features:
similarity
parameter now only accepts two options: 'duplicates'
or 'similar'
.Minor improvements:
Minor improvements and fixes:
search.lower_quality
images to a different folder, as suggested by @ManthanRami