Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/sameera-madushan/findm
Findm is a python script to find duplicate file copies in a given directory.
https://github.com/sameera-madushan/findm
duplicate-detection duplicate-files duplicatefilefinder file-hashing python
Last synced: 6 days ago
JSON representation
Findm is a python script to find duplicate file copies in a given directory.
- Host: GitHub
- URL: https://github.com/sameera-madushan/findm
- Owner: sameera-madushan
- License: mit
- Created: 2020-04-12T10:48:53.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2020-05-13T07:32:03.000Z (over 4 years ago)
- Last Synced: 2023-08-06T03:26:12.130Z (over 1 year ago)
- Topics: duplicate-detection, duplicate-files, duplicatefilefinder, file-hashing, python
- Language: Python
- Homepage:
- Size: 7.81 KB
- Stars: 11
- Watchers: 4
- Forks: 5
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Findm - Duplicate Files Finder
Findm is a python script to find duplicate file copies in a given directory.
Findm search the duplicates by computing a hash for every file while considering several other factors. This script is able to find duplicate images, documents, videos and audio files. However, when it comes to audio files, simillar audio files with different bitrates will not be identified as duplicates from this script, because different bit-rates will produce different checksums.
#### Findm without duplicate files :point_down:
#### Findm with duplicate files :point_down:
## Git Installation
```
# clone the repo
$ git clone https://github.com/sameera-madushan/Findm.git# change the working directory to Findm
$ cd Findm
```## Usage
```
python findm.py
```## References
https://stackoverflow.com/a/36113168/13276219