https://github.com/nwtgck/same-file-finder-python
Find duplicate files by message digest (hash)
https://github.com/nwtgck/same-file-finder-python
duplicate-files duplicate-finder
Last synced: 2 months ago
JSON representation
Find duplicate files by message digest (hash)
- Host: GitHub
- URL: https://github.com/nwtgck/same-file-finder-python
- Owner: nwtgck
- License: mit
- Created: 2019-12-02T03:49:51.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2019-12-02T07:11:37.000Z (over 5 years ago)
- Last Synced: 2025-02-06T13:31:03.994Z (4 months ago)
- Topics: duplicate-files, duplicate-finder
- Language: Python
- Homepage:
- Size: 2.93 KB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# same-file-finder
Find duplicate files by message digest (hash)## Usage
```bash
python3 main.py my/directory/which/may/have/duplicate/files/
```## Algorithm
* Get all file sizes
* Make a group by the file size
* Find duplicate files in the same file-size group