Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/avalentino/pydups
Find duplicate files in the specified directories
https://github.com/avalentino/pydups
utility
Last synced: 13 days ago
JSON representation
Find duplicate files in the specified directories
- Host: GitHub
- URL: https://github.com/avalentino/pydups
- Owner: avalentino
- License: bsd-3-clause
- Created: 2017-11-25T18:37:20.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2019-01-18T16:38:31.000Z (about 6 years ago)
- Last Synced: 2024-12-01T16:59:58.422Z (2 months ago)
- Topics: utility
- Language: Python
- Size: 33.2 KB
- Stars: 2
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.rst
- Changelog: NEWS.rst
- License: LICENSE
Awesome Lists containing this project
README
PyDups
======Find duplicate files in the specified directories.
:copiright: 2016-2019 Antonio Valentino
.. image:: https://travis-ci.org/avalentino/pydups.svg?branch=master
:target: https://travis-ci.org/avalentino/pydupsUsage
-----::
usage: pydups [-h] [--version] [-s] [-l] [--format {json,pprint,custom}]
[-k {name,name_and_size,md5}] [--clean] [-c CACHEFILE]
[-o OUTPUT] [-v]
datarootSearch all duplicate files in the specified directories.
Symbolic links are always ignored.
By default duplicte file names are searched but the criterion can be
customized using the "-k" ("--key") parameter.positional arguments:
dataroot path to the root of the directory tree to scanoptional arguments:
-h, --help show this help message and exit
--version show program's version number and exit
-s, --compute-size compute the total size of duplicate files
(default: False)
-l, --list-files dump the entire list of duplicate files
(default: False)
--format {json,pprint,custom}
select the format of file list, implies "-l"
("--list-files"). Default: "json"
-k {name,name_and_size,md5}, --key {name,name_and_size,md5}
"name" search duplicate basenames, "name_and_size"
files are considered duplicate if they have the
same basename and size, "md5" compare MD5 sum of
files (default: "name")
--clean remove duplicate files and replace them with
symbolic links (default: False)
-c CACHEFILE, --cache CACHEFILE
enable caching and use cache file (default: None)
-o OUTPUT, --output OUTPUT
save the list of duplicate files on the specified
output file. Implies "-l" ("--list").
-v, --verbose print verbose help messages (default: False)Copyright (C) 2016-2017 Antonio Valentino
License
-------BSD 3-Clause License (see LICENSE file).