https://github.com/drorasaf/duplicatechecker
Check for infringement/common code across multiple devices
https://github.com/drorasaf/duplicatechecker
Last synced: 7 months ago
JSON representation
Check for infringement/common code across multiple devices
- Host: GitHub
- URL: https://github.com/drorasaf/duplicatechecker
- Owner: drorasaf
- Created: 2019-06-02T13:50:20.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2019-06-02T14:02:16.000Z (about 7 years ago)
- Last Synced: 2025-02-26T19:47:50.327Z (over 1 year ago)
- Language: Python
- Size: 1000 Bytes
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# code duplication finder
This package allows to traverse a list of repositories to find similiarities among the repos
to avoid copyrights collision or to check the quality of the code.
The default language it checks is scala, but this can be configured
## Installation
First install pmd:
``` sh
$ wget https://github.com/pmd/pmd/releases/download/pmd_releases%2F6.15.0/pmd-bin-6.15.0.zip
$ unzip pmd-bin-6.15.0.zip
$ alias cpd="$HOME/pmd-bin-6.15.0/bin/run.sh cpd"
```
Remember to add the alias to your .bashrc
## Configuration
Requirements:
- Github access token(default: GITHUB_ACCESS_TOKEN environment variable)
- text file with a list of repositories to check(default: repos.txt)
## How to Run
python dope.py
### setting specific filename
python dope.py --repo_list bla.txt
### setting specific access token
python dope.py --token deadbeef
### setting specific language
python dope.py --language python
Note: The run will generate a folder named clones and eventually delete it, since the program clones repos, then the assumption is that there is enough harddrive space.