https://github.com/secretsquirrel/plagiarismchecker
Quick script to check for plagiarism between two documents
https://github.com/secretsquirrel/plagiarismchecker
Last synced: about 1 year ago
JSON representation
Quick script to check for plagiarism between two documents
- Host: GitHub
- URL: https://github.com/secretsquirrel/plagiarismchecker
- Owner: secretsquirrel
- License: mit
- Created: 2017-10-26T19:38:20.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2020-11-22T04:10:57.000Z (over 5 years ago)
- Last Synced: 2025-04-14T01:12:36.474Z (about 1 year ago)
- Language: Python
- Size: 253 KB
- Stars: 16
- Watchers: 2
- Forks: 5
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Support: support/LICENSE.md
Awesome Lists containing this project
README
# PlagiarismChecker
Quick script to check for plagiarism between two documents.
Supports variable length checking and synonyms (adds significant time).
Synonyms obtained from: https://github.com/FinNLP/synonyms
**Documents must be TEXT files!**
## How to use
This was made to be a class, but if you want to use it as is:
```
python3 ./check.py questionable_file.txt possible_source.txt check_synonyms(true/false) length_to_check
time python3 ../PlagiarismChecker/check.py test_output.txt input.txt true 6
[snip]
```
## Speed results
```
$ wc test_output.txt
2 202 1170 test_output.txt
$ cat data/fw/input.txt
4078 214831 1230725
```
With checking synonyms:
```
$ time python3 ../PlagiarismChecker/check.py test_output.txt input.txt t 6
real 8m19.381s
user 8m19.288s
sys 0m0.028s
```
Without checking synonyms:
```
$ time python3 ../PlagiarismChecker/check.py test_output.txt input.txt f 6
real 0m21.929s
user 0m21.880s
sys 0m0.044s
```