Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/sandeepkundalwal/automated-plagiarism-detector

An automated plagiarism detector that handles unzipping, generates plagiarism report and scraps the reports for threshold plagiarism.
https://github.com/sandeepkundalwal/automated-plagiarism-detector

gradle html-scraper java jsoup maven plagiarism-detector python3 teaching-assistant unzipping-files

Last synced: 27 days ago
JSON representation

An automated plagiarism detector that handles unzipping, generates plagiarism report and scraps the reports for threshold plagiarism.

Awesome Lists containing this project

README

        

# Automated Plagiarism Detector
An automated plagiarism detector that handles unzipping, generates plagiarism report and scraps the reports for minimum threshold plagiarism.

### The project consists of three modules:



  1. Staging Files: All the zip files are unzipped and the files that are present in unzipped folder are segreggated based on a particular format which is {rollno.}_{questionno.}. Makes multiple directories based on the number of questions.


  2. Plagiarism Script: Checks for plagiarism and provides the percentage of plagiarism for each file against all the other files that are present in a directory. Generates a HTML file containing plagiarism report.

    - run script: python3 plag.py {Assignment file location} {Report generation location}


  3. Scrapping Plagiarism Report: Scrapes the percentage of plagiarism from each report generated by Plagiarism Script and returns a .txt file that contains the Roll No. of all the students who have plagiarism
    above the minimum allowed threshold.