{"id":18662298,"url":"https://github.com/dimits-ts/plagiarismdetection","last_synced_at":"2025-11-06T06:30:22.901Z","repository":{"id":103795506,"uuid":"384985414","full_name":"dimits-ts/PlagiarismDetection","owner":"dimits-ts","description":"A program that scans all text files in a given directory and finds the pairs that are more likely to have copied / plagiariazed text.","archived":false,"fork":false,"pushed_at":"2021-07-11T15:47:37.000Z","size":14,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2024-12-27T16:40:10.366Z","etag":null,"topics":["artificial-intelligence","machine-learning","tf-idf"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/dimits-ts.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null}},"created_at":"2021-07-11T15:34:51.000Z","updated_at":"2024-07-29T03:43:27.000Z","dependencies_parsed_at":"2024-04-22T02:45:10.674Z","dependency_job_id":null,"html_url":"https://github.com/dimits-ts/PlagiarismDetection","commit_stats":null,"previous_names":["dimits-ts/plagiarismdetection"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dimits-ts%2FPlagiarismDetection","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dimits-ts%2FPlagiarismDetection/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dimits-ts%2FPlagiarismDetection/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dimits-ts%2FPlagiarismDetection/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/dimits-ts","download_url":"https://codeload.github.com/dimits-ts/PlagiarismDetection/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":239484214,"owners_count":19646429,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["artificial-intelligence","machine-learning","tf-idf"],"created_at":"2024-11-07T08:11:17.759Z","updated_at":"2025-11-06T06:30:22.874Z","avatar_url":"https://github.com/dimits-ts.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# PlagiarismDetection\n\nScans all text files in a given directory and compares each one to all others as to find the pairs that are more likely to have copied / plagiariazed text.\n\nUtilizes the [TF-IDF](https://en.wikipedia.org/wiki/Tf%E2%80%93idf) algorithm to handle comparisons between files somewhat intelligently.\n\nAutomatically ignores binary / empty files. By default only looks for .txt documents, but can be told to scan all file types anyway.\n\nUsed via console, program parameters are determined at runtime and saved to a dedicated settings file. Test files are included in the \"Tests\" folder.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdimits-ts%2Fplagiarismdetection","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdimits-ts%2Fplagiarismdetection","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdimits-ts%2Fplagiarismdetection/lists"}