https://github.com/sidmishraw/scp
A data processing pipeline for text-mining on contents extracted from PDFs using Apriori and Simplicial Complex algorithms
https://github.com/sidmishraw/scp
apriori-algorithm association-rules docpruner document-clustering pdf-processor simplicial-complex simplicialcomplex text-mining
Last synced: 4 months ago
JSON representation
A data processing pipeline for text-mining on contents extracted from PDFs using Apriori and Simplicial Complex algorithms
- Host: GitHub
- URL: https://github.com/sidmishraw/scp
- Owner: sidmishraw
- License: bsd-2-clause
- Created: 2017-10-28T00:24:00.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2017-10-28T00:48:28.000Z (over 7 years ago)
- Last Synced: 2025-01-15T06:48:16.730Z (5 months ago)
- Topics: apriori-algorithm, association-rules, docpruner, document-clustering, pdf-processor, simplicial-complex, simplicialcomplex, text-mining
- Language: C++
- Homepage:
- Size: 268 MB
- Stars: 2
- Watchers: 4
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: Readme.md
- License: LICENSE