Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/cortega26/PDF-Text-Analizer

This repository houses a script that can download PDFs from a specified URL, convert them to text, and perform text analysis. This analysis includes identifying the language, eliminating stopwords, and counting word and phrase frequency. It's worth noting that the script is capable of analyzing texts in multiple languages.
https://github.com/cortega26/PDF-Text-Analizer

nlp ocr pdf pdf-converter text-analysis text-mining text-summarization

Last synced: about 2 months ago
JSON representation

This repository houses a script that can download PDFs from a specified URL, convert them to text, and perform text analysis. This analysis includes identifying the language, eliminating stopwords, and counting word and phrase frequency. It's worth noting that the script is capable of analyzing texts in multiple languages.

Lists