Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/brianlesko/text-compression-python

Upload a PDF, extract the text, and compress the text to a binary format
https://github.com/brianlesko/text-compression-python

Last synced: about 20 hours ago
JSON representation

Upload a PDF, extract the text, and compress the text to a binary format

Awesome Lists containing this project

README

        

# Text Compression from an PDF file
This code implements text parsing from a PDF File and then text compression

 

## Dependencies

This code uses the following libraries:
- `streamlit`: for building the user interface.
- `pyobjc`: for the PDF text extraction.
- `lz4`: for compressing the encoded text.

 

## Usage

Run the following commands:
```
pip install --upgrade streamlit zipfile bs4
streamlit run app.py
```

This will start the local Streamlit server, and you can access the chatbot by opening a web browser and navigating to `http://localhost:8501`.

 

## Topics
```
Python | Streamlit | Git | Low Code UI
text scraping | textual data | books
Self taught coding | Mechanical engineer | Robotics engineer
```
 


 

╭━━╮╭━━━┳━━┳━━━┳━╮╱╭╮ ╭╮╱╱╭━━━┳━━━┳╮╭━┳━━━╮
┃╭╮┃┃╭━╮┣┫┣┫╭━╮┃┃╰╮┃┃ ┃┃╱╱┃╭━━┫╭━╮┃┃┃╭┫╭━╮┃
┃╰╯╰┫╰━╯┃┃┃┃┃╱┃┃╭╮╰╯┃ ┃┃╱╱┃╰━━┫╰━━┫╰╯╯┃┃╱┃┃
┃╭━╮┃╭╮╭╯┃┃┃╰━╯┃┃╰╮┃┃ ┃┃╱╭┫╭━━┻━━╮┃╭╮┃┃┃╱┃┃
┃╰━╯┃┃┃╰┳┫┣┫╭━╮┃┃╱┃┃┃ ┃╰━╯┃╰━━┫╰━╯┃┃┃╰┫╰━╯┃
╰━━━┻╯╰━┻━━┻╯╱╰┻╯╱╰━╯ ╰━━━┻━━━┻━━━┻╯╰━┻━━━╯

 

X Logo             GitHub             LinkedIn

follow all of these or i will kick you