Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/yjg30737/pyqt-image-to-text
PyQt GUI example of image to text using image-to-text model
https://github.com/yjg30737/pyqt-image-to-text
ai huggingface image-captioning image-recognition image-to-text pyqt pyqt-examples pyqt-gui pyqt5 pyqt5-desktop-application pyqt5-examples pyqt5-gui python python3 qt
Last synced: about 1 month ago
JSON representation
PyQt GUI example of image to text using image-to-text model
- Host: GitHub
- URL: https://github.com/yjg30737/pyqt-image-to-text
- Owner: yjg30737
- License: mit
- Created: 2023-05-28T08:45:23.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-07-25T00:57:35.000Z (over 1 year ago)
- Last Synced: 2024-12-29T08:42:11.471Z (about 1 month ago)
- Topics: ai, huggingface, image-captioning, image-recognition, image-to-text, pyqt, pyqt-examples, pyqt-gui, pyqt5, pyqt5-desktop-application, pyqt5-examples, pyqt5-gui, python, python3, qt
- Language: Python
- Homepage:
- Size: 8.34 MB
- Stars: 4
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# pyqt-image-to-text
PyQt GUI example of image to text using image-to-text modelThe model i'm using is nlpconnect/vit-gpt2-image-captioning from huggingface.
This is for saving image files' name based on its image.
## Requirements
* PyQt5 >= 5.14
* transformers## How to Install
1. git clone ~
2. cd pyqt-image-to-text
3. python setup.py install
4. pip install -r requirements.txt
5. cd pyqt_image_to_text
6. python main.pySee "How to Use" below 🙂
## How to Use
If you want to use this as CLI version only, see scripts.py and use `saveImage(src, dst)` function.src can be list of files or one directory. dst is directory you want to save images.
### GUI
![image](https://github.com/yjg30737/pyqt-image-to-text/assets/55078043/f95b5c4a-8217-4b30-bcc8-5f0bfe4597cb)
Press "Set Directory" to add image files in list (only one directory per image to text process).Support png, jpg, jpeg, bmp.
After you've done it, press "Save" button to save files to your desired directory
You can see the result of it.
For example:
This image is converted into text as:
```
a tv sitting on top of a stand in a room
```Based on the text, file will be saved to dst folder named "a_tv_sitting_on_top_of_a_stand_in_a_room".
Example image files which i was using to test are inside each src and dst, you can compare with each other :)
## Troubleshooting
If you encounter any error, check this out:
* transformers is not recent version
* required modules to transformers are not recent version or not be installed (such as sentencepiece, safetensors)
* if it is related to scikit-learn, try reinstall sckit-learn. make sure previous versions of it are all deleted## See Also
I'm working on a lot of AI-related project.
* pyqt-openai - pyqt implementation of chatgpt
* pyqt-dreamstudio - using stable diffusion in pyqt desktop app