https://github.com/yjg30737/pyqt-image-to-text

PyQt GUI example of image to text using image-to-text model
https://github.com/yjg30737/pyqt-image-to-text

ai huggingface image-captioning image-recognition image-to-text pyqt pyqt-examples pyqt-gui pyqt5 pyqt5-desktop-application pyqt5-examples pyqt5-gui python python3 qt

Last synced: 6 months ago
JSON representation

PyQt GUI example of image to text using image-to-text model

Host: GitHub
URL: https://github.com/yjg30737/pyqt-image-to-text
Owner: yjg30737
License: mit
Created: 2023-05-28T08:45:23.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2023-07-25T00:57:35.000Z (almost 2 years ago)
Last Synced: 2024-12-29T08:42:11.471Z (7 months ago)
Topics: ai, huggingface, image-captioning, image-recognition, image-to-text, pyqt, pyqt-examples, pyqt-gui, pyqt5, pyqt5-desktop-application, pyqt5-examples, pyqt5-gui, python, python3, qt
Language: Python
Homepage:
Size: 8.34 MB
Stars: 4
Watchers: 1
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# pyqt-image-to-text
PyQt GUI example of image to text using image-to-text model

The model i'm using is nlpconnect/vit-gpt2-image-captioning from huggingface.

This is for saving image files' name based on its image.

## Requirements
* PyQt5 >= 5.14
* transformers

## How to Install
1. git clone ~
2. cd pyqt-image-to-text
3. python setup.py install
4. pip install -r requirements.txt
5. cd pyqt_image_to_text
6. python main.py

See "How to Use" below 🙂

## How to Use
If you want to use this as CLI version only, see scripts.py and use `saveImage(src, dst)` function.

src can be list of files or one directory. dst is directory you want to save images.

### GUI

![image](https://github.com/yjg30737/pyqt-image-to-text/assets/55078043/f95b5c4a-8217-4b30-bcc8-5f0bfe4597cb)
Press "Set Directory" to add image files in list (only one directory per image to text process).

Support png, jpg, jpeg, bmp.

After you've done it, press "Save" button to save files to your desired directory

You can see the result of it.

For example:

This image is converted into text as:

```
a tv sitting on top of a stand in a room
```

Based on the text, file will be saved to dst folder named "a_tv_sitting_on_top_of_a_stand_in_a_room".

Example image files which i was using to test are inside each src and dst, you can compare with each other :)

## Troubleshooting
If you encounter any error, check this out:
* transformers is not recent version
* required modules to transformers are not recent version or not be installed (such as sentencepiece, safetensors)
* if it is related to scikit-learn, try reinstall sckit-learn. make sure previous versions of it are all deleted

## See Also
I'm working on a lot of AI-related project.
* pyqt-openai - pyqt implementation of chatgpt
* pyqt-dreamstudio - using stable diffusion in pyqt desktop app

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/yjg30737/pyqt-image-to-text

Awesome Lists containing this project

README