An open API service indexing awesome lists of open source software.

https://github.com/quanpersie2001/imagecaptioning

Predicting a caption for a given image using Inception Net V3, LSTM and Glove
https://github.com/quanpersie2001/imagecaptioning

cnn image-captioning imagecaptioning inception-v3 lstm rnn

Last synced: about 2 months ago
JSON representation

Predicting a caption for a given image using Inception Net V3, LSTM and Glove

Awesome Lists containing this project

README

        

# Image Captioning
## Model using Inception Net V3, LSTM and Glove (Using SSD300 to improve feature)

![image](model.png)

## How to run?
### Install lib
```
pip install -r requirements.txt
```

### Download data
```console
python data_download.py
```
> **Note** : Dataset is MS COCO 2014 and Glove . This is large dataset, long download.
### Preprocess
You **must** run
```console
python preprocess.py
```
With COCO datase this command runs for a long time you can download and coppy them to `ROOT / process_data`

### [Download here](https://drive.google.com/drive/folders/1HDgToaiFKzVNTQZI1ts2Dlfgh1sVMk3D?usp=sharing)

### Trainning
```console
python train.py --batch-size 64 --output weights --epochs 30
```
You can download pre-train model an copy them to `ROOT / weights`
### [Download here](https://drive.google.com/drive/folders/1oXVC8fVioblaRpvB-tVtQsBHwKTmfMse?usp=sharing)

### Predict
```console
python predict.py --image path/to/image --weight path/to/weight --k-beam 9
```

## Result
![image](output.png)
You can see sumary in [summary.ipynb](summary.ipynb)