https://github.com/ajlearner46/image-caption-generator
Genrate descriptive captions for images using VGG16-LSTM
https://github.com/ajlearner46/image-caption-generator
Last synced: 8 months ago
JSON representation
Genrate descriptive captions for images using VGG16-LSTM
- Host: GitHub
- URL: https://github.com/ajlearner46/image-caption-generator
- Owner: AJlearner46
- Created: 2023-06-18T05:10:57.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2024-09-07T18:49:28.000Z (almost 2 years ago)
- Last Synced: 2025-03-01T02:47:23.591Z (over 1 year ago)
- Language: Jupyter Notebook
- Homepage: https://image-captioin-generator-aj.streamlit.app/
- Size: 6.22 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Image-Caption-Generator
While humans can interpret these images without detailed captions, machines require some form of image captions for automatic understanding.
This project aims to develop an end-to-end solution for generating descriptive captions for images using deep learning techniques.
demo : https://image-captioin-generator-aj.streamlit.app/
## Dataset
- flickr dataset link :- https://www.kaggle.com/datasets/adityajn105/flickr8k
- I used the Flickr8k Dataset, which contains 8092 photographs and text descriptions. Dataset contain 5 caption for each Image
## Methodology for Image Captioning
### 1. Data Preprocessing
- Extract image features
- Text preprocessing
- Train-Test split
- Data generator
### 2. Encoder-Decoder Architecture
- Load VGG16 model
- Encoder :
Image feature layer
Sequence feature layer
- Decoder

### 3. Training & Optimization
- Training model
- Evaluation of model
### 4. Frontend
- User interface using streamlit.

## Results
- The VGG16-LSTM model was trained for 20 epochs, achieving a low training loss of 2.1828.
- I evaluated the model using the BLEU score, with a focus on BLEU-1 score (0.536631).

#### Model :- https://www.kaggle.com/code/ajr094/image-caption-generator/output?select=best_model.h5
#### Kaggle NoteBook :- https://www.kaggle.com/code/ajr094/image-caption-generator/notebook