https://github.com/aakanksha1406/image-captioning
predicts the captions for the input image
https://github.com/aakanksha1406/image-captioning
cnn-model keras lstm matplotlib nltk python tensorflow
Last synced: 3 months ago
JSON representation
predicts the captions for the input image
- Host: GitHub
- URL: https://github.com/aakanksha1406/image-captioning
- Owner: aakanksha1406
- Created: 2024-08-07T12:58:30.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-08-07T13:31:39.000Z (almost 2 years ago)
- Last Synced: 2024-12-28T07:18:45.907Z (over 1 year ago)
- Topics: cnn-model, keras, lstm, matplotlib, nltk, python, tensorflow
- Language: Python
- Homepage:
- Size: 7.94 MB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Image Caption Generator - Flickr Dataset
**Environment:** Kaggle
## Project Information
The objective of the project is to predict the captions for the input image. The dataset consists of 8k images and 5 captions for each image. The features are extracted from both the image and the text captions for input. The features will be concatenated to predict the next word of the caption. CNN is used for image and LSTM is used for text. BLEU Score is used as a metric to evaluate the performance of the trained model.
## Libraries
- numpy
- matplotlib
- keras
- tensorflow
- nltk
## Neural Network
- VGG16 Network
- CNN-LSTM Network
**BLEU-1 Score:** 0.562578
**BLEU-2 Score:** 0.374006