Projects in Awesome Lists tagged with image-caption-generator
A curated list of projects in awesome lists tagged with image-caption-generator .
https://github.com/jmisilo/clip-gpt-captioning
CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.
computer-vision cv deep-learning image-caption image-caption-generator image-captioning machine-learning nlp python pytorch
Last synced: 09 Apr 2025
https://github.com/angeligareta/image-captioning
Image Caption Generator implemented using Tensorflow and Keras in a Python Jupyter Notebook. The goal is to describe the content of an image by using a CNN and RNN.
cnn cnn-keras cnn-tensorflow image-caption-generator image-captioning jupyter-notebook keras python rnn rnn-tensorflow tensorflow
Last synced: 22 Nov 2024
https://github.com/pavansomisetty21/image-caption-generation-using-llms-gemini-
we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI
artificial-intelligence description encoder-decoder-architecture gemini gemini-api gen-ai generate-contents generative-ai generativemodel image image-caption-generator image-captioning image-descriptions image-descriptor multi-model-learning openai vision vision-language-model vision-models visual-models
Last synced: 30 Apr 2025
https://github.com/german-zarate/image-captioning-app
Deployed image captioning ML model using Flask and access via Flutter app
flask flask-backend flutter image-caption-generator image-captioning python-backend
Last synced: 01 Feb 2025
https://github.com/smile040501/image_captioning
Generates textual description of any given image. Use both Natural Language Processing (NLP) and Computer Vision to generate captions. The idea implemented is to replace the encoder (RNN layer) in an encoder-decoder architecture with a deep Convolutional Neural Network (CNN) trained to classify objects in images.
artificial-intelligence convolutional-neural-networks image-caption-generator image-captioning image-processing image-recognition machine-learning neural-networks recurrent-neural-networks
Last synced: 06 Mar 2025
https://github.com/sreeeswaran/image-captioning-transformer
This project demonstrates an image captioning model using a Transformer architecture. The model takes an image as input and generates a descriptive caption. We use the COCO dataset for training and evaluation.
coco coco-dataset image-caption-generator image-captioning model neural-networks transformer transformer-models transformers
Last synced: 16 Mar 2025
https://github.com/snigdho8869/ai-generative-models-notebooks-dcgan-vae-autoencoder
This repository contains notebooks showcasing various generative models, including DCGAN and VAE for anime face generation, an Autoencoder for converting photos to sketches, a captioning model using an attention mechanism for an image caption generator, and more.
anime-face-generation autoencoder cnn dcgan deep-convolutional-gan generative-adversarial-network generative-ai generative-art generative-model generative-models image-caption-generator image-captioning image-generation image-processing photo-to-sketch sketches tensorflow upsample vae variational-autoencoder
Last synced: 28 Mar 2025
https://github.com/neemiasbsilva/minigpt4-image-caption-generation
Streamline the creation of supervised datasets to facilitate data augmentation for deep learning architectures focused on image captioning. The core framework leverages MiniGPT-4, complemented by the pre-trained Vicuna model, which boasts 13 billion parameters.
caption image-caption-generator minigpt4
Last synced: 23 Feb 2025
https://github.com/skp-1997/image-captioning-with-mlflow
This repository trains image captioning model using CNN and Transformers.
cnn-keras dagshub generative-ai image-caption-generator jupyter-notebook mlflow-tracking nlp-machine-learning transformers
Last synced: 22 Feb 2025
https://github.com/billy-enrizky/image-caption-generator
🚀 Image Caption Generator Project 🚀 🧠Building Customized LSTM Neural Network Encoder model with Dropout, Dense, RepeatVector, and Bidirectional LSTM layers. Sequence feature layers with Embedding, Dropout, and Bidirectional LSTM layers. Attention mechanism using Dot product, Softmax attention scores,...
computer-vision deep-learning flickr8k-dataset image-caption-generator image-captioning lstm-neural-networks natural-language-processing vgg16 vgg16-model
Last synced: 25 Feb 2025
https://github.com/smile040501/image_captioning_frontend
Generates textual description of any given image. Use both Natural Language Processing (NLP) and Computer Vision to generate captions. The idea implemented is to replace the encoder (RNN layer) in an encoder-decoder architecture with a deep Convolutional Neural Network (CNN) trained to classify objects in images.
artificial-intelligence convolutional-neural-networks image-caption-generator image-captioning image-processing image-recognition machine-learning neural-networks recurrent-neural-networks
Last synced: 06 Mar 2025
https://github.com/yeshalkhan/image-captioning-model
This repository contains code for an image caption generation model built using Tensorflow and trained on flickr_8k_dataset.
cnn computer-vision image-caption-generator keras lstm rnn sacrebleu-evaluation tensorflow
Last synced: 26 Mar 2025
https://github.com/ganesh2409/imagecaptioninganddescription
Image Captioning and Description is an AI-powered web application that generates captions and detailed descriptions for images uploaded by users. The app uses the BLIP (Bootstrapping Language-Image Pretraining) model for image captioning
bootstrapping-language-image-pretrained image-caption-generator machine-learning
Last synced: 10 Apr 2025
https://github.com/somenath203/four-in-one-ai-toolkit-powered-by-google-gemini-api
Click below to checkout the website
chatbot gemini gemini-api image-caption-generator python question-answering-system streamlie-cloud streamlit text-embedding
Last synced: 14 Mar 2025
https://github.com/tuanbeba/img_captions
Image caption generation based on encoder - decoder architecture with Flickr8k dataset. Beam search used for inference
beamer encoder-decoder-architecture image-caption-generator mobilenetv3 pytorch tranformers
Last synced: 29 Mar 2025
https://github.com/allenpandas/blip-imagecaption
BLIP-ImageCaption
blip image-caption image-caption-generation image-caption-generator image-captioning image-captions imagecaptioning
Last synced: 17 Mar 2025
https://github.com/tejas-130704/image-captioning
Image Captioning: This project utilizes deep learning for image captioning, combining a CNN-based feature extractor with an LSTM-based sequence generator.
computer-vision deep-neural-networks image-caption-generator lstm-neural-networks nlp resnet-50
Last synced: 20 Mar 2025