Projects in Awesome Lists tagged with imagecaptioning
A curated list of projects in awesome lists tagged with imagecaptioning .
https://github.com/guillaumegenthial/im2latex
Image to LaTeX (Seq2seq + Attention with Beam Search) - Tensorflow
attention-seq2seq beam-search im2latex imagecaptioning seq2seq seq2seq-attn show-and-tell tensorflow
Last synced: 06 Apr 2025
https://github.com/luopeixiang/im2latex
Pytorch implemention of Deep CNN Encoder + LSTM Decoder with Attention for Image to Latex
encoder-decoder-model im2latex imagecaptioning pytorch seq2seq show-and-tell
Last synced: 01 May 2025
https://github.com/MrAnayDongre/Machine-Learning-Collection
Repo for Implementing Research Papers & Projects related to Machine Learning
bidirectional-lstm chatgpt cnn-classification dcgan-pytorch deep-neural-networks diffusion-models gan imagecaptioning lstm-neural-networks machine-learning object-detection ollama pytorch-implementation pytorch-lightning pytorch-lightning-tutorial rag time-series-analysis transformer wgan-gp-pytorch yolo
Last synced: 25 Nov 2024
https://github.com/mingtaoguo/rnn-tensorflow
Some interesting applications of RNN, e.g. char rnn (pomes generation), seq2seq (machine translation), image captioning (NIC)
imagecaptioning poem-generator seq2seq tensorflow
Last synced: 20 Nov 2024
https://github.com/mountchicken/imagecaptioning-attention-pyqt5
ImageCaptioning improved with an attention mechanism. Also a PyQt5 application
attention imagecaptioning pyqt5 pytorch
Last synced: 20 Mar 2025
https://github.com/islam-hady9/generative-ai-models
Generative AI Models is a comprehensive repository dedicated to the implementation of cutting-edge generative AI models using Python. It features various models, including those for image captioning and text-to-image generation, leveraging advanced architectures like Vision Transformers (ViT), GPT-2, and Stable Diffusion.
computervision deeplearning generativeai gpt-2 huggingface-transformers imagecaptioning nlp pytorch stablediffusion text-to-image-generation visiontransformers
Last synced: 09 Feb 2025
https://github.com/adi2334/image-caption-generator
This project implements an image captioning model using a CNN-LSTM architecture. The model takes an image as input and generates a descriptive caption using natural language processing techniques
cnn computer-vision deep-learning imagecaptioning lstm machine-learning neural-network tensorflow
Last synced: 18 Mar 2025
https://github.com/cizodevahm/image-caption-generator
This project is an image caption generator that uses a deep learning model to generate captions for images. The model is trained using the Flickr8k dataset and leverages a pre-trained Xception model for feature extraction and an LSTM network for sequence processing.
captions deep-learning imagecaptioning lstm nueral-networks xception-model
Last synced: 10 Mar 2025
https://github.com/shlokashah/deeplearning
Implementation of few things learnt from deeplearning.ai specialization
deep-learning deeplearning-ai facenet facerecognition imagecaptioning lstm siamese-network
Last synced: 19 Feb 2025
https://github.com/quanpersie2001/imagecaptioning
Predicting a caption for a given image using Inception Net V3, LSTM and Glove
cnn image-captioning imagecaptioning inception-v3 lstm rnn
Last synced: 09 Apr 2025
https://github.com/allenpandas/blip-imagecaption
BLIP-ImageCaption
blip image-caption image-caption-generation image-caption-generator image-captioning image-captions imagecaptioning
Last synced: 17 Mar 2025
https://github.com/abdelrahman-amen/gemini_image_insights
This project leverages Google's Gemini LLM to analyze uploaded images and generate detailed descriptions or insights based on user prompts. It combines advanced AI capabilities with Streamlit for an interactive and user-friendly experience.
api dotenv gemini imagecaptioning os pil python streamlit
Last synced: 22 Mar 2025
https://github.com/gagan3012/image2audio
Convert Image to audio using ViT, GPT and FastSpeech
fastspeech2 gpt-2 image-captioning imagecaptioning pytorch speech-to-text vit
Last synced: 07 Apr 2025