Projects in Awesome Lists tagged with image-description
A curated list of projects in awesome lists tagged with image-description .
https://github.com/oztrkoguz/comfyui_kosmos2_bbox_cutter
Image identification with Kosmos2 model, drawing and cutting bbox with object detection
comfyui image-description kosmos2 nodes object-detection
Last synced: 12 May 2025
https://github.com/orengrinker/gpt4oimage
This project is a Streamlit web application that leverages OpenAI's GPT-4o to generate descriptions for uploaded images
gpt4-api gpt4o gpt4o-mini gpt4omini image image-description openai-api
Last synced: 16 Aug 2025
https://github.com/alterism/mastodon-alt-text
Experimenting with mastodon.social client alt-text usage dataset.
a11y accessibility aiss-master alt-text alttext data-science datascience fediverse image-description image-descriptions mastodon mastodon-social university university-project
Last synced: 21 Mar 2025
https://github.com/mejdihaddad/ai-powered-solution-for-assisting-visually-impaired-individuals
AI-Powered-Solution-for-Assisting-Visually-Impaired-Individuals
ai-assistance computer-vision generative-ai google-gemini image-description machine-learning natural-language-processing ocr python streamlit text-to-speech
Last synced: 27 Apr 2026
https://github.com/shaswata56/genericai
An intelligent assistant powered by the ReAct framework, leveraging LangChain for tool-based reasoning and Gradio for a user-friendly interface. Supports tasks like weather queries, PDF summarization, image descriptions, and more.
acting agent ai-assistant chatbot-toolkit cot gradio image-description langchain llm mongodb openai pdf-summarization python react reasoning
Last synced: 12 Feb 2026
https://github.com/coding-enthusiast9857/gemini_llm_application
It is an innovative repository housing a sophisticated Large Language Model (LLM) project, showcasing the intersection of advanced natural language processing and cutting-edge artificial intelligence. This repository serves as a comprehensive platform for the development, experimentation, and application of state-of-the-art language models.
ai dl gemini gemini-pro generative-models image-description language-modeling llm llm-model ml nlp open-api python question-answering-system streamlit text-generation
Last synced: 07 Feb 2026
https://github.com/louis-alexandre-laguet/goldenleaf
GoldenLeaf is a Python application for creating an image search system using the CLIP model. It generates descriptions for herbarium images using the Llava model, enhancing multi-modal search capabilities. The system allows automated image description generation, multi-modal data augmentation, and customizable configurations for efficient training.
ai clip data-augmentation deep-learning image-description image-search llava multi-modal python
Last synced: 17 Apr 2026