Projects in Awesome Lists tagged with multimodal-processing
A curated list of projects in awesome lists tagged with multimodal-processing .
https://github.com/abhrankan-chakrabarti/geminifusion
A versatile web application that leverages advanced AI models, including Gemini Flash, DALL-E 3, and Stable Diffusion XL, to provide three main features: Chatbot Interaction, Image Captioning, and Text-to-Image Generation.
ai-chatbot ai-integration dall-e-3 gemini-pro gemini-pro-vision generative-ai image-captioning multimodal-processing stable-diffusion-xl text-and-vision
Last synced: 16 Jul 2025