An open API service indexing awesome lists of open source software.

https://github.com/eliabdiel/advanced-multimodal-ai

Built with Gemini API, LangChain, Chainlit, and Python, this multimodal assistant explores AI tool integration driven by curiosity about multimodal capabilities. It demonstrates modern AI development by combining cutting-edge multimodal AI with practical user interface design.
https://github.com/eliabdiel/advanced-multimodal-ai

artificial-intelligence assistant backend chainlit chatbot frontend gemini google langchain langgraph langsmith microsoft-azure oauth postgresql python question-answering rag software speech-to-text weasyprint

Last synced: about 2 months ago
JSON representation

Built with Gemini API, LangChain, Chainlit, and Python, this multimodal assistant explores AI tool integration driven by curiosity about multimodal capabilities. It demonstrates modern AI development by combining cutting-edge multimodal AI with practical user interface design.

Awesome Lists containing this project

README

          

# Advanced Multimodal AI

### Feature Gallery

| Feature | Feature |
|:--------------------------------:|:--------------------------------:|
| **User Authentication** | **Dashboard Overview** |
| ![Login](public/gifs/login.gif) | ![Home Page](public/gifs/dashboard.gif) |
| **Custom Command Configuration** | **AI Image Generation** |
| ![Commands](public/gifs/command.gif) | ![Image Generation](public/gifs/img-gen.gif) |
| **Web Content Extraction** | **Document Intelligence** |
| ![Web Scraping](public/gifs/scrape.gif) | ![PDF Q&A](public/gifs/files.gif) |
| **Advanced Research** | **Report Generation** |
| ![Deep Search](public/gifs/search.gif) | ![PDF Report](public/gifs/pdf.gif) |
| **Interactive Chat** | **Media Processing** |
| ![Conversational AI](public/gifs/chat.gif) | ![Video Transcripts](public/gifs/youtube.gif) |
| **Visual Analysis** | **Audio Comprehension** |
| ![Image Understanding](public/gifs/img-understand.gif) | ![Audio Understanding](public/gifs/audio.gif) |