Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with gpt-4-vision
A curated list of projects in awesome lists tagged with gpt-4-vision .
https://github.com/lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.
ai azure-openai-api chat chatglm chatgpt claude dalle-3 function-calling gemini gpt gpt-4 gpt-4-vision knowledge-base nextjs ollama openai qwen2 rag tts
Last synced: 17 Dec 2024
https://github.com/danny-avila/LibreChat
Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development
ai anthropic assistant-api azure bing chatgpt chatgpt-clone claude clone dall-e-3 gemini google gpt-4-vision langchain librechat openai plugins search vision webui
Last synced: 27 Oct 2024
https://github.com/szczyglis-dev/py-gpt
Desktop AI Assistant powered by o1, GPT-4, GPT-4 Vision, Gemini, Claude, Llama 3, Bielik, DALL-E, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, agents, command execution, file upload/download, speech synthesis and recognition, access to Web, memory, presets, assistants, plugins, and more. Linux, Windows, Mac.
ai ai-assistant artificial-intelligence autonomous-agent bielik chatbot claude dalle-3 desktop-app gemini gpt-4 gpt-4-vision gpt4 langchain llama-index llama3 llm o1 ollama openai
Last synced: 19 Dec 2024
https://github.com/skythinker616/gpt-assistant-android
免费的ChatGPT API的安卓语音助手,可用音量键唤起并进行语音交流,支持联网、Vision拍照识图、提问模板等功能 | A free ChatGPT API voice assistant for Android, activated via volume keys for voice interaction, supporting features such as network connectivity, Vision photo recognition, and question templates.
android assistant chatgpt free-gpt gpt-4-vision markdown
Last synced: 09 Nov 2024
https://github.com/lancedb/vectordb-recipes
High quality resources & applications for LLMs, multi-modal models and VectorDBs
agents ai deep-learning embeddings fine-tuning gpt gpt-4-vision langchain llama-index llms machine-learning multimodal openai rag vector-database
Last synced: 14 Dec 2024
https://github.com/Skythinker616/gpt-assistant-android
免费的ChatGPT API的安卓语音助手,可用音量键唤起并进行语音交流,支持联网、Vision拍照识图、提问模板等功能 | A free ChatGPT API voice assistant for Android, activated via volume keys for voice interaction, supporting features such as network connectivity, Vision photo recognition, and question templates.
android assistant chatgpt free-gpt gpt-4-vision markdown
Last synced: 28 Oct 2024
https://github.com/skalskip/sports
Cool experiments at the intersection of Computer Vision and Sports ⚽🏃
computer-vision deep-learning deep-neural-networks gpt-4 gpt-4-vision object-detection prompt-engineering pytorch sports-analytics tutorial yolov5 yolov7
Last synced: 15 Dec 2024
https://github.com/SkalskiP/sports
Cool experiments at the intersection of Computer Vision and Sports ⚽🏃
computer-vision deep-learning deep-neural-networks gpt-4 gpt-4-vision object-detection prompt-engineering pytorch sports-analytics tutorial yolov5 yolov7
Last synced: 06 Nov 2024
https://github.com/typingmind/typingmind
The most advanced Web UI for AI chat
chatgpt chatgpt-ui claude claude2 gemini gemini-pro gpt-4 gpt-4-turbo gpt-4-vision typingmind webui
Last synced: 09 Nov 2024
https://github.com/wisconsinaivision/vip-llava
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
chatbot clip cvpr2024 foundation-models gpt-4 gpt-4-vision llama llama2 llava multi-modal vision-language visual-prompting
Last synced: 15 Dec 2024
https://github.com/developersdigest/ai-devices
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
function-calling gpt-4-vision groq langchain langsmith llama3 llava llm openai serper tts whisper
Last synced: 16 Dec 2024
https://github.com/vdutts7/gpt4v-scraper
AI agent that can SEE 👁️, control, navigate, & do stuff for you on your browser.
ai-agents browser-automation gpt-4-vision puppeteer web-scraping
Last synced: 11 Nov 2024
https://github.com/vdutts7/gpt4V-scraper
AI agent that can SEE 👁️, control, navigate, & do stuff for you on your browser.
ai-agents browser-automation gpt-4-vision puppeteer web-scraping
Last synced: 05 Nov 2024
https://github.com/tbckr/sgpt
SGPT is a command-line tool that provides a convenient way to interact with OpenAI models, enabling users to run queries, generate shell commands and produce code directly from the terminal.
bash cli go gpt-3 gpt-4 gpt-4-vision gpt-4-vision-preview gpt-4o openai shell
Last synced: 06 Nov 2024
https://github.com/mountaineerbr/shellchatgpt
Shell wrapper for OpenAI's ChatGPT, DALL-E, Whisper, and TTS. Features LocalAI, Ollama, Gemini, Mistral, Groq, and Anthropic integration.
awesome-chatgpt-prompts awesome-chatgpt-prompts-zh bash chat-completions chatbot claude-3 davinci gemini-api gemini-pro gpt-4-vision gpt-4o groq llama3 localai mistral-api o1-preview ollama terminal text-completions tts
Last synced: 18 Dec 2024
https://github.com/mountaineerbr/shellChatGPT
Shell wrapper for OpenAI's ChatGPT, DALL-E, Whisper, and TTS. Features LocalAI, Ollama, Gemini, Mistral, Groq, and Anthropic integration.
awesome-chatgpt-prompts awesome-chatgpt-prompts-zh bash chat-completions chatbot claude-3 davinci gemini-api gemini-pro gpt-4-vision gpt-4o groq llama3 localai mistral-api o1-preview ollama terminal text-completions tts
Last synced: 07 Nov 2024
https://github.com/nateraw/openai-vision-api-for-videos
Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦
chatgpt colab-notebook gpt-4 gpt-4-vision machine-learning openai python
Last synced: 17 Nov 2024
https://github.com/lazauk/aoai-gpt4vision-streamlit-sdkv1
Using Azure OpenAI deployment of GPT-4 Turbo with Vision to analyse out-of-stock situation in a fictitious retail shop.
ai azure gpt gpt-4-vision openai out-of-stock streamlit
Last synced: 13 Nov 2024
https://github.com/LazaUK/AOAI-GPT4Vision-Streamlit-SDKv1
Using Azure OpenAI deployment of GPT-4 Turbo with Vision to analyse out-of-stock situation in a fictitious retail shop.
ai azure gpt gpt-4-vision openai out-of-stock streamlit
Last synced: 06 Nov 2024
https://github.com/mickymultani/GPT-4-Vision-Architecture-Scanner
A web-based tool that utilizes GPT-4's vision capabilities to analyze and describe system architecture diagrams, providing instant insights and detailed breakdowns in an interactive chat interface.
architecture-visualization computer-vision flask flask-api flask-application gpt-4 gpt-4-turbo gpt-4-vision gpt-4-vision-preview gpt-vision llm llms openai openai-chatgpt openapi
Last synced: 05 Nov 2024
https://github.com/neka-nat/mylangrobot
Language instructions to mycobot using GPT-4V
chatgpt gpt-4-vision gpt-4-vision-preview gpt4v mycobot segment-anything whisper
Last synced: 14 Oct 2024
https://github.com/kornia/pixie
Pixie: Computer Vision AI Engineer assistant
artificial-intelligence chatgpt computer-vision deep-learning geometry gpt-4-vision machine-learning robotics
Last synced: 13 Nov 2024
https://github.com/reidbarber/gen-ui
Use text or image prompts to generate components and apps built with React.
assistants-api codesandbox gpt-4 gpt-4-vision openai react sandpack
Last synced: 28 Oct 2024
https://github.com/mapluisch/gpt-4-vision-for-hololens
Capture images with HoloLens and receive descriptive responses from OpenAI's GPT-4V(ision)
gpt-4 gpt-4-vision gpt-4-vision-preview gpt4vision hololens hololens-applications hololens2 openai openai-api unity3d
Last synced: 13 Nov 2024
https://github.com/niawjunior/vision-speak
CameraVision: Capture, Analyze - Seamlessly integrate image analysis using GPT-4 Vision API and convert text to speech with Whisper AI
Last synced: 02 Dec 2024
https://github.com/c0mm4nd/command-windows
CommandWindows is a desktop opeating system copilot based on multi-modal large language model, supporting all-platforms which have application window
ai chatgpt copilot gemini gemini-pro-vision gpt gpt-4-vision
Last synced: 04 Dec 2024
https://github.com/philfung/awesome-computer-use
Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.
anthropic anthropic-claude computer-use computer-vision gpt-4-vision gui-agents llm rpa rpa-robotic-process-automation tool-use vision
Last synced: 25 Nov 2024
https://github.com/9vult/raiha
Raiha Discord Accessibility Bot
accessibility alt-text audio-transcription azure-cognitive-services discord discord-bot discord-moderation gpt-4-vision gpt-4o video-transcription whisper-ai
Last synced: 13 Nov 2024
https://github.com/corbindavenport/alt-text-creator
Browser extension that generates image alternate text, using GPT-4o or an LM Studio server.
chrome-extension chrome-extensions gpt-4 gpt-4-api gpt-4-vision gpt-4-vision-preview gpt-4o gpt-4o-m lm-studio lmstudio webextension webextensions
Last synced: 24 Oct 2024
https://github.com/paul-borisov/react-azure-open-ai-chat-web-part-spfx
Azure OpenAI SPFx web part for SharePoint Online offering user experience familiar to ChatGPT users. Supports Azure & Native OpenAI endpoints published via Azure API Management, Private & Shared Chats, Storage Encryption, Event Streaming, Code Highlighting, Full-screen mode, optional internet & data Integrations, PDF & Image analysis, Dalle3 Images
api-management azure azure-openai bing-search-api dalle-3 dalle3 function-calling google-search-api gpt-4-api gpt-4-vision gpt-4o gpt-4o-mini microsoft-api openai-api openai-chatgpt sharepoint-framework sharepoint-online sharepoint-webpart spfx spfx-webpart
Last synced: 15 Dec 2024
https://github.com/kwishna/openai-smart-vision
AI apps using OpenAI Vision model.
ai gpt-4-vision gpt-4o gpt-4omni openai
Last synced: 07 Nov 2024
https://github.com/cailailai/-chatgpt
安全可用ChatGPT国内中文版镜像网站整理(2024/10/09)
chatgpt chatgpt-4o chatgpt-4o-mini gpt-3-5-turbo gpt-4 gpt-4-vision gpt-4o openai
Last synced: 07 Dec 2024
https://github.com/aelew/ocr-api
A simple Golang API that detects and extracts text from images using OpenAI's GPT-4o-mini model.
api go golang gpt-4-vision gpt-4o gpt-4o-mini openai
Last synced: 17 Nov 2024
https://github.com/cailailai/chatgpt-cn
国内可用ChatGPT 国内国外镜像中文网站汇总(11/05更新)
ai chatgpt gpt gpt-35-turbo gpt-4-vision gpt-4o gpt-4o-mini midjourney openai
Last synced: 07 Nov 2024
https://github.com/ks6088ts-labs/extractor-python
A data extract tool written in Python
fitz gpt-4-vision openai pymupdf
Last synced: 09 Nov 2024
https://github.com/benderscript/netvision
Network Topology Image Analsysis
cisco gpt-4-vision images networking topology vision
Last synced: 14 Dec 2024
https://github.com/sacred-g/ai
PDF Chatbot, Image Chatbot, Web-Site Chatbot with a Knowledge base. OpenAI , Memory, PostgreSQL
assistant-chat-bots assistants autonomous docker embeddings gpt-4 gpt-4-vision image-recognition memory openai postgresql rag vector-database
Last synced: 15 Dec 2024