Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with gpt-4-vision

A curated list of projects in awesome lists tagged with gpt-4-vision .

https://github.com/lobehub/lobe-chat

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.

ai azure-openai-api chat chatglm chatgpt claude dalle-3 function-calling gemini gpt gpt-4 gpt-4-vision knowledge-base nextjs ollama openai qwen2 rag tts

Last synced: 17 Dec 2024

https://github.com/danny-avila/LibreChat

Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development

ai anthropic assistant-api azure bing chatgpt chatgpt-clone claude clone dall-e-3 gemini google gpt-4-vision langchain librechat openai plugins search vision webui

Last synced: 27 Oct 2024

https://github.com/szczyglis-dev/py-gpt

Desktop AI Assistant powered by o1, GPT-4, GPT-4 Vision, Gemini, Claude, Llama 3, Bielik, DALL-E, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, agents, command execution, file upload/download, speech synthesis and recognition, access to Web, memory, presets, assistants, plugins, and more. Linux, Windows, Mac.

ai ai-assistant artificial-intelligence autonomous-agent bielik chatbot claude dalle-3 desktop-app gemini gpt-4 gpt-4-vision gpt4 langchain llama-index llama3 llm o1 ollama openai

Last synced: 19 Dec 2024

https://github.com/skythinker616/gpt-assistant-android

免费的ChatGPT API的安卓语音助手,可用音量键唤起并进行语音交流,支持联网、Vision拍照识图、提问模板等功能 | A free ChatGPT API voice assistant for Android, activated via volume keys for voice interaction, supporting features such as network connectivity, Vision photo recognition, and question templates.

android assistant chatgpt free-gpt gpt-4-vision markdown

Last synced: 09 Nov 2024

https://github.com/lancedb/vectordb-recipes

High quality resources & applications for LLMs, multi-modal models and VectorDBs

agents ai deep-learning embeddings fine-tuning gpt gpt-4-vision langchain llama-index llms machine-learning multimodal openai rag vector-database

Last synced: 14 Dec 2024

https://github.com/Skythinker616/gpt-assistant-android

免费的ChatGPT API的安卓语音助手,可用音量键唤起并进行语音交流,支持联网、Vision拍照识图、提问模板等功能 | A free ChatGPT API voice assistant for Android, activated via volume keys for voice interaction, supporting features such as network connectivity, Vision photo recognition, and question templates.

android assistant chatgpt free-gpt gpt-4-vision markdown

Last synced: 28 Oct 2024

https://github.com/wisconsinaivision/vip-llava

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

chatbot clip cvpr2024 foundation-models gpt-4 gpt-4-vision llama llama2 llava multi-modal vision-language visual-prompting

Last synced: 15 Dec 2024

https://github.com/developersdigest/ai-devices

AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more

function-calling gpt-4-vision groq langchain langsmith llama3 llava llm openai serper tts whisper

Last synced: 16 Dec 2024

https://github.com/vdutts7/gpt4v-scraper

AI agent that can SEE 👁️, control, navigate, & do stuff for you on your browser.

ai-agents browser-automation gpt-4-vision puppeteer web-scraping

Last synced: 11 Nov 2024

https://github.com/vdutts7/gpt4V-scraper

AI agent that can SEE 👁️, control, navigate, & do stuff for you on your browser.

ai-agents browser-automation gpt-4-vision puppeteer web-scraping

Last synced: 05 Nov 2024

https://github.com/tbckr/sgpt

SGPT is a command-line tool that provides a convenient way to interact with OpenAI models, enabling users to run queries, generate shell commands and produce code directly from the terminal.

bash cli go gpt-3 gpt-4 gpt-4-vision gpt-4-vision-preview gpt-4o openai shell

Last synced: 06 Nov 2024

https://github.com/mountaineerbr/shellchatgpt

Shell wrapper for OpenAI's ChatGPT, DALL-E, Whisper, and TTS. Features LocalAI, Ollama, Gemini, Mistral, Groq, and Anthropic integration.

awesome-chatgpt-prompts awesome-chatgpt-prompts-zh bash chat-completions chatbot claude-3 davinci gemini-api gemini-pro gpt-4-vision gpt-4o groq llama3 localai mistral-api o1-preview ollama terminal text-completions tts

Last synced: 18 Dec 2024

https://github.com/mountaineerbr/shellChatGPT

Shell wrapper for OpenAI's ChatGPT, DALL-E, Whisper, and TTS. Features LocalAI, Ollama, Gemini, Mistral, Groq, and Anthropic integration.

awesome-chatgpt-prompts awesome-chatgpt-prompts-zh bash chat-completions chatbot claude-3 davinci gemini-api gemini-pro gpt-4-vision gpt-4o groq llama3 localai mistral-api o1-preview ollama terminal text-completions tts

Last synced: 07 Nov 2024

https://github.com/nateraw/openai-vision-api-for-videos

Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦

chatgpt colab-notebook gpt-4 gpt-4-vision machine-learning openai python

Last synced: 17 Nov 2024

https://github.com/lazauk/aoai-gpt4vision-streamlit-sdkv1

Using Azure OpenAI deployment of GPT-4 Turbo with Vision to analyse out-of-stock situation in a fictitious retail shop.

ai azure gpt gpt-4-vision openai out-of-stock streamlit

Last synced: 13 Nov 2024

https://github.com/LazaUK/AOAI-GPT4Vision-Streamlit-SDKv1

Using Azure OpenAI deployment of GPT-4 Turbo with Vision to analyse out-of-stock situation in a fictitious retail shop.

ai azure gpt gpt-4-vision openai out-of-stock streamlit

Last synced: 06 Nov 2024

https://github.com/mickymultani/GPT-4-Vision-Architecture-Scanner

A web-based tool that utilizes GPT-4's vision capabilities to analyze and describe system architecture diagrams, providing instant insights and detailed breakdowns in an interactive chat interface.

architecture-visualization computer-vision flask flask-api flask-application gpt-4 gpt-4-turbo gpt-4-vision gpt-4-vision-preview gpt-vision llm llms openai openai-chatgpt openapi

Last synced: 05 Nov 2024

https://github.com/neka-nat/mylangrobot

Language instructions to mycobot using GPT-4V

chatgpt gpt-4-vision gpt-4-vision-preview gpt4v mycobot segment-anything whisper

Last synced: 14 Oct 2024

https://github.com/reidbarber/gen-ui

Use text or image prompts to generate components and apps built with React.

assistants-api codesandbox gpt-4 gpt-4-vision openai react sandpack

Last synced: 28 Oct 2024

https://github.com/mapluisch/gpt-4-vision-for-hololens

Capture images with HoloLens and receive descriptive responses from OpenAI's GPT-4V(ision)

gpt-4 gpt-4-vision gpt-4-vision-preview gpt4vision hololens hololens-applications hololens2 openai openai-api unity3d

Last synced: 13 Nov 2024

https://github.com/niawjunior/vision-speak

CameraVision: Capture, Analyze - Seamlessly integrate image analysis using GPT-4 Vision API and convert text to speech with Whisper AI

camera gpt-4-vision whisper

Last synced: 02 Dec 2024

https://github.com/c0mm4nd/command-windows

CommandWindows is a desktop opeating system copilot based on multi-modal large language model, supporting all-platforms which have application window

ai chatgpt copilot gemini gemini-pro-vision gpt gpt-4-vision

Last synced: 04 Dec 2024

https://github.com/philfung/awesome-computer-use

Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.

anthropic anthropic-claude computer-use computer-vision gpt-4-vision gui-agents llm rpa rpa-robotic-process-automation tool-use vision

Last synced: 25 Nov 2024

https://github.com/corbindavenport/alt-text-creator

Browser extension that generates image alternate text, using GPT-4o or an LM Studio server.

chrome-extension chrome-extensions gpt-4 gpt-4-api gpt-4-vision gpt-4-vision-preview gpt-4o gpt-4o-m lm-studio lmstudio webextension webextensions

Last synced: 24 Oct 2024

https://github.com/paul-borisov/react-azure-open-ai-chat-web-part-spfx

Azure OpenAI SPFx web part for SharePoint Online offering user experience familiar to ChatGPT users. Supports Azure & Native OpenAI endpoints published via Azure API Management, Private & Shared Chats, Storage Encryption, Event Streaming, Code Highlighting, Full-screen mode, optional internet & data Integrations, PDF & Image analysis, Dalle3 Images

api-management azure azure-openai bing-search-api dalle-3 dalle3 function-calling google-search-api gpt-4-api gpt-4-vision gpt-4o gpt-4o-mini microsoft-api openai-api openai-chatgpt sharepoint-framework sharepoint-online sharepoint-webpart spfx spfx-webpart

Last synced: 15 Dec 2024

https://github.com/kwishna/openai-smart-vision

AI apps using OpenAI Vision model.

ai gpt-4-vision gpt-4o gpt-4omni openai

Last synced: 07 Nov 2024

https://github.com/cailailai/-chatgpt

安全可用ChatGPT国内中文版镜像网站整理(2024/10/09)

chatgpt chatgpt-4o chatgpt-4o-mini gpt-3-5-turbo gpt-4 gpt-4-vision gpt-4o openai

Last synced: 07 Dec 2024

https://github.com/aelew/ocr-api

A simple Golang API that detects and extracts text from images using OpenAI's GPT-4o-mini model.

api go golang gpt-4-vision gpt-4o gpt-4o-mini openai

Last synced: 17 Nov 2024

https://github.com/cailailai/chatgpt-cn

国内可用ChatGPT 国内国外镜像中文网站汇总(11/05更新)

ai chatgpt gpt gpt-35-turbo gpt-4-vision gpt-4o gpt-4o-mini midjourney openai

Last synced: 07 Nov 2024

https://github.com/ks6088ts-labs/extractor-python

A data extract tool written in Python

fitz gpt-4-vision openai pymupdf

Last synced: 09 Nov 2024

https://github.com/benderscript/netvision

Network Topology Image Analsysis

cisco gpt-4-vision images networking topology vision

Last synced: 14 Dec 2024

https://github.com/sacred-g/ai

PDF Chatbot, Image Chatbot, Web-Site Chatbot with a Knowledge base. OpenAI , Memory, PostgreSQL

assistant-chat-bots assistants autonomous docker embeddings gpt-4 gpt-4-vision image-recognition memory openai postgresql rag vector-database

Last synced: 15 Dec 2024