An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with gpt-vision

A curated list of projects in awesome lists tagged with gpt-vision .

https://github.com/lambdua/openai4j

Java client library for OpenAI API.Full support for all OpenAI API models including Completions, Chat, Edits, Embeddings, Audio, Files, Assistants-v2, Images, Moderations, Batch, and Fine-tuning.

assistants-api chatgpt chatgpt4 gpt-4o gpt-4o-api gpt-vision gpt4 openai openai-api openai-api-chatbot openai-assistant-api openai-images openai-whisper

Last synced: 26 Nov 2024

https://github.com/libraryofcelsus/Aetherius_AI_Assistant

A completely private, locally-operated Ai Assistant/Chatbot/Sub-Agent Framework with realistic Long Term Memory and thought formation using Open Source LLMs. Qdrant is used for the Vector DB.

agent ai assistant autonomous chatbot companion customizable discord gpt gpt-4 gpt-vision long-term-memory multi-agent oobabooga qdrant research research-tool voice-assistant webscrape

Last synced: 24 Mar 2025

https://github.com/fingerthief/minimal-chat

MinimalChat is a lightweight, open-source chat application that allows you to interact with various large language models.

artificial-intelligence chat chat-application chatbot chatgpt chatgpt-api claude-3 gpt gpt-vision javascript llama llm meta-llama self-hosted selfhosted vue vue3 vuejs webapplication

Last synced: 04 Apr 2025

https://github.com/andraxdev/speak-gpt-web

Web version of SpeakGPT created using ReactJS and Google Material Design 3.

api chatbot chatgpt chatgpt-client claude gemini gpt gpt-4 gpt-4o gpt-vision jsx llama material-design openai openai-api perplexity react reactjs

Last synced: 07 May 2025

https://github.com/arshad-yaseen/pictocode

Convert Screenshots 📸 into Code 🧑‍💻

gpt-vision screenshot-to-code

Last synced: 10 Apr 2025

https://github.com/mickymultani/GPT-4-Vision-Architecture-Scanner

A web-based tool that utilizes GPT-4's vision capabilities to analyze and describe system architecture diagrams, providing instant insights and detailed breakdowns in an interactive chat interface.

architecture-visualization computer-vision flask flask-api flask-application gpt-4 gpt-4-turbo gpt-4-vision gpt-4-vision-preview gpt-vision llm llms openai openai-chatgpt openapi

Last synced: 06 Apr 2025

https://github.com/zhudotexe/kani-vision

Kani extension for supporting vision-language models (VLMs). Comes with model-agnostic support for GPT-Vision and LLaVA.

extension gpt-vision kani large-language-models llava multimodal-llm vision-language-model

Last synced: 20 Mar 2025

https://github.com/adityathakurxd/make-real-polls

Create interactive polls directly from the whiteboard content. Built on top of tldraw make-real template and live audio-video by 100ms, it uses OpenAI's GPT Vision to create an appropriate question with options to launch a poll instantly that helps engage the audience.

100ms 100mslive gpt gpt-vision make-real openai-api tldraw

Last synced: 02 Mar 2025

https://github.com/concaption/school-info-parser

Extract structured data about courses, accommodations, and pricing from school prospectuses

gpt-vision ocr pdf-parsing

Last synced: 22 Apr 2025