An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with vision-api

A curated list of projects in awesome lists tagged with vision-api .

https://github.com/KimDarren/FaceCropper

:scissors: Crop faces, inside of your image, with iOS 11 Vision api.

face face-detection face-recognition ios ios11 swift vision vision-api

Last synced: 02 Aug 2025

https://github.com/Feghal/ImageDetect

✂️ Detect and crop faces, barcodes and texts in image with iOS 11 Vision api.

barcode detector face face-detection face-recognition ios ios11 recognition swift vision vision-api

Last synced: 06 Aug 2025

https://github.com/harishdeivanayagam/rowfill

Open-source unstructured data (PDFs, Images, Audiofiles) processing platform built for knowledge workers

document document-extraction document-parsing image-ocr langgraph llama llm nextjs ocr ocr-javascript ollama openai pdf pdfs unstructured unstructured-data vision vision-api

Last synced: 13 Apr 2025

https://github.com/cubist38/mlx-openai-server

A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-friendly solution for running MLX-based vision and language models locally with an OpenAI-compatible interface.

apple-silicon fastapi flux image-generation mlx mlx-lm mlx-vlm openai-compatible queue speech-recognition structured-outputs tool-calling vision-api whisper

Last synced: 12 Feb 2026

https://github.com/kyegomez/vit-rgts

Open source implementation of "Vision Transformers Need Registers"

attention-mechanism gpt4 vision-api vision-transformer vit

Last synced: 12 Apr 2025

https://github.com/xavimondev/vdbs

Convert your database diagrams into SQL Schema

cli hackaton-supabase nextjs openai shadcn-ui supabase tailwindcss typescript vision-api

Last synced: 09 May 2025

https://github.com/hugoqnc/Split

Split! is an app for iOS and iPadOS, that allows a group of people to easily share expenses from a common receipt — Winner of WWDC22 Swift Student Challenge

ios swift swift-student-challenge swiftui vision-api wwdc

Last synced: 23 Jul 2025

https://github.com/wax911/vision-barcode-scanner

Customized Google Vision API Barcode Scanner

barcode-scanner qrcode-scanner vision-api

Last synced: 06 Jul 2025

https://github.com/amirandalibi/perception

A Wordpress plugin to detect broad sets of objects in your media library images, from flowers, animals, or transportation to thousands of other object categories commonly found within images.

google-cloud-platform vision-api wordpress-plugin

Last synced: 17 Apr 2025

https://github.com/alfianlosari/xcawastickermaker

A WhatsApp Sticker Maker iOS App with Vision API Subject Lifting / Background Removal

background-removal ios ios-app swiftui vision-api whatsapp-sticker-maker

Last synced: 04 Oct 2025

https://github.com/simformsolutionspvtltd/ssbubblepop-visionos

This game showcases the immersive capabilities of VisionOS, where players pop floating bubbles 🫧 within a virtual space. It’s simple, addictive, and perfect for demonstrating the power of augmented reality gaming!

swift swiftui swiftui-components swiftui-demo swiftui-example swiftui-learning vision vision-api visionos visionos-app visionos2

Last synced: 10 Apr 2025

https://github.com/maxidonkey/delphigroqcloud

The GroqCloud API wrapper for Delphi provides access to models from Meta, OpenAI, MistralAI and Google on Groq’s LPUs, offering chat, text generation, image analysis, audio transcription, JSON output, tool integration, and content moderation capabilities.

api-wrapper audio-transcription delphi gpt groq groq-ai groq-api image-prompting llama llama3-1 llama3-2 lpu vision vision-api

Last synced: 26 Sep 2025

https://github.com/anubhavitis/peeksy

screenshot name automation tool for macos

macos openapi rust screenshots vision-api

Last synced: 16 Aug 2025

https://github.com/sayakpaul/gcp-ml-api-demos

Contains Colab Notebooks show cool use-cases of different GCP ML APIs.

cloud-ml gcp texttospeech video-intelligence-api vision-api

Last synced: 07 May 2025

https://github.com/williamtsoi1/vision-api-product-search-demo

Demo repository for Google Cloud Vision API Product Search

gcloud vision-api

Last synced: 15 Apr 2025

https://github.com/GINK03/keras-distillation

keras google-vision's distillation

keras keras-distillation tensorflow-basics vision-api

Last synced: 05 Apr 2025

https://github.com/amahanna/picsumvision

an interactive graph-based image repository

arangodb typescript vision-api

Last synced: 12 Apr 2025

https://github.com/anthonyjdella/summarize-text

📖 A Python app that uses text recognition on photos, then texts you a summary.

openai python text-recognition twilio vision-api

Last synced: 11 Apr 2025

https://github.com/shamspias/langchain-gemini-api

An innovative AI conversation API leveraging Google's Gemini for multimodal understanding. Combines FastAPI, Langchain, and Redis for robust, scalable, and privacy-conscious text and image-based interactions

fastapi gemini gemini-ai gemini-api gemini-pro gemini-pro-api gemini-pro-bot gemini-pro-vision google-gemini google-generative-ai langchain langchain-python large-language-models python vision-api

Last synced: 30 Jul 2025

https://github.com/iprajwaal/enhanced-vision-assistant

An AI-powered vision assistant for real-time navigation and awareness.

gemini-pro opencv vertex-ai vertex-ai-gemini-api vertexaisprint vision-api

Last synced: 14 May 2025

https://github.com/paolorotolo/happily

A simple Android Experiment you can control with smile

andorid vision-api

Last synced: 13 May 2025

https://github.com/betiol/koa-google-vision-api

Vision API KoaJS Wrapper

google koajs vision-api

Last synced: 12 Apr 2025

https://github.com/fleker/food-foto-fitness

An app for quick-adding nutritional data to Google Fit using LLMs

gemini-api google-fit-api large-language-models nutrition vision-api

Last synced: 15 Oct 2025

https://github.com/fmxexpress/googlevisionapi

Detect labels for images with Google Cloud Vision API on Windows, Android, iOS, macOS, Linux https://cloud.google.com/vision/docs/labels

cloud delphi vision-api windows

Last synced: 30 Jul 2025

https://github.com/webmobi59/expo.io-google-cloud-ocr

React Native Google Vision Sample

expo google-cloud react-native vision-api

Last synced: 26 Apr 2025

https://github.com/fitomad/bicimad-bike-identifier

Keras, Vision y CoreML para desarrollar una app que identifica los números de serie de las bicicletas del servicio BiciMAD

convolutional-neural-networks coreml coremltools image-recognition ios keras python swift swift4 vision vision-api

Last synced: 20 Mar 2025

https://github.com/ergonomech/ollama-model-interaction

A simple Gradio-based app for interacting with Ollama models, supporting image analysis, text completion, and model pullin

gradio llava ollama ollama-api vision-api

Last synced: 04 Sep 2025

https://github.com/anilpatidar/ai-appium-lens-plugin

This plugin has been developed by Anil Patidar, and the purpose of this repository is to welcome all the QA folks. to make use of this AI enabled plugin in their mobile automation scripting.

appium appium-android appium-framework appium-ios appium-tests artificial-intelligence google-cloud googlevisionapi selenium vision vision-api

Last synced: 22 Aug 2025

https://github.com/emadehsan/food-classifier

Experimenting with Cloud Vision API. DEMO:

cloud computer-vision deep-learning gcp machine-learning vision-api

Last synced: 01 May 2025

https://github.com/adityanandanx/dental-conditions-detection

An AI-powered platform for detecting dental conditions in X-ray images using computer vision and DICOM processing

ai computer-vision dental fastapi inference medical-imaging ml nextjs opencv roboflow vision vision-api

Last synced: 12 Aug 2025

https://github.com/mecitsem/Arf-HashtagBot

Send a photo and generate auto hashtag for your photos with this bot on Facebook, Telegram and Skype channels.

bot cognitive-services facebook microsoft microsoft-bot-framework telegram vision-api

Last synced: 12 Apr 2025

https://github.com/marcominerva/visionplayground

A simple ASP.NET Core Application that takes a photo and calls Azure AI Vision to get a description

alpinejs azure azure-vision-api c-sharp dotnet minimal-api vision-api visual-studio

Last synced: 02 Jul 2025

https://github.com/souravdey777/readio

An Audiobook 📚 🔊 generator from pdf and a Socializing platform to discuss about Books, Research Papers, and etc.

audio-generator books google-cloud-platform pdf-converter social-media vision-api

Last synced: 04 Feb 2026

https://github.com/armaggheddon/gstgeminivision

Let your GStreamer pipelines describe what they see! 👁️‍🗨️ GstGeminiVision brings Google's Gemini Vision AI to your media streams for some serious (and fun!) video analysis. 🎥🤖✨

c-programming computer-vision docker gemini-api google-gemini gstreamer gstreamer-plugins python video-analysis vision-api

Last synced: 09 Oct 2025

https://github.com/starwit/vision-api

Data model for Starwit's Awareness Engine

sae vision-api

Last synced: 29 Jan 2026

https://github.com/cincibrainlab/autocleaneeg-icvision

Automated ICA artifact classification and removal for EEG data using OpenAI Vision API. Generates component visualizations, classifies artifacts, and produces cleaned datasets with detailed reports.

artifact-removal automation eeg eeglab ica machine-learning mne-python reproducibility vision-api

Last synced: 10 Mar 2026

https://github.com/meinside/telegram-bot-kakao-vision

A Telegram Bot using Kakao Vision API

golang kakao-api telegram-bot vision-api

Last synced: 14 Jan 2026

https://github.com/martijnvos/textvision

Android application which recognizes written text and pastes it into the clipboard

android-application clipboard machine-learning vision-api

Last synced: 29 Mar 2025

https://github.com/sivatsk26/development-of-an-automated-text-to-speech-device-for-visually-impaired

Developing a text-to-speech device that can capture an image (containing text), extract the text from image using OCR, translate the text to a desired language using G-Translator, and generate audio for the translated text using Google Cloud TTS

artificial-intelligence google-cloud google-translate-api machine-learning ocr-recognition picamera3 python raspberry-pi-4 text-to-speech vision-api

Last synced: 27 Jan 2026

https://github.com/jmbl1685/not-hot-dog

:hotdog: Not Hot Dog App using Nodejs + Express, Vision API (Microsoft Azure), VanillaJS

expressjs javascript microsoft-azure nodejs not-hotdog vision-api

Last synced: 30 Dec 2025

https://github.com/ghostrick/vision-api-hobby

🐕 VisionAPIで遊んだ

elixir google-cloud-platform vision-api

Last synced: 11 Mar 2025

https://github.com/amahanna/rbwiki

querying Wikipedia content through photo recognition

typescript vision-api wikipedia-api

Last synced: 09 Nov 2025

https://github.com/expecho/imageobjectrecognizer

.Net Core Console Application that leverages the Azure Cognitive Services Vision SDK to recognize objects in a jpg image file.

azure azure-cognitive-services backpressure cloud computer-vision csharp dotnet-core multithreading net-core vision-api

Last synced: 06 Mar 2025

https://github.com/joseconstela/pi-node-cam-labels

Take pictures using webcam and read though labels using Google Vision Labels API

google raspberry-pi raspberry-pi-camera text-to-speech vision-api webcam webcam-capture

Last synced: 22 Mar 2025

https://github.com/sanmeet007/visionate

Chrome extension that automatically generates captions for images lacking alt text.

ai blip chrome-extension vision-api

Last synced: 10 Jun 2025

https://github.com/celluloid-camp/vision

Video detection and analysis application, features person and object detection and tracking used in celluloid.me

ai machine-learning mediapipe python3 vision-api vision-language-models

Last synced: 16 Feb 2026

https://github.com/jocelynvelarde/kiavision

Project for GPT-4 Powered App Creation and Evals Hackathon

assistant-chat-bots gpt-4 openai streamlit trulens vision-api

Last synced: 25 Feb 2025

https://github.com/mchmarny/logo-identifier

Simple logo identification service demonstrating use of Cloud SQL and Google Vision API in Cloud Run.

cloudrun cloudsql custom-metrics golang sql stackdriver vision-api

Last synced: 21 Feb 2025

https://github.com/michal-skoula/calorie-tracker

Laravel app for tracking calories using OpenAI's vision

ai calories-tracker laravel openai-api php tailwindcss vision-api

Last synced: 04 Mar 2025

https://github.com/wale254/software-engineering-at-university

🐲 University-level Software 🐳 Engineering courses 🦁 theory to practical 🍅 implementation this 🫑 repo bridges the 🍔 gap between academia 🚂 and real-world 🚞 development Software 🚒 Development Life Cycle Agile 🚞 UML Design Patterns 🚘 Testing and Maintenance 🚁 notes assignments studies ✈ and project templates analysis software design team🚤

build-automation computer-vision database-structure javascript jenkins mysql python rc-car software thymeleaf uml university university-project vision-api

Last synced: 28 Apr 2025

https://github.com/cedws/finch

A proof-of-concept for enhancing and organising an image collection using Google's Vision API

google-api image-processing vision-api

Last synced: 02 Mar 2025

https://github.com/en10/visionapi

Node.js API based on Google Cloud Vision API

javascript nodejs tensorflow vision vision-api

Last synced: 27 Mar 2025

https://github.com/malloxpb/lamo

An app that uses Google Vision API on IOS

google ios swift vision-api

Last synced: 25 Oct 2025

https://github.com/sapuri/vision

Label detection camera app using Google Cloud Vision API

android ios javascript react-native vision-api

Last synced: 23 Sep 2025

https://github.com/xavimondev/visql

Supabase Launch Week X Hackathon - Platform to convert your database diagrams into functional SQL

cli nextjs openai shadcn-ui supabase-auth supabase-db supabase-storage tailwindcss typescript vision-api

Last synced: 17 Mar 2025

https://github.com/anubhavitis/homebrew-peeksy

screenshot name automation tool for macos

homebrew open-source openai screenshot screenshot-utility vision-api

Last synced: 23 Aug 2025

https://github.com/getteli/bytetrash

Byte Trash - seu lixo eletrônico vale Trashy Coin. Aplicação de inteligencia fotovisual, identificacao de objetos para cidade inteligente.

ecology hackathon javascript jquery mysql smart-cities vision-api

Last synced: 24 Jun 2025

https://github.com/fapulito/build2gether

Project Repository for Build2Gether 2.0 Hackathon

accessibility adaptive-technology arduino makerspace vision-api

Last synced: 04 Jan 2026

https://github.com/starwit/vision-lib

Tools and functions to work vision-api

sae vision-api

Last synced: 29 Jan 2026

https://github.com/riccardogiorato/together-ai-vision-examples

Together AI SDK Vision and OCR examples in Typescript and Python

ocr ocr-python together-ai togetherai vision-api

Last synced: 29 Jun 2025

https://github.com/terrylica/crypto-kline-vision-data

High-performance Python 3.13 market data via Failover Control Protocol (FCP): Cache > Vision API (S3) > REST. Binance OHLCV: Spot, USDT-M & Coin-M Futures. Polars internals, pandas/Polars API. Arrow IPC mmap cache (sub-ms), gap detection, rate-limit-aware retry, checksums. uv add crypto-kline-vision-data | from ckvd import CryptoKlineVisionData

arrow binance binance-api cache candlestick crypto data-pipeline failover futures market-data ohlcv pandas polars pypi python rate-limiting s3 vision-api

Last synced: 03 Mar 2026

https://github.com/edipo2s/gist-comment

Tech test app created for a recruitment process

android dagger2 gist-client glidev4 kotlin retrofit2 vision-api

Last synced: 07 Mar 2025

https://github.com/screenshotone/examples

Code examples and applications to demonstrate integration with the ScreenshotOne API

screenshot screenshotone screenshots vision-api

Last synced: 24 Feb 2025

https://github.com/drankush/keychat

KeyChat: A Multimodal Chat Client with Streamlit

chatgpt client dalle-2 llm multimodal openai python streamlit vision-api

Last synced: 16 Mar 2025

https://github.com/starwit/sae-database-writer

A redis/valkey client, that stores all SAE data into a database.

sae vision-api

Last synced: 29 Jan 2026