Projects in Awesome Lists tagged with vision-api
A curated list of projects in awesome lists tagged with vision-api .
https://github.com/KimDarren/FaceCropper
:scissors: Crop faces, inside of your image, with iOS 11 Vision api.
face face-detection face-recognition ios ios11 swift vision vision-api
Last synced: 02 Aug 2025
https://github.com/Feghal/ImageDetect
✂️ Detect and crop faces, barcodes and texts in image with iOS 11 Vision api.
barcode detector face face-detection face-recognition ios ios11 recognition swift vision vision-api
Last synced: 06 Aug 2025
https://github.com/harishdeivanayagam/rowfill
Open-source unstructured data (PDFs, Images, Audiofiles) processing platform built for knowledge workers
document document-extraction document-parsing image-ocr langgraph llama llm nextjs ocr ocr-javascript ollama openai pdf pdfs unstructured unstructured-data vision vision-api
Last synced: 13 Apr 2025
https://github.com/cubist38/mlx-openai-server
A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-friendly solution for running MLX-based vision and language models locally with an OpenAI-compatible interface.
apple-silicon fastapi flux image-generation mlx mlx-lm mlx-vlm openai-compatible queue speech-recognition structured-outputs tool-calling vision-api whisper
Last synced: 12 Feb 2026
https://github.com/kyegomez/vit-rgts
Open source implementation of "Vision Transformers Need Registers"
attention-mechanism gpt4 vision-api vision-transformer vit
Last synced: 12 Apr 2025
https://github.com/sanjujosh/auto-image-renamer
Rename images using deep learning
artificial-intelligence microsoft-api python vision-api
Last synced: 11 Aug 2025
https://github.com/xavimondev/vdbs
Convert your database diagrams into SQL Schema
cli hackaton-supabase nextjs openai shadcn-ui supabase tailwindcss typescript vision-api
Last synced: 09 May 2025
https://github.com/hironsan/google-vision-sampler
Code examples for Google Vision API.
face-detection google-vision label-detection logo-detection python text-detection vision-api
Last synced: 11 Aug 2025
https://github.com/hugoqnc/Split
Split! is an app for iOS and iPadOS, that allows a group of people to easily share expenses from a common receipt — Winner of WWDC22 Swift Student Challenge
ios swift swift-student-challenge swiftui vision-api wwdc
Last synced: 23 Jul 2025
https://github.com/wax911/vision-barcode-scanner
Customized Google Vision API Barcode Scanner
barcode-scanner qrcode-scanner vision-api
Last synced: 06 Jul 2025
https://github.com/amirandalibi/perception
A Wordpress plugin to detect broad sets of objects in your media library images, from flowers, animals, or transportation to thousands of other object categories commonly found within images.
google-cloud-platform vision-api wordpress-plugin
Last synced: 17 Apr 2025
https://github.com/alfianlosari/xcawastickermaker
A WhatsApp Sticker Maker iOS App with Vision API Subject Lifting / Background Removal
background-removal ios ios-app swiftui vision-api whatsapp-sticker-maker
Last synced: 04 Oct 2025
https://github.com/simformsolutionspvtltd/ssbubblepop-visionos
This game showcases the immersive capabilities of VisionOS, where players pop floating bubbles 🫧 within a virtual space. It’s simple, addictive, and perfect for demonstrating the power of augmented reality gaming!
swift swiftui swiftui-components swiftui-demo swiftui-example swiftui-learning vision vision-api visionos visionos-app visionos2
Last synced: 10 Apr 2025
https://github.com/maxidonkey/delphigroqcloud
The GroqCloud API wrapper for Delphi provides access to models from Meta, OpenAI, MistralAI and Google on Groq’s LPUs, offering chat, text generation, image analysis, audio transcription, JSON output, tool integration, and content moderation capabilities.
api-wrapper audio-transcription delphi gpt groq groq-ai groq-api image-prompting llama llama3-1 llama3-2 lpu vision vision-api
Last synced: 26 Sep 2025
https://github.com/boudhayan-dev/automatic-parking-system
An automatic parking system solution for the modern work spaces.
google-cloud-platform mit-appinventor ocr-recognition python raspberry-pi-3 thingspeak vision-api
Last synced: 28 Aug 2025
https://github.com/boudhayan-dev/blind-reader-project
A low cost reading device for blind people.
google-cloud ocr pygame python3 raspberry-pi-3 text-to-speech vision-api
Last synced: 13 Apr 2025
https://github.com/xevion/phototag
Rich tagging in the Terminal via Google Vision API
click google-vision-api iptc iptc-metadata jpeg label labeling metadata photo-tagging python python-click python3 raw raw-image vision-api
Last synced: 21 Jul 2025
https://github.com/anubhavitis/peeksy
screenshot name automation tool for macos
macos openapi rust screenshots vision-api
Last synced: 16 Aug 2025
https://github.com/sayakpaul/gcp-ml-api-demos
Contains Colab Notebooks show cool use-cases of different GCP ML APIs.
cloud-ml gcp texttospeech video-intelligence-api vision-api
Last synced: 07 May 2025
https://github.com/williamtsoi1/vision-api-product-search-demo
Demo repository for Google Cloud Vision API Product Search
Last synced: 15 Apr 2025
https://github.com/GINK03/keras-distillation
keras google-vision's distillation
keras keras-distillation tensorflow-basics vision-api
Last synced: 05 Apr 2025
https://github.com/amahanna/picsumvision
an interactive graph-based image repository
arangodb typescript vision-api
Last synced: 12 Apr 2025
https://github.com/anthonyjdella/summarize-text
📖 A Python app that uses text recognition on photos, then texts you a summary.
openai python text-recognition twilio vision-api
Last synced: 11 Apr 2025
https://github.com/shamspias/langchain-gemini-api
An innovative AI conversation API leveraging Google's Gemini for multimodal understanding. Combines FastAPI, Langchain, and Redis for robust, scalable, and privacy-conscious text and image-based interactions
fastapi gemini gemini-ai gemini-api gemini-pro gemini-pro-api gemini-pro-bot gemini-pro-vision google-gemini google-generative-ai langchain langchain-python large-language-models python vision-api
Last synced: 30 Jul 2025
https://github.com/iprajwaal/enhanced-vision-assistant
An AI-powered vision assistant for real-time navigation and awareness.
gemini-pro opencv vertex-ai vertex-ai-gemini-api vertexaisprint vision-api
Last synced: 14 May 2025
https://github.com/paolorotolo/happily
A simple Android Experiment you can control with smile
Last synced: 13 May 2025
https://github.com/yonet/ayscontent
Talks, Videos and workshops and abstracts.
accessibility ai babylonjs javascript mixed-reality speech talks text-to-speech typescript vision-api webxr workshop workshop-series
Last synced: 04 Nov 2025
https://github.com/fleker/food-foto-fitness
An app for quick-adding nutritional data to Google Fit using LLMs
gemini-api google-fit-api large-language-models nutrition vision-api
Last synced: 15 Oct 2025
https://github.com/kiritigowda/pi-vx
OpenVX for Raspberry Pi
api computer-vision conformance embedded-systems hobbyists machine-learning machine-vision open-source openvx openvx-conformant openvx-graph raspberry-pi real-time royalty-free samples standard vision-api
Last synced: 27 Feb 2026
https://github.com/fmxexpress/googlevisionapi
Detect labels for images with Google Cloud Vision API on Windows, Android, iOS, macOS, Linux https://cloud.google.com/vision/docs/labels
cloud delphi vision-api windows
Last synced: 30 Jul 2025
https://github.com/webmobi59/expo.io-google-cloud-ocr
React Native Google Vision Sample
expo google-cloud react-native vision-api
Last synced: 26 Apr 2025
https://github.com/fitomad/bicimad-bike-identifier
Keras, Vision y CoreML para desarrollar una app que identifica los números de serie de las bicicletas del servicio BiciMAD
convolutional-neural-networks coreml coremltools image-recognition ios keras python swift swift4 vision vision-api
Last synced: 20 Mar 2025
https://github.com/ergonomech/ollama-model-interaction
A simple Gradio-based app for interacting with Ollama models, supporting image analysis, text completion, and model pullin
gradio llava ollama ollama-api vision-api
Last synced: 04 Sep 2025
https://github.com/anilpatidar/ai-appium-lens-plugin
This plugin has been developed by Anil Patidar, and the purpose of this repository is to welcome all the QA folks. to make use of this AI enabled plugin in their mobile automation scripting.
appium appium-android appium-framework appium-ios appium-tests artificial-intelligence google-cloud googlevisionapi selenium vision vision-api
Last synced: 22 Aug 2025
https://github.com/emadehsan/food-classifier
Experimenting with Cloud Vision API. DEMO:
cloud computer-vision deep-learning gcp machine-learning vision-api
Last synced: 01 May 2025
https://github.com/adityanandanx/dental-conditions-detection
An AI-powered platform for detecting dental conditions in X-ray images using computer vision and DICOM processing
ai computer-vision dental fastapi inference medical-imaging ml nextjs opencv roboflow vision vision-api
Last synced: 12 Aug 2025
https://github.com/xbaysal11/face-tracker
:space_invader: Android Face-Tracking Playground
android cloud-vision-api eye-detection eye-tracking face-detection face-tracker face-tracking mobile-app mobile-development vision-api
Last synced: 02 Mar 2025
https://github.com/mecitsem/Arf-HashtagBot
Send a photo and generate auto hashtag for your photos with this bot on Facebook, Telegram and Skype channels.
bot cognitive-services facebook microsoft microsoft-bot-framework telegram vision-api
Last synced: 12 Apr 2025
https://github.com/marcominerva/visionplayground
A simple ASP.NET Core Application that takes a photo and calls Azure AI Vision to get a description
alpinejs azure azure-vision-api c-sharp dotnet minimal-api vision-api visual-studio
Last synced: 02 Jul 2025
https://github.com/souravdey777/readio
An Audiobook 📚 🔊 generator from pdf and a Socializing platform to discuss about Books, Research Papers, and etc.
audio-generator books google-cloud-platform pdf-converter social-media vision-api
Last synced: 04 Feb 2026
https://github.com/armaggheddon/gstgeminivision
Let your GStreamer pipelines describe what they see! 👁️🗨️ GstGeminiVision brings Google's Gemini Vision AI to your media streams for some serious (and fun!) video analysis. 🎥🤖✨
c-programming computer-vision docker gemini-api google-gemini gstreamer gstreamer-plugins python video-analysis vision-api
Last synced: 09 Oct 2025
https://github.com/starwit/vision-api
Data model for Starwit's Awareness Engine
Last synced: 29 Jan 2026
https://github.com/cincibrainlab/autocleaneeg-icvision
Automated ICA artifact classification and removal for EEG data using OpenAI Vision API. Generates component visualizations, classifies artifacts, and produces cleaned datasets with detailed reports.
artifact-removal automation eeg eeglab ica machine-learning mne-python reproducibility vision-api
Last synced: 10 Mar 2026
https://github.com/meinside/telegram-bot-kakao-vision
A Telegram Bot using Kakao Vision API
golang kakao-api telegram-bot vision-api
Last synced: 14 Jan 2026
https://github.com/martijnvos/textvision
Android application which recognizes written text and pastes it into the clipboard
android-application clipboard machine-learning vision-api
Last synced: 29 Mar 2025
https://github.com/mushfiqur-rahman/barcodereader
Google Mobile Vision API Barcode
android android-application barcode-reader google java vision-api
Last synced: 30 Mar 2025
https://github.com/sivatsk26/development-of-an-automated-text-to-speech-device-for-visually-impaired
Developing a text-to-speech device that can capture an image (containing text), extract the text from image using OCR, translate the text to a desired language using G-Translator, and generate audio for the translated text using Google Cloud TTS
artificial-intelligence google-cloud google-translate-api machine-learning ocr-recognition picamera3 python raspberry-pi-4 text-to-speech vision-api
Last synced: 27 Jan 2026
https://github.com/jmbl1685/not-hot-dog
:hotdog: Not Hot Dog App using Nodejs + Express, Vision API (Microsoft Azure), VanillaJS
expressjs javascript microsoft-azure nodejs not-hotdog vision-api
Last synced: 30 Dec 2025
https://github.com/ghostrick/vision-api-hobby
🐕 VisionAPIで遊んだ
elixir google-cloud-platform vision-api
Last synced: 11 Mar 2025
https://github.com/amahanna/rbwiki
querying Wikipedia content through photo recognition
typescript vision-api wikipedia-api
Last synced: 09 Nov 2025
https://github.com/expecho/imageobjectrecognizer
.Net Core Console Application that leverages the Azure Cognitive Services Vision SDK to recognize objects in a jpg image file.
azure azure-cognitive-services backpressure cloud computer-vision csharp dotnet-core multithreading net-core vision-api
Last synced: 06 Mar 2025
https://github.com/joseconstela/pi-node-cam-labels
Take pictures using webcam and read though labels using Google Vision Labels API
google raspberry-pi raspberry-pi-camera text-to-speech vision-api webcam webcam-capture
Last synced: 22 Mar 2025
https://github.com/sanmeet007/visionate
Chrome extension that automatically generates captions for images lacking alt text.
ai blip chrome-extension vision-api
Last synced: 10 Jun 2025
https://github.com/mainakverse/handwriting-to-text
Convert the most illegible handwriting to comfortably readable text
conversion deep-learning handwriting-recognition neural-network optical-character-recognition text vision-api vision-transformer
Last synced: 25 Feb 2025
https://github.com/23tae/instagrammable-garden-research
23-2 조경미학
computer-vision data-visualization image-classification instagram university vision-api wordcloud
Last synced: 06 Mar 2025
https://github.com/celluloid-camp/vision
Video detection and analysis application, features person and object detection and tracking used in celluloid.me
ai machine-learning mediapipe python3 vision-api vision-language-models
Last synced: 16 Feb 2026
https://github.com/jocelynvelarde/kiavision
Project for GPT-4 Powered App Creation and Evals Hackathon
assistant-chat-bots gpt-4 openai streamlit trulens vision-api
Last synced: 25 Feb 2025
https://github.com/mchmarny/logo-identifier
Simple logo identification service demonstrating use of Cloud SQL and Google Vision API in Cloud Run.
cloudrun cloudsql custom-metrics golang sql stackdriver vision-api
Last synced: 21 Feb 2025
https://github.com/gnaneshkunal/checkoutlogger
Save your bills online.
google-cloud googlecloudplatform nodejs vision-api
Last synced: 20 Oct 2025
https://github.com/michal-skoula/calorie-tracker
Laravel app for tracking calories using OpenAI's vision
ai calories-tracker laravel openai-api php tailwindcss vision-api
Last synced: 04 Mar 2025
https://github.com/wale254/software-engineering-at-university
🐲 University-level Software 🐳 Engineering courses 🦁 theory to practical 🍅 implementation this 🫑 repo bridges the 🍔 gap between academia 🚂 and real-world 🚞 development Software 🚒 Development Life Cycle Agile 🚞 UML Design Patterns 🚘 Testing and Maintenance 🚁 notes assignments studies ✈ and project templates analysis software design team🚤
build-automation computer-vision database-structure javascript jenkins mysql python rc-car software thymeleaf uml university university-project vision-api
Last synced: 28 Apr 2025
https://github.com/cedws/finch
A proof-of-concept for enhancing and organising an image collection using Google's Vision API
google-api image-processing vision-api
Last synced: 02 Mar 2025
https://github.com/en10/visionapi
Node.js API based on Google Cloud Vision API
javascript nodejs tensorflow vision vision-api
Last synced: 27 Mar 2025
https://github.com/georgiydemo/googlevisionexample
Google Cloud Vision API example
google-cloud opencv-python tensorflow-experiments vision-api
Last synced: 20 Jan 2026
https://github.com/sapuri/vision
Label detection camera app using Google Cloud Vision API
android ios javascript react-native vision-api
Last synced: 23 Sep 2025
https://github.com/pipeless-ai/home-assistant-custom-component
Custom component to integrate Pipeless Agents with home assistant
automation computer-vision home-assistant home-automation vision vision-api visionprocessing webhooks
Last synced: 23 Jul 2025
https://github.com/xavimondev/visql
Supabase Launch Week X Hackathon - Platform to convert your database diagrams into functional SQL
cli nextjs openai shadcn-ui supabase-auth supabase-db supabase-storage tailwindcss typescript vision-api
Last synced: 17 Mar 2025
https://github.com/anubhavitis/homebrew-peeksy
screenshot name automation tool for macos
homebrew open-source openai screenshot screenshot-utility vision-api
Last synced: 23 Aug 2025
https://github.com/getteli/bytetrash
Byte Trash - seu lixo eletrônico vale Trashy Coin. Aplicação de inteligencia fotovisual, identificacao de objetos para cidade inteligente.
ecology hackathon javascript jquery mysql smart-cities vision-api
Last synced: 24 Jun 2025
https://github.com/fapulito/build2gether
Project Repository for Build2Gether 2.0 Hackathon
accessibility adaptive-technology arduino makerspace vision-api
Last synced: 04 Jan 2026
https://github.com/starwit/vision-lib
Tools and functions to work vision-api
Last synced: 29 Jan 2026
https://github.com/riccardogiorato/together-ai-vision-examples
Together AI SDK Vision and OCR examples in Typescript and Python
ocr ocr-python together-ai togetherai vision-api
Last synced: 29 Jun 2025
https://github.com/terrylica/crypto-kline-vision-data
High-performance Python 3.13 market data via Failover Control Protocol (FCP): Cache > Vision API (S3) > REST. Binance OHLCV: Spot, USDT-M & Coin-M Futures. Polars internals, pandas/Polars API. Arrow IPC mmap cache (sub-ms), gap detection, rate-limit-aware retry, checksums. uv add crypto-kline-vision-data | from ckvd import CryptoKlineVisionData
arrow binance binance-api cache candlestick crypto data-pipeline failover futures market-data ohlcv pandas polars pypi python rate-limiting s3 vision-api
Last synced: 03 Mar 2026
https://github.com/edipo2s/gist-comment
Tech test app created for a recruitment process
android dagger2 gist-client glidev4 kotlin retrofit2 vision-api
Last synced: 07 Mar 2025
https://github.com/screenshotone/examples
Code examples and applications to demonstrate integration with the ScreenshotOne API
screenshot screenshotone screenshots vision-api
Last synced: 24 Feb 2025
https://github.com/drankush/keychat
KeyChat: A Multimodal Chat Client with Streamlit
chatgpt client dalle-2 llm multimodal openai python streamlit vision-api
Last synced: 16 Mar 2025
https://github.com/starwit/sae-database-writer
A redis/valkey client, that stores all SAE data into a database.
Last synced: 29 Jan 2026