Projects in Awesome Lists tagged with vision-framework
A curated list of projects in awesome lists tagged with vision-framework .
https://github.com/salesforce/lavis
LAVIS - A One-stop Library for Language-Vision Intelligence
deep-learning deep-learning-library image-captioning multimodal-datasets multimodal-deep-learning salesforce vision-and-language vision-framework vision-language-pretraining vision-language-transformer visual-question-anwsering
Last synced: 12 May 2025
https://github.com/salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
deep-learning deep-learning-library image-captioning multimodal-datasets multimodal-deep-learning salesforce vision-and-language vision-framework vision-language-pretraining vision-language-transformer visual-question-anwsering
Last synced: 14 Mar 2025
https://github.com/symisc/sod
An Embedded Computer Vision & Machine Learning Library (CPU Optimized & IoT Capable)
c computer-vision convolutional-neural-networks cpu deep-learning detection embedded face-detection facial-landmarks image-analysis image-processing image-recognition iot iot-device library machine-learning-algorithms object-detection real-time vision-framework webassembly
Last synced: 08 Apr 2025
https://github.com/Blaizzy/mlx-vlm
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
apple-silicon florence2 idefics llava llm local-ai mlx molmo paligemma pixtral vision-framework vision-language-model vision-transformer
Last synced: 18 Jul 2025
https://github.com/blaizzy/mlx-vlm
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
apple-silicon florence2 idefics llava llm local-ai mlx molmo paligemma pixtral vision-framework vision-language-model vision-transformer
Last synced: 03 Apr 2026
https://github.com/anupamchugh/iowncode
A curated collection of iOS, ML, AR resources sprinkled with some UI additions
alamofire arkit computer-vision coreml coremltools ios keras ml-kit natural-language-processing nlp realitykit swift swiftui vision vision-framework
Last synced: 22 Jul 2025
https://github.com/pipeless-ai/pipeless
An open-source computer vision framework to build and deploy apps in minutes
artificial-intelligence cloud computer-vision deep-learning ffmpeg gstreamer inference inference-server machine-learning multimedia multimedia-applications object-detection perception pipeline-framework python stream-processing video video-processing vision-framework yolo
Last synced: 09 Apr 2026
https://github.com/overeasy-sh/overeasy
Orchestrate zero-shot computer vision models
agent agents artificial-intelligence computer-vision llms open-source vision-framework
Last synced: 25 Sep 2025
https://github.com/keplerlab/katna
Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks
artificial-intelligence crops frame-extraction image-cropping image-processing image-resizing python video video-compression vision-framework
Last synced: 11 Apr 2026
https://github.com/DroidsOnRoids/VisionFaceDetection
An example of use a Vision framework for face landmarks detection in iOS 11
ios11 landmark-detection landmarks vision vision-framework xcode9
Last synced: 10 Jun 2026
https://github.com/appcoda/textdetection
Vision Framework Demo on Text Detection
demo-app image-recognition ios11 swift swift4 text-detection tutorial-code vision-framework xcode9
Last synced: 21 Aug 2025
https://github.com/thomasgalliker/camerascanner.maui
Camera preview and barcode scanner for .NET MAUI apps
barcode barcode-reader barcode-scanner camera camera2 maui mlkit mlkit-barcode vision-framework
Last synced: 29 Jan 2026
https://github.com/lingdong-/visionosc
PoseOSC + FaceOSC + HandOSC + OcrOSC + CatOSC + DogOSC
computer-vision facial-landmarks-detection hand-tracking ocr osc pose-estimation vision-framework
Last synced: 29 Dec 2025
https://github.com/npna/CoreMLPlayer
Try CoreML models on multiple images and videos easily and quickly
coreml coreml-vision machine-learning macos object-detection swift swiftui vision-framework
Last synced: 14 Mar 2025
https://github.com/StanDimitroff/DocumentScanner
Simple document scanner built with the Apple's Vision framework
carthage cocoapods document-scanner ios objective-c swift-5 vision-framework
Last synced: 21 Jul 2025
https://github.com/vikramparimi/vision-object-tracking
Object Tracking using Apple's VISION Framework
computer-vision image-recognition ios ios-swift ios-vision ios11 machine-learning object-tracking rectangle-detection vision-framework
Last synced: 07 May 2025
https://github.com/onl1ner/Hands
🖐 Memory game with hand gesture recognition that will keep your brain in a good shape!
avfoundation camera gesture-detection gesture-recognition hand-gesture hand-gesture-recognition hand-recognition handgesture-recognition just-dance playgrounds simon-game swift swift-student-challenge swiftplaygrounds uikit vision vision-framework wwdc wwdc-scholarship wwdc-scholarship-submissions
Last synced: 05 May 2025
https://github.com/thomasgalliker/CameraScanner.Maui
Camera preview and barcode scanner for .NET MAUI apps
barcode barcode-reader barcode-scanner camera camera2 maui mlkit mlkit-barcode vision-framework
Last synced: 02 May 2025
https://github.com/ankityddv/docscan-ios
Designing and developing a open source pdf scanner from scratch. DocScan allows to scan and share your PDF documents.
camscanner docscanner ios scanner vision-framework visionkit
Last synced: 25 Jul 2025
https://github.com/shingt/beerclassifier
Sample app to classify beer bottle using Keras / Turi Create and Core ML.
coreml ios keras python swift vision-framework
Last synced: 23 Apr 2025
https://github.com/rxswiftcommunity/rxvision
RxVision (based on RxSwift)
ios reactive-programming rxswift vision vision-framework
Last synced: 05 Oct 2025
https://github.com/karami-mehdi/AISightQuest
Utilizing AI and machine learning, the project extracts text from images via Apple's Vision Framework and offers instant answers to questions in documents through the BERT model.
artificial-intelligence bert-model ios-app machine-learning scan-documents tipki vision-framework
Last synced: 23 Apr 2025
https://github.com/muhittincamdali/swiftintelligence
Privacy-first modular AI toolkit for Apple developers with Vision, NaturalLanguage, Speech, benchmarks, and release proof.
ai-toolkit apple apple-developer-tools benchmarking coreml ios macos natural-language on-device-ml privacy privacy-first speech swift swift-ai swift-package-manager tvos vision-framework visionos watchos
Last synced: 15 Apr 2026
https://github.com/ywake/unified_apple_vision
A plugin for using Apple Vision Framework with Flutter, designed to integrate multiple APIs into one plugin and process multiple analysis requests at once.
Last synced: 20 Feb 2026
https://github.com/mertozseven/qrreader
A powerful xcframework that provides a simple and customizable QR code and barcode scanning experience using VisionKit framework.
barcode-scanner ios-swift qrcode qrcode-scanner swift uikit vision-framework xcframework
Last synced: 25 Feb 2026
https://github.com/wiltodelta/sleep-timer-app
A menu bar application for macOS that allows you to set a sleep timer to automatically put your Mac to sleep. Features both manual timer mode and intelligent camera-based sleep detection.
face-detection macos menubar-app pmset sleep-timer swift swiftui vision-framework
Last synced: 04 Jun 2026
https://github.com/anujdutt9/ios-machinelearning
Machine Learning iOS applications.
coreml coreml-framework coreml-models coreml-vision ios-app machine-learning swift3 vision-framework
Last synced: 17 Mar 2026
https://github.com/motemen/macos-obs-websocket-ocr
A proxy for obs-websocket that adds Optical Character Recognition (OCR) capabilities.
Last synced: 05 May 2026
https://github.com/knightbenax/Falk
macOS utility tool to copy text from videos, images on screen
macos macos-app osx swift swiftui vision-framework
Last synced: 03 Oct 2025
https://github.com/kayoslab/one-o-one
one-o-one can be used as an example implementation for the Apple StoreKit API. This project was an approach to work with the MNIST dataset to implement a childrens learning-game to teach handwriting of numbers as well as basich arithmetics.
handwriting handwriting-recognition image-recognition mnist mnist-classification storekit tensorflow vision vision-framework
Last synced: 09 Mar 2026
https://github.com/davepoon/mlx-vlm-smolvlm-realtime-webcam
Real-time webcam demo with SmolVLM(mlx-community/SmolVLM-Instruct-4bit) and MLX-VLM
apple-silicon idefics llms mlx mlx-vlm vision-framework vision-transformer
Last synced: 03 Sep 2025
https://github.com/gergo225/smartcamera
Vision framework (on iOS) practice
apple ios practice swiftui vision-framework
Last synced: 15 May 2026