An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/lukexyz/flightvision

๐ŸŽฅ๐ŸŽฏ Tracking dart coordinates with fastai v2

computer-vision dart

Last synced: 19 Mar 2025

https://github.com/dayyass/synthetic-health-data-hackathon-2020

Synthetic Health Data Hackathon 2020 Alzheimer's MRI Analysis solution.

bioinformatics computer-vision image-classification mri-images pytorch synthetic-data

Last synced: 03 Feb 2026

https://github.com/victorqribeiro/naivecorners

A naive algorithm to identify corners on a image

computer-vision corner-detection image-processing naive python

Last synced: 05 May 2025

https://github.com/mlsanigeria/naija-nutri-hub-frontend

GenAI and Image-based food classifier that returns nutritional estimates, recipe suggestions, and nearby restaurants โ€” built with Azure AI Services.

azure computer-vision generative-ai hacktoberfest open-source

Last synced: 18 May 2026

https://github.com/cuixing158/cocoapi

Simple, efficient and convenient!

api coco computer-vision deep-learning deep-learning-datasets

Last synced: 02 May 2025

https://github.com/jgfranco17/torchcam

Applying PyTorch MiDaS model on live CV capture

computer-vision monocular-depth-estimation opencv-python pytorch

Last synced: 03 Feb 2026

https://github.com/colasgael/deepclimb

A CNN approach to automatically assess bouldering routes difficulty levels

climbing cnn computer-vision naive-bayes-classifier pytorch softmax-classification

Last synced: 14 Apr 2025

https://github.com/mrtj/yolox-panorama-tutorial

A step-by-step tutorial with example code on deploying a custom YOLOX object detector model on the AWS Panorama appliance.

aws-panorama computer-vision tutorial yolox

Last synced: 04 Mar 2026

https://github.com/neelanjan00/parkinsons-detection

An AI-based mobile application that is able to diagnose Parkinson's Disease using two independent tests that require only a pencil and a paper. Based on the 2017 research paper Distinguishing Different Stages of Parkinson's Disease Using Composite Index of Speed and Pen-Pressure of Sketching a Spiral by Zham et. al. The trained models were deployed using a Flask backend server, along with a Flutter based frontend mobile application frontend to interact with the REST API.

computer-vision flask machine-learning parkinsons-disease python

Last synced: 23 Mar 2025

https://github.com/abhi-abhi86/disease-predictor

AI disease detection and prediction for humans, plants, and animals. Complete ML project with custom training, offline operation, no API keys. Detect diseases from images using deep learning and computer vision. Open-source disease detection system for healthcare, agriculture, and veterinary applications. Full code and deployment guides.

agriculture-data ai-agent animals cnn computer-vision custom-training customtraining deeplearning diagnosis diseases healthcare-application imageprocessing machine-learning medical-application offline opensource plants prediction predictor veterinary

Last synced: 04 Mar 2026

https://github.com/balavenkatesh3322/video_analysis

This will read data from webcam or video and capture object names from video

aws-rekognition computer-vision json opencv python3 video-analysis video-processing yolov3

Last synced: 22 Apr 2025

https://github.com/jinglescode/meditorch

A PyTorch package for biomedical datasets processing, development and testing.

computer-vision deep-learning image-processing medical-imaging pytorch

Last synced: 15 Apr 2025

https://github.com/aimaster-dev/ocr-recaptcha

An OCR-based Python system to automatically solve some Bank CAPTCHA images. Includes scraping, training a CNN with PyTorch, and serving predictions via a Flask API. Ideal for CAPTCHA automation and dataset generation.

automation captcha character-recognition classification cnn computer-vision deep-learning flask-application image-recognition inference ocr pytorch-implementation recaptcha scraping training

Last synced: 18 May 2026

https://github.com/manciuszz/AHK-OpenCV

OpenCV 2.4 support for AutoHotkey.

autohotkey computer-vision opencv2

Last synced: 08 Apr 2025

https://github.com/abhinav-26/mlops-internship

In this repository I have put down all the Project which I did during my MLOps Internship from LinuxWorld Informatics Pvt Ltd under the mentorship of World Record Holder Mr. Vimal Daga sir.

autodeployments automation computer-vision deep-learning devops-tools docker face-mask-detection jenkins machine-learning ml-with-devops mlops opencv transfer-learning

Last synced: 03 Mar 2026

https://github.com/maheshkkumar/adacrowd

AdaCrowd: Unlabeled Scene Adaptation for Crowd Counting (TMM 2021)

computer-vision crowd-counting few-shot-learning pytorch

Last synced: 15 Feb 2026

https://github.com/mc-cat-tty/DoorbellCamDaemon

Part of DoorbellCam project: daemon for people recognition with YOLO from a RTSP video stream.

cicd computer-vision container cpp docker doorbell ezviz mqtt mqtt-client opencv rtsp-client rtsp-stream smart-door smart-doorbell yolo yolov5

Last synced: 21 Apr 2025

https://github.com/akhdanfadh/efficient-capsnet-pytorch

PyTorch implementation of "Efficient-CapsNet: Capsule Network with Self-Attention Routing" (Mazzia et al., 2021).

capsule-network computer-vision deep-learning pytorch

Last synced: 06 Jul 2025

https://github.com/antoinehabis/torch_contour

python torch library for polygon / contour to image (mask, distance map) processing and more

computer-vision contour differentiable distance-map mask polygon toolbox torch

Last synced: 13 Jun 2026

https://github.com/PeterWang512/AttributeByUnlearning

Code for the paper "Data Attribution for Text-to-Image Models by Unlearning Synthesized Images."

computer-vision deeplearning diffusion generative-ai responsible-ai

Last synced: 08 Sep 2025

https://github.com/jaykef/min-patchnizer

Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.

computer-vision nlp patchnization tokenization transformer

Last synced: 25 Mar 2025

https://github.com/supernik17/demos

Robotics demos: ROS, SLAM, robust control, mobile robotics.

arduino computer-vision control mobile-robots multi-agent robotics ros ros2 slam

Last synced: 24 Jul 2025

https://github.com/codebydant/pcl_visualizer

Visualizer for 3D point cloud using PCL Library 1.9.1

c-plus-plus clang-format cmake computer-vision cpp pcd pcl pcl-library pcl-viewer ply point-cloud xyz

Last synced: 11 Mar 2025

https://github.com/ashad001/ml-dl-projects

This repository contains a compilation of my Python projects and notebooks focusing on Machine Learning and Deep Learning.

ai classifier cnn computer-vision deeplearning machinelearning nlp regression rnn

Last synced: 13 Sep 2025

https://github.com/Tritonix711/Gesture-Scroll-Control

Gesture Scroll Control is a Python application that lets you control scrolling on your desktop using hand gestures. With a simple hand movement, you can navigate through web pages and apps without needing to touch your mouse or keyboard. It's designed for ease of use and hands-free navigation.

automation computer-vision gesture-recognition hand-tracking mediapipe opencv python scrolling

Last synced: 24 Apr 2025

https://github.com/kesa2773/uiimagecolorpalette

UIImageColorPalette is a versatile utility for extracting the prominent colors from images in iOS. It efficiently identifies and provides the three most prevalent colors in a UIImage.

clustering-algorithms color-analysis color-palette computer-vision graphics-programming image-processing image-rendering ios-development objective-c uicolor uiimage

Last synced: 10 Apr 2025

https://github.com/sqmah/plate-reading-network

Plate reading network based on the LPRNet network written in Tensorflow 2.0

alpr computer-vision keras license-plate-recognition lprnet tensorflow tensorflow-models tensorflow2 tensorflow2-models

Last synced: 26 Apr 2025

https://github.com/natmlx/mobilenet-v2-unity

Realtime image classification in Unity Engine.

classification computer-vision edge-machine-learning machine-learning unity

Last synced: 25 Jul 2025

https://github.com/jeanchilger/image-editor

A simple image editor implemented in order to apply some cv algorithms and techniques.

computer-vision

Last synced: 15 Jul 2025

https://github.com/fancyboi999/ai-engineering-from-scratch-zh

ไปŽ้›ถ็ฒพ้€š AI ๅทฅ็จ‹ ยท 20 ้˜ถๆฎต 468 ่ฏพ ยท ไธญๆ–‡ๅ…จ้‡็ฟป่ฏ‘ + ้…ๅฅ—็ซ™็‚น ๅฆ‚ไฝ•ๆˆไธบไธ€ๅAgent ๅทฅ็จ‹ๅธˆ ไฟฎๆˆๆŒ‡ๅ—

agents ai ai-agents ai-engineering chinese chinese-translation computer-vision course deep-learning from-scratch generative-ai learn-ai llm machine-learning mcp nlp python reinforcement-learning transformers tutorial

Last synced: 01 Jun 2026

https://github.com/jacobmarks/image-deduplication-plugin

Remove exact and approximate duplicates from your dataset in FiftyOne!

computer-vision data-cleaning deduplication fiftyone image-processing plugin python similarity

Last synced: 31 Oct 2025

https://github.com/compphoto/intrinsichdr

Repo for the paper "Intrinsic Single-Image HDR Reconstruction" (ECCV 2024)

computational-photography computer-graphics computer-vision hdr machine-learning

Last synced: 28 Feb 2025

https://github.com/paulgavrikov/cvpr22w_robustnessthroughthelens

Official repository of our submission "Adversarial Robustness through the Lens of Convolutional Filters" for the CVPR2022 Workshop "The Art of Robustness: Devil and Angel in Adversarial Machine Learning Workshop"

adversarial-attacks cnn computer-vision convolution-filter convolutional-neural-networks cvpr2022 deep-learning machine-learning robustness robustness-analysis

Last synced: 17 Jan 2026

https://github.com/willyfh/farl-face-segmentation

A face segmentation implementation of FarRL model (CVPR 2022) using Facer, a face analysis toolkit for modern research.

artificial-intelligence computer-vision deep-learning face-analysis face-parsing face-segmentation facer farl maching-learning

Last synced: 10 Apr 2025

https://github.com/sooperset/boss

Bayesian Optimization Meets Self-Distillation, ICCV 2023

computer-vision deep-learning hyperparameter-optimization iccv2023 self-distillation

Last synced: 18 Jul 2025

https://github.com/jacobmarks/pytesseract-ocr-plugin

Run optical character recognition with PyTesseract from the FiftyOne App!

computer-vision document-understanding fiftyone nlp ocr plugin python tesseract tesseract-ocr

Last synced: 31 Oct 2025

https://github.com/bentoml/pneumonia-detection-demo

Pneumonia Detection - Healthcare Imaging Application built with BentoML and fine-tuned Vision Transformer (ViT) model

computer-vision health-care-application healthcare healthcare-imaging transformer

Last synced: 04 May 2025

https://github.com/retkowsky/swimmingpooldetection

Swimming pool detection with Azure Custom Vision

azure azure-custom-vision azurecognitiveservice computer-vision

Last synced: 22 Apr 2025

https://github.com/tritonix711/gesture-scroll-control

Gesture Scroll Control is a Python application that lets you control scrolling on your desktop using hand gestures. With a simple hand movement, you can navigate through web pages and apps without needing to touch your mouse or keyboard. It's designed for ease of use and hands-free navigation.

automation computer-vision gesture-recognition hand-tracking mediapipe opencv python scrolling

Last synced: 30 Apr 2025

https://github.com/johnbean393/text-scraper

A macOS app that can surface text on your displays and allow easy copying. Useful in apps and websites where text isn't copyable.

accessibility computer-vision copy copy-paste ocr swift swiftui vision

Last synced: 12 Apr 2025

https://github.com/mc-cat-tty/doorbellcamdaemon

Part of DoorbellCam project: daemon for people recognition with YOLO from a RTSP video stream.

cicd computer-vision container cpp docker doorbell ezviz mqtt mqtt-client opencv rtsp-client rtsp-stream smart-door smart-doorbell yolo yolov5

Last synced: 20 Mar 2025

https://github.com/smkim7-kr/AdaMatch-pytorch

Unofficial implementation of "AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain Adaptation"

computer-vision deep-learning domain-adaptation semi-supervised-domain-adaptation semi-supervised-learning unsupervised-domain-adaptation

Last synced: 08 May 2025

https://github.com/wzh99/RealGes

SJTU CS386 Project: Real-time Dynamic Hand Gesture Recognition

computer-vision convolutional-neural-networks deep-learning gesture-recognition keras-tensorflow real-time

Last synced: 12 Apr 2025

https://github.com/lukexyz/FlightVision

๐ŸŽฅ๐ŸŽฏ Tracking dart coordinates with fastai v2

computer-vision dart

Last synced: 08 May 2025

https://github.com/anishlearnstocode/computer-vision-basics

Solutions Repository ๐Ÿ“• for Computer Vision Basics course on Coursera ๐ŸŽ“ offered by University of Buffalo ๐Ÿƒ and The State University of New York ๐Ÿ—ฝ

computer-vision coursera cv image-processing matlab matlab-gui nyu solutions solutions-repo solutions-repository

Last synced: 10 Apr 2025

https://github.com/Aavache/mlx-resnet

ResNet implementation with the MLX, Apple's DL framework.

apple computer-vision deep-learning machine-learning mlx mnist resnet

Last synced: 10 Apr 2025

https://github.com/priyanshudutta04/yoga-pose-detection

Real-time AI based yoga pose detection using CNN and YOLO

cnn computer-vision deep-learning tensorflow yoga-detection yolov8

Last synced: 07 Mar 2026

https://github.com/NJUyued/SoC4SS-FGVC

"Roll with the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning" by Yue Duan (AAAI 2024)

aaai2024 computer-vision fine-grained-classification fine-grained-visual-categorization machine-learning semi-supervised-classification semi-supervised-learning torch

Last synced: 05 Apr 2025

https://github.com/mrl-amrl/preimutils

All you need to preprocess and prepare your annotated dataset

computer-vision dataset-labels image-augmentation label-checker neural-network pascal-voc-format

Last synced: 10 Apr 2025

https://github.com/amilworks/3d-xception

Facebook DeepFake Detection Challenge: PyTorch 3D Xception Network for Video Classification

computer-vision docker facebook image-processing machine-learning pytorch video-classification video-processing

Last synced: 10 Apr 2025

https://github.com/di37/multiclass-image-classification-using-multimodal-llms

A comprehensive comparison of multimodal models - llama3.2-vision, minicpm-v, llava-llama3, llava, llava13:b and closed source models for animal classification tasks. This project evaluates various models' performance in classifying 10 different animal species, ranging from common to rare animals.

artificial-intelligence computer-vision gemini google-generative-ai large-language-models machine-learning natural-language-processing ollama openai python

Last synced: 10 Apr 2025

https://github.com/ilijamihajlovic/core-ml-and-vision-object-classifier-lightweight-version

Core ML and Vision object classifier with a lightweight trained model. The model is trained and tested with Create ML straight from Xcode Playgrounds with the dataset I provided.

computer-vision coreml coreml-framework coreml-image-classifier coreml-model coreml-models coreml-vision image-classification image-recognition machine-learning object-detection swift vison

Last synced: 11 Jul 2025

https://github.com/kyegomez/clipq

A simple implementation of a CLIP that splits up an image into quandrants and then gets the embeddings for each quandrant

artificial-intelligence clip computer-vision gpt4 multimodal vision-transformer vit

Last synced: 07 May 2025

https://github.com/aavache/mlx-resnet

ResNet implementation with the MLX, Apple's DL framework.

apple computer-vision deep-learning machine-learning mlx mnist resnet

Last synced: 09 Oct 2025

https://github.com/xverse-engine/xvrfacetracking

A VRCFaceTracking(VRCFT) based plugin aiming to provide real-time lower-face tracking solution for any VR headsets.

computer-vision vrcfacetracking vrchat

Last synced: 25 Jul 2025

https://github.com/varadbhogayata/visual-question-answering

The Intelligent system which gives answer to an Image and a Natural Language Question related to image

cnn-model computer-vision lstm-neural-networks natural-language-processing

Last synced: 15 Apr 2025

https://github.com/alvinwan/flowlets

converts KITTI tracklets into vectors denoting flow, for object tracking

computer-vision kitti-dataset object-tracking

Last synced: 24 Jul 2025

https://github.com/hlydecker/gis2coco

Convert geospatial datasets of shapefiles and rasters into COCO format datasets of images and JSON.

ai computer-vision deep-learning geospatial gis segmentation

Last synced: 17 Aug 2025

https://github.com/trishume/smartheadtracker

WIP OpenCV webcam head tracker based on coloured markers

computer-vision halide head-tracking opencv

Last synced: 14 Apr 2025

https://github.com/adithya-s-k/eyeontask

An OpenCV and Mediapipe-based eye-tracking and attention detection system that provides real-time feedback to help improve focus and productivity.

computer-vision flask mediapipe opencv

Last synced: 25 Oct 2025

https://github.com/michaeltroger/feature-matching-android

Augmented Reality Template Matching (Feature Matching) using OpenCV 4 for >= Android 5

android augmented-reality augmented-reality-applications computer-vision feature-matching opencv template-matching

Last synced: 17 Aug 2025

https://github.com/mli0603/tatoo

TAToo ("Vision-based Joint Tracking of Anatomy and Tool for Skull-base Surgery"), IPCAI 2023.

computer-vision deep-learning object-tracking

Last synced: 20 Aug 2025

https://github.com/microsoft/hams

Code for Automated License Testing from the HAMS group @ Microsoft Research, India

aruco computer-vision license-testing

Last synced: 11 May 2026

https://github.com/choyingw/gais-net

CVPR 2020 Workshop on Scalability in Autonomous Driving: GAIS-Net: Geometry-Aware Instance Segmentation with Disparity Maps

3d autonomous-vehicles computer-vision cvpr2020 deep-neural-networks geometry-processing instance-segmentation mask-rcnn multimodal-deep-learning scalability stereo-vision

Last synced: 22 Aug 2025

https://github.com/technexion-vision/vizionsdk

The cross-platform SDK for controlling and managing TechNexion Cameras

camera-api computer-vision toolkit

Last synced: 13 Mar 2026

https://github.com/matlab-deep-learning/classification-of-sars-covid-19-and-other-lung-infections-from-chest-x-ray-scan-images-with-densenet

This example shows how to train a deep neural network to classify SARS COVID-19 and other lung infections using chest X-ray (CXR) images.

chest-xray-images computer-vision covid19 deep-learning densenet matlab matlab-deep-learning medical-image-processing

Last synced: 06 Oct 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-3D-Object-Detection 169 Awesome-Skeleton-based-Action-Recognition 104 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-state-of-depth-completion 71 awesome-robotics-datasets 79 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97