An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/sethuiyer/mlhub

Machine Learning Experiments - Practical Learning

computer-vision learning machine-learning neural-network tidbits

Last synced: 30 Apr 2025

https://github.com/azad-academy/personalized-diffusion

A Tutorial on Customized Image Generation by Fine-Tuning the Stable Diffusion Models

computer-vision deep-learning diffusion-models machine-learning

Last synced: 11 May 2025

https://github.com/matlab-deep-learning/automate-labeling-in-image-labeler-using-a-pretrained-tensorflow-object-detector

This example shows how to automate object labeling in the Image Labeler app using a TensorFlow object detector model trained in Python.

co-execution computer-vision deep-learning image-labeling matlab object-detection python

Last synced: 07 May 2025

https://github.com/suutari/meterelf

Water meter reading with Computer Vision

computer-vision opencv python python3

Last synced: 23 Jun 2025

https://github.com/realorangeone/zoloto

A fiducial marker system powered by OpenCV - Supports ArUco and April

computer-vision fiducial-markers hacktoberfest opencv python

Last synced: 19 Jun 2025

https://github.com/Powercube7/YOLOv5-GUI

A simple GUI made for creating jobs in YOLOv5

computer-vision gui yolov5

Last synced: 21 Apr 2025

https://github.com/rexledesma/haze-removal

An implementation of "Single Image Haze Removal Using Dark Channel Prior" by He et al. 2009

computer-vision haze-removal opencv-python python

Last synced: 26 Jun 2025

https://github.com/folhascadentes/mamao-app

An open-source platform designed for the crowdsourcing of sign language datasets. The project aims to foster a diverse and inclusive dataset, collected from a broad user base, to train and improve AI-driven sign language translation systems

accessibility artificial-intelligence computer-vision crowdsourcing dataset inclusive-design machine-learning open-source sign-language translation

Last synced: 12 Aug 2025

https://github.com/mickymultani/GPT-4-Vision-Architecture-Scanner

A web-based tool that utilizes GPT-4's vision capabilities to analyze and describe system architecture diagrams, providing instant insights and detailed breakdowns in an interactive chat interface.

architecture-visualization computer-vision flask flask-api flask-application gpt-4 gpt-4-turbo gpt-4-vision gpt-4-vision-preview gpt-vision llm llms openai openai-chatgpt openapi

Last synced: 06 Apr 2025

https://github.com/yizhe-ang/adamatch-pytorch

Unofficial PyTorch Implementation of AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain Adaptation

computer-vision deep-learning domain-adaptation pytorch semi-supervised-learning unsupervised-learning

Last synced: 14 Mar 2025

https://github.com/nengelmann/meetingcam

Run your AI and CV algorithms in meetings such as Zoom, Meets or Teams! ๐Ÿš€

aritificial-intelligence computer-vision meeting-tools meetings meets online-meetings teams webcam webcam-hacking zoom

Last synced: 07 Apr 2025

https://github.com/nengelmann/MeetingCam

Run your AI and CV algorithms in meetings such as Zoom, Meets or Teams! ๐Ÿš€

aritificial-intelligence computer-vision meeting-tools meetings meets online-meetings teams webcam webcam-hacking zoom

Last synced: 14 Apr 2025

https://github.com/akrielz/vision_models_playground

Playground for testing and implementing various Vision Models

artificial-intelligence attention-mechanism computer-vision deep-learning pytorch transformer

Last synced: 26 Oct 2025

https://github.com/jamarshon/chesspdftofen

Annotates PDF with comments containing FEN of detected chessboards

chess computer-vision machine-learning python

Last synced: 12 Jul 2025

https://github.com/addleonel/anpr-arduino

Automatic Number Plate Recognition (ANPR ) using Arduino

anpr arduino-uno computer-vision

Last synced: 22 Feb 2026

https://github.com/branch-bunch/facedex

PokรฉDex for people! Gotta tag em all!

camera computer-vision ios pokedex swift tag-em

Last synced: 18 Apr 2025

https://github.com/ieee8023/blindtool

BlindTool โ€“ A mobile app that gives a "sense of vision" to the blind with deep learning

blind computer-vision deep-learning

Last synced: 02 May 2025

https://github.com/intel-retail/rtsf-at-checkout-reference-design

Detect loss at self-checkout by seamlessly connecting different sensor devices, including weight scale sensors, cameras, and RFIDs.

computer-vision deep-learning edge edge-ai edge-computing image-recognition inference intel live-demo machine-learning object-detection openvino pretrained-models real-time reference-implementation video

Last synced: 18 Aug 2025

https://github.com/jishanshaikh4/alpha-net

Alpha-Net: Architecture, Models, and Applications (https://arxiv.org/abs/2007.07221)

alpha-net computer-vision machine-learning

Last synced: 28 Jan 2026

https://github.com/fabprezja/deep-fast-vision

A Python library for rapid prototyping of deep transfer learning vision models.

auto-ml computer-vision deep-learning machine-learning python tensorflow-keras

Last synced: 05 Apr 2025

https://github.com/roboticslab-uc3m/textiles

Research on algorithms for garment perception, manipulation...

computer-vision manipulation research robotics textile

Last synced: 19 Jan 2026

https://github.com/lingyzhu0101/low-light-video-enhancement

[IJCV'24] Temporally Consistent Enhancement of Low-light Videos via Spatial-Temporal Compatible Learning

computer-vision deep-learning low-light video-enhancement

Last synced: 12 Jul 2025

https://github.com/rafaelpadilla/tcf-lmo

TCF-LMO is a network made with dedicated modules to process videos and identify the presence of anomalies in frames. It is composed by: dissimilarity model; a differentiable morphology module; temporal consistency; and classification module.

anomaly-detection computer-vision differentiable-morphology signal-processing

Last synced: 26 Mar 2025

https://github.com/octo-technology/vio

Visual Inspection Orchestrator is a modular framework made to ease the deployment of VI usecases

ai computer-vision inspection mlops

Last synced: 07 Apr 2025

https://github.com/amrrs/tinyyolo_in_r

Image Detection in R using image.darknet and Tiny YOLO

computer-vision r tinyyolo yolov3

Last synced: 25 Feb 2026

https://github.com/amathislab/wildclip

Scene and animal attribute retrieval from camera trap data with domain-adapted vision-language models

behavior camera-trap clip computer-vision computervision visual-language-models

Last synced: 03 Feb 2026

https://github.com/benjaminjellis/repnetpytorch

PyTorch port of RepNet: Counting Out Time - Class Agnostic Video Repetition Counting in the Wild

computer-vision deep-learning neural-network python pytorch

Last synced: 19 Apr 2025

https://github.com/globalwetlands/luderick-seagrass

This dataset comprises of annotated footage of Girella tricuspidata in two estuary systems in South East Queensland, Australia. This data is suitable for a range of classification and object detection research in unconstrained underwater environments.

computer-vision dataset deep-learning environmental-monitoring object-detection underwater-images

Last synced: 21 Nov 2025

https://github.com/autodistill/autodistill-paligemma

Use PaliGemma to auto-label data for use in training fine-tuned vision models.

autodistill computer-vision fine-tuning-computer-vision paligemma zero-shot-object-detection

Last synced: 14 Apr 2025

https://github.com/fepegar/resseg-ijcars

Code, data and model for Pรฉrez-Garcรญa et al. 2021, "A self-supervised learning strategy for postoperative brain cavity segmentation simulating resections"

computer-vision convolutional-neural-networks epilepsy medical-image-computing mri segmentation tumor

Last synced: 25 Feb 2025

https://github.com/heppu/hand-recognition

Python script to recognize human hand in video feed.

computer-vision drone hand-recognition python

Last synced: 21 Apr 2025

https://github.com/maximecharriere/gxipy

Python API to use with the Daheng Imaging cameras

api camera computer-vision daheng daheng-imaging python sdk

Last synced: 13 Apr 2025

https://github.com/ilmedova/ml-fun

Machine Learning and AI projects for fun

chatbot computer-vision langchain llama llms ml opencv pandas

Last synced: 16 Jan 2026

https://github.com/iamgideonidoko/bio-attendance-sys

Biometric Attendance System built with computer vision (Python OpenCV), Flask and the MERN stack

biometric-attendance-system biometrics biometrics-authentication computer-vision fingerprint mern opencv-python

Last synced: 13 Apr 2026

https://github.com/devbruce/cmask2polygons

Convert rgb color mask to polygons per class

computer-vision contour-detection mask opencv-python polygon python

Last synced: 11 Jul 2025

https://github.com/amhsirak/human-monitoring-system

Real-time human detection, tracking and counting using MobileNet SSD

computer-vision deep-learning footfall-analysis image-processing mobilenet-ssd opencv python

Last synced: 13 Apr 2025

https://github.com/tzolov/spring-boot-tensorflow-demo

Spring Boot and Tensorflow demos for Image-Recognition, Pose-Estimation, Object-Detection, Instance-Segmentation

computer-vision image-recognition instance-segmentation object-detection pose-estimation spring-boot spring-cloud-dataflow tensorflow

Last synced: 21 Apr 2025

https://github.com/tencentarc/common_trainer

Common template for pytorch project. Easy to extent and modify for new project.

computer-vision deep-learning machine-learning pytorch

Last synced: 05 Apr 2025

https://github.com/mabdh/mot-pf

๐Ÿ”Ž Multiple Object Tracking with Particle Filter

computer-vision object-tracking

Last synced: 03 Jan 2026

https://github.com/natmlx/movenet-unity

Realtime pose detection in Unity Engine with NatML.

computer-vision machine-learning pose-estimation pose-tracking unity3d

Last synced: 10 Apr 2025

https://github.com/shubhamai/seismic-facies-identification

This repo is based on the Shubhamai solutions of AICrowd Facies Identification Challenge in which we need to create a ML/DL Model to do pixel classification by taking 3D images.

computer-vision deep-learning

Last synced: 13 May 2025

https://github.com/dendenxu/panorama-stitching-101

PyTorch feature detection -> feature matching -> multi-image homography -> panorama stitching algorithm

computer-vision graph-shortest-path homography multi-image-homography panorama-stitching

Last synced: 09 Jul 2025

https://github.com/ajaysub110/a-neural-compositional-paradigm-for-image-captioning

Implementation of 'A Neural Compositional Paradigm for Image Captioning' by B. Dai, S.Fidler, D. Lin

computer-vision deep-learning image-captioning natural-language-processing paper-implementations

Last synced: 01 Aug 2025

https://github.com/slashtechno/wyzely-detect

Recognize faces/objects in a video stream (from a webcam or a security camera) and send notifications to your devices

computer-vision hacktoberfest object-detection opencv wyze wyzecam

Last synced: 18 Mar 2025

https://github.com/roboflow/magic-scissors

Synthetic data for object detection and segmentation

augmentation computer-vision roboflow synthetic-data

Last synced: 24 Jun 2025

https://github.com/cpbridge/rifeatures

A small C++ library for efficient calculation of rotation invariant features in 2D images using OpenCV.

2d-images computer-vision image-processing machine-learning openmp orientation rotation-invariant-features

Last synced: 07 May 2025

https://github.com/wx-chevalier/3d-aigc-examples

ๅŸบไบŽ YOLOV5 ็š„ 3D ๆ‰“ๅฐไปถ่‡ชๅŠจๅˆ†ๆ‹ฃ๏ผŒๅŒ…ๅซ้›ถไปถๆ‹“ๆ‰‘็‰นๅพๅˆ†ๆž

computer-vision pytorc tensor-bo wx-ai-kits yolo yolov5

Last synced: 17 Jun 2025

https://github.com/Pushkar1853/Cover-Generator

Application of OpenAI tools such as Whisper, DALL-E, and ChatGPT to generate album covers from audio

chat-gpt computer-vision dall-e music nlp openai stable-diffusion whisper-ai

Last synced: 17 Mar 2025

https://github.com/jasmcaus/canaro

A Python library including support for Deep Learning models built using the Keras framework

computer-vision convolutional-neural-networks deep-learning keras keras-tensorflow matplotlib opencv tensorflow

Last synced: 13 Apr 2025

https://github.com/michaeltroger/feature-matching-native-android

Augmented Reality Template Matching (Feature Matching) using OpenCV 4 for >= Android 5 using the NDK and an async approach (Coroutines)

android augmented-reality augmented-reality-applications computer-vision cpp feature-matching ndk ndk-sample opencv template-matching

Last synced: 03 Mar 2026

https://github.com/ashok-arjun/cscct

Official Implementation of the ECCV 2022 Paper "Class-Incremental Learning with Cross-Space Clustering and Controlled Transfer"

class-incremental-learning computer-vision continual-learning deep-learning eccv eccv2022 incremental-learning pytorch

Last synced: 28 Jul 2025

https://github.com/nyandwi/deep_learning_with_tensorflow

Deep Learning with TensorFlow for basic neural networks tasks, computer vision and natural language processing.

computer-vision deep-learning machine-learning nlp tensorflow

Last synced: 12 Jun 2025

https://github.com/rese1f/uniap

[AAAI 2024] UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning

2d-pose-estimation animal classification computer-vision semantic-segmentation

Last synced: 10 Jul 2025

https://github.com/mendez-luisjose/car-detection-and-speed-calculation-with-yolov8-sort-and-opencv

Car Detection and Speed Calculation with YOLOv8, Sort and OpenCV

computer-vision opencv sort yolov8

Last synced: 13 Oct 2025

https://github.com/andreaconti/depth-on-demand

ECCV24 Official Code for Depth on Demand: Streaming Dense Depth from a Low Frame-Rate Active Sensor

computer-vision depth-completion depth-estimation lidar multi-view multi-view-stereo pattern-recognition tof

Last synced: 29 Jun 2025

https://github.com/ata-turhan/mnist-digit-classification-pytorch

A complete solution for the MNIST handwritten digit classification challenge using PyTorch, including data exploration, model training, and Kaggle submission generation.

computer-vision deep-learning image-classification python pytorch

Last synced: 24 Oct 2025

https://github.com/shengyuh/3d-potion

We reproduce 2D PoTion and extend this to 3D PoTion for action recognition task.

3d-vision action-recogntion computer-vision

Last synced: 10 Apr 2025

https://github.com/kimianoorbakhsh/LPF-Defense

Code and Data for the paper "LPF-Defense: 3D Adversarial Defense based on Frequency Analysis", PLoS ONE

3d-attack adversarial-attacks computer-vision deep-learning dgcnn frequency-analysis point-cloud pointnet pointnet2 spherical-harmonics

Last synced: 21 Nov 2025

https://github.com/vlm-run/vlmrun-python-sdk

Official Python SDK for VLM Run

computer-vision genai llm ocr python vlm

Last synced: 22 Jan 2026

https://github.com/bhimrazy/diabetic-retinopathy-detection

Diabetic Retinopathy Detection: Utilizing Multiprocessing for Processing Large Datasets and Transfer Learning to Fine-Tune Deep Learning Models | PyTorch

computer-vision deep-learning diabetic-retinopathy-detection eyepacs healthcare

Last synced: 19 Apr 2025

https://github.com/krshrimali/panorama-image-stitching-using-opencv

Program to generate Panorama of multiple images. Image Stitching, Panorama.

computer-vision image image-stitching machine-learning opencv panorama stitching

Last synced: 25 Jun 2025

https://github.com/fpt-thaituan/detect-yoga-poses-and-correction-in-real-time-using-machine-learning-algorithms

Compare SVM mode yoga movement classification accuracy with Linear kernel, Polynomial kernel, RBF (Radial Basis Function) kernel, LSTM with accuracy up to 98%. In addition, it also supports adjusting the practitioner's movements according to standard movements.

blazepose computer-vision deep-learning long-short-term-memory lsmt machine-learning pose-estimation support-vector-machines svc-model yoga-classfication

Last synced: 22 Aug 2025

https://github.com/rykovv/pills_counter

Machine Learning and Computer Vision at the Edge for pills counting using ESP32.

computer-vision edge-computing esp32 machine-learning random-forest svm

Last synced: 03 Jun 2026

https://github.com/kuohaozeng/visual_reaction

Visual Reaction: Learning to Play Catch with Your Drone

ai2-thor computer-vision drone forecasting reinforcement-learning visual-reaction

Last synced: 14 Oct 2025

https://github.com/fdesjardins/surface-estimation-and-image-relighting

Illumination registration, surface estimation, and physically-based image relighting

computer-vision machine-learning opengl ray-tracing rendering svm-classifier

Last synced: 03 Apr 2025

https://github.com/cilabuniba/i-dream-my-painting

[WACV 2025] I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi-Mask Inpainting

computer-vision diffusion inpainting llava mllm multimodal prompt-generation stable-diffusion text-to-image

Last synced: 06 Jul 2025

https://github.com/lddl/goparking

This project is targeted to detect which parking lot (actually any user defined polygon) are occupied by any object.

computer-vision gocv opencv parking-lot vehicle-detection

Last synced: 29 Jul 2025

https://github.com/mfaizanse/kinectfusion

Re-implementation of the paper KinectFusion: Real-Time Dense Surface Mapping and Tracking

3d-reconstruction computer-vision

Last synced: 14 Apr 2025

https://github.com/rizwanmunawar/visionusecases

Explore a wide range of computer vision projects and documentation covering everything from object detection, image segmentation, and tracking to pose estimation, object counting, and automated annotation. These resources highlight real-world AI applications built with modern models like Ultralytics YOLO, Meta SAM 2, and other vlms

artificial-intelligence computer-vision deep-learning instance-segmentation object-detection object-tracking pose-estimation real-time-inference ultralytics

Last synced: 23 Jan 2026

https://github.com/sachinl0har/vision-python

Python Detection Programs using Mediapipe

computer-vision mediapipe python vision

Last synced: 28 Jul 2025

https://github.com/amitkumarj441/identify_pneumonia

Mask-RCNN based model to automatically identify potential pneumonia cases

computer-vision mask-rcnn pneumonia-detection pytorch resnet tensorflow

Last synced: 13 Aug 2025

https://github.com/ddayto21/self-driving-car-

This Python repository implements lane detection techniques using OpenCV which is the open-source library used for computer vision, machine learning and image processing.

computer-vision opencv python self-driving-car

Last synced: 13 Oct 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-Skeleton-based-Action-Recognition 104 Awesome-3D-Object-Detection 169 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-robotics-datasets 79 awesome-state-of-depth-completion 71 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97