An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/intel-retail/rtsf-at-checkout-reference-design

Detect loss at self-checkout by seamlessly connecting different sensor devices, including weight scale sensors, cameras, and RFIDs.

computer-vision deep-learning edge edge-ai edge-computing image-recognition inference intel live-demo machine-learning object-detection openvino pretrained-models real-time reference-implementation video

Last synced: 18 Aug 2025

https://github.com/fabprezja/deep-fast-vision

A Python library for rapid prototyping of deep transfer learning vision models.

auto-ml computer-vision deep-learning machine-learning python tensorflow-keras

Last synced: 05 Apr 2025

https://github.com/Powercube7/YOLOv5-GUI

A simple GUI made for creating jobs in YOLOv5

computer-vision gui yolov5

Last synced: 21 Apr 2025

https://github.com/ieee8023/blindtool

BlindTool โ€“ A mobile app that gives a "sense of vision" to the blind with deep learning

blind computer-vision deep-learning

Last synced: 02 May 2025

https://github.com/jishanshaikh4/alpha-net

Alpha-Net: Architecture, Models, and Applications (https://arxiv.org/abs/2007.07221)

alpha-net computer-vision machine-learning

Last synced: 28 Jan 2026

https://github.com/azad-academy/personalized-diffusion

A Tutorial on Customized Image Generation by Fine-Tuning the Stable Diffusion Models

computer-vision deep-learning diffusion-models machine-learning

Last synced: 11 May 2025

https://github.com/yizhe-ang/AdaMatch-PyTorch

Unofficial PyTorch Implementation of AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain Adaptation

computer-vision deep-learning domain-adaptation pytorch semi-supervised-learning unsupervised-learning

Last synced: 05 Apr 2025

https://github.com/matlab-deep-learning/automate-labeling-in-image-labeler-using-a-pretrained-tensorflow-object-detector

This example shows how to automate object labeling in the Image Labeler app using a TensorFlow object detector model trained in Python.

co-execution computer-vision deep-learning image-labeling matlab object-detection python

Last synced: 07 May 2025

https://github.com/meetps/ee-702

Project codes for EE702 Computer Vision.

computer-vision shape-descriptor stereo-vision

Last synced: 07 May 2025

https://github.com/realorangeone/zoloto

A fiducial marker system powered by OpenCV - Supports ArUco and April

computer-vision fiducial-markers hacktoberfest opencv python

Last synced: 19 Jun 2025

https://github.com/sethuiyer/mlhub

Machine Learning Experiments - Practical Learning

computer-vision learning machine-learning neural-network tidbits

Last synced: 30 Apr 2025

https://github.com/joshuabillson/waterbody-detection-via-deep-learning

Source code for the paper, "Water Body Extraction from Sentinel-2 Imagery with Deep Convolutional Networks and Pixelwise Category Transplantation".

computer-vision deep-learning landcover-classification machine-learning python waterbody-detection

Last synced: 08 May 2025

https://github.com/suutari/meterelf

Water meter reading with Computer Vision

computer-vision opencv python python3

Last synced: 23 Jun 2025

https://github.com/soheil-mp/histogram-of-oriented-gradients-hog

This algorithm counts occurrences of gradient orientation in localized portions of an image and visualize it in an image.

computer-vision feature-descriptors histogram-of-oriented-gradients hog localization

Last synced: 12 Apr 2025

https://github.com/folhascadentes/mamao-app

An open-source platform designed for the crowdsourcing of sign language datasets. The project aims to foster a diverse and inclusive dataset, collected from a broad user base, to train and improve AI-driven sign language translation systems

accessibility artificial-intelligence computer-vision crowdsourcing dataset inclusive-design machine-learning open-source sign-language translation

Last synced: 12 Aug 2025

https://github.com/nengelmann/meetingcam

Run your AI and CV algorithms in meetings such as Zoom, Meets or Teams! ๐Ÿš€

aritificial-intelligence computer-vision meeting-tools meetings meets online-meetings teams webcam webcam-hacking zoom

Last synced: 07 Apr 2025

https://github.com/branch-bunch/facedex

PokรฉDex for people! Gotta tag em all!

camera computer-vision ios pokedex swift tag-em

Last synced: 18 Apr 2025

https://github.com/mickymultani/GPT-4-Vision-Architecture-Scanner

A web-based tool that utilizes GPT-4's vision capabilities to analyze and describe system architecture diagrams, providing instant insights and detailed breakdowns in an interactive chat interface.

architecture-visualization computer-vision flask flask-api flask-application gpt-4 gpt-4-turbo gpt-4-vision gpt-4-vision-preview gpt-vision llm llms openai openai-chatgpt openapi

Last synced: 06 Apr 2025

https://github.com/rexledesma/haze-removal

An implementation of "Single Image Haze Removal Using Dark Channel Prior" by He et al. 2009

computer-vision haze-removal opencv-python python

Last synced: 26 Jun 2025

https://github.com/yizhe-ang/adamatch-pytorch

Unofficial PyTorch Implementation of AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain Adaptation

computer-vision deep-learning domain-adaptation pytorch semi-supervised-learning unsupervised-learning

Last synced: 14 Mar 2025

https://github.com/nengelmann/MeetingCam

Run your AI and CV algorithms in meetings such as Zoom, Meets or Teams! ๐Ÿš€

aritificial-intelligence computer-vision meeting-tools meetings meets online-meetings teams webcam webcam-hacking zoom

Last synced: 14 Apr 2025

https://github.com/jamarshon/chesspdftofen

Annotates PDF with comments containing FEN of detected chessboards

chess computer-vision machine-learning python

Last synced: 12 Jul 2025

https://github.com/tencentarc/common_trainer

Common template for pytorch project. Easy to extent and modify for new project.

computer-vision deep-learning machine-learning pytorch

Last synced: 05 Apr 2025

https://github.com/shengyuh/3d-potion

We reproduce 2D PoTion and extend this to 3D PoTion for action recognition task.

3d-vision action-recogntion computer-vision

Last synced: 10 Apr 2025

https://github.com/andreaconti/depth-on-demand

ECCV24 Official Code for Depth on Demand: Streaming Dense Depth from a Low Frame-Rate Active Sensor

computer-vision depth-completion depth-estimation lidar multi-view multi-view-stereo pattern-recognition tof

Last synced: 29 Jun 2025

https://github.com/michaeltroger/feature-matching-native-android

Augmented Reality Template Matching (Feature Matching) using OpenCV 4 for >= Android 5 using the NDK and an async approach (Coroutines)

android augmented-reality augmented-reality-applications computer-vision cpp feature-matching ndk ndk-sample opencv template-matching

Last synced: 03 Mar 2026

https://github.com/autodistill/autodistill-paligemma

Use PaliGemma to auto-label data for use in training fine-tuned vision models.

autodistill computer-vision fine-tuning-computer-vision paligemma zero-shot-object-detection

Last synced: 14 Apr 2025

https://github.com/globalwetlands/luderick-seagrass

This dataset comprises of annotated footage of Girella tricuspidata in two estuary systems in South East Queensland, Australia. This data is suitable for a range of classification and object detection research in unconstrained underwater environments.

computer-vision dataset deep-learning environmental-monitoring object-detection underwater-images

Last synced: 21 Nov 2025

https://github.com/shubhamai/seismic-facies-identification

This repo is based on the Shubhamai solutions of AICrowd Facies Identification Challenge in which we need to create a ML/DL Model to do pixel classification by taking 3D images.

computer-vision deep-learning

Last synced: 13 May 2025

https://github.com/fepegar/resseg-ijcars

Code, data and model for Pรฉrez-Garcรญa et al. 2021, "A self-supervised learning strategy for postoperative brain cavity segmentation simulating resections"

computer-vision convolutional-neural-networks epilepsy medical-image-computing mri segmentation tumor

Last synced: 25 Feb 2025

https://github.com/jasmcaus/canaro

A Python library including support for Deep Learning models built using the Keras framework

computer-vision convolutional-neural-networks deep-learning keras keras-tensorflow matplotlib opencv tensorflow

Last synced: 13 Apr 2025

https://github.com/natmlx/movenet-unity

Realtime pose detection in Unity Engine with NatML.

computer-vision machine-learning pose-estimation pose-tracking unity3d

Last synced: 10 Apr 2025

https://github.com/devbruce/cmask2polygons

Convert rgb color mask to polygons per class

computer-vision contour-detection mask opencv-python polygon python

Last synced: 11 Jul 2025

https://github.com/roboflow/magic-scissors

Synthetic data for object detection and segmentation

augmentation computer-vision roboflow synthetic-data

Last synced: 24 Jun 2025

https://github.com/octo-technology/vio

Visual Inspection Orchestrator is a modular framework made to ease the deployment of VI usecases

ai computer-vision inspection mlops

Last synced: 07 Apr 2025

https://github.com/amathislab/wildclip

Scene and animal attribute retrieval from camera trap data with domain-adapted vision-language models

behavior camera-trap clip computer-vision computervision visual-language-models

Last synced: 03 Feb 2026

https://github.com/roboticslab-uc3m/textiles

Research on algorithms for garment perception, manipulation...

computer-vision manipulation research robotics textile

Last synced: 19 Jan 2026

https://github.com/ata-turhan/mnist-digit-classification-pytorch

A complete solution for the MNIST handwritten digit classification challenge using PyTorch, including data exploration, model training, and Kaggle submission generation.

computer-vision deep-learning image-classification python pytorch

Last synced: 24 Oct 2025

https://github.com/maximecharriere/gxipy

Python API to use with the Daheng Imaging cameras

api camera computer-vision daheng daheng-imaging python sdk

Last synced: 13 Apr 2025

https://github.com/iamgideonidoko/bio-attendance-sys

Biometric Attendance System built with computer vision (Python OpenCV), Flask and the MERN stack

biometric-attendance-system biometrics biometrics-authentication computer-vision fingerprint mern opencv-python

Last synced: 13 Apr 2026

https://github.com/ashok-arjun/cscct

Official Implementation of the ECCV 2022 Paper "Class-Incremental Learning with Cross-Space Clustering and Controlled Transfer"

class-incremental-learning computer-vision continual-learning deep-learning eccv eccv2022 incremental-learning pytorch

Last synced: 28 Jul 2025

https://github.com/nyandwi/deep_learning_with_tensorflow

Deep Learning with TensorFlow for basic neural networks tasks, computer vision and natural language processing.

computer-vision deep-learning machine-learning nlp tensorflow

Last synced: 12 Jun 2025

https://github.com/benjaminjellis/repnetpytorch

PyTorch port of RepNet: Counting Out Time - Class Agnostic Video Repetition Counting in the Wild

computer-vision deep-learning neural-network python pytorch

Last synced: 19 Apr 2025

https://github.com/amrrs/tinyyolo_in_r

Image Detection in R using image.darknet and Tiny YOLO

computer-vision r tinyyolo yolov3

Last synced: 25 Feb 2026

https://github.com/ajaysub110/a-neural-compositional-paradigm-for-image-captioning

Implementation of 'A Neural Compositional Paradigm for Image Captioning' by B. Dai, S.Fidler, D. Lin

computer-vision deep-learning image-captioning natural-language-processing paper-implementations

Last synced: 01 Aug 2025

https://github.com/Pushkar1853/Cover-Generator

Application of OpenAI tools such as Whisper, DALL-E, and ChatGPT to generate album covers from audio

chat-gpt computer-vision dall-e music nlp openai stable-diffusion whisper-ai

Last synced: 17 Mar 2025

https://github.com/dendenxu/panorama-stitching-101

PyTorch feature detection -> feature matching -> multi-image homography -> panorama stitching algorithm

computer-vision graph-shortest-path homography multi-image-homography panorama-stitching

Last synced: 09 Jul 2025

https://github.com/ilmedova/ml-fun

Machine Learning and AI projects for fun

chatbot computer-vision langchain llama llms ml opencv pandas

Last synced: 16 Jan 2026

https://github.com/amhsirak/human-monitoring-system

Real-time human detection, tracking and counting using MobileNet SSD

computer-vision deep-learning footfall-analysis image-processing mobilenet-ssd opencv python

Last synced: 13 Apr 2025

https://github.com/cpbridge/rifeatures

A small C++ library for efficient calculation of rotation invariant features in 2D images using OpenCV.

2d-images computer-vision image-processing machine-learning openmp orientation rotation-invariant-features

Last synced: 07 May 2025

https://github.com/slashtechno/wyzely-detect

Recognize faces/objects in a video stream (from a webcam or a security camera) and send notifications to your devices

computer-vision hacktoberfest object-detection opencv wyze wyzecam

Last synced: 18 Mar 2025

https://github.com/wx-chevalier/3d-aigc-examples

ๅŸบไบŽ YOLOV5 ็š„ 3D ๆ‰“ๅฐไปถ่‡ชๅŠจๅˆ†ๆ‹ฃ๏ผŒๅŒ…ๅซ้›ถไปถๆ‹“ๆ‰‘็‰นๅพๅˆ†ๆž

computer-vision pytorc tensor-bo wx-ai-kits yolo yolov5

Last synced: 17 Jun 2025

https://github.com/mabdh/mot-pf

๐Ÿ”Ž Multiple Object Tracking with Particle Filter

computer-vision object-tracking

Last synced: 03 Jan 2026

https://github.com/mendez-luisjose/car-detection-and-speed-calculation-with-yolov8-sort-and-opencv

Car Detection and Speed Calculation with YOLOv8, Sort and OpenCV

computer-vision opencv sort yolov8

Last synced: 13 Oct 2025

https://github.com/tzolov/spring-boot-tensorflow-demo

Spring Boot and Tensorflow demos for Image-Recognition, Pose-Estimation, Object-Detection, Instance-Segmentation

computer-vision image-recognition instance-segmentation object-detection pose-estimation spring-boot spring-cloud-dataflow tensorflow

Last synced: 21 Apr 2025

https://github.com/rafaelpadilla/tcf-lmo

TCF-LMO is a network made with dedicated modules to process videos and identify the presence of anomalies in frames. It is composed by: dissimilarity model; a differentiable morphology module; temporal consistency; and classification module.

anomaly-detection computer-vision differentiable-morphology signal-processing

Last synced: 26 Mar 2025

https://github.com/heppu/hand-recognition

Python script to recognize human hand in video feed.

computer-vision drone hand-recognition python

Last synced: 21 Apr 2025

https://github.com/lingyzhu0101/low-light-video-enhancement

[IJCV'24] Temporally Consistent Enhancement of Low-light Videos via Spatial-Temporal Compatible Learning

computer-vision deep-learning low-light video-enhancement

Last synced: 12 Jul 2025

https://github.com/kimianoorbakhsh/LPF-Defense

Code and Data for the paper "LPF-Defense: 3D Adversarial Defense based on Frequency Analysis", PLoS ONE

3d-attack adversarial-attacks computer-vision deep-learning dgcnn frequency-analysis point-cloud pointnet pointnet2 spherical-harmonics

Last synced: 21 Nov 2025

https://github.com/rese1f/uniap

[AAAI 2024] UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning

2d-pose-estimation animal classification computer-vision semantic-segmentation

Last synced: 10 Jul 2025

https://github.com/bhimrazy/diabetic-retinopathy-detection

Diabetic Retinopathy Detection: Utilizing Multiprocessing for Processing Large Datasets and Transfer Learning to Fine-Tune Deep Learning Models | PyTorch

computer-vision deep-learning diabetic-retinopathy-detection eyepacs healthcare

Last synced: 19 Apr 2025

https://github.com/vlm-run/vlmrun-python-sdk

Official Python SDK for VLM Run

computer-vision genai llm ocr python vlm

Last synced: 22 Jan 2026

https://github.com/amitkumarj441/identify_pneumonia

Mask-RCNN based model to automatically identify potential pneumonia cases

computer-vision mask-rcnn pneumonia-detection pytorch resnet tensorflow

Last synced: 13 Aug 2025

https://github.com/lynnporu/rukopys-dataset

Dataset of Ukrainian handwritted letters with plenty of variations

ai computer-vision cyrillic cyrillic-characters dataset handwriting-recognition ukrainian

Last synced: 05 Feb 2026

https://github.com/sachinl0har/vision-python

Python Detection Programs using Mediapipe

computer-vision mediapipe python vision

Last synced: 28 Jul 2025

https://github.com/HenryNdubuaku/halo

A Library That Uses Quantized Diffusion Model With Clustered Weights For Efficiently Generating More Image Datasets On-Device.

computer-vision data-augmentation diffusion-models generative-model keras machine-learning pyhton tensorflow

Last synced: 03 Aug 2025

https://github.com/ronlek/fastv2c-handnet

Repository for the implementation of "FastV2C-HandNet: Fast Voxel to Coordinate Hand Pose Estimation with 3D Convolutional Neural Networks"

3d-convolutional-network 3d-hand-pose 3d-pose-estimation computer-vision deep-learning depth-images fastv2c-handnet hand-pose-estimation

Last synced: 10 Apr 2025

https://github.com/cpvrlab/slproject4

SLProject is a platform independent 3D computer graphics scene graph library.

c-plus-plus computer-graphics computer-vision framework opengl realtime rendering

Last synced: 23 Apr 2025

https://github.com/cilabuniba/i-dream-my-painting

[WACV 2025] I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi-Mask Inpainting

computer-vision diffusion inpainting llava mllm multimodal prompt-generation stable-diffusion text-to-image

Last synced: 06 Jul 2025

https://github.com/ddayto21/self-driving-car-

This Python repository implements lane detection techniques using OpenCV which is the open-source library used for computer vision, machine learning and image processing.

computer-vision opencv python self-driving-car

Last synced: 13 Oct 2025

https://github.com/sarthakjshetty/fracture

Vision Based Inspection tool comprising of retrained Inception V3 network and OpenCV Filters for fracture detection. Published at A2IC 2019.

computer-vision image-processing inceptionv3 opencv python

Last synced: 31 Jul 2025

https://github.com/alexander-soare/torchvision_feature_extraction_walkthrough

Walkthrough code for YouTube video https://www.youtube.com/watch?v=QRQBTkCLpFY

computer-vision deep-neural-networks pytorch torchvision

Last synced: 08 Mar 2026

https://github.com/kuohaozeng/visual_reaction

Visual Reaction: Learning to Play Catch with Your Drone

ai2-thor computer-vision drone forecasting reinforcement-learning visual-reaction

Last synced: 14 Oct 2025

https://github.com/maxinexiong/imageai-flask-apps

This project offers two Flask applications that utilize ImageAI's image prediction algorithms and object detection models. These apps enable users to upload images and videos for object recognition, detection and analysis, providing accurate prediction results, confidence scores, raw data of detected objects at frame-level, and object insights.

ai-project computer-vision css3 data-visualization flask flask-application html5 image-recognition imageai matplotlib-pyplot moviepy pandas pandas-dataframe python video-object-detection

Last synced: 11 Apr 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-Skeleton-based-Action-Recognition 104 Awesome-3D-Object-Detection 169 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-robotics-datasets 79 awesome-state-of-depth-completion 71 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97