An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/modanesh/differential_ig

Source code for the differential saliency method used in "Re-understanding Finite-State Representations of Recurrent Policy Networks"

computer-vision explainable-ai pytorch reinforcement-learning saliency

Last synced: 10 Apr 2025

https://github.com/madhawav/playwithautoencoders

Image Compression on COCO Dataset using Convolution AutoEncoders

autoencoder computer-graphics computer-vision convolutional-autoencoder image-compression pytorch

Last synced: 18 Mar 2025

https://github.com/ekote/ai-on-microsoft-azure

Microsoft buduje i tworzy Polską Dolinę Cyfrową. W ramach tej inicjatywy podjęliśmy się wyzwania zbudowania chmurowych kompetencji wśród 150tys osób w Polsce. Jednym z elementów tej inicjatywy jest dedykowany kurs na studiach inzynierskich i magisterskich na Politechnice Warszawskiej poświęcony chmurze obliczeniowej oraz sztucznej inteligencji.

artificial-intelligence azure azure-cognitive-services azure-functions azure-machine azure-machine-learning cloud cloudcomputing cognitive-services computer-vision machine-learning nlp

Last synced: 29 Oct 2025

https://github.com/anhtuan284/rebuild-zone

OLP 2024 - Open-Source Software

computer-vision icpc low-code open-source vfossa

Last synced: 12 Apr 2025

https://github.com/souvikmajumder26/any-face-clustering

📸 Face Clustering Engine developed using OpenCV & DBSCAN, deployed as a Streamlit Web App to deliver uploaded images grouped according to the individual unique faces in them.

any-face-clustering clustering computer-vision dbscan dbscan-clustering deep-learning dlib face-clustering face-detection face-recognition google-colab machine-learning opencv python python3 streamlit streamlit-webapp unlabeled-data unsupervised-learning webapp

Last synced: 10 Jul 2025

https://github.com/bwconrad/video-content-search

Search the content of a video with a text or image query

computer-vision deep-learning video-search

Last synced: 20 Jun 2025

https://github.com/bluebrain/atlas-alignment

Blue Brain multi-modal registration and alignment toolbox

computer-vision deep-learning image-registration machine-learning

Last synced: 16 Jul 2025

https://github.com/lynkos/algae-detection

Detect and identify different species of harmful algae within natural water in real-time with AI and a camera (i.e., ESP32-CAM, smartphone, or webcam).

ai arduino artificial-intelligence c cnn computer-vision cpp deep-learning esp32 espressif html iot machine-learning neural-network opencv opencv-python python tinyml ultralytics yolov8

Last synced: 13 Apr 2025

https://github.com/LuJiangTHU/DMIL-WDDS

Deep multiple instance learning based wheat diseases diagnosis system

agriculture-research computer-vision deep-learning machine-learning

Last synced: 05 Apr 2025

https://github.com/adilshamim8/100-ai-machine-learning-deep-learnin-projects

100 AI Machine Learning Deep Learning Projects is a curated repository showcasing innovative, production-ready solutions across computer vision, NLP, and more.

ai artificial-intelligence computer-vision computer-vision-projects data-science deep-learning deep-learning-projects machine-learning machine-learning-projects nlp nlp-projects python

Last synced: 20 Apr 2026

https://github.com/njroussel/roadsegmentation

Segmenting satellite images of earth : determining which parts are roads.

computer-vision convolutional-neural-networks jupyter-notebook python road-segmentation

Last synced: 25 Jun 2025

https://github.com/chinefed/convolutional-set-transformer

Official implementation of the Convolutional Set Transformer (Chinello & Boracchi, 2025). This repository includes the source code of the cstmodels Python package, which provides reusable Keras 3 layers for constructing CST architectures, together with an interface to load and use the CST-15 model pre-trained on ImageNet.

anomaly-detection computer-vision convolutional-set-transformer deep-learning explainability image-classification set-learning transfer-learning

Last synced: 14 Jan 2026

https://github.com/pmixer/otb.plugin

Bridging python and matlab version OTB

benchmark computer-vision tracking

Last synced: 25 Feb 2025

https://github.com/ronlek/fastv2c-handnet

Repository for the implementation of "FastV2C-HandNet: Fast Voxel to Coordinate Hand Pose Estimation with 3D Convolutional Neural Networks"

3d-convolutional-network 3d-hand-pose 3d-pose-estimation computer-vision deep-learning depth-images fastv2c-handnet hand-pose-estimation

Last synced: 10 Apr 2025

https://github.com/alexander-soare/torchvision_feature_extraction_walkthrough

Walkthrough code for YouTube video https://www.youtube.com/watch?v=QRQBTkCLpFY

computer-vision deep-neural-networks pytorch torchvision

Last synced: 08 Mar 2026

https://github.com/snehil-shah/multimodal-image-search-engine

Text to Image & Reverse Image Search Engine built upon Vector Similarity Search utilizing CLIP VL-Transformer for Semantic Embeddings & Qdrant as the Vector-Store

computer-vision multimodal-transformer nlp openai-clip qdrant qdrant-vector-database vector-embeddings

Last synced: 31 Oct 2025

https://github.com/mfaizanse/kinectfusion

Re-implementation of the paper KinectFusion: Real-Time Dense Surface Mapping and Tracking

3d-reconstruction computer-vision

Last synced: 14 Apr 2025

https://github.com/jmhessel/catrank

Pretrained models for the ranking task described in Cats and Captions vs. Creators and the Clock (WWW 2017)

computer-vision machine-learning pretrained-models research

Last synced: 22 Jul 2025

https://github.com/fpt-thaituan/detect-yoga-poses-and-correction-in-real-time-using-machine-learning-algorithms

Compare SVM mode yoga movement classification accuracy with Linear kernel, Polynomial kernel, RBF (Radial Basis Function) kernel, LSTM with accuracy up to 98%. In addition, it also supports adjusting the practitioner's movements according to standard movements.

blazepose computer-vision deep-learning long-short-term-memory lsmt machine-learning pose-estimation support-vector-machines svc-model yoga-classfication

Last synced: 22 Aug 2025

https://github.com/hartikainen/stanford-cs231n

Stanford CS231n: Convolutional Neural Networks for Visual Recognition

computer-vision cs231n deep-learning neural-networks stanford-cv

Last synced: 25 Mar 2025

https://github.com/fdesjardins/surface-estimation-and-image-relighting

Illumination registration, surface estimation, and physically-based image relighting

computer-vision machine-learning opengl ray-tracing rendering svm-classifier

Last synced: 03 Apr 2025

https://github.com/nandometzger/mlfocallengths

Estimating the Focal Length from Monocular Images

cnn computer-vision deep-learning

Last synced: 09 Jul 2025

https://github.com/kuohaozeng/visual_reaction

Visual Reaction: Learning to Play Catch with Your Drone

ai2-thor computer-vision drone forecasting reinforcement-learning visual-reaction

Last synced: 14 Oct 2025

https://github.com/maxinexiong/imageai-flask-apps

This project offers two Flask applications that utilize ImageAI's image prediction algorithms and object detection models. These apps enable users to upload images and videos for object recognition, detection and analysis, providing accurate prediction results, confidence scores, raw data of detected objects at frame-level, and object insights.

ai-project computer-vision css3 data-visualization flask flask-application html5 image-recognition imageai matplotlib-pyplot moviepy pandas pandas-dataframe python video-object-detection

Last synced: 11 Apr 2025

https://github.com/amitkumarj441/identify_pneumonia

Mask-RCNN based model to automatically identify potential pneumonia cases

computer-vision mask-rcnn pneumonia-detection pytorch resnet tensorflow

Last synced: 13 Aug 2025

https://github.com/smrfeld/dash-annotate-cv

Dash components for computer vision annotation tasks

annotations computer-vision dash data-labeling machine-learning plotly plotly-dash

Last synced: 07 Apr 2025

https://github.com/HenryNdubuaku/halo

A Library That Uses Quantized Diffusion Model With Clustered Weights For Efficiently Generating More Image Datasets On-Device.

computer-vision data-augmentation diffusion-models generative-model keras machine-learning pyhton tensorflow

Last synced: 03 Aug 2025

https://github.com/sarthakjshetty/fracture

Vision Based Inspection tool comprising of retrained Inception V3 network and OpenCV Filters for fracture detection. Published at A2IC 2019.

computer-vision image-processing inceptionv3 opencv python

Last synced: 31 Jul 2025

https://github.com/lddl/goparking

This project is targeted to detect which parking lot (actually any user defined polygon) are occupied by any object.

computer-vision gocv opencv parking-lot vehicle-detection

Last synced: 29 Jul 2025

https://github.com/ddayto21/self-driving-car-

This Python repository implements lane detection techniques using OpenCV which is the open-source library used for computer vision, machine learning and image processing.

computer-vision opencv python self-driving-car

Last synced: 13 Oct 2025

https://github.com/machinefi/trio-core

Real-time vision intelligence engine for Apple Silicon. YOLO counting + VLM scene understanding + auto-calibration. REST API in one command.

apple-silicon computer-vision edge-ai inference mlx object-detection people-counting real-time vlm yolo

Last synced: 03 Apr 2026

https://github.com/rizwanmunawar/visionusecases

Explore a wide range of computer vision projects and documentation covering everything from object detection, image segmentation, and tracking to pose estimation, object counting, and automated annotation. These resources highlight real-world AI applications built with modern models like Ultralytics YOLO, Meta SAM 2, and other vlms

artificial-intelligence computer-vision deep-learning instance-segmentation object-detection object-tracking pose-estimation real-time-inference ultralytics

Last synced: 23 Jan 2026

https://github.com/cilabuniba/i-dream-my-painting

[WACV 2025] I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi-Mask Inpainting

computer-vision diffusion inpainting llava mllm multimodal prompt-generation stable-diffusion text-to-image

Last synced: 06 Jul 2025

https://github.com/neelanjan00/image-forgery-detection

A deep convolutional neural network for the detection as well as localization of the area of manipulation in forged images, bearing forgeries of simple as well as complex nature. Further along, the trained model is interfaced with a web application for users to interact with the model in a simple and effective manner, and finally, we also develop a chatbot for further easing the process of interaction with the model and effectively tackling the problem of fake news forwards in popular internet messaging platforms such as WhatsApp.

chatbot computer-vision deep-learning flask forgery-detection tensorflow

Last synced: 11 Oct 2025

https://github.com/sovit-123/german-traffic-sign-recognition-with-deep-learning

Recognizing traffic signs with deep learning and PyTorch using Spatial Transformer Convolutional Neural Networks.

computer-vision convolutional-neural-networks deep-learning image-recognition neural-networks python pytorch spatial-transformer-networks

Last synced: 18 Oct 2025

https://github.com/owen-liuyuxuan/kitti360_visualize

Visualize KITTI360 sequences on ROS with full tf support.

autonomous-driving computer-vision dataset kitti-360

Last synced: 22 Apr 2025

https://github.com/charleslf2/visionner

Visionner turn raw image data into numpy array, more suitable for deep learning task

computer-vision dataset image machine-learning python python3

Last synced: 29 Jun 2025

https://github.com/fareedkhan-dev/create-high-quality-dataset-for-computer-vision

This project focuses on generating a diverse and realistic dataset for computer vision training using ChatGPT and a realistic vision image generation model. The process involves dynamically creating prompts, utilizing ChatGPT to generate image descriptions, and generating images based on those descriptions.

chatgpt computer-vision cv grounded-sam stable-diffusion

Last synced: 02 Aug 2025

https://github.com/radiantearth/ml4gd

Machine Learning for Global Development Working Group

computer-vision machine-learning satellite-imagery

Last synced: 05 Mar 2025

https://github.com/groundlight/stream

Groundlight Stream Processor - Container for analyzing video using RTSP etc

artificial-intelligence computer-vision machine-learning

Last synced: 13 Mar 2026

https://github.com/symisc/pixlab-php-nsfw

PHP Class to classify NSFW content via the PixLab Machine Vision APIs

computer-vision machine-learning machine-learning-api nsfw nsfw-recognition php rest-api

Last synced: 30 Apr 2025

https://github.com/colasgael/cs231n-assignments-spring19

My personal solutions to the CS231n assignments (Spring 2019). CS231n: "CNN" is a Computer Vision class taught at Stanford.

cnn computer-vision deep-learning deep-learning-tutorial pytorch

Last synced: 14 Apr 2025

https://github.com/kleinyuan/train-crfasrnn

Detailed guide to help you understand how to train CRF as RNN

computer-vision crf crf-as-rnn crfasrnn deep-learning deeplab machine-learning rnn segmentation

Last synced: 04 Jan 2026

https://github.com/ankitsharma-007/angular-computer-vision-azure-cognitive-services

An Optical Character Recognition (OCR) using Angular and Azure Computer Vision Cognitive Services

angular ankit-sharma article azure cognitive-services computer-vision optical-character-recognition

Last synced: 30 Jul 2025

https://github.com/natmlx/meet-unity

MediaPipe selfie segmentation in Unity Engine.

computer-vision matting mediapipe meet natml selfie-segmentation unity

Last synced: 27 Jul 2025

https://github.com/sergio11/safedrive_drowsiness_detection

🚗 Driver Drowsiness Detection with Deep Learning (PoC) 🧠 A personal experiment comparing CNN + MobileNetV2 and YOLOv11 for real-time drowsiness detection. Focused on evaluating accuracy and efficiency to explore AI’s potential in preventing fatigue-related accidents. 😴⚡ A hands-on dive into safety-driven computer vision.

autonomous-driving computer-vision convolutional-neural-networks deep-learning driver-drowsiness-detection image-classification mobile-ai real-time-detection tensorflow tensorflow-lite transfer-learning yolo yolov11

Last synced: 04 Oct 2025

https://github.com/koldim2001/jupiterlabcv

Готовый сервис JupiterLab для разработок в сфере компьютерного зрения

computer-vision docker docker-compose gpu-computing jupyter-lab jupyter-notebook jupyterlab

Last synced: 03 Sep 2025

https://github.com/srohit0/food_mnist

This dataset has 10 food categories, with 5,000 images. For each class, 125 manually reviewed test images are provided as well as 375 training images. All images were rescaled to have a maximum side length of 512 pixels.

cnn cnn-classification computer-vision convolutional-neural-networks dataset mnist mnist-classification mnist-dataset mnist-image-dataset

Last synced: 15 Apr 2025

https://github.com/imperialcollegelondon/rcs-pacemakers

Example of GPU-accelerated machine learning on the RCS compute cluster using Jupyter and conda

computer-vision jupyter machine-learning python pytorch

Last synced: 07 Apr 2025

https://github.com/neurobooth/neurobooth-os

Python package for digital phenotyping data synchronization and acquisition

computer-vision neurology neuroscience python wearables

Last synced: 09 Apr 2026

https://github.com/isaaccorley/making-convolutional-networks-shift-invariant-again-tensorflow

Tensorflow Implementation of BlurPool the Antialiasing Pooling operation from "Making Convolutional Networks Shift-Invariant Again"

antialiasing artificial-intelligence computer-vision keras tensorflow

Last synced: 08 Oct 2025

https://github.com/bhattbhavesh91/pixellib-tutorial

Learn how to change background efficiently and easily by just using 3-4 lines of code using Python's Pixellib library that uses Deeplab's pre-trained Deep Learning model.

computer-vision deep-learning deeplab deeplabv3 instance-segmentation machine-learning maskr-cnn pixellib tensorflow video-segmentation

Last synced: 17 Apr 2025

https://github.com/motapinto/computer-vision-structured-light

Project developed for the Computer Vision course unit in FEUP

computer-vision edge-detection jupyter-notebook opencv python structured-light

Last synced: 08 Oct 2025

https://github.com/elhamkhan859/cvf-scrapper-public

The CVF Open Access Downloader is a Python application designed to automate the bulk downloading of open-access papers from Computer Vision Foundation (CVF) conferences, including WACV, ICCV, and CVPR.

bulkdownload computer-vision computer-vision-tools cvf cvf-conference paper-download pdf-downloader

Last synced: 23 Apr 2025

https://github.com/idinsight/mosaiks

Parallelized encoding of satellite imagery into easy-to-use features using the MOSAIKS algorithm.

computer-vision feature-engineering satellite-imagery

Last synced: 14 Jan 2026

https://github.com/saptakbhoumik/tinyvision

TinyVision is an evolving project focused on designing ultra-lightweight image classification models with minimal parameter counts. The goal is to explore what’s actually necessary for fundamental vision tasks by combining handcrafted feature preprocessing with highly efficient CNN architectures.

computer-vision machine-learning python python3 pytorch vision

Last synced: 16 Aug 2025

https://github.com/abhroroy365/tennis-tracker

This project utilizes OpenCV, YOLO and CNN to track the position, movement of players in a video. YOLOv8 is used to track the players. YOLOv5 is used to track the position of tennis ball at every frame of the video. ResNet34 is fine-tuned to detect the court keypoints.

computer-vision deep-learning opencv pytorch tennis-game yolov5

Last synced: 14 Apr 2025

https://github.com/davidstutz/vlfeat-slic-example

Example of using VLFeat's SLIC implementation from C++.

computer-vision image-processing opencv slic superpixel-algorithm superpixels vlfeat

Last synced: 23 Apr 2025

https://github.com/matthewspear/stoptouchingyourface

🚨🤦🏻‍♂️🦠– macOS app to help you stop touching your face when remote working or watching Netflix!

computer-vision coreml coronavirus covid-19 machine-learning macos segmentation swift

Last synced: 21 Apr 2025

https://github.com/urholaukkarinen/template-matching

GPU-accelerated template matching for Rust

computer-vision gpu image-recognition rust template-matching

Last synced: 18 Mar 2025

https://github.com/sayakpaul/emotion-detection-using-deep-learning

This project demonstrates the use of Deep Learning to detect emotion (sad, angry, happy etc) from the images of faces.

computer-vision deep-learning tensorflow

Last synced: 07 May 2025

https://github.com/IBM/vision-terraform

IBM Maximo Visual Inspection sample terraform templates for deployment in IBM Cloud (formerly IBM PowerAI Vision and IBM Visual Insights)

computer-vision deep-learning ibm-cloud terraform-template vpc

Last synced: 13 May 2025

https://github.com/kalfasyan/home_surveillance_with_python

Motion detection using OpenCV (Raspberry Pi compatible), alerting through pushbullet, served with flask.

camera computer-vision flask-stream motion-detection pushbullet-api pushbullet-notifications python3 raspberry-pi-camera security-system surveillance-systems

Last synced: 18 Jul 2025

https://github.com/mmgalushka/hungarian-loss

Computes loss between two sets of entities using the optimal assignment based on the Hungarian algorithm.

computer-vision deep-learning object-classification object-detection object-segmentation python tensorflow

Last synced: 23 Mar 2025

https://github.com/jalajthanaki/real_time_object_detection

This repository contains the code for real-time object detection. I'm using video stream coming from webcam. MobileNet-SSD and OpenCv has been used as base-line approach. TensorFlow object detection API has been used in revised approach.

computer-vision deep-learning objectdetection opencv real-time-object-detection

Last synced: 10 Jun 2025

https://github.com/mreliptik/sudokuresolver

A computer vision python program to resolve sudoku taken from a camera

computer-vision image-processing python python3 sudoku sudoku-solver

Last synced: 18 Aug 2025

https://github.com/tensorlayer/tlxcv

A Platform-agnostic Computer Vision Application Library

computer-vision mindspore paddlepaddle pytorch tensorflow tensorlayer tensorlayerx

Last synced: 24 Apr 2025

https://github.com/neurodata/value-of-ood-data

The value of out-of-distribution data (ICML 2023)

computer-vision machine-learning ood-generalization statistical-learning

Last synced: 25 Apr 2025

https://github.com/anishlearnstocode/image2sketch

Converts a 2D Color Image 🖼 into a Hand drawn Sketch ✏ Using Novel Technique.

computer-vision cv digital-image-processing dip image-processing

Last synced: 22 Jul 2025

https://github.com/ivandonadello/semantic-pascal-part

This is a curated semantic version of the PASCAL-Part dataset for part-based object detection. Objects are aligned with WordNet and Yago concepts. The dataset is both in PASCAL-Voc and RDF format.

computer-vision dataset knowledge-graph object-detection ontologies ontology owl owl-ontology part-whole partof-ontology pascal-part pascal-voc rdf rdf-format rdflib semantic-web wordnet yago

Last synced: 22 Aug 2025

https://github.com/wkentaro/chainer-cyclegan

Chainer implementation of "Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Network".

chainer computer-graphics computer-vision cyclegan deep-learning gan generative-adversarial-network image-generation image-manipulation pix2pix

Last synced: 15 Apr 2025

https://github.com/shvetsiya/carvana

My solution and pipline for corresponding kaggle competition

computer-vision deep-learning image-segmentation python

Last synced: 25 Oct 2025

https://github.com/symisc/pixlab-php

Official PHP Client for the PixLab Machine Vision API

api client computer-vision machine-learning php-library pixlab

Last synced: 16 Jun 2025

https://github.com/rishit-dagli/cv-with-tf-demos

My talk about Computer Vision with TensorFlow

cnn cnn-classification cnn-keras computer-vision deep-learning tensorflow

Last synced: 07 May 2025

https://github.com/qa276390/SearchTrack

[BMVC 2022] SearchTrack: Multiple Object Tracking with Object-Customized Search and Motion-Aware Features

bmvc2022 computer-vision deep-learning kitti-dataset mots multiple-object-tracking segementation tracking

Last synced: 20 Mar 2025

https://github.com/scivision/lineclipping-python-fortran

Line clipping in clean, simple, modern Fortran and Python

coarray-fortran cohen-sutherland computer-vision line-clipping

Last synced: 05 Apr 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-Skeleton-based-Action-Recognition 104 Awesome-3D-Object-Detection 169 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-robotics-datasets 79 awesome-state-of-depth-completion 71 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97