An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/mfaizanse/kinectfusion

Re-implementation of the paper KinectFusion: Real-Time Dense Surface Mapping and Tracking

3d-reconstruction computer-vision

Last synced: 14 Apr 2025

https://github.com/jmhessel/catrank

Pretrained models for the ranking task described in Cats and Captions vs. Creators and the Clock (WWW 2017)

computer-vision machine-learning pretrained-models research

Last synced: 22 Jul 2025

https://github.com/maxinexiong/imageai-flask-apps

This project offers two Flask applications that utilize ImageAI's image prediction algorithms and object detection models. These apps enable users to upload images and videos for object recognition, detection and analysis, providing accurate prediction results, confidence scores, raw data of detected objects at frame-level, and object insights.

ai-project computer-vision css3 data-visualization flask flask-application html5 image-recognition imageai matplotlib-pyplot moviepy pandas pandas-dataframe python video-object-detection

Last synced: 11 Apr 2025

https://github.com/react-declarative/react-face-kyc

Quite useful react snippet for automatic photo capture with check of correct face position. It uses Haar-cascade in WASM (performant and can be trained instead of programmed)

computer-vision haar-cascade kyc liveness-detection mui opencv react reactive rxjs typescript wasm

Last synced: 02 Mar 2026

https://github.com/intel-retail/automated-vending

Deploy sensor fusion technology for an automated checkout that enables real-time insight about the products consumers are buying using the EdgeX Foundry* extensible framework.

computer-vision deep-learning edge edge-ai edge-computing image-recognition inference intel live-demo machine-learning object-detection openvino pretrained-models real-time reference-implementation video

Last synced: 18 Aug 2025

https://github.com/smrfeld/dash-annotate-cv

Dash components for computer vision annotation tasks

annotations computer-vision dash data-labeling machine-learning plotly plotly-dash

Last synced: 07 Apr 2025

https://github.com/chinefed/convolutional-set-transformer

Official implementation of the Convolutional Set Transformer (Chinello & Boracchi, 2025). This repository includes the source code of the cstmodels Python package, which provides reusable Keras 3 layers for constructing CST architectures, together with an interface to load and use the CST-15 model pre-trained on ImageNet.

anomaly-detection computer-vision convolutional-set-transformer deep-learning explainability image-classification set-learning transfer-learning

Last synced: 14 Jan 2026

https://github.com/ddayto21/self-driving-car-

This Python repository implements lane detection techniques using OpenCV which is the open-source library used for computer vision, machine learning and image processing.

computer-vision opencv python self-driving-car

Last synced: 13 Oct 2025

https://github.com/lynnporu/rukopys-dataset

Dataset of Ukrainian handwritted letters with plenty of variations

ai computer-vision cyrillic cyrillic-characters dataset handwriting-recognition ukrainian

Last synced: 05 Feb 2026

https://github.com/cpvrlab/slproject4

SLProject is a platform independent 3D computer graphics scene graph library.

c-plus-plus computer-graphics computer-vision framework opengl realtime rendering

Last synced: 23 Apr 2025

https://github.com/rizwanmunawar/visionusecases

Explore a wide range of computer vision projects and documentation covering everything from object detection, image segmentation, and tracking to pose estimation, object counting, and automated annotation. These resources highlight real-world AI applications built with modern models like Ultralytics YOLO, Meta SAM 2, and other vlms

artificial-intelligence computer-vision deep-learning instance-segmentation object-detection object-tracking pose-estimation real-time-inference ultralytics

Last synced: 23 Jan 2026

https://github.com/nandometzger/mlfocallengths

Estimating the Focal Length from Monocular Images

cnn computer-vision deep-learning

Last synced: 09 Jul 2025

https://github.com/cilabuniba/i-dream-my-painting

[WACV 2025] I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi-Mask Inpainting

computer-vision diffusion inpainting llava mllm multimodal prompt-generation stable-diffusion text-to-image

Last synced: 06 Jul 2025

https://github.com/hartikainen/stanford-cs231n

Stanford CS231n: Convolutional Neural Networks for Visual Recognition

computer-vision cs231n deep-learning neural-networks stanford-cv

Last synced: 25 Mar 2025

https://github.com/ronlek/fastv2c-handnet

Repository for the implementation of "FastV2C-HandNet: Fast Voxel to Coordinate Hand Pose Estimation with 3D Convolutional Neural Networks"

3d-convolutional-network 3d-hand-pose 3d-pose-estimation computer-vision deep-learning depth-images fastv2c-handnet hand-pose-estimation

Last synced: 10 Apr 2025

https://github.com/krshrimali/panorama-image-stitching-using-opencv

Program to generate Panorama of multiple images. Image Stitching, Panorama.

computer-vision image image-stitching machine-learning opencv panorama stitching

Last synced: 25 Jun 2025

https://github.com/anuj0456/kaggle_competition_solutions

This repository compiles solutions from past Kaggle competitions, shared by winners and top performers in the discussion forums. Finding specific competition solutions can be time-consuming, so I've centralized them here for easy reference.

computer-vision kaggle kaggle-competition kaggle-solution machine-learning nlp time-series top-solution

Last synced: 15 Apr 2026

https://github.com/souvikmajumder26/any-face-clustering

📸 Face Clustering Engine developed using OpenCV & DBSCAN, deployed as a Streamlit Web App to deliver uploaded images grouped according to the individual unique faces in them.

any-face-clustering clustering computer-vision dbscan dbscan-clustering deep-learning dlib face-clustering face-detection face-recognition google-colab machine-learning opencv python python3 streamlit streamlit-webapp unlabeled-data unsupervised-learning webapp

Last synced: 10 Jul 2025

https://github.com/fpt-thaituan/detect-yoga-poses-and-correction-in-real-time-using-machine-learning-algorithms

Compare SVM mode yoga movement classification accuracy with Linear kernel, Polynomial kernel, RBF (Radial Basis Function) kernel, LSTM with accuracy up to 98%. In addition, it also supports adjusting the practitioner's movements according to standard movements.

blazepose computer-vision deep-learning long-short-term-memory lsmt machine-learning pose-estimation support-vector-machines svc-model yoga-classfication

Last synced: 22 Aug 2025

https://github.com/ekote/ai-on-microsoft-azure

Microsoft buduje i tworzy Polską Dolinę Cyfrową. W ramach tej inicjatywy podjęliśmy się wyzwania zbudowania chmurowych kompetencji wśród 150tys osób w Polsce. Jednym z elementów tej inicjatywy jest dedykowany kurs na studiach inzynierskich i magisterskich na Politechnice Warszawskiej poświęcony chmurze obliczeniowej oraz sztucznej inteligencji.

artificial-intelligence azure azure-cognitive-services azure-functions azure-machine azure-machine-learning cloud cloudcomputing cognitive-services computer-vision machine-learning nlp

Last synced: 29 Oct 2025

https://github.com/bwconrad/video-content-search

Search the content of a video with a text or image query

computer-vision deep-learning video-search

Last synced: 20 Jun 2025

https://github.com/bluebrain/atlas-alignment

Blue Brain multi-modal registration and alignment toolbox

computer-vision deep-learning image-registration machine-learning

Last synced: 16 Jul 2025

https://github.com/mrtrkmn/product-search-in-supermarket-shelves

Project: Product Search in Supermarket Shelves | | COMP431:Computer Vision Project

computer-vision feature-extraction image-processing jupyter-notebook object-detection opencv-python python

Last synced: 12 Apr 2025

https://github.com/anhtuan284/rebuild-zone

OLP 2024 - Open-Source Software

computer-vision icpc low-code open-source vfossa

Last synced: 12 Apr 2025

https://github.com/madhawav/playwithautoencoders

Image Compression on COCO Dataset using Convolution AutoEncoders

autoencoder computer-graphics computer-vision convolutional-autoencoder image-compression pytorch

Last synced: 18 Mar 2025

https://github.com/pmixer/otb.plugin

Bridging python and matlab version OTB

benchmark computer-vision tracking

Last synced: 25 Feb 2025

https://github.com/modanesh/differential_ig

Source code for the differential saliency method used in "Re-understanding Finite-State Representations of Recurrent Policy Networks"

computer-vision explainable-ai pytorch reinforcement-learning saliency

Last synced: 10 Apr 2025

https://github.com/njroussel/roadsegmentation

Segmenting satellite images of earth : determining which parts are roads.

computer-vision convolutional-neural-networks jupyter-notebook python road-segmentation

Last synced: 25 Jun 2025

https://github.com/lynkos/algae-detection

Detect and identify different species of harmful algae within natural water in real-time with AI and a camera (i.e., ESP32-CAM, smartphone, or webcam).

ai arduino artificial-intelligence c cnn computer-vision cpp deep-learning esp32 espressif html iot machine-learning neural-network opencv opencv-python python tinyml ultralytics yolov8

Last synced: 13 Apr 2025

https://github.com/snehil-shah/multimodal-image-search-engine

Text to Image & Reverse Image Search Engine built upon Vector Similarity Search utilizing CLIP VL-Transformer for Semantic Embeddings & Qdrant as the Vector-Store

computer-vision multimodal-transformer nlp openai-clip qdrant qdrant-vector-database vector-embeddings

Last synced: 31 Oct 2025

https://github.com/qa276390/SearchTrack

[BMVC 2022] SearchTrack: Multiple Object Tracking with Object-Customized Search and Motion-Aware Features

bmvc2022 computer-vision deep-learning kitti-dataset mots multiple-object-tracking segementation tracking

Last synced: 20 Mar 2025

https://github.com/vra/easybox

☐ :chart_with_upwards_trend: ☐ A simple, out-of-the-box and cross-platform bbox annotation tool by Python. Try it by `pip install easybox`

computer-vision deep-learning gui linux object-detection python tkinter

Last synced: 12 Apr 2025

https://github.com/sardhendu/cifar10-object-recognition

{Python}: Mix-Match of several image Processing and Machine Learning Techniques for object recognition

classification computer-vision deep-learning deep-neural-networks machine-learning-algorithms python3 tensorflow

Last synced: 12 Apr 2025

https://github.com/IBM/vision-terraform

IBM Maximo Visual Inspection sample terraform templates for deployment in IBM Cloud (formerly IBM PowerAI Vision and IBM Visual Insights)

computer-vision deep-learning ibm-cloud terraform-template vpc

Last synced: 13 May 2025

https://github.com/manujosephv/covid-xray-imagenet

Imagenet Pretraining for Covid-19 Xray Identification

computer-vision covid-19 xray-detection

Last synced: 13 Apr 2025

https://github.com/aleksandergrzyb/mostitch

Program that automatically stitches set of retinal angiography eye images.

computer-vision cpp image-processing opencv stitched-images stitching

Last synced: 13 Apr 2025

https://github.com/nathancooperjones/super-mario-64-ds-wanted-autoplay

A computer vision model designed to autonomously play the "Wanted!" minigame from Super Mario 64 DS

computer-vision game mario nintendo-ds object-detection

Last synced: 11 Apr 2025

https://github.com/rishit-dagli/cv-with-tf-demos

My talk about Computer Vision with TensorFlow

cnn cnn-classification cnn-keras computer-vision deep-learning tensorflow

Last synced: 07 May 2025

https://github.com/anishlearnstocode/image2sketch

Converts a 2D Color Image 🖼 into a Hand drawn Sketch ✏ Using Novel Technique.

computer-vision cv digital-image-processing dip image-processing

Last synced: 22 Jul 2025

https://github.com/rodneyosodo/computer-vision-ai-saturdays

This repository contains classwork and homework for JKUAT SES Project under computer vision. Good luck in finding new bugs 😀.

computer-vision deep-learning hacktoberfest image-processing opencv

Last synced: 19 Mar 2025

https://github.com/abhroroy365/tennis-tracker

This project utilizes OpenCV, YOLO and CNN to track the position, movement of players in a video. YOLOv8 is used to track the players. YOLOv5 is used to track the position of tennis ball at every frame of the video. ResNet34 is fine-tuned to detect the court keypoints.

computer-vision deep-learning opencv pytorch tennis-game yolov5

Last synced: 14 Apr 2025

https://github.com/radiantearth/ml4gd

Machine Learning for Global Development Working Group

computer-vision machine-learning satellite-imagery

Last synced: 05 Mar 2025

https://github.com/cameron-d/openpose-ptz-control

Follow a person with a PTZ camera using openpose for people recognition

birddog computer-vision livestream nvidia-cuda production ptz-control visca

Last synced: 14 Jun 2025

https://github.com/sayakpaul/emotion-detection-using-deep-learning

This project demonstrates the use of Deep Learning to detect emotion (sad, angry, happy etc) from the images of faces.

computer-vision deep-learning tensorflow

Last synced: 07 May 2025

https://github.com/symisc/pixlab-php

Official PHP Client for the PixLab Machine Vision API

api client computer-vision machine-learning php-library pixlab

Last synced: 16 Jun 2025

https://github.com/aayusharora/aftershoot

An AI to filter technically bad photos #AndroidDevChallenge

androiddevchallenge automl computer-vision photographers

Last synced: 22 Jun 2025

https://github.com/pankajarm/fastai2-vision-app

Starter Pack for creating fast responsive Web App for FastAI Version 2 Vision models

computer-vision deep-learning fastai fastai2 machine-learning

Last synced: 06 May 2025

https://github.com/andreabac3/study-transfer-learning-covid-19

This repository contains the official code of the research paper Study on transfer learning capabilities for pneumonia classification in chest-x-rays images pubblished at the Computer Methods and Programs in Biomedicine Journal.

computer-vision covid cv explainability explainable-ai pytorch pytorch-lightning torch

Last synced: 09 Nov 2025

https://github.com/khuangaf/youtube-8m-2018

The 19th Place Solution to The 2nd YouTube-8M Video Understanding Challenge

computer-vision deep-learning machine-learning netvlad tensorflow

Last synced: 30 Apr 2025

https://github.com/mmgalushka/hungarian-loss

Computes loss between two sets of entities using the optimal assignment based on the Hungarian algorithm.

computer-vision deep-learning object-classification object-detection object-segmentation python tensorflow

Last synced: 23 Mar 2025

https://github.com/idinsight/mosaiks

Parallelized encoding of satellite imagery into easy-to-use features using the MOSAIKS algorithm.

computer-vision feature-engineering satellite-imagery

Last synced: 14 Jan 2026

https://github.com/wkentaro/chainer-cyclegan

Chainer implementation of "Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Network".

chainer computer-graphics computer-vision cyclegan deep-learning gan generative-adversarial-network image-generation image-manipulation pix2pix

Last synced: 15 Apr 2025

https://github.com/owen-liuyuxuan/kitti360_visualize

Visualize KITTI360 sequences on ROS with full tf support.

autonomous-driving computer-vision dataset kitti-360

Last synced: 22 Apr 2025

https://github.com/symisc/pixlab-php-nsfw

PHP Class to classify NSFW content via the PixLab Machine Vision APIs

computer-vision machine-learning machine-learning-api nsfw nsfw-recognition php rest-api

Last synced: 30 Apr 2025

https://github.com/neurodata/value-of-ood-data

The value of out-of-distribution data (ICML 2023)

computer-vision machine-learning ood-generalization statistical-learning

Last synced: 25 Apr 2025

https://github.com/elhamkhan859/cvf-scrapper-public

The CVF Open Access Downloader is a Python application designed to automate the bulk downloading of open-access papers from Computer Vision Foundation (CVF) conferences, including WACV, ICCV, and CVPR.

bulkdownload computer-vision computer-vision-tools cvf cvf-conference paper-download pdf-downloader

Last synced: 23 Apr 2025

https://github.com/sergio11/safedrive_drowsiness_detection

🚗 Driver Drowsiness Detection with Deep Learning (PoC) 🧠 A personal experiment comparing CNN + MobileNetV2 and YOLOv11 for real-time drowsiness detection. Focused on evaluating accuracy and efficiency to explore AI’s potential in preventing fatigue-related accidents. 😴⚡ A hands-on dive into safety-driven computer vision.

autonomous-driving computer-vision convolutional-neural-networks deep-learning driver-drowsiness-detection image-classification mobile-ai real-time-detection tensorflow tensorflow-lite transfer-learning yolo yolov11

Last synced: 04 Oct 2025

https://github.com/neurobooth/neurobooth-os

Python package for digital phenotyping data synchronization and acquisition

computer-vision neurology neuroscience python wearables

Last synced: 09 Apr 2026

https://github.com/natmlx/meet-unity

MediaPipe selfie segmentation in Unity Engine.

computer-vision matting mediapipe meet natml selfie-segmentation unity

Last synced: 27 Jul 2025

https://github.com/kalfasyan/home_surveillance_with_python

Motion detection using OpenCV (Raspberry Pi compatible), alerting through pushbullet, served with flask.

camera computer-vision flask-stream motion-detection pushbullet-api pushbullet-notifications python3 raspberry-pi-camera security-system surveillance-systems

Last synced: 18 Jul 2025

https://github.com/groundlight/stream

Groundlight Stream Processor - Container for analyzing video using RTSP etc

artificial-intelligence computer-vision machine-learning

Last synced: 13 Mar 2026

https://github.com/ivandonadello/semantic-pascal-part

This is a curated semantic version of the PASCAL-Part dataset for part-based object detection. Objects are aligned with WordNet and Yago concepts. The dataset is both in PASCAL-Voc and RDF format.

computer-vision dataset knowledge-graph object-detection ontologies ontology owl owl-ontology part-whole partof-ontology pascal-part pascal-voc rdf rdf-format rdflib semantic-web wordnet yago

Last synced: 22 Aug 2025

https://github.com/isaaccorley/making-convolutional-networks-shift-invariant-again-tensorflow

Tensorflow Implementation of BlurPool the Antialiasing Pooling operation from "Making Convolutional Networks Shift-Invariant Again"

antialiasing artificial-intelligence computer-vision keras tensorflow

Last synced: 08 Oct 2025

https://github.com/motapinto/computer-vision-structured-light

Project developed for the Computer Vision course unit in FEUP

computer-vision edge-detection jupyter-notebook opencv python structured-light

Last synced: 08 Oct 2025

https://github.com/tensorlayer/tlxcv

A Platform-agnostic Computer Vision Application Library

computer-vision mindspore paddlepaddle pytorch tensorflow tensorlayer tensorlayerx

Last synced: 24 Apr 2025

https://github.com/koldim2001/jupiterlabcv

Готовый сервис JupiterLab для разработок в сфере компьютерного зрения

computer-vision docker docker-compose gpu-computing jupyter-lab jupyter-notebook jupyterlab

Last synced: 03 Sep 2025

https://github.com/charleslf2/visionner

Visionner turn raw image data into numpy array, more suitable for deep learning task

computer-vision dataset image machine-learning python python3

Last synced: 29 Jun 2025

https://github.com/matthewspear/stoptouchingyourface

🚨🤦🏻‍♂️🦠– macOS app to help you stop touching your face when remote working or watching Netflix!

computer-vision coreml coronavirus covid-19 machine-learning macos segmentation swift

Last synced: 21 Apr 2025

https://github.com/fisal77/ArtaEye

ArtaEye is web/mobile camera eye tracking to make beautiful drawing and artwork based on eye movements. In this project. We target people with disabilities to create an artwork. We try to translate their feelings through their eyes into digital drawings.

ai computer-vision eye-detection eye-tracking eye-tracking-mouse ml opencv webcam webcam-eyetracking

Last synced: 29 Apr 2025

https://github.com/colasgael/cs231n-assignments-spring19

My personal solutions to the CS231n assignments (Spring 2019). CS231n: "CNN" is a Computer Vision class taught at Stanford.

cnn computer-vision deep-learning deep-learning-tutorial pytorch

Last synced: 14 Apr 2025

https://github.com/neelanjan00/image-forgery-detection

A deep convolutional neural network for the detection as well as localization of the area of manipulation in forged images, bearing forgeries of simple as well as complex nature. Further along, the trained model is interfaced with a web application for users to interact with the model in a simple and effective manner, and finally, we also develop a chatbot for further easing the process of interaction with the model and effectively tackling the problem of fake news forwards in popular internet messaging platforms such as WhatsApp.

chatbot computer-vision deep-learning flask forgery-detection tensorflow

Last synced: 11 Oct 2025

https://github.com/davidstutz/vlfeat-slic-example

Example of using VLFeat's SLIC implementation from C++.

computer-vision image-processing opencv slic superpixel-algorithm superpixels vlfeat

Last synced: 23 Apr 2025

https://github.com/scivision/lineclipping-python-fortran

Line clipping in clean, simple, modern Fortran and Python

coarray-fortran cohen-sutherland computer-vision line-clipping

Last synced: 05 Apr 2025

https://github.com/sovit-123/german-traffic-sign-recognition-with-deep-learning

Recognizing traffic signs with deep learning and PyTorch using Spatial Transformer Convolutional Neural Networks.

computer-vision convolutional-neural-networks deep-learning image-recognition neural-networks python pytorch spatial-transformer-networks

Last synced: 18 Oct 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-Skeleton-based-Action-Recognition 104 Awesome-3D-Object-Detection 169 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-state-of-depth-completion 71 awesome-robotics-datasets 79 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97