An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/ayushexel/trolo

An SDK for Transformers + YOLO and other SSD family models

computer-vision object-detection opencv segmentation transformers yolo

Last synced: 06 Oct 2025

https://github.com/fregu856/ebms_3dod

Official implementation of "Accurate 3D Object Detection using Energy-Based Models", CVPR Workshops 2021.

3d-object-detection computer-vision deep-learning energy-based-model machine-learning object-detection pytorch

Last synced: 20 Mar 2025

https://github.com/ahmedfgad/cifar10cnnflask

Building a HTTP-accessed convolutional neural network model using TensorFlow NN (tf.nn), CIFAR10 dataset, Python and Flask.

cnn computer-vision convnet convolutional-neural-network css dropout flask fully-connected-network html http image-classification image-processing javascript max-pooling python relu tensorflow web-application

Last synced: 23 Apr 2025

https://github.com/2b-t/stereo-matching

Stereo-image depth reconstruction with different matching costs and matching algorithms in Python using Numpy and Numba

computer-vision docker jupyter-notebook ncc normalized-cross-correlation numba numpy python sad scipy semi-global-matching sgm ssd sum-of-squares winner-take-all wta

Last synced: 12 Apr 2025

https://github.com/tsarjak/simulate-correct-colorblindness

This Script simulates images based on how people with colorblindness would perceive it naturally. You can also vary the degree of colorblindness for simulating. This script can also be used to correct images, making differenciation of certain colors in an image easier for people with colorblindness.

colorblind colorblind-people colorblindness colorblindness-simulator colour-blind-people colours computer-vision computervision converts correct-colorblindness corrector daltonization-algorithm hsv-shifting-algorithm image image-processing simulate social-cause vision

Last synced: 21 Mar 2025

https://github.com/breathingcyborg/mediapipe-face-effects

Realtime Face Effects in Browser using mediapipe and three js.

computer-vision mediapipe mediapipe-facemesh realtime try-on

Last synced: 03 Mar 2026

https://github.com/bagaturchess/chessboardscanner

Java based Chess Board Scanner, which converts 2D chess board image into a machine readable format a.k.a. Forsyth–Edwards Notation (FEN). It uses OpenCV and Deeplearning4j frameworks, complemented with some proprietary algorithms implemented for realizing the goal. It currently supports the chess board and pieces sets of the most common online chess platforms chess.com and lichess.org.

chess chess-board chessboard chessboard-detection computer-vision deeplearning deeplearning4j image-recognition imagerecognition java opencv

Last synced: 10 Sep 2025

https://github.com/themattinthehatt/behavenet

Toolbox for analyzing behavioral videos and neural activity

behavioral-videos computer-vision neuroscience-methods

Last synced: 28 Jul 2025

https://github.com/cxhernandez/cellcount

A Convolutional Neural Network for Segmenting and Counting Cells in Microscopy Images

biology computer-vision python pytorch segmentation

Last synced: 21 Mar 2025

https://github.com/yochengliu/mlic-kd-wsd

Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection (ACM MM 2018)

computer-vision deep-learning knowledge-distillation multi-label-classification weakly-supervised-detection

Last synced: 12 Jun 2025

https://github.com/dsgiitr/reading-group

Discussions on papers, frameworks, blogs and ideas every Saturday.

computer-vision deep-learning discussions machine-learning neural-networks nlp papers

Last synced: 15 Apr 2025

https://github.com/nikolasent/vehicle-detection-and-tracking

Udacity Self-Driving Car Engineer Nanodegree. Project: Vehicle Detection and Tracking

carnd classifier computer-vision opencv self-driving-car svm

Last synced: 30 Apr 2025

https://github.com/sayakpaul/swin-transformers-tf

Implementation of Swin Transformers in TensorFlow along with converted pre-trained models, code for off-the-shelf classification and fine-tuning.

attention computer-vision hierarchies image-recognition relative-attention swin-transformers tensorflow

Last synced: 30 Apr 2025

https://github.com/hxndev/virtual-mouse-using-opencv

In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

ai-mouse autopy code computer-vision cv2 handtracking handtrackingmodule mediapipe opencv opencv-python virtual-mouse virtual-mouse-using-hand-gesture webcam

Last synced: 15 Jun 2025

https://github.com/cheese-roll/light-anime-face-detector

A fast and light-weighted anime face detection based on LFFD.

anime computer-vision face-detection lffd mxnet

Last synced: 20 Nov 2025

https://github.com/KushajveerSingh/resize_network_cv

PyTorch implementation of the paper "Learning to Resize Images for Computer Vision Tasks" on Imagenette and Imagewoof datasets

computer-vision pytorch pytorch-implementation pytorch-lightning

Last synced: 08 May 2025

https://github.com/kelvins/lbph

Local Binary Patterns Histograms (LBPH) implementation in Go

computer-vision face-recognition face-recognizer image-processing lbph local-binary-patterns texture-classification

Last synced: 08 May 2025

https://github.com/ucdvision/gen2seg

Code for "gen2seg: Generative Models Enable Generalizable Instance Segmentation"

computer-vision generative-ai machine-learning segmentation self-supervised-learning stable-diffusion

Last synced: 23 Aug 2025

https://github.com/bailool/placerecognition-loopdetection

Light-weight place recognition and loop detection using road markings

computer-vision cpp11 loop-detection matlab place-recognition road-markings

Last synced: 26 Jun 2025

https://github.com/balavenkatesh3322/face_unlock

We can lock and unlock our Ubuntu system using face recognition(currently only on Ubuntu).

computer-vision face-recognition lockscreen numpy opencv os python3 security-automation ubuntu

Last synced: 22 Apr 2025

https://github.com/Masao-Taketani/FOTS_OCR

TensorFlow Implementation of FOTS, Fast Oriented Text Spotting with a Unified Network.

computer-vision deep-learning image-recognition ocr scene-text-recognition tensorflow

Last synced: 02 Apr 2025

https://github.com/real-stanford/umpnet

[RA-L / ICRA 2022] UMPNet: Universal Manipulation Policy Network for Articulated Objects

computer-vision robotics simulation

Last synced: 05 May 2025

https://github.com/groundlight/python-sdk

Python SDK for accessing Groundlight API

artificial-intelligence computer-vision machine-learning

Last synced: 16 Jan 2026

https://github.com/harboryuan/polyphonicformer

[ECCV 2022] 🎵PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation

computer-vision deep-learning

Last synced: 25 Oct 2025

https://github.com/eric-canas/homography.js

Lightweight, High-Performance and easy-to-use library for performing Affine, Projective or Piecewise Affine transformations over any Image or HTMLElement from only a set of reference points. In Javascript.

2d affine-transformations computer-vision easy-to-use fast homography image-manipulation javascript node-js perspective-transform piecewise-affine projective-transformations warp

Last synced: 10 Apr 2025

https://github.com/Charmve/Semantic-Segmentation-PyTorch

PyTorch implementation for Semantic Segmentation, include FCN, U-Net, SegNet, GCN, PSPNet, Deeplabv3, Deeplabv3+, Mask R-CNN, DUC, GoogleNet, and more dataset

charmve computer-vision deep-learning fcn image-classification image-processing mask-rcnn pytorch pytorch-implementation segnet semantic-segmentation unet-pytorch

Last synced: 13 Feb 2026

https://github.com/calciferzh/minimal-body

A very simple baseline to estimate 2D & 3D SMPL-compatible keypoints from a single color image.

3d-pose-estimation computer-vision deep-learning

Last synced: 21 Apr 2025

https://github.com/pushkalkatara/darknet_ros

Robotics Operating System Package for Yolo v3 based on darknet with optimized tracking using Kalman Filter and Optical Flow.

alexeyab-darknet computer-vision darknet darknet-ros deep-learning kalman-filter object-detection ros yolo

Last synced: 15 Oct 2025

https://github.com/agentmaker/webai.js

A simple Web AI model deployment tool using JavaScript based on OpenCV.js and ONNXRuntime

ai classification computer-vision detection javascript js node objectdetection onnx onnxruntime onnxruntime-web opencv segmentation web

Last synced: 30 Aug 2025

https://github.com/ibaiGorordo/ONNX-CenterSnap-6D-Pose-and-Shape-Estimation

Python scripts for performing 6D pose estimation and shape reconstruction using the CenterSnap model in ONNX

3d-object-detection 3d-object-reconstruction 6dof-pose azure-kinect computer-vision depthai kinect oak-d onnx onnxruntime onnxruntime-gpu opencv python

Last synced: 20 Mar 2025

https://github.com/asahi417/wikiart-image-dataset

We release WikiART Crawler, a python-library to download/process images from WikiART via WikiART API, and two image datasets: `WikiART Face` and `WikiART General`.

art computer-vision dataset generative-model

Last synced: 19 Jul 2025

https://github.com/o-tawab/Video-Pixel-Networks

Video Pixel Networks in Tensorflow

computer-vision video-pixel-network vpn

Last synced: 11 Mar 2025

https://github.com/adeelh/pytorch-fpn

PyTorch implementations of some FPN-based semantic segmentation architectures: vanilla FPN, Panoptic FPN, PANet FPN; with ResNet and EfficientNet backbones.

computer-vision deep-learning deeplearning efficientnet feature-pyramid-network fpn implementation-of-research-paper machine-learning neural-network pytorch pytorch-fpn pytorch-implementation resnet semantic-segmentation semantic-segmentation-architectures

Last synced: 14 Apr 2025

https://github.com/gbourniq/bank-statement-analysis

Flask application generating interactive visualisations from bank statements PDF documents

bank-statement-documents computer-vision docker flask-application machine-learning powerbi sql-database web-application

Last synced: 05 Mar 2025

https://github.com/isarandi/metro-pose3d

Metric-Scale Truncation-Robust Heatmaps for 3D Human Pose Estimation

3d-human-pose computer-vision deep-learning human-pose-estimation python tensorflow

Last synced: 14 Apr 2025

https://github.com/alsora/ros2-tensorflow

ROS2 nodes for computer vision tasks in Tensorflow

computer-vision image-classification image-detection ros2 ros2-tensorflow tensorflow

Last synced: 06 Apr 2026

https://github.com/inkbytefo/ScreenMonitorMCP

A REVOLUTIONARY Model Context Protocol (MCP) server! Gives AI real-time vision capabilities and enhanced UI intelligence power. This isn't just screen capture - it gives AI the power to truly "see" and understand your digital world!

ai artificial-intelligence automation computer-vision mcp model-context-protocol openai predictive-ai python real-time revolutionary screen-monitoring ui-automation

Last synced: 27 Sep 2025

https://github.com/universaldatatool/coronavirus-mask-image-dataset

Image dataset from Instagram of people wearing medical masks, no mask, or a non-medical (DIY) mask

computer-vision coronavirus covid-19 covid19 dataset fastai images machine-learning

Last synced: 19 Apr 2025

https://github.com/wkentaro/chainer-mask-rcnn

Chainer Implementation of Mask R-CNN. (Training code to reproduce the original result is available.)

chainer coco computer-vision deep-learning detectron iccv-2017 instance-segmentation mask-rcnn object-detection

Last synced: 14 Jan 2026

https://github.com/stephanecharette/darkplate

License plate parsing using Darknet and YOLO

alpr anpr computer-vision darkhelp darknet neural-network opencv vlpr yolo

Last synced: 04 Oct 2025

https://github.com/nikolasent/advanced-lane-lines

Udacity Self-Driving Car Engineer Nanodegree. Project: Advanced Lane Finding

carnd computer-vision opencv self-driving-car

Last synced: 12 Mar 2026

https://github.com/hemansnation/ai-ml-14-hours-free-course

7 Day AI ML Fundamentals Workshop The purpose of this FREE workshop is 1. To give you a boost of getting started with AI. 2. A life-long community with a similar mindset. 3. strong grip on fundamentals that the advanced concepts will be easy to understand.

computer-vision datastructures-algorithms deep-learning generative-ai machine-learning mlops nlp numpy pandas python system-design

Last synced: 03 Sep 2025

https://github.com/prs-eth/scene-recognition-in-3d

[IROS 2020] Indoor Scene Recognition in 3D

3dvision computer-vision scene-recognition sparse-convolution

Last synced: 24 Aug 2025

https://github.com/eora-ai/torchok

Production-oriented Computer Vision models training pipeline for common tasks: classification, segmentation, detection and representation🥤

computer-vision deep-learning image-classification image-retrieval image-segmentation representation-learning

Last synced: 08 May 2025

https://github.com/fidler-lab/efficient-annotation-cookbook

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

annotations computer-vision cvpr2021 simulated-workers

Last synced: 14 Apr 2025

https://github.com/ibaigorordo/onnx-yolo-world-open-vocabulary-object-detection

Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.

computer-vision object-detection onnx onnxruntime open-vocabulary-detection python

Last synced: 19 Aug 2025

https://github.com/kdmayer/3D-PV-Locator

Dockerized Repo for "3D-PV-Locator: Large-scale detection of rooftop-mounted photovoltaic systems in 3D" based on Applied Energy publication.

ai climate-change computer-vision deep-learning deeplabv3 deepsolar inception-v3 network-planning neurips-2020 pv-systems remote-sensing renewable-energy satellite-imagery solar solar-panels

Last synced: 07 May 2025

https://github.com/achillesrasquinha/spockpy

:fist: :hand: :v: :point_up: 🖖 A Python hand gesture recognition library for Kinetic User Interface (KUI).

computer-vision gesture gesture-recognition-toolkit gesture-recognizer hand-gestures opencv python

Last synced: 03 Mar 2026

https://github.com/nianticlabs/airplanes

[CVPR 2024] AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings

computer-vision cvpr2024 machine-learning plane-detection

Last synced: 14 Sep 2025

https://github.com/wkentaro/reorientbot

ReorientBot: Learning Object Reorientation for Specific-Posed Placement, ICRA 2022

artificial-intelligence computer-vision deep-learning machine-learning robotics ros

Last synced: 15 Apr 2025

https://github.com/axiscommunications/acap-computer-vision-sdk-examples

Example applications that provide developers with the tools and knowledge to use Axis Camera Application Platform (ACAP) Computer Vision solution

acap analytics axis camera computer-vision edge python video

Last synced: 25 Jun 2025

https://github.com/wkentaro/safepicking

SafePicking: Learning Safe Object Extraction via Object-Level Mapping, ICRA 2022

artificial-intelligence computer-vision deep-learning machine-learning reinforcement-learning robotics ros

Last synced: 15 Apr 2025

https://github.com/bwconrad/soft-moe

PyTorch implementation of "From Sparse to Soft Mixtures of Experts"

computer-vision machine-learning mixture-of-experts pytorch transformer vision-transformer

Last synced: 11 Apr 2025

https://github.com/cggos/ptam_cg

Modified version of the monocular SLAM framework PTAM

artificial-intelligence computer-vision ptam slam

Last synced: 21 Mar 2025

https://github.com/davidstutz/seeds-revised

Implementation of the superpixel algorithm called SEEDS [1].

computer-vision image-processing opencv superpixel-algorithm superpixels

Last synced: 23 Oct 2025

https://github.com/pixano/pixano

Data-centric AI building blocks for computer vision applications

computer-vision data-annotation data-visualization deep-learning machine-learning python

Last synced: 27 Aug 2025

https://github.com/erfaniaa/persian-ocr

Optical character recognition of Farsi and Arabic letters (تشخیص حروف فارسی و عربی)

computer-vision convolutional-neural-networks deep-learning delphi farsi image-processing neural-networks ocr optical-character-recognition persian persian-ocr

Last synced: 05 Jan 2026

https://github.com/iago-suarez/fsg

:straight_ruler: :rocket: FSG: A statistical approach to line detection via fast segments grouping

computer-vision image-processing line-detection line-segment-detector vanishing-points

Last synced: 08 May 2025

https://github.com/swarm-lab/ropencvlite

Utility package to install OpenCV in R

computer-vision opencv r

Last synced: 05 Mar 2026

https://github.com/sunsided/viola-jones-adaboost

Training a face detection cascade using Adaptive Boosting after Viola and Jones.

adaboost computer-vision face-detection haar-cascade image-processing jupyter-notebook machine-learning python viola-jones

Last synced: 18 Mar 2025

https://github.com/ybabakhin/zindi_wheat_growth

1st place solution for CGIAR Wheat Growth Stage Challenge

computer-vision deep-learning image-classification pytorch pytorch-lightning resnet

Last synced: 02 Sep 2025

https://github.com/rohit901/cooperative-foundational-models

[WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"

computer-vision deep-learning novel-objects object-detection open-set-object-detection open-vocabulary-detection pytorch zero-shot-object-detection

Last synced: 24 Jul 2025

https://github.com/sayakpaul/knowledge-distillation-in-keras

Demonstrates knowledge distillation for image-based models in Keras.

computer-vision keras knowledge-distillation model-optimization tensorflow

Last synced: 11 Jul 2025

https://github.com/soumik12345/kinect-vision

A computer vision based gesture detection system that automatically detects the number of fingers as a hand gesture and enables you to control simple button pressing games using you hand gestures.

computer-vision gaming gesture-recognition opencv opencv3-python python-3 sign-language-recognition-system

Last synced: 07 May 2025

https://github.com/alladinian/Visionaire

Streamlined, ergonomic APIs around Apple's Vision framework

computer-vision ios macos swift vision

Last synced: 22 Jul 2025

https://github.com/aw-junaid/computer-science

Explore a collection of resources and projects in Computer Science, covering algorithms, data structures, programming languages, and emerging technologies. Ideal for learners and enthusiasts looking to enhance their knowledge and skills in the field

algorithms assembly-language automata computer-architecture computer-networks computer-science computer-vision cpp cybersecurity data-science data-science-projects data-structures database game-development machine-learning networking operating-system python

Last synced: 26 Mar 2025

https://github.com/sap-samples/btp-ai-sustainability-bootcamp

This github repository contains the sample code and exercises of btp-ai-sustainability-bootcamp, which showcases how to build Intelligence and Sustainability into Your Solutions on SAP Business Technology Platform with SAP AI Core and SAP Analytics Cloud for Planning.

computer-vision condition-monitoring deep-learning defect-detection image-segmentation machine-learning predictive-maintenance sac-planning sample sample-code sap-ai-core sap-ai-launchpad sap-analytics-cloud sound-classification sustainability

Last synced: 01 Mar 2026

https://github.com/dsgiitr/adversarial_lab

Web-based Tool for visualisation and generation of adversarial examples by attacking ImageNet Models like VGG, AlexNet, ResNet etc.

adversarial-attacks computer-vision flask html-css-javascript imagenet machine-learning python pytorch visualization

Last synced: 15 Apr 2025

https://github.com/wkentaro/osam

Get up and running with SAM, EfficientSAM, YOLO-World, and other promptable vision models locally.

computer-vision deep-learning foundation-models onnx segment-anything

Last synced: 16 Jan 2026

https://github.com/csiro-robotics/TCE

This repository contains the code implementation used in the paper Temporally Coherent Embeddings for Self-Supervised Video Representation Learning (TCE).

action-recognition computer-vision contrastive-learning contrastive-loss deep-learning embeddings hmdb51 kinetics-datasets metric-learning pytorch representation-learning self-supervised-learning tsne-visualisations ucf-101

Last synced: 26 Apr 2025

https://github.com/princeton-vl/packit

Code for reproducing results in ICML 2020 paper "PackIt: A Virtual Environment for Geometric Planning"

computer-vision icml-2020 open-ai-gym unity-ml-agents virtual-environments

Last synced: 08 May 2025

https://github.com/utiasstars/cat-net

Canonical Appearance Transformations

cat-net computer-vision deep-learning robotics

Last synced: 28 Jun 2025

https://github.com/bakercp/ofxdlib

An openFrameworks wrapper for dlib. http://dlib.net/

computer-vision deep-learning dlib dnn machine-learning openframeworks

Last synced: 18 Mar 2025

https://github.com/sayakpaul/ml-bootcamp-launchpad

Contains notebooks prepared for ML Bootcamp organized by Google Developers Launchpad.

ai-platform computer-vision deep-learning google-cloud-platform tensorflow2

Last synced: 30 Apr 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-Skeleton-based-Action-Recognition 104 Awesome-3D-Object-Detection 169 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-state-of-depth-completion 71 awesome-robotics-datasets 79 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97