An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/ad-si/perspectra

Automatically extract documents from images and perspectively correct them with classic computer-vision algorithms. In maintenance mode. Check out its successor at:

computer-vision deskew document-scanner perspective-correction python scanner scikit-image skimage

Last synced: 26 Oct 2025

https://github.com/arthur-qiu/FreeTraj

Code for FreeTraj, a tuning-free method for trajectory-controllable video generation

computer-vision diffusion diffusion-models trajectory-control video-generation

Last synced: 28 Mar 2025

https://github.com/astra-vision/LMSCNet

LMSCNet: Lightweight Multiscale 3D Semantic Completion. Roldão, L., de Charette, R., & Verroust-Blondet, A. International Conference on 3D Vision (3DV). 2020

3d-reconstruction computer-vision reconstruction semantickitti

Last synced: 20 Mar 2025

https://github.com/isaaccorley/pytorch-enhance

Open-source Library of Image Super-Resolution Models, Datasets, and Metrics for Benchmarking or Pretrained Use

artificial-intelligence computer-vision deep-learning pytorch super-resolution

Last synced: 12 Apr 2025

https://github.com/cvg/glace

[CVPR 2024] GLACE: Global Local Accelerated Coordinate Encoding

computer-vision deep-learning pose-estimation scene-coordinate-regression visual-localization

Last synced: 06 Sep 2025

https://github.com/bobld/yolov4mlnet

Use the YOLO v4 and v5 (ONNX) models for object detection in C# using ML.Net

computer-vision csharp dotnet dotnet-core machine-learning ml ml-net neural-network onnx v4 v5 yolo yolov4 yolov5

Last synced: 24 Oct 2025

https://github.com/valillon/FaceMorph

Code to generate a face morphing effect between two faces.

computer-vision dlib face-morphing facial-landmarks neural-networks opencv

Last synced: 09 May 2025

https://github.com/drydart/flutter_opencv

OpenCV bindings plugin for Flutter apps [work in progress]

computer-vision dart-library flutter flutter-plugin opencv

Last synced: 23 Oct 2025

https://github.com/farukalamai/top-100-computer-vision-projects-idea-for-2024

Welcome to the "Top 100 Computer Vision Projects Idea for 2024" repository! This repository contains a curated list of computer vision project ideas that you can explore, implement, and experiment with in 2024

ai-robot computer-vision deep-learning image-classification image-processing image-recognition image-segmentation object-detection object-tracking opencv pose-estimation real-time-processing text-classification yolov8

Last synced: 03 Mar 2025

https://github.com/fixstars/cuda-efficient-features

A CUDA implementation of keypoint detection and descriptor extraction

computer-vision cuda descriptors local-features robotics slam structure-from-motion

Last synced: 13 Apr 2025

https://github.com/mantasu/face-crop-plus

Face aligner and cropper with quality enhancement and attribute parsing

alignment computer-vision face face-attributes face-detection face-parsing pytorch segmentation super-resolution

Last synced: 15 Apr 2025

https://github.com/myungsub/meta-interpolation

Source code for CVPR 2020 paper "Scene-Adaptive Video Frame Interpolation via Meta-Learning"

computer-vision cvpr2020 deep-learning frame-interpolation meta-learning pytorch slow-motion video-frame-interpolation

Last synced: 15 Mar 2026

https://github.com/nvidia-ai-iot/deepstream_libraries

DeepStream Libraries offer CVCUDA, NvImageCodec, and PyNvVideoCodec modules as Python APIs for seamless integration into custom frameworks.

computer-vision cuda cv-cuda data-processing gpu image-processing nvidia nvimagecodec pynvvideocodec pytorch

Last synced: 01 Apr 2026

https://github.com/peterl1n/backgroundmattingv2-tensorflow

TensorFlow implementation of Real-Time High-Resolution Background Matting

computer-vision machine-learning matting

Last synced: 08 Mar 2026

https://github.com/ehsanik/dogtorch

Who Let The Dogs Out? Modeling Dog Behavior From Visual Data https://arxiv.org/pdf/1803.10827.pdf

computer-vision dog dog-movements ego-centric-videos imitation-learning modeling-dog recurrent-neural-networks representation-learning

Last synced: 14 Oct 2025

https://github.com/aimagelab/art2real

Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation. CVPR 2019

art2real computer-vision cvpr2019 deep-learning image-to-image-translation pytorch

Last synced: 11 Apr 2025

https://github.com/BobLd/YOLOv4MLNet

Use the YOLO v4 and v5 (ONNX) models for object detection in C# using ML.Net

computer-vision csharp dotnet dotnet-core machine-learning ml ml-net neural-network onnx v4 v5 yolo yolov4 yolov5

Last synced: 20 Apr 2025

https://github.com/jeffheaton/mergelife

Evolve complex cellular automata with a genetic algorithm.

cellular-automaton computer-vision genetic-algorithms

Last synced: 03 Mar 2026

https://github.com/xiaoming-zhao/pointnav-vo

[ICCV 2021] Official implementation of "The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation"

computer-vision deep-learning embodied-ai iccv2021 indoor-navigation visual-odometry

Last synced: 13 Apr 2025

https://github.com/yuhui-zh15/vlmclassifier

Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)

computer-vision machine-learning natural-language-processing

Last synced: 09 Apr 2025

https://github.com/ternaus/angiodysplasia-segmentation

Wining solution and its further development for MICCAI 2017 Endoscopic Vision Challenge Angiodysplasia Detection and Localization

computer-vision deep-learning image-segmentation medical-imaging python pytorch

Last synced: 22 Aug 2025

https://github.com/naver/dune

Code repository for "DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers"

computer-vision foundation-model image-encoder knowledge-distillation vision-transformer

Last synced: 04 Apr 2026

https://github.com/Neuraxio/New-Empty-Python-Project-Base

The Perfect Python Project Template. Bored of coding anew the same thing for your new Python projects? Here is what you need. Click below on the "use this template" green button to start using it instantly. Rename the "project" folder and all references to this folder to customize your project name.

base computer-vision library nlp project-template project-templates pypi-package python-library time-series

Last synced: 07 May 2025

https://github.com/lingdong-/poseosc

📹🤸‍♂️🤾‍♀️🤺 PoseNet + OSC: send realtime human pose estimation data to your apps

computer-vision electron machine-learning open-sound-control osc pose-estimation posenet

Last synced: 26 Oct 2025

https://github.com/anttwo/macarons

(CVPR 2023) Official code of MACARONS: Mapping And Coverage Anticipation with RGB ONline Self-supervision. Also contains an updated and improved implementation of our previous work SCONE (NeurIPS 2022), on which this work is built.

3d-reconstruction computer-vision cvpr-2023 cvpr2023 deep-learning neurips-2022 neurips2022 next-best-view pytorch

Last synced: 10 Apr 2025

https://github.com/harshcasper/brihaspati

Collection of various implementations and Codes in Machine Learning, Deep Learning and Computer Vision ✨💥

artificial-intelligence artificial-neural-networks computer-vision decision-tree-classifier deep-learning linear-regression logistic-regression machine-learning neural-network notebooks

Last synced: 21 Mar 2025

https://github.com/sinclaircoder/do-research-in-ai

A repository of useful research/skill-upgrading talks or acticles in NLP/CV/AI Area (in Chinese).

ai computer-vision nlp-machine-learning reseach

Last synced: 15 Feb 2026

https://github.com/LingDong-/PoseOSC

📹🤸‍♂️🤾‍♀️🤺 PoseNet + OSC: send realtime human pose estimation data to your apps

computer-vision electron machine-learning open-sound-control osc pose-estimation posenet

Last synced: 18 Jul 2025

https://github.com/somnusochi/vlm-autoyolo

AI Auto Annotation & YOLO Training Pipeline, End-to-end object detection auto-labeling and YOLO training platform. VLM-powered annotation with NVIDIA LocateAnything-3B, manual refinement, one-click YOLO training, video keyframe extraction, and model validation. Supports image and video.

auto-labeling computer-vision data-annotation deep-learning fastapi locate-anything machine-learning nvidia object-detection pytorch react ultralytics video-annotation vlm yolo yolo-training

Last synced: 08 Jun 2026

https://github.com/escorciav/daps

This repo allocate DAPs code of our ECCV 2016 publication

action-recognition computer-vision python python27 theano video

Last synced: 09 Apr 2025

https://github.com/ekzhang/dispict

Design a growing artistic exhibit of your own making, with semantic search powered by OpenAI CLIP

aesthetics art clip computer-vision creative graphic-design machine-learning modal museum python

Last synced: 10 Apr 2025

https://github.com/gjy3035/pcc-net

PCC Net: Perspective Crowd Counting via Spatial Convolutional Network

computer-vision crowd-analysis crowd-counting multi-task-learning

Last synced: 10 Apr 2025

https://github.com/glennglennglenn/augmented-reality

Snapchat-like augmented reality filters

augmented-reality computer-vision python

Last synced: 17 Aug 2025

https://github.com/ActiveVisionLab/gaussctrl

[ECCV 2024] GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing

3d 3d-editing computer-vision diffusion-models eccv2024 gaussian-splatting pytorch

Last synced: 07 May 2025

https://github.com/intel-iot-devkit/smart-retail-analytics

Use computer vision inference in the Intel® Distribution of OpenVINO™ toolkit to provide analytics on customer engagement, store traffic, and shelf inventory.

computer-vision deep-learning edge edge-ai edge-computing image-recognition inference intel live-demo machine-learning object-detection openvino pretrained-models real-time reference-implementation video

Last synced: 04 Apr 2025

https://github.com/StrayRobots/3d-annotation-tool

A graphical user interface to annotate point clouds and 3D data.

3d-vision bgfx computer-vision cpp detection labeling-tool lidar point-cloud

Last synced: 07 May 2025

https://github.com/bmw-innovationlab/sordi-ai-evaluation-gui

This repository allows you to evaluate a trained computer vision model and get general information and evaluation metrics with little configuration.

ai bmw computer-vision dataset deeplearning docker evaluation evaluation-framework no-code python rest-api sordi synthetic-data tensorflow

Last synced: 02 Jul 2025

https://github.com/ehsanik/dogTorch

Who Let The Dogs Out? Modeling Dog Behavior From Visual Data https://arxiv.org/pdf/1803.10827.pdf

computer-vision dog dog-movements ego-centric-videos imitation-learning modeling-dog recurrent-neural-networks representation-learning

Last synced: 18 Mar 2025

https://github.com/uppulurikalyani/ml-nexus

ML Nexus is an open-source collection of machine learning projects, covering topics like neural networks, computer vision, and NLP. Whether you're a beginner or expert, contribute, collaborate, and grow together in the world of AI. Join us to shape the future of machine learning!

ann cnn computer-vision gssoc-ext gssoc24 hacktoberfest hacktoberfest-accepted keras-neural-networks keras-tensorflow machine-learning ml-frameworks natural-language-processing neural-networks opencv python pytorch transfer-learning

Last synced: 12 Apr 2025

https://github.com/una-dinosauria/local-search-quantization

State-of-the-art method for large-scale ANN search as of Oct 2016. Presented at ECCV 16.

computer-vision cuda eccv-16 gpu julia multi-codebook quantization

Last synced: 02 Aug 2025

https://github.com/wrenfold/wrenfold

Toolkit for generating code from symbolic math expressions.

code-generation computer-vision mathematics robotics symbolic-math

Last synced: 01 Apr 2026

https://github.com/margaretmz/selfie2anime-with-tflite

How to create Selfie2Anime from tflite model to Android.

android computer-vision gans selfie2anime tensorflow-lite tflite

Last synced: 07 Apr 2025

https://github.com/princeton-vl/spatialsense

An Adversarially Crowdsourced Benchmark for Spatial Relation Recognition

computer-vision dataset iccv-2019 spatial-relation-recognition

Last synced: 08 May 2025

https://github.com/likyoo/BAN

The pytorch implementation for "A New Learning Paradigm for Foundation Model-based Remote Sensing Change Detection"

change-detection computer-vision deep-learning remote-sensing-image

Last synced: 11 May 2025

https://github.com/IbrahimSobh/Segmentation

In this tutorial, you will perform inference across 10 well-known pre-trained semantic segmentors and fine-tune on a custom dataset. Design and train your own segmentor.

computer-vision deep-learning semantic-segmentation

Last synced: 04 May 2025

https://github.com/hasibzunair/boss-detector

Change monitor screen when your boss is coming towards you!

computer-vision face-recognition opencv python

Last synced: 09 Mar 2026

https://github.com/vinay0410/pedestrian_detection

Detects Pedestrians in images using HOG as a feature extractor and SVM for classification

computer-vision detects-pedestrians hog-features human-detection human-detection-algorithm pedestrian-detection person-detection svm

Last synced: 11 Mar 2026

https://github.com/h33p/ofps

Optical Flow Processing Stack

computer-vision motion-estimation optical-flow

Last synced: 28 Oct 2025

https://github.com/zekeriyyaa/maskrcnn-modanet-fashion-segmentation-and-classification

Using modanet fashion dataset, the clothes images were classified under 5 season (summer,winter,spring,autumn,all).

annotati classification computer-vision deep-learning fashion kera maskrcnn modanet model python pytorch season season-classification segmentation

Last synced: 30 Jun 2025

https://github.com/evgenykashin/srmnet

PyTorch implementation of "SRM : A Style-based Recalibration Module for Convolutional Neural Networks"

computer-vision deep-learning pytorch srm

Last synced: 11 Apr 2025

https://github.com/ternaus/midv-500-models

Model for document segmentation trained on the midv-500-models dataset.

computer-vision deep-learning document-scanner image-segmentation python pytorch semantic-segmentation

Last synced: 18 Sep 2025

https://github.com/bmw-innovationlab/bmw-classification-training-gui

This repository allows you to get started with training a State-of-the-art Deep Learning model with little to no configuration needed! You provide your labeled dataset and you can start the training right away. You can even test your model with our built-in Inference REST API. Training classification models with GluonCV has never been so easy.

classification computer-vision deep-learning gluoncv inference-api training

Last synced: 02 Jul 2025

https://github.com/v7labs/covid-19-xray-dataset

12000+ manually drawn pixel-level lung segmentations, with and without covid

chest-x-ray computer-vision covid-19 deep-learning medical-image-dataset pneumonia x-ray x-ray-images

Last synced: 08 Apr 2026

https://github.com/BMW-InnovationLab/BMW-IntelOpenVINO-Detection-Inference-API

This is a repository for a No-Code object detection inference API using the OpenVINO. It's supported on both Windows and Linux Operating systems.

computer-vision cpu deeplearning detection-algorithm detection-api docker inference inference-engine neural-network nocode object-detection openvino openvino-model-zoo openvino-toolkit resnet rest-api

Last synced: 04 Apr 2025

https://github.com/ncoevoet/facet

Local AI photo scoring, culling, and gallery — score, organise, and explore your library with face recognition and semantic search. No cloud, no subscriptions.

aesthetic-quality angular clip computer-vision face-recognition fastapi image-analysis image-quality photo photo-culling photo-management photo-scoring photography pytorch self-hosted selfhosted sqlite topiq

Last synced: 01 Apr 2026

https://github.com/telecombcn-dl/2017-persontyle

Applied Deep Learning Workshop London 2017

computer-vision deep-learning nlp

Last synced: 19 Jul 2025

https://github.com/jingkang50/iccv21_scood

The Official Implementation of the ICCV-2021 Paper: Semantically Coherent Out-of-Distribution Detection.

computer-vision model-robustness ood ood-detection out-of-distribution out-of-distribution-detection

Last synced: 21 Mar 2025

https://github.com/fortierq/mtgscan

Recognition of Magic cards on images. Detection with OCR.

card computer-vision deck detect magic magic-the-gathering mtg ocr recognition scan scanner screenshot vision

Last synced: 17 Jan 2026

https://github.com/joffman/ros_object_recognition

A framework for ROS-based 2D and 3D object recognition.

computer-vision cplusplus opencv pcl python robotics ros

Last synced: 19 Mar 2025

https://github.com/karisigurd4/DocumentLab

OCR using tesseract, ImageMagick, EmguCV, an advanced query language and a fluent query interface for C#

computer-vision ocr optical-character-recognition query-language text-classifier

Last synced: 14 Apr 2025

https://github.com/waiwnf/pilotguru

Gather training data for training a self-driving car with just a smartphone.

computer-vision machine-learning slam

Last synced: 04 May 2025

https://github.com/mattdeitke/cvpr2019

Displays all the 2019 CVPR Accepted Papers in a way that they are easy to parse.

computer-vision cvpr2019 imagemagick lda python web-crawler web-crawler-python

Last synced: 13 Apr 2025

https://github.com/nmhaddad/fast-track

Object tracking pipelines complete with RF-DETR, YOLOv9, YOLO-NAS, YOLOv8, and YOLOv7 detection and BYTETracker tracking

computer-vision deep-learning object-detection object-tracking onnx opencv rf-detr yolo yolo-nas yolov7 yolov8 yolov9

Last synced: 06 Apr 2025

https://github.com/manila95/BASS-Net

Band-Adaptive Spectral-Spatial Feature Learning Deep Neural Network for Hyperspectral Image Classification

bass-net computer-vision deep-neural-networks hyperspectral-images landcover-classification

Last synced: 08 May 2025

https://github.com/the-ai-summer/gans-in-computer-vision

GANs in computer vision AI Summer article series

adverserial computer-vision deep-learning gan gans learning

Last synced: 09 Jul 2025

https://github.com/yuanxiaosc/find-a-machine-learning-job

找一份机器学习工作(算法工程师),需要提纲(算法能力)挈领(编程能力),充分准备。 本人学习和在找工作期间受到了很多前辈们的帮助,目前已经找到心仪的工作,撰写此文献给那些在求职路上有梦有汗水的人们!2020秋招算法,难度剧增!没有选择,只能迎难而上。

computer-vision job-hunting machine-learning-tutorials natural-language-processing resume

Last synced: 24 Jan 2026

https://github.com/nota-netspresso/netspresso-trainer

A library for training, compressing and deploying computer vision models (including ViT) with edge devices

computer-vision fx netspresso onnx pytorch tensorrt trainer

Last synced: 15 Oct 2025

https://github.com/lessthanoptimal/pyboof

Python wrapper around the BoofCV Computer Vision Library

boofcv computer-vision micro-qr-code python qr-code stereo-camera

Last synced: 04 Apr 2025

https://github.com/lukasbommes/pv-hawk

Tool for the extraction and mapping of photovoltaics modules from IR drone videos of utility-scale PV plants (my PhD project)

computer-vision drone inspection instance-segmentation map mask-rcnn photovoltaic structure-from-motion thermography tracking

Last synced: 27 Jun 2025

https://github.com/dmitryryumin/wacv-2024-papers

WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!

3d-computer-vision 3d-sensor adversarial-attacks autonomous-driving biometrics computer-vision datasets face-recognition generative-models gesture-recognition image-recognition image-understanding low-level machine-learning robotics video-recognition vision-transformer visualization wacv wacv2024

Last synced: 12 Apr 2025

https://github.com/upul/CarND-Vehicle-Detection

Vehicle Tracking and Detection Project Submitted for Udacity's CND using Traditional Computer Vision and Machine Learning Techniques.

computer-vision machine-learning object-tracking python-3 self-driving-car

Last synced: 19 Jul 2025

https://github.com/rishit-dagli/cppe-dataset

Code for our paper CPPE - 5 (Medical Personal Protective Equipment), a new challenging object detection dataset

artificial-intelligence computer-vision cppe5 data dataset deep-learning machine-learning models object-detection pretrained-models pytorch tensorflow vision

Last synced: 15 Jun 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-3D-Object-Detection 169 Awesome-Skeleton-based-Action-Recognition 104 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-robotics-datasets 79 awesome-state-of-depth-completion 71 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97