An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/iboing/CliqueNet

Convolutional Neural Networks with Alternately Updated Clique (to appear in CVPR 2018)

computer-vision deep-learning image-recognition

Last synced: 20 Mar 2025

https://github.com/clovaai/assembled-cnn

Tensorflow implementation of "Compounding the Performance Improvements of Assembled Techniques in a Convolutional Neural Network"

computer-vision convolutional-neural-networks deep-learning food-101 image-classification imagenet inference-throughput mce robustness tensorflow transfer-learning

Last synced: 01 Aug 2025

https://github.com/frgfm/Holocron

PyTorch implementations of recent Computer Vision tricks (ReXNet, RepVGG, Unet3p, YOLOv4, CIoU loss, AdaBelief, PolyLoss, MobileOne). Other additions: AdEMAMix

ademamix computer-vision cspdarknet53 darknet deep-learning object-detection pytorch resnet rexnet tridentnet unet-image-segmentation yolo yolov4

Last synced: 05 Apr 2025

https://github.com/brainhubeu/react-native-opencv-tutorial

๐Ÿ‘ฉโ€๐ŸซFully working example of the OpenCV library used together with React Native

camera computer-vision java javascript jest objective-c objective-c-plus-plus opencv react react-native react-native-camera reactnative

Last synced: 05 Apr 2025

https://github.com/IBM/powerai-counting-cars

Run a Jupyter Notebook to detect, track, and count cars in a video using Maximo Visual Insights (formerly PowerAI Vision) and OpenCV

computer-vision ibmcode jupyter-notebook mp4-video opencv powerai powerai-vision video

Last synced: 10 Apr 2025

https://github.com/gabrielarchanjo/marvinj

Javascript Image Processing Framework based on Marvin Framework

computer-vision image-processing javascript javascript-library

Last synced: 10 Jul 2025

https://github.com/vzhou842/cnn-from-scratch

A Convolutional Neural Network implemented from scratch (using only numpy) in Python.

computer-vision convolutional-neural-networks guide machine-learning neural-network python

Last synced: 09 Sep 2025

https://github.com/ishandutta0098/mukh

A comprehensive face analysis library that provides unified APIs for various face-related tasks

ai computer-vision deepfake-detection face-analysis face-detection face-reenactment facial-landmarks machine-learning opencv pip python torch

Last synced: 09 Oct 2025

https://github.com/khanhnamle1994/fashion-recommendation

A clothing retrieval and visual recommendation model for fashion images.

computer-vision deepfashion fashion-recommendation floydhub numpy pandas python3 resnet tensorflow

Last synced: 09 Apr 2025

https://github.com/z-x-yang/cfbi

The official implementation of CFBI(+): Collaborative Video Object Segmentation by (Multi-scale) Foreground-Background Integration.

computer-vision pytorch-implementation video-object-segmentation

Last synced: 25 Aug 2025

https://github.com/Sierkinhane/mtcnn-pytorch

A face detection algorithm

computer-vision facedetector pytorch

Last synced: 20 Mar 2025

https://github.com/google-research/rigl

End-to-end training of sparse deep neural networks with little-to-no performance loss.

computer-vision machine-learning neural-networks sparse-training

Last synced: 06 Apr 2025

https://github.com/sierkinhane/mtcnn-pytorch

A face detection algorithm

computer-vision facedetector pytorch

Last synced: 07 Apr 2025

https://github.com/rizwanmunawar/yolov8-object-tracking

YOLOv8 Object Tracking Using PyTorch, OpenCV and Ultralytics

computer-vision deep-learning object-detection object-tracking yolov8

Last synced: 16 May 2025

https://github.com/alexandre01/UltimateLabeling

A multi-purpose Video Labeling GUI in Python with integrated SOTA detector and tracker

computer-vision labeling labeling-tool object-detection object-tracking python pytorch video

Last synced: 23 Apr 2025

https://github.com/pylabel-project/pylabel

Python library for computer vision labeling tasks. The core functionality is to translate bounding box annotations between different formats-for example, from coco to yolo.

annotation-tool bounding-boxes coco computer-vision dataset object-detection yolov5

Last synced: 05 Apr 2025

https://github.com/LMD0311/PointMamba

PointMamba: A Simple State Space Model for Point Cloud Analysis

3d-point-clouds computer-vision mamba state-space-model

Last synced: 20 Mar 2025

https://github.com/BMW-InnovationLab/BMW-Labeltool-Lite

This repository provides you with an easy-to-use labeling tool for State-of-the-art Deep Learning training purposes. It supports Auto-Labeling.

annotaion auto-label autolabeling bounding-box boundingbox computer-vision deep-learning docker image-annotation inference label labeling-tool labeltool neural-network object-detection smart-labeling synthetic-data tensorflow voc yolov4

Last synced: 07 May 2025

https://github.com/yqyao/FCOS_PLUS

Some improvements (center sample) about FCOS (FCOS: Fully Convolutional One-Stage Object Detection).

anchor-free computer-vision fcos object-detection one-stage pytorch

Last synced: 13 Jul 2025

https://github.com/unity-technologies/robotics-object-pose-estimation

A complete end-to-end demonstration in which we collect training data in Unity and use that data to train a deep neural network to predict the pose of a cube. This model is then deployed in a simulated robotic pick-and-place task.

autonomy computer-vision deep-learning machine-learning manipulation model-training motion-planning perception physics-simulation pose-estimation robotics robotics-simulation ros simulation synthetic-data trajectory-generation tutorial unity ur3-robot-arm urdf

Last synced: 06 Apr 2025

https://github.com/andyzeng/arc-robot-vision

MIT-Princeton Vision Toolbox for Robotic Pick-and-Place at the Amazon Robotics Challenge 2017 - Robotic Grasping and One-shot Recognition of Novel Objects with Deep Learning.

3d amazon-robotics-challenge artificial-intelligence computer-vision deep-learning grasping manipulation mit-princeton rgbd vision

Last synced: 09 Apr 2025

https://github.com/0ssamaak0/dlta-ai

Data Labeling, Tracking and Annotation with AI

annotation computer-vision datasets-preparation pytorch

Last synced: 02 May 2026

https://github.com/linkedai/flip

Synthetic Image generation with Flip. Generate thousands of new 2D images from a small batch of objects and backgrounds.

computer-vision deep-learning generated-labels hacktoberfest object-detection synthetic-data

Last synced: 02 Apr 2026

https://github.com/andyzeng/apc-vision-toolbox

MIT-Princeton Vision Toolbox for the Amazon Picking Challenge 2016 - RGB-D ConvNet-based object segmentation and 6D object pose estimation.

3d amazon-picking-challenge artificial-intelligence computer-vision deep-learning marvin mit-princeton rgbd ros segmentation vision

Last synced: 09 Apr 2025

https://github.com/bfortuner/pytorch_tiramisu

FC-DenseNet in PyTorch for Semantic Segmentation

computer-vision deep-learning neural-network pytorch semantic-segmentation

Last synced: 09 Apr 2025

https://github.com/alexeyab/yolo2_light

Light version of convolutional neural network Yolo v3 & v2 for objects detection with a minimum of dependencies (INT8-inference, BIT1-XNOR-inference)

bit1-xnor-inference computer-vision machine-learning neural-network object-detection

Last synced: 09 Apr 2025

https://github.com/AlexeyAB/yolo2_light

Light version of convolutional neural network Yolo v3 & v2 for objects detection with a minimum of dependencies (INT8-inference, BIT1-XNOR-inference)

bit1-xnor-inference computer-vision machine-learning neural-network object-detection

Last synced: 20 Apr 2025

https://github.com/maciejczyzewski/neural-chessboard

โ™” An Extremely Efficient Chess-board Detection for Non-trivial Photos โ™”

chess chess-board computer-vision machine-learning

Last synced: 25 Jul 2025

https://github.com/bamwani/car-counting-and-speed-estimation-yolo-sort-python

This project imlements the following tasks in the project: 1. Vehicle counting, 2. Lane detection. 3.Lane change detection and 4.speed estimation

car car-counting computer-vision lane-change-detection lane-detection lane-segmentation object-detection opencv python3 sort sort-tracking speed-detection speed-estimation vehicle-counting vehicle-tracking video yolo

Last synced: 21 Apr 2025

https://github.com/WisconsinAIVision/few-shot-gan-adaptation

[CVPR '21] Official repository for Few-shot Image Generation via Cross-domain Correspondence

computer-vision few-shot-learning gans

Last synced: 08 May 2025

https://github.com/wisconsinaivision/few-shot-gan-adaptation

[CVPR '21] Official repository for Few-shot Image Generation via Cross-domain Correspondence

computer-vision few-shot-learning gans

Last synced: 09 Apr 2025

https://github.com/mbeyeler/opencv-python-blueprints

M. Beyeler (2015). OpenCV with Python Blueprints: Design and develop advanced computer vision projects using OpenCV with Python, Packt Publishing Ltd., ISBN 978-178528269-0.

computer-vision image-processing machine-learning opencv python

Last synced: 02 Sep 2025

https://github.com/luxonis/depthai-ros

Official ROS Driver for DepthAI Sensors.

camera camera-driver computer-vision cv depthai robotics ros

Last synced: 21 Jun 2025

https://github.com/ZumoLabs/zpy

Synthetic data for computer vision. An open source toolkit using Blender and Python.

ai blender blender-addon computer-vision data deep-learning ml python synthetic synthetic-data

Last synced: 11 May 2025

https://github.com/zhkkke/pixelssl

A PyTorch-based Semi-Supervised Learning (SSL) Codebase for Pixel-wise (Pixel) Vision Tasks [ECCV 2020]

computer-vision pixel-wise-task pytorch-implementation semi-supervised toolbox

Last synced: 08 Apr 2025

https://github.com/ZHKKKe/PixelSSL

A PyTorch-based Semi-Supervised Learning (SSL) Codebase for Pixel-wise (Pixel) Vision Tasks [ECCV 2020]

computer-vision pixel-wise-task pytorch-implementation semi-supervised toolbox

Last synced: 02 Apr 2025

https://github.com/paddlecv-sig/paddlelabel

้ฃžๆกจๆ™บ่ƒฝๆ ‡ๆณจ๏ผŒ่ฎฉๆ ‡ๆณจๅฟซไบบไธ€ๆญฅ

annotations classification computer-vision deep-learning image-annotation instance-segmentation python semantic-segmentation

Last synced: 14 Dec 2025

https://github.com/nvidia-ai-iot/deepstream_pose_estimation

This is a DeepStream application to demonstrate a human pose estimation pipeline.

computer-vision deepstream human-pose-estimation jetson real-time tensorrt tesla

Last synced: 28 Jun 2025

https://github.com/ethereon/lycon

A minimal and fast image library for Python and C++

computer-vision cpp image-processing python

Last synced: 05 Apr 2025

https://github.com/dwctod/eccv2022-papers-with-code-demo

ๆ”ถ้›† ECCV ๆœ€ๆ–ฐ็š„ๆˆๆžœ๏ผŒๅŒ…ๆ‹ฌ่ฎบๆ–‡ใ€ไปฃ็ ๅ’Œdemo่ง†้ข‘็ญ‰๏ผŒๆฌข่ฟŽๅคงๅฎถๆŽจ่๏ผ

ai computer-vision cv dataset diffusion eccv eccv2022 face-recognition image-segmentation multimodal-deep-learning nerf objection-detection vision-transformer

Last synced: 27 Jan 2026

https://github.com/DLR-RM/SingleViewReconstruction

Official Code: 3D Scene Reconstruction from a Single Viewport

computer-vision deep-learning eccv2020 open-source paper reconstruction singleview

Last synced: 14 Mar 2025

https://github.com/VlSomers/native-opencv-android-template

A tutorial for setting up OpenCV 4.6.0 (and other 4.x.y version) for Android in Android Studio with Native Development Kit (NDK) support for C++ development.

android android-ndk android-studio computer-vision cpp java java-native-interface jni kotlin kotlin-android ndk opencv opencv-android-release opencv-android-sdk opencv4android

Last synced: 02 Apr 2025

https://github.com/hsankesara/deepresearch

This repository is the collection of research papers in Deep learning, computer vision and NLP.

computer-vision deep-learning keras machine-learning nlp nueral-networks python3 research-paper

Last synced: 09 Apr 2025

https://github.com/amanovishnu/ineuron-full-stack-data-science-assignments

this repository features assignments and projects from the iNeuron full stack data science course, providing valuable resources for learners to enhance their skills and apply their knowledge.

computer-vision data-science datascience deep-learning exploratory-data-analysis linear-regression machine-learning natural-language-processing python recommender-system sql statistics

Last synced: 08 Apr 2025

https://github.com/postech-cvlab/fastpointtransformer

Official source code of Fast Point Transformer, CVPR 2022

3d-vision computer-vision cvpr2022 point-cloud transformer

Last synced: 09 Apr 2025

https://github.com/debkbanerji/lego-art-remix

Powerful computer vision assisted Lego mosaic creator ยท Over 1 million images created (so far!)

computer-vision custom-pictures deep-neural-network javascript lego lego-art lego-art-remix lego-mosaic onnx

Last synced: 31 Mar 2025

https://github.com/POSTECH-CVLab/FastPointTransformer

Official source code of Fast Point Transformer, CVPR 2022

3d-vision computer-vision cvpr2022 point-cloud transformer

Last synced: 20 Mar 2025

https://github.com/fregu856/3DOD_thesis

3D Object Detection for Autonomous Driving in PyTorch, trained on the KITTI dataset.

3d-deep-learning 3d-detection autonomous-driving computer-vision deep-learning machine-learning object-detection pytorch

Last synced: 19 Jul 2025

https://github.com/preddy5/Im2Vec

[CVPR 2021 Oral] Im2Vec Synthesizing Vector Graphics without Vector Supervision

computer-graphics computer-vision cvpr2021 vector-graphics

Last synced: 03 Apr 2025

https://github.com/visipedia/annotation_tools

Visipedia Annotation Tools

annotations computer-vision datasets

Last synced: 04 Apr 2025

https://github.com/RizwanMunawar/yolov8-object-tracking

YOLOv8 Object Tracking Using PyTorch, OpenCV and Ultralytics

computer-vision deep-learning object-detection object-tracking yolov8

Last synced: 17 Apr 2025

https://github.com/amanovishnu/ineuron-full-stack-data-science-assignment-collection

this repository features assignments and projects from the iNeuron full stack data science course, providing valuable resources for learners to enhance their skills and apply their knowledge.

computer-vision data-science datascience deep-learning exploratory-data-analysis linear-regression machine-learning natural-language-processing python recommender-system sql statistics

Last synced: 28 Feb 2025

https://github.com/gszfwsb/NCFM

Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function" (NCFM) in CVPR 2025.

computer-vision data-centric-ai dataset-distillation synthetic-data

Last synced: 01 Apr 2025

https://github.com/kkanshul/finegan

FineGAN: Unsupervised Hierarchical Disentanglement for Fine-grained Object Generation and Discovery

computer-vision deep-learning disentangled-representations fine-grained gans generative-adversarial-network image-generation image-manipulation pytorch

Last synced: 08 May 2025

https://github.com/DerrickXuNu/v2x-vit

[ECCV2022] Official Implementation of paper "V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer"

3d-object-detection autonomous-driving collaborative-perception computer-vision deep-learning machine-learning multi-agent-system pytorch simulation v2x vehicle-to-everything vision-transformer

Last synced: 20 Mar 2025

https://github.com/lucasjinreal/thor

thor: C++ helper library, for deep learning purpose

computer-vision deeplearning object-detection tracking

Last synced: 07 Apr 2025

https://github.com/Haiyang-W/UniTR

[ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Birdโ€™s-Eye-View Representation"

3d 3d-object-detection 3d-segmentation backbone bev camera computer-vision iccv2023 lidar multi-modal multi-view point-cloud transformer unified

Last synced: 20 Mar 2025

https://github.com/nianticlabs/map-free-reloc

[ECCV 2022] Map-free Visual Relocalization: Metric Pose Relative to a Single Image

benchmark computer-vision dataset eccv2022 pose-estimation pytorch relocalization visual-relocalization

Last synced: 06 Apr 2025

https://github.com/joaolages/diffusers-interpret

Diffusers-Interpret ๐Ÿค—๐Ÿงจ๐Ÿ•ต๏ธโ€โ™€๏ธ: Model explainability for ๐Ÿค— Diffusers. Get explanations for your generated images.

computer-vision deep-learning diffusers diffusion explainable-ai image-generation interpretability model-explainability primary-attributions pytorch text2image transformers

Last synced: 05 Apr 2025

https://github.com/HusseinYoussef/Arabic-OCR

OCR system for Arabic language that converts images of typed text to machine-encoded text.

arabic character-segmentation computer-vision dataset image-processing machine-learning neural-network ocr opencv-python scikit-learn segmentation

Last synced: 02 Apr 2025

https://github.com/prasunroy/stefann

:fire: [CVPR 2020] STEFANN: Scene Text Editor using Font Adaptive Neural Network (official code).

color-transfer colornet computer-vision cvpr cvpr2020 deep-learning fannet font-generation scene-text-editor stefann

Last synced: 15 Jul 2025

https://github.com/ducha-aiki/affnet

Code and weights for local feature affine shape estimation paper "Repeatability Is Not Enough: Learning Discriminative Affine Regions via Discriminability"

affine-shape-estimator computer-vision convolutional-networks convolutional-neural-networks deep-learning hessian image-matching image-retrieval local-features pytorch

Last synced: 09 Apr 2025

https://github.com/servicenow/highres-net

Pytorch implementation of HighRes-net, a neural network for multi-frame super-resolution, trained and tested on the European Space Agencyโ€™s Kelvin competition. This is a ServiceNow Research project that was started at Element AI.

computer-vision deep-learning esa highres-net mfsr multi-frame proba-v satellite-imagery super-resolution

Last synced: 09 Oct 2025

https://github.com/natanielruiz/disrupting-deepfakes

๐Ÿ”ฅ๐Ÿ”ฅDefending Against Deepfakes Using Adversarial Attacks on Conditional Image Translation Networks

adversarial-attacks computer-vision deep-learning deepfake-detection deepfakes defending defending-deepfakes disrupting-deepfakes face-swap faceswap fake-news machine-learning

Last synced: 04 Nov 2025

https://github.com/lannguyen0910/food-recognition

๐Ÿ”๐ŸŸ๐Ÿ— Food analysis baseline with Theseus. Integrate object detection, image classification and multi-class semantic segmentation ๐Ÿž๐Ÿ–๐Ÿ•

computer-vision deep-learning efficientnet food-analysis food-classification food-detection food-segmentation image-classification object-detection pytorch semantic-segmentation unetplusplus yolov5 yolov8

Last synced: 21 Apr 2025

https://github.com/ali-design/gan_steerability

On the "steerability" of generative adversarial networks

computer-vision deep-learning gan generative-adversarial-network

Last synced: 08 May 2025

https://github.com/eric-canas/qreader

Robust and Straight-Forward solution for reading difficult and tricky QR codes within images in Python. Powered by YOLOv8

computer-vision easy-to-use image-processing object-detection pip python pytorch pyzbar qr qrcode qrcode-reader qrcode-scanner yolov8

Last synced: 15 May 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-3D-Object-Detection 169 Awesome-Skeleton-based-Action-Recognition 104 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-robotics-datasets 79 awesome-state-of-depth-completion 71 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97