An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/jiachens/ModelNet40-C

Repo for "Benchmarking Robustness of 3D Point Cloud Recognition against Common Corruptions" https://arxiv.org/abs/2201.12296

benchmark computer-vision corruption-robustness data-augmentation deep-learning ml-safety point-cloud-processing pytorch regularization robustness

Last synced: 20 Mar 2025

https://github.com/vinjn/opencv-2-cookbook-src

《OpenCV 2 计算机视觉编程手册》 配套代码,支持 OpenCV 3.x / 4.x

computer-vision opencv opencv2 opencv3 opencv4

Last synced: 13 Oct 2025

https://github.com/gaiasd/DFireDataset

D-Fire: an image data set for fire and smoke detection.

computer-vision dataset fire-detection smoke-detection yolo

Last synced: 21 Apr 2025

https://github.com/akshaybhatia10/ComputerVision-Projects

Some simple computer vision implementations using OpenCV

computer-vision opencv python

Last synced: 29 Mar 2025

https://github.com/morvanzhou/tensorflow-computer-vision-tutorial

Tutorials of deep learning for computer vision.

cnn computer-vision deepdream googlenet lenet resnet

Last synced: 10 May 2025

https://github.com/Project-AgML/AgML

AgML is a centralized framework for agricultural machine learning. AgML provides access to public agricultural datasets for common agricultural deep learning tasks, with standard benchmarks and pretrained models, as well the ability to generate synthetic data and annotations.

agriculture computer-vision dataset deep-learning image-classification object-detection pytorch semantic-segmentation synthetic-data

Last synced: 07 May 2025

https://github.com/project-agml/agml

AgML is a centralized framework for agricultural machine learning. AgML provides access to public agricultural datasets for common agricultural deep learning tasks, with standard benchmarks and pretrained models, as well the ability to generate synthetic data and annotations.

agriculture computer-vision dataset deep-learning image-classification object-detection pytorch semantic-segmentation synthetic-data

Last synced: 15 May 2025

https://github.com/1adrianb/binary-human-pose-estimation

This code implements a demo of the Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment with Limited Resources paper by Adrian Bulat and Georgios Tzimiropoulos.

binary computer-vision deep-learning human-pose-estimation network-binarization torch7

Last synced: 19 Jul 2025

https://github.com/daveredrum/scanrefer

[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language

3d computer-vision dataset deep-learning eccv natural-language-processing point-cloud pytorch visual-grounding

Last synced: 09 Oct 2025

https://github.com/kwotsin/tensorflow-xception

TensorFlow implementation of the Xception Model by François Chollet

computer-vision deep-learning machine-learning tensorflow tensorflow-xception xception-model

Last synced: 24 Jul 2025

https://github.com/kingyiusuen/clip-image-search

Search images with a text or image query, using Open AI's pretrained CLIP model.

computer-vision deep-learning elasticsearch image-search image-search-engine reverse-image-search search-engine streamlit-webapp

Last synced: 20 Aug 2025

https://github.com/kwotsin/TensorFlow-Xception

TensorFlow implementation of the Xception Model by François Chollet

computer-vision deep-learning machine-learning tensorflow tensorflow-xception xception-model

Last synced: 29 Mar 2025

https://github.com/maartengr/concept

Concept Modeling: Topic Modeling on Images and Text

computer-vision image-processing nlp topic-modeling

Last synced: 16 May 2025

https://github.com/angeladai/3DMV

[ECCV'18] 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation

computer-graphics computer-vision deep-learning scene-understanding

Last synced: 13 Apr 2025

https://github.com/lijx10/DeepI2P

DeepI2P: Image-to-Point Cloud Registration via Deep Classification. CVPR 2021

3d-vision cnn computer-vision deep-learning image-processing localization neural-network point-cloud registration

Last synced: 20 Mar 2025

https://github.com/rish-16/sight

👁 Sightseer: TensorFlow library for state-of-the-art Computer Vision and Object Detection models

computer-vision object-detection state-of-the-art tensorflow

Last synced: 26 Jun 2025

https://github.com/LeapLabTHU/EfficientTrain

1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.

computer-vision deep-learning efficient-training machine-learning pytorch

Last synced: 05 Apr 2025

https://github.com/albumentations-team/autoalbument

AutoML for image augmentation. AutoAlbument uses the Faster AutoAugment algorithm to find optimal augmentation policies. Documentation - https://albumentations.ai/docs/autoalbument/

augmentation automated-machine-learning automl computer-vision deep-learning image-augmentation machine-learning pytorch

Last synced: 07 Apr 2025

https://github.com/dmarnerides/pydlt

PyTorch based Deep Learning Toolbox

computer-vision deep-learning pytorch toolbox

Last synced: 04 Jan 2026

https://github.com/imfing/sketch-to-art

🖼 Create artwork from your casual sketch with GAN and style transfer

computer-vision deep-learning gan pix2pix style-transfer

Last synced: 13 Apr 2025

https://github.com/abhiTronix/deffcode

A cross-platform High-performance FFmpeg based Real-time Video Frames Decoder in Pure Python 🎞️⚡

computer-vision cross-platform decoder decoders ffmpeg ffmpeg-decoder framework opencv-python python python3 realtime video video-processing wrapper

Last synced: 09 Jul 2025

https://github.com/junyanz/light-field-video

Light field video applications (e.g. video refocusing, focus tracking, changing aperture and view)

caffe computer-graphics computer-vision deep-learning lightfield

Last synced: 08 Jul 2025

https://github.com/extreme-assistant/deep-learning-datasets

整理分类深度学习各方向公开数据集

computer-vision deep-learning

Last synced: 24 Dec 2025

https://github.com/manila95/monolayout

MonoLayout: Amodal Scene Layout from a single image

autonomous-driving computer-vision robotic-vision scene-layout

Last synced: 20 Mar 2025

https://github.com/kasperkamperman/mobilecameratemplate

A HTML5, JS, CSS Camera interface template. Feel free to use it in your next Computer Vision or AI project.

ai camera canvas-2d-context computer-vision javascript mobile-first webrtc-demos

Last synced: 12 Jul 2025

https://github.com/kaustubh-sadekar/VirtualCam

Virtual camera is created only using opencv and numpy. It simulates a camera where we can control all its parameters, intrinsic and extrinsic to get a better understanding how each component in the camera projection matrix affects the final image of the object captured by the camera.

camera-calibration camera-control camera-distortion camera-matrix camera-parameters camera-projection-matrix camera-rotation camera-translation computer-graphics computer-vision computer-vision-funamentals extrinsic-parameters fundamentals gui image-formation rendering trackbars transformation-matrix virtual-camera

Last synced: 14 Mar 2025

https://github.com/nvidia/dl4agx

Deep Learning tools and applications for NVIDIA AGX platforms.

autonomous-driving computer-vision cuda deep-learning drive-agx embedded

Last synced: 12 Apr 2025

https://github.com/jacobgil/pytorch-zssr

PyTorch implementation of 1712.06087 "Zero-Shot" Super-Resolution using Deep Internal Learning

computer-vision deep-learning pytorch super-resolution

Last synced: 21 Aug 2025

https://github.com/ibaiGorordo/ONNX-YOLOv7-Object-Detection

Python scripts performing object detection using the YOLOv7 model in ONNX.

computer-vision deep-learning object-detection onnx opencv python yolo yolov7

Last synced: 21 Apr 2025

https://github.com/automaticdai/rpi-object-detection

Real-time object detection and tracking with Raspberry Pi and OpenCV!

computer-vision object-detection opencv raspberry-pi

Last synced: 05 Apr 2025

https://github.com/rucaibox/negative-sampling-paper

This repository collects 100 papers related to negative sampling methods.

computer-vision contrastive-learning negative-sampling paper papers recommender-system

Last synced: 14 Feb 2026

https://github.com/junyanz/facedemo

A simple 3D face alignment and warping demo.

3d-face computer-vision face-swap

Last synced: 07 Sep 2025

https://github.com/seeed-projects/tutorial-of-ai-kit-with-raspberry-pi-from-zero-to-hero

This repository provides a comprehensive step-by-step guide to building AI projects using the Raspberry Pi AI Kit.

clip computer-vision hailo8 instance-segmentation object-detection ollama pose-estimation raspberry-pi

Last synced: 04 Apr 2025

https://github.com/huybery/visualizingcnn

:see_no_evil:A PyTorch implementation of the paper "Visualizing and Understanding Convolutional Networks." (ECCV 2014)

computer-vision convolutional-neural-networks deeplearning

Last synced: 22 Aug 2025

https://github.com/abhitronix/deffcode

A cross-platform High-performance FFmpeg based Real-time Video Frames Decoder in Pure Python 🎞️⚡

computer-vision cross-platform decoder decoders ffmpeg ffmpeg-decoder framework opencv-python python python3 realtime video video-processing wrapper

Last synced: 05 Apr 2025

https://github.com/krshrimali/no-reference-image-quality-assessment-using-brisque-model

Implementation of the paper "No Reference Image Quality Assessment in the Spatial Domain" by A Mittal et al. in OpenCV (using both C++ and Python)

computer-vision cpp image-processing image-quality image-quality-assessment libsvm machine-learning opencv python svm

Last synced: 24 Oct 2025

https://github.com/emla2805/vision-transformer

Tensorflow implementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

computer-vision tensorflow transformer vision-transformer

Last synced: 09 Oct 2025

https://github.com/MaartenGr/Concept

Concept Modeling: Topic Modeling on Images and Text

computer-vision image-processing nlp topic-modeling

Last synced: 05 Apr 2025

https://github.com/pwlmc/imghash

Perceptual image hashing for Node.js

computer-vision image-processing imghash webscraping

Last synced: 25 Apr 2026

https://github.com/zcemycl/matlab-gan

MATLAB implementations of Generative Adversarial Networks -- from GAN to Pixel2Pixel, CycleGAN

aae acgan cgan computer-vision cyclegan dcgan deep-learning deep-neural-networks gans image-generation infogan lsgan matlab matlab-gan matlab-implementations pix2pix

Last synced: 10 Jul 2025

https://github.com/Imageomics/bioclip

This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].

clip computer-vision imageomics knowledge-guided-machine-learning taxonomy

Last synced: 05 Apr 2025

https://github.com/Gradiant/pyodi

Python Object Detection Insights

computer-vision deep-learning machine-learning object-detection

Last synced: 05 Apr 2025

https://github.com/amusi/daily-question

每日一题(涉及但不仅限于机器学习、深度学习和计算机视觉等方向)

computer-vision deep-learning machine-learning nerual-network

Last synced: 01 Apr 2026

https://github.com/bat67/pytorch-fcn-easiest-demo

PyTorch Implementation of Fully Convolutional Networks (a very simple and easy demo).

cnn computer-vision fcn pytorch pytorch-implementation pytorch-implmention semantic-segmentation

Last synced: 09 Jul 2025

https://github.com/rocm/mivisionx

MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.

amd-opencl amd-opencv amd-openvx computer-vision inference inference-engine khronos-openvx machine-learning neural-network nnef onnx opencl openvx openvx-extensions openvx-neural-network rocm ryzen virtual-reality windows-machine-learning winml

Last synced: 16 May 2025

https://github.com/olafenwamoses/idenprof

IdenProf dataset is a collection of images of identifiable professionals. It is been collected to enable the development of AI systems that can serve by identifying people and the nature of their job by simply looking at an image, just like humans can do.

ai-systems computer-vision datasets humans idenprof-dataset identifying-people image-classification image-recognition images machine-intelligence machine-learning machine-vision people professionals

Last synced: 16 Mar 2026

https://github.com/jwwangchn/AI-TOD

Official code for "Tiny Object Detection in Aerial Images".

aerial-images computer-vision dataset object-detection

Last synced: 03 Apr 2025

https://github.com/10x-Engineers/Infinite-ISP

A camera ISP (image signal processor) pipeline that contains modules with simple to complex algorithms implemented at the application level.

camera-pipeline computer-vision image-processing image-signal-processor isp

Last synced: 09 May 2025

https://github.com/cmsflash/beauty-net

A simple, flexible, and extensible template for PyTorch. It's beautiful.

cnn computer-vision deep-learning image-classification pytorch pytorch-template

Last synced: 18 Feb 2026

https://github.com/lingdong-/handpose-facemesh-demos

🎥🤟 8 minimalistic templates for tfjs mediapipe handpose and facemesh

computer-vision handpose machine-learning mediapipe networking p5js tensorflowjs threejs

Last synced: 09 Oct 2025

https://github.com/OlafenwaMoses/IdenProf

IdenProf dataset is a collection of images of identifiable professionals. It is been collected to enable the development of AI systems that can serve by identifying people and the nature of their job by simply looking at an image, just like humans can do.

ai-systems computer-vision datasets humans idenprof-dataset identifying-people image-classification image-recognition images machine-intelligence machine-learning machine-vision people professionals

Last synced: 09 May 2025

https://github.com/RUCAIBox/Negative-Sampling-Paper

This repository collects 100 papers related to negative sampling methods.

computer-vision contrastive-learning negative-sampling paper papers recommender-system

Last synced: 09 May 2025

https://github.com/LingDong-/handpose-facemesh-demos

🎥🤟 8 minimalistic templates for tfjs mediapipe handpose and facemesh

computer-vision handpose machine-learning mediapipe networking p5js tensorflowjs threejs

Last synced: 12 Apr 2026

https://github.com/lightforever/mlcomp

Distributed DAG (Directed acyclic graph) framework for machine learning with UI

artificial-intelligence automl computer-vision deep-learning distributed-computing infrastructure machine-learning python pytorch research

Last synced: 20 Aug 2025

https://github.com/catalyst-team/mlcomp

Distributed DAG (Directed acyclic graph) framework for machine learning with UI

artificial-intelligence automl computer-vision deep-learning distributed-computing infrastructure machine-learning python pytorch research

Last synced: 03 Mar 2025

https://github.com/nianticlabs/depth-hints

[ICCV 2019] Depth Hints are complementary depth suggestions which improve monocular depth estimation algorithms trained from stereo pairs

computer-vision deep-learning depth-estimation monodepth neural-network pytorch self-supervision stereo-matching

Last synced: 28 Feb 2026

https://github.com/SHI-Labs/Self-Similarity-Grouping

Self-similarity Grouping: A Simple Unsupervised Cross Domain Adaptation Approach for Person Re-identification (ICCV 2019, Oral)

computer-vision deep-learning domain-adaptation person-reidentification

Last synced: 15 Mar 2025

https://github.com/shi-labs/self-similarity-grouping

Self-similarity Grouping: A Simple Unsupervised Cross Domain Adaptation Approach for Person Re-identification (ICCV 2019, Oral)

computer-vision deep-learning domain-adaptation person-reidentification

Last synced: 19 Jul 2025

https://github.com/ROCm/MIVisionX

MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.

amd-opencl amd-opencv amd-openvx computer-vision inference inference-engine khronos-openvx machine-learning neural-network nnef onnx opencl openvx openvx-extensions openvx-neural-network rocm ryzen virtual-reality windows-machine-learning winml

Last synced: 18 Jul 2025

https://github.com/BMW-InnovationLab/BMW-Anonymization-API?referral=top-free-anonymization-tools-apis-and-open-source-models

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

anonymization-api anonymization-technique bmw computer-vision data-anonymization deep-learning docker image-transformation inference no-code object-detection openvino privacy-enhancing-technologies privacy-protection pytorch semantic-segmentation tensorflow tensorflow-training video video-anonymization

Last synced: 27 Jul 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-3D-Object-Detection 169 Awesome-Skeleton-based-Action-Recognition 104 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-robotics-datasets 79 awesome-state-of-depth-completion 71 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97