An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/visioncortex/visioncortex

Core library for computer graphics & vision applications

computer-graphics computer-vision rust

Last synced: 06 Apr 2025

https://github.com/kwan3854/ceb_econ

AI based all-in-one character generator Blender plug-in. This project contains unofficial updates for the CEB_ECON Blender add-on.

3d-graphics 3d-modeling 3d-reconstruction blender-addon character-generator computer-graphics computer-vision game-development metaverse model-generator pifu stable-diffusion texture-generation virtual-humans

Last synced: 27 Sep 2025

https://github.com/jaisayush/Fatigue-Detection-System-Based-On-Behavioural-Characteristics-Of-Driver

A computer vision system made with the help of opencv that can automatically detect driver drowsiness in a real-time video stream and then play an alarm if the driver appears to be drowsy.

anaconda-environment anaconda3 computer-vision cv cv2 drowsiness-detection drowsy-driver-warning-system imutils miniproject numpy opencv opencv-python opencv2 playsound python python3 scipy

Last synced: 10 Apr 2025

https://github.com/interdigitalinc/compressai-vision

CompressAI-Vision helps you design, test and compare Video Compression for Machines pipelines. Compression methods can be either pulled from custom AI-based modules from CompressAI or traditional codecs such as H.266/VVC.

compression computer-vision deep-learning machine-to-machine-communication pytorch video-compression

Last synced: 10 Oct 2025

https://github.com/libmir/dcv

Computer Vision Library for D Programming Language

computer-vision image-processing

Last synced: 04 Jan 2026

https://github.com/imoonlab/hyper-yolov1.1

The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".

computer-vision deep-learning hypergraph hypergraph-neural-networks object-detection yolo

Last synced: 07 Apr 2025

https://github.com/ternaus/check_orientation

Model to check if image was rotated by 90, 180, 270 degrees.

computer-vision deep-learning image-classification orientation-detection

Last synced: 04 Jul 2025

https://github.com/distant-viewing/dvt

Distant Viewing Toolkit for the Analysis of Visual Culture

computer-vision cultural-analytics digital-humanities

Last synced: 17 Mar 2025

https://github.com/mdsrqbl/omnihuman

AI model that understands text & humanoids.

computer-vision generative-ai nlp pose-synthesis pytorch transformers

Last synced: 15 Apr 2025

https://github.com/cambridgeltl/visual-spatial-reasoning

[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.

computer-vision multimodal-deep-learning nlp vision-and-language

Last synced: 03 Apr 2025

https://github.com/AIVFI/Video-Frame-Interpolation-Rankings-and-Video-Deblurring-Rankings

Rankings include: ABME AdaFNIO ALANET AMT BiT BVFI CDFI CtxSyn DBVI DeMFI DQBC DRVI EAFI EBME EDC EDENVFI EDSC EMA-VFI FGDCN FILM FLAVR H-VFI IFRNet IQ-VFI JNMR LADDER M2M MA-GCSPA NCM PerVFI PRF ProBoost-Net RIFE RN-VFI SoftSplat SSR ST-MFNet Swin-VFI TDPNet TTVFI UGFI UPR-Net UTI-VFI VFIformer VFIFT VFIMamba VFIT VIDUE VRT

artificial-intelligence computer-vision deblurring deep-learning frame-interpolation interpolation low-level-vision machine-learning motion-blur motion-deblurring ncnn restoration rife tensorrt vapoursynth video-deblurring video-frame-interpolation video-interpolation video-processing video-restoration

Last synced: 06 Mar 2025

https://github.com/robert80203/HuPR-A-Benchmark-for-Human-Pose-Estimation-Using-Millimeter-Wave-Radar

The official implementation of HuPR: A Benchmark for Human Pose Estimation Using Millimeter Wave Radar

computer-vision deep-learning millimeter-wave radar

Last synced: 18 Nov 2025

https://github.com/gantman/rps_tfjs_demo

Training a Rock Paper Scissors model in the browser via TFJS - Learn along style

computer-vision machine-learning machinelearning react rock-paper-scissors rps tensorflow tensorflow-js tensorflowjs

Last synced: 06 Apr 2025

https://github.com/jmmanley/vgg-multiple-view-geometry

A set of MATLAB utilities for multiple view geometry, provided alongside Hartley & Zisserman's "Multiple View Geometry in Computer Vision, Second Edition" (2004). Obtained from http://www.robots.ox.ac.uk/~vgg/hzbook/code/.

3d-reconstruction camera-calibration computer-vision multiple-view-geometry

Last synced: 05 Mar 2025

https://github.com/bytedeco/sbt-javacv

Start using OpenCV in your JVM project in just 1 line, no separate compiling, installing OpenCV, or fussing with your system required

computer-vision javacv opencv-java sbt-plugin scala

Last synced: 13 Apr 2025

https://github.com/qdata/lamp

ECML 2019: Graph Neural Networks for Multi-Label Classification

computer-vision graph-attention-networks graph-neural-networks multi-label-classification nlp transformers

Last synced: 13 Apr 2025

https://github.com/abcSup/NotEnoughSleepAI

Implementation of the Multi-Task Multi-Sensor Fusion for 3D Object Detection paper by Uber

computer-vision deep-learning neural-networks

Last synced: 20 Mar 2025

https://github.com/ndrplz/dreyeve

[TPAMI 2018] Predicting the Driverโ€™s Focus of Attention: the DR(eye)VE Project. A deep neural network learnt to reproduce the human driver focus of attention (FoA) in a variety of real-world driving scenarios.

adas attention automotive autonomous-driving autonomous-vehicles computer-vision convolutional-neural-networks deep-learning driver-focus

Last synced: 10 Jul 2025

https://github.com/datarootsio/tutorial-face-mask-detection

In this project, we develop a pipeline to detect unmasked faces in images. This can, for example, be used to alert people that do not wear a mask when entering a building.

classification computer-vision deep-learning face-mask-detection face-mask-detector facemaskdetection keras opencv python-3 tensorflow tutorial vggface2

Last synced: 11 Apr 2025

https://github.com/sayakpaul/dreambooth-keras

Implementation of DreamBooth in KerasCV and TensorFlow.

computer-vision dreambooth generative-ai keras keras-cv stable-diffusion tensorflow

Last synced: 19 Apr 2025

https://github.com/bluemirrors/cvu

Computer Vision deployment tools for dummies and experts. CVU aims at making CV pipelines easier to build and consistent around platforms, devices, and models.

computer-vision object-detection onnxruntime pytorch tensorflow2 tensorrt yolov5

Last synced: 08 May 2025

https://github.com/sahagobinda/GPM

Official [ICLR] Code Repository for "Gradient Projection Memory for Continual Learning"

computer-vision continual-learning deep-learning optimizaiton pytorch

Last synced: 08 May 2025

https://github.com/kuanghuei/clean-net

Tensorflow source code for "CleanNet: Transfer Learning for Scalable Image Classifier Training with Label Noise" (CVPR 2018)

computer-vision deep-learning label-noise large-scale-learning tensorflow transfer-learning

Last synced: 01 Mar 2026

https://github.com/khrylx/egopose

[ICCV 2019] Official PyTorch Implementation of "Ego-Pose Estimation and Forecasting as Real-Time PD Control". ICCV 2019.

character-controller computer-vision physics-based pose-estimation pose-forecasting reinforcement-learning

Last synced: 05 Mar 2026

https://github.com/funkelab/gunpowder

A library to facilitate machine learning on multi-dimensional images.

computer-vision gunpowder machine-learning multidimensional pipeline

Last synced: 06 Apr 2026

https://github.com/cvg/scrstudio

[CVPR 2025] A unified framework for Scene Coordinate Regression-based visual localization

3d-reconstruction 3d-vision computer-vision deep-learning pose-estimation scene-coordinate-regression visual-localization

Last synced: 11 Oct 2025

https://github.com/norlab-ulaval/PercepTreeV1

Implementation of Grondin et al. 2022 "Tree Detection and Diameter Estimation Based on Deep Learning". Also includes datasets and some of the pretrained models.

computer-vision datasets forestry

Last synced: 09 May 2025

https://github.com/wgcban/spin_roadmapper

Official implementation of our ICRA'22 paper: SPIN Road Mapper: Extracting Roads from Aerial Images via Spatial and Interaction Space Graph Reasoning for Autonomous Driving

aerial-imagery autonomous-driving computer-vision road-detection satellite-imagery segmentation

Last synced: 26 Apr 2025

https://github.com/nikolasent/road-semantic-segmentation

Udacity Self-Driving Car Engineer Nanodegree. Project: Road Semantic Segmentation

carnd classification computer-vision deep-learning fcn self-driving-car semantic-segmentation tensorflow

Last synced: 30 Apr 2025

https://github.com/skalskip/som

Unofficial implementation and experiments related to Set-of-Mark (SoM) ๐Ÿ‘๏ธ

chatgpt computer-vision gpt-4 multimodal

Last synced: 22 Apr 2025

https://github.com/cjekel/tindetheus

Build personalized machine learning models for Tinder using Python

classification-model computer-vision database facenet facenet-models machine-learning machinelearning python tinder

Last synced: 20 Aug 2025

https://github.com/r3gm/insightsolver-colab

InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.

ai-ops aiops autogpt colab-notebook colorization computer-vision deep-learning llama-2 llama-cpp llm machine-learning object-detection stable-diffusion text-to-speech

Last synced: 12 Oct 2025

https://github.com/kitware/dive

Media annotation and analysis tools for web and desktop. Get started at https://viame.kitware.com

annotation computer-vision docker image-annotation machine-learning marine-biology object-detection video video-analytics video-annotation

Last synced: 05 Apr 2025

https://github.com/dipanjans/adversarial-learning-robustness

Contains materials for workshops pertaining to adversarial robustness in deep learning.

adversarial-attacks adversarial-learning computer-vision deep-learning python tensorflow

Last synced: 30 Apr 2025

https://github.com/miteshputhran/image-caption-generator

The LSTM model generates captions for the input images after extracting features from pre-trained VGG-16 model. (Computer Vision, NLP, Deep Learning, Python)

bleu-score caption-generator computer-vision flickr image-captioning jupyter-notebook lstm-model machine-learning natural-language-processing python

Last synced: 14 Apr 2025

https://github.com/kartik4949/tensorpipe

High Performance Tensorflow Data Pipeline with State of Art Augmentations and low level optimizations.

computer-vision datapipeline tensorflow

Last synced: 26 Oct 2025

https://github.com/yahoo/graphkit

A lightweight Python module for creating and running ordered graphs of computations.

computer-vision machine-learning python

Last synced: 13 Apr 2025

https://github.com/fabioperez/skin-data-augmentation

Source code for the paper 'Data Augmentation for Skin Lesion Analysis' โ€” ๐Ÿ† Best Paper Award at the ISIC Skin Image Analysis Workshop @ MICCAI 2018

computer-vision data-augmentation deep-learning machine-learning melanoma melanoma-classification pytorch skin-lesion-analysis

Last synced: 06 Apr 2025

https://github.com/moygcc/hpcwild

Human Performance Capture from Monocular Video in the Wild (3DV2021)

3d-vision 3dv2021 computer-vision

Last synced: 29 Sep 2025

https://github.com/sayakpaul/adaptive-gradient-clipping

Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.

colab-notebook computer-vision deep-neural-networks normalization-free-training tensorflow2

Last synced: 30 Apr 2025

https://github.com/ShuweiShao/IEBins

[NeurIPS2023] IEBins: Iterative Elastic Bins for Monocular Depth Estimation

computer-vision deep-learning depth-estimation

Last synced: 05 May 2025

https://github.com/wellflat/cat-fancier

cat detection, recognition, classifier engine :cat:

computer-vision machine-learning python

Last synced: 21 Mar 2025

https://github.com/thushv89/manning_tf2_in_action

The official code repository for "TensorFlow in Action" by Manning.

computer-vision deep-learning machine-learning nlp notebook python tensorflow tensorflow2 tf tf2

Last synced: 18 Mar 2025

https://github.com/l1997i/LiM3D

๐Ÿ”ฅ(CVPR 2023) Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation

complexity computer-vision cvpr cvpr2023 deep-learning lidar point-cloud semantic-segmentation

Last synced: 20 Mar 2025

https://github.com/magamig/duplicate-images-finder

๐Ÿž๐ŸŒ‰ Find and Delete Duplicate Images / Photos

computer-vision duplicate-images image-recognition lowe sift

Last synced: 18 Jan 2026

https://github.com/iotJumpway/RPI-Examples

Examples of using the IoT JumpWay with the Raspberry Pi 3.

computer-vision iot iot-jumpway mqtt python raspberry-pi

Last synced: 17 Jul 2025

https://github.com/iotjumpway/rpi-examples

Examples of using the IoT JumpWay with the Raspberry Pi 3.

computer-vision iot iot-jumpway mqtt python raspberry-pi

Last synced: 14 Apr 2025

https://github.com/svishnu88/TGS-SaltNet

Kaggle | 21st place solution for TGS Salt Identification Challenge

computer-vision deep-learning fastai kaggle-competition pytorch

Last synced: 19 Jul 2025

https://github.com/hongyehu/RG-Flow

This is project page for the paper "RG-Flow: a hierarchical and explainable flow model based on renormalization group and sparse prior". Paper link: https://arxiv.org/abs/2010.00029

computer-vision generative-model hierarchical-models normalizing-flow renormalization-group representation-learning sparse-coding

Last synced: 15 Jul 2025

https://github.com/fabioperez/transito-cv

[DEPRECATED] Traffic sign detector and classifier that uses dlib and its implementation of the Felzenszwalb's version of the Histogram of Oriented Gradients (HoG) detector

computer-vision dlib hog traffic-signs

Last synced: 10 Oct 2025

https://github.com/MiraPurkrabek/BBoxMaskPose

[ICCV 25] The official repository of paper 'Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle'

computer-vision human-pose-estimation iccv iccv2025 keypoint-detection pose-estimation research-paper

Last synced: 30 Jan 2026

https://github.com/s32x/gamedetect

:space_invader: Game Detection API using Tensorflow and Go

computer-vision docker gaming golang tensorflow

Last synced: 23 Apr 2025

https://github.com/theos-ai/easy-yolov7

This a clean and easy-to-use implementation of YOLOv7 in PyTorch, made with โค๏ธ by Theos AI.

computer-vision deep-learning machine-learning neural-networks object-detection python pytorch yolov5 yolov7

Last synced: 20 Apr 2025

https://github.com/mukosame/aoda

Official implementation of "Adversarial Open Domain Adaptation for Sketch-to-Photo Synthesis"(WACV 2022/CVPRW 2021)

computer-vision cvpr cvpr2021 deep-learning gan gans image-generation image-manipulation image2image open-domain pytorch sketch wacv wacv2022

Last synced: 20 Oct 2025

https://github.com/vision-cair/3dcompat-v2

3DCoMPaT++: An improved large-scale 3D vision dataset for compositional recognition

3d compositional-learning computer-vision deep-learning multimodal-deep-learning

Last synced: 15 Apr 2025

https://github.com/collidingscopes/threejs-handtracking-101

A threejs / WebGL / MediaPipe-powered interactive demo that allows you to control a 3D sphere using hand gestures.

3d computer-vision free gestures hand hand-tracking mediapipe open-source threejs tutorial webgl

Last synced: 14 Mar 2026

https://github.com/Suoivy/LPD-net

LPD-Net: 3D Point Cloud Learning for Large-Scale Place Recognition and Environment Analysis, ICCV 2019, Seoul, Korea

computer-vision deep-learning place-recognition point-cloud

Last synced: 20 Mar 2025

https://github.com/thaoshibe/beautygan-pytorch-reimplementation

A re-implementation of BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network (ACM MM'18)

color-matching computer-vision computervision deep-learning gan makeup-transfer pytorch

Last synced: 07 Mar 2026

https://github.com/marteinn/wagtail-alt-generator

Insert image description and tags with the help of computer vision

computer-vision django wagtail

Last synced: 10 Apr 2025

https://github.com/harveyslash/patchmatch

GPU and CPU implementation of Patchmatch in Python!

computer-graphics computer-vision

Last synced: 08 Jul 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-3D-Object-Detection 169 Awesome-Skeleton-based-Action-Recognition 104 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-robotics-datasets 79 awesome-state-of-depth-completion 71 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97