An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/longxiang-ai/human101

The official implementation of "Human101: Training 100+FPS Human Gaussians in 100s from 1 View".

3d 3d-reconstruction 3d-vision computer-vision digital-human gaussian gaussian-splatting human-reconstruction monocular real-time real-time-rendering view-synthesis

Last synced: 03 Jan 2026

https://github.com/hal-42/alchemycat

Alchemy Cat โ€”โ€” ๐Ÿ”ฅConfig System for SOTA

auto-tuning computer-vision config deep-learning machine-learning parameter-tuning

Last synced: 14 Jan 2026

https://github.com/pooya-mohammadi/deep_utils

An open-source toolkit which is full of handy functions, including the most used models and utilities for deep-learning practitioners!

augmentation coco computer-vision cutmix deep-learning face-detection face-recognition machine-learning modelcheckpoint nlp object-detection python pytorch senet tensorflow utils vggface2 yolov5

Last synced: 16 May 2025

https://github.com/li-plus/seam-carving

A super-fast Python implementation of seam carving algorithm for intelligent image resizing.

computer-graphics computer-vision image-processing image-resizing python seam-carving

Last synced: 04 Mar 2026

https://github.com/ravisvi/super-resolution-videos

Applying SRGAN technique implemented in https://github.com/zsdonghao/SRGAN on videos to super resolve them.

computer-vision srgan super-resolution tensorlayer

Last synced: 13 Apr 2025

https://github.com/nrl-ai/daisykit

DaisyKit is an easy AI toolkit with face mask detection, pose detection, background matting, barcode detection, face recognition and more. - with NCNN, OpenCV, Python wrappers

background-matting barcode-detection computer-vision cpp deep-learning deployment embedded face-detection face-mask-detection hand-pose inference-engine machine-learning mobile ncnn neural-network no-code object-detection python vulkan

Last synced: 09 Mar 2026

https://github.com/orrzohar/PROB

[CVPR 2023] Official Pytorch code for PROB: Probabilistic Objectness for Open World Object Detection

computer-vision object-detection open-set-object-detection open-world-object-detection owod

Last synced: 15 Jun 2025

https://github.com/ai4ce/msg

[NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)

computer-vision deep-learning multiview neurips-2024 scene-graph scene-representations visual-navigation

Last synced: 11 Oct 2025

https://github.com/soubhiksanyal/now_evaluation

This is the official repository for evaluation on the NoW Benchmark Dataset. The goal of the NoW benchmark is to introduce a standard evaluation metric to measure the accuracy and robustness of 3D face reconstruction methods from a single image under variations in viewing angle, lighting, and common occlusions.

2d-3d 3d-data 3d-face-alignment 3d-landmarks 3d-mesh 3d-reconstruction benchmark-datasets computer-vision face face-alignment face-reconstruction face-reconstruction-challenge flame flame-model now-evaluation python python3 single-image-reconstruction triplet-loss

Last synced: 16 Jan 2026

https://github.com/henzler/neuraltexture

Learning a Neural 3D Texture Space from 2D Exemplars [CVPR 2020]

computer-graphics computer-vision deep-learning machine-learning texture-synthesis

Last synced: 10 Oct 2025

https://github.com/RizwanMunawar/yolov5-object-tracking

YOLOv5 Object Tracking + Detection + Object Blurring + Streamlit Dashboard Using OpenCV, PyTorch and Streamlit

computer-vision object-detection object-tracking streamlit-dashboard yolov5

Last synced: 21 Apr 2025

https://github.com/rese1f/aurora

[ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

benchmark computer-vision deploy llm multimodal-large-language-models transformers video-understanding

Last synced: 19 Aug 2025

https://github.com/karayaman/lichess-with-a-real-board

Lichess.org client for real life chess boards.

computer-vision lichess

Last synced: 15 Apr 2025

https://github.com/yihongXU/TransCenter

This is the official implementation of TransCenter (TPAMI). The code and pretrained models are now available here: https://gitlab.inria.fr/yixu/TransCenter_official.

computer-vision deep-learning multiple-object-tracking pytorch transformers

Last synced: 05 May 2025

https://github.com/furkan-gulsen/sport-with-ai

The human body is detected with the help of the Mediapipe library. Then, using the mathematical methods applied, it is determined how much the exercise count is done.

ai artificial-intelligence computer-vision deep-learning image-processing keras machine-learning mediapipe python python3 sport tensorflow

Last synced: 16 Oct 2025

https://github.com/fasttrackorg/fasttrack

FastTrack is a cross-platform application designed to track multiple objects in video recording.

computer-vision image-processing linux macos opencv opencv4 qt qt6 research science tracking windows

Last synced: 05 Apr 2025

https://github.com/tonylianlong/crossmae

Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders

computer-vision deep-learning mae masked-autoencoder self-supervised-learning

Last synced: 15 Sep 2025

https://github.com/by-sabbir/headposeestimation

Head Pose Estimation using OpenCV solving PNP.

computer-vision headpose-estimation opencv

Last synced: 05 Mar 2025

https://github.com/xiaosu-zhu/mcquic

Repository of CVPR'22 paper "Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression"

computer-vision cvpr2022 image-compression image-processing pytorch

Last synced: 12 Oct 2025

https://github.com/dthuerck/mapmap_cpu

A high-performance general-purpose MRF MAP solver, heavily exploiting SIMD instructions.

algorithm computer-vision dynamic-programming feedback-vertex-set markov-random-field massively-parallel mrf simd simd-programming vectorization

Last synced: 21 Apr 2025

https://github.com/ZichengDuan/EZIGen

[BMVC 2025] Official implementation for paper EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidance

aigc computer-vision deep-learning diffusers diffusion-models generative-ai

Last synced: 14 Sep 2025

https://github.com/twke18/SPML

Universal Weakly Supervised Segmentation by Pixel-to-Segment Contrastive Learning

computer-vision deep-learning metric-learning semantic-segmentation weakly-supervised-learning

Last synced: 26 Mar 2025

https://github.com/dmitryryumin/neweraai-papers

The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementation insights beyond the scope of the article. Stay up to date with the latest advances in AI research!

artificial-intelligence computer-vision cvpr deep-learning emnlp icassp iccv image-processing interspeech ismir mashine-learning natural-language-processing neural-networks signal-processing text-classification video-processing

Last synced: 12 Apr 2025

https://github.com/jhu-lcsr/good_robot

"Good Robot! Now Watch This!": Repurposing Reinforcement Learning for Task-to-Task Transfer; and โ€œGood Robot!โ€: Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer

computer-vision deep-learning deep-q-network deep-reinforcement-learning grasping learning-from-demonstration manipulation multi-step-dqn multi-step-learning reinforcement-learning robotic-manipulation robotics simulation task-to-task-transfer vrep-simulator zero-shot-learning

Last synced: 03 Oct 2025

https://github.com/giulioz/laser-scanning

๐Ÿ“ท๐Ÿ”ฆ๐Ÿ’ญ A 3D Scanner using Laser Structured Light, written in Python using OpenCV and NumPy.

3d-reconstruction 3d-scanning computer-vision opencv structured-light

Last synced: 09 Mar 2026

https://github.com/young-geng/m3ae_public

Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation

computer-vision flax jax natural-language-processing transformers

Last synced: 06 Apr 2025

https://github.com/ccextractor/rekognition

Free and Open Source alternative to Amazon's Rekognition service. CCExtractor Development | Poor Man's Rekognition

computer-vision deep-learning django django-rest-framework docker face-detection gsoc image-processing machine-learning machinelearning opencv python rest rest-api tensorflow tensorflow-serving video-processing

Last synced: 21 Aug 2025

https://github.com/alkasm/colorfilters

Image thresholding in multiple colorspaces.

computer-vision image-processing opencv

Last synced: 26 Oct 2025

https://github.com/amazon-science/exponential-moving-average-normalization

PyTorch implementation of EMAN for self-supervised and semi-supervised learning: https://arxiv.org/abs/2101.08482

computer-vision normalization self-supervised-learning semi-supervised-learning

Last synced: 03 May 2025

https://github.com/georg-wolflein/chesscog

Determining chess game state from an image.

chess chess-recognition computer-vision deep-learning pytorch

Last synced: 16 Jan 2026

https://github.com/kyegomez/rt-x

Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"

artificial-intelligence attention-is-all-you-need attention-model computer-vision gpt4 gpt4all multimodal vision vision-transformer

Last synced: 06 May 2025

https://github.com/nvidia/nvimagecodec

A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface

computer-vision cpp cuda dali data-processing deep-learning fast-data-pipeline gpu image-processing machine-learning nvidia python pytorch

Last synced: 12 Jan 2026

https://github.com/tatsuyah/Lane-Lines-Detection-Python-OpenCV

Lane Lines Detection using Python and OpenCV for self-driving car

computer-vision self-driving-car

Last synced: 02 Apr 2025

https://github.com/mdietrichstein/yolo-tiny-v1-mobile

Yolo for Android and iOS - Mobile Deep Learning Object Detection in Realtime written in Kotlin and Swift

android computer-vision deep-learning deep-neural-networks ios keras keras-tensorflow mobile tensorflow

Last synced: 10 Apr 2025

https://github.com/anupsarkar-dev/exovisix

Auto Attendance System Using Real Time Face Recognition With Various Computer Vision & Machine Learning Tools

bio-metric-attendance-system computer-vision eye-detection face-recognition gesture-detection haar-training javacv javafx mechine-learing motion-detection mysql-database ocr-recognition opencv

Last synced: 26 Oct 2025

https://github.com/etherealengine/digital-beings

A platform for letting researchers connect an intelligent AI directly to real time communication networks and 3D worlds. Your AI, Anywhere.

ai artificial-intelligence bot computer-vision cv digital-beings digital-humans machine-learning ml nlp telegram

Last synced: 11 Mar 2026

https://github.com/eliphatfs/calibur

Conversion between different conventions of camera matrices and transform matrices.

computer-graphics computer-vision developer-productivity lightweight

Last synced: 21 Aug 2025

https://github.com/microsoft/orbit-dataset

The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mobile phone. The dataset is presented with a teachable object recognition benchmark task which aims to drive few-shot learning on challenging real-world data.

benchmark classification computer-vision dataset few-shot-learning machine-learning meta-learning microsoft object-recognition video

Last synced: 26 Oct 2025

https://github.com/plstcharles/litiv

C++ implementation pool for computer vision R&D projects.

algorithms computer-vision image-processing library litiv opencv segmentation

Last synced: 18 Mar 2025

https://github.com/yuanze-lin/revive

[NeurIPS 2022] Official code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering

computer-vision deep-learning gpt-3 knowledge-based multimodal-deep-learning neurips2022 ok-vqa pytorch question-answering vision-and-languge vqa

Last synced: 09 Apr 2025

https://github.com/shi-labs/semi-supervised-transfer-learning

[CVPR 2021] Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

computer-vision semi-supervised-learning transfer-learning

Last synced: 12 Sep 2025

https://github.com/bytedance/twist

Official codes: Self-Supervised Learning by Estimating Twin Class Distribution

computer-vision deep-learning pretraining research self-supervised-learning twist

Last synced: 13 Apr 2025

https://github.com/skylab-tech/ffhqr-dataset

FFHQR -- the first large-scale retouching dataset for computer vision research.

computer-vision dataset deep-learning high-resolution large-scale retouching

Last synced: 25 Jan 2026

https://github.com/SHI-Labs/Semi-Supervised-Transfer-Learning

[CVPR 2021] Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

computer-vision semi-supervised-learning transfer-learning

Last synced: 08 May 2025

https://github.com/Z-Zheng/FreeNet

FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image Classification (TGRS 2020) https://ieeexplore.ieee.org/document/9007624

computer-vision convolutional-neural-network deep-learning geospatial hyperspectral-image-classification remote-sensing

Last synced: 29 Mar 2025

https://github.com/sayakpaul/convnext-tf

Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.

computer-vision convolutional-neural-networks efficient-models imagenet-1k imagenet-21k keras tensorflow

Last synced: 16 Apr 2025

https://github.com/nalepae/bounding-box

Bounding Box is a library to plot pretty bounding boxes with a simple Python API.

bounding-boxes computer-vision python visualization

Last synced: 10 Apr 2025

https://github.com/abhshkdz/neural-vqa-attention

:question: Attention-based Visual Question Answering in Torch

computer-vision deep-learning natural-language-processing torch

Last synced: 15 Apr 2025

https://github.com/imatge-upc/sentiment-2017-imavis

From Pixels to Sentiment: Fine-tuning CNNs for Visual Sentiment Prediction

affective-computing cnn computer-vision deep-learning fine-tuning-cnns sentiment sentiment-maps

Last synced: 11 May 2025

https://github.com/michaelzhang-ai/Speech2Video

Code for ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"

3d accv aigc body computer-vision face gan generative-ai gesture hand pose rnn simulation speech speech2video-synthesis talking-head talking-heads video

Last synced: 10 May 2025

https://github.com/wl-zhao/diml

[ICCV 2021] Towards Interpretable Deep Metric Learning with Structural Matching

computer-vision deep-learning deep-metric-learning iccv2021 interpretable-deep-learning

Last synced: 15 May 2025

https://github.com/varunagrawal/bbox

Python library for 2D/3D bounding boxes

3d-bounding-boxes bounding-boxes computer-vision computer-vision-tools

Last synced: 24 Oct 2025

https://github.com/norlab-ulaval/perceptreev1

Implementation of Grondin et al. 2022 "Tree Detection and Diameter Estimation Based on Deep Learning". Also includes datasets and some of the pretrained models.

computer-vision datasets forestry

Last synced: 22 Jul 2025

https://github.com/karanchawla/monocular-visual-inertial-odometry

This contains the code(in development) for monocular visual odometry of a quadrotor. The visual data from the monocular camera is fused with onboard IMU to develop indoor control and navigation algorithms.

computer-vision fuse imu odometry opencv visual-odmetry

Last synced: 27 Jul 2025

https://github.com/jaisayush/fatigue-detection-system-based-on-behavioural-characteristics-of-driver

A computer vision system made with the help of opencv that can automatically detect driver drowsiness in a real-time video stream and then play an alarm if the driver appears to be drowsy.

anaconda-environment anaconda3 computer-vision cv cv2 drowsiness-detection drowsy-driver-warning-system imutils miniproject numpy opencv opencv-python opencv2 playsound python python3 scipy

Last synced: 12 Apr 2025

https://github.com/juglab/FourierImageTransformer

Fourier Image Transformer (FIT) can solve relevant image analysis tasks in Fourier space.

computer-vision image-processing transformer

Last synced: 08 May 2025

https://github.com/adityap27/face-mask-detector

๐‘๐ž๐š๐ฅ-๐“๐ข๐ฆ๐ž ๐…๐š๐œ๐ž ๐ฆ๐š๐ฌ๐ค ๐๐ž๐ญ๐ž๐œ๐ญ๐ข๐จ๐ง ๐ฎ๐ฌ๐ข๐ง๐  ๐๐ž๐ž๐ฉ๐ฅ๐ž๐š๐ซ๐ง๐ข๐ง๐  ๐ฐ๐ข๐ญ๐ก ๐€๐ฅ๐ž๐ซ๐ญ ๐ฌ๐ฒ๐ฌ๐ญ๐ž๐ฆ ๐Ÿ’ป๐Ÿ””

artificial-intelligence computer-vision deep-learning machine-learning neural-network yolo

Last synced: 21 Apr 2025

https://github.com/tuvovan/vision_transformer_keras

Keras Implementation of Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

computer-vision image-recognition keras tensorflow transformer vision

Last synced: 21 Jul 2025

https://github.com/lingdong-/visionosc

PoseOSC + FaceOSC + HandOSC + OcrOSC + CatOSC + DogOSC

computer-vision facial-landmarks-detection hand-tracking ocr osc pose-estimation vision-framework

Last synced: 29 Dec 2025

https://github.com/baidut/PaQ-2-PiQ

Source code for "From Patches to Pictures (PaQ-2-PiQ): Mapping the Perceptual Space of Picture Quality"

computer-vision image-processing image-quality image-quality-assessment picture-quality

Last synced: 02 Apr 2025

https://github.com/KempnerInstitute/overcomplete

๐Ÿ‘‹ Overcomplete is a Vision-based SAE Toolbox

computer-vision deep-learning interpretability

Last synced: 27 Nov 2025

https://github.com/tensorush/awesome-graphics-programming

๐Ÿ˜Ž ๐ŸงŠ Collection of the most awesome learning resources on graphics programming in the form of videos, tutorials and books.

3d 3d-graphics augmented-reality awesome-list cg computer-graphics computer-vision game-development gamedev geometry-processing graphics graphics-programming learning opengl raytracing rendering shaders tutorials virtual-reality vulkan

Last synced: 08 Apr 2025

https://github.com/Placenote/PlacenoteSDK-iOS

PlacenoteSDK Sample app in native iOS using ARKit, written primarily in Swift

arkit augmented-reality computer-vision ios ios-swift scene-recognition scenekit slam

Last synced: 30 Jul 2025

https://github.com/opengvlab/piip

[NeurIPS 2024 Spotlight โญ๏ธ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)

computer-vision image-classification instance-segmentation multimodal-large-language-models object-detection semantic-segmentation vision-language-models vision-transformer

Last synced: 10 Oct 2025

https://github.com/eveningglow/multitask-CycleGAN

Pytorch implementation of multitask CycleGAN with auxiliary classification loss

computer-vision cyclegan deep-learning gan generative-adversarial-network image-translation pytorch

Last synced: 08 May 2025

https://github.com/aljosaosep/ciwt

This repository contains code for the tracking system as described in ''Combined Image- and World-Space Tracking in Traffic Scenes'', ICRA 2017.

3d-vision autonomous-driving autonomous-vehicles computer-vision multi-object-tracking research research-project stereo-vision tracking

Last synced: 18 Mar 2025

https://github.com/robertwgh/ezSIFT

ezSIFT: An easy-to-use standalone SIFT library written in C/C++

computer-vision cpp feature-detection feature-extraction feature-matching sift sift-algorithm sift-descriptors

Last synced: 08 May 2025

https://github.com/rehanguha/brisque

Blind/Referenceless Image Spatial QUality Evaluator (BRISQUE)

brisque computer-vision cv image-processing python statistical-analysis

Last synced: 22 Feb 2026

https://github.com/pliang279/multiviz

[ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models

computer-vision machine-learning multimodal-learning natural-language-processing

Last synced: 12 Apr 2025

https://github.com/yoonit-labs/nativescript-yoonit-camera

The most advanced and modern NativeScript Camera module for Android and iOS with a lot of awesome features

android camera camerax-api computer-vision hacktoberfest ios machine-learning mlkit nativescript vuejs-plugin

Last synced: 23 Oct 2025

https://github.com/rssr25/computer-vision

Some computer vision projects written using openCV and python.

computer-vision opencv python python27

Last synced: 09 Apr 2025

https://github.com/lzhbrian/deepfashion2-kps-agg-finetune

1st place solution (Team StylingAI Inc. & PKU AIIC) for CVPR 2020 DeepFashion2 Clothes Landmark Detection Track. Aggregation and Finetuning for Clothes Landmark Detection

ai clothes computer-vision deepfashion2 fashion keypoints landmark

Last synced: 23 Jun 2025

https://github.com/r00tman/eventnerf

Neural Radiance Fields from a Single Colour Event Camera [CVPR 2023]

computer-vision cvpr2023 event-camera eventnerf mesh nerf neural-radiance-fields neural-rendering pytorch volume-rendering

Last synced: 13 Oct 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-3D-Object-Detection 169 Awesome-Skeleton-based-Action-Recognition 104 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-robotics-datasets 79 awesome-state-of-depth-completion 71 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97