An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/bmw-innovationlab/bmw-anonymization-api

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

anonymization-api anonymization-technique bmw computer-vision data-anonymization deep-learning docker image-transformation inference no-code object-detection openvino privacy-enhancing-technologies privacy-protection pytorch semantic-segmentation tensorflow tensorflow-training video video-anonymization

Last synced: 21 Jul 2025

https://github.com/joeymeyer/raspberryturk

The Raspberry Turk is a robot that can play chessโ€”it's entirely open source, based on Raspberry Pi, and inspired by the 18th century chess playing machine, the Mechanical Turk.

3d-printing chess computer-vision data-science machine-learning raspberry-pi robotics

Last synced: 12 May 2025

https://github.com/trekhleb/links-detector

๐Ÿ“– ๐Ÿ‘†๐Ÿป Links Detector makes printed links clickable via your smartphone camera. No need to type a link in, just scan and click on it.

computer-vision javascript machine-learning object-detection ocr tensorflowjs tesnorflow tesseract typescript

Last synced: 07 Oct 2025

https://github.com/willbrennan/semanticsegmentation

A framework for training segmentation models in pytorch on labelme annotations with pretrained examples of skin, cat, and pizza topping segmentation

birds bisenet bisenetv2 cats coco computer-vision labelme labelme-annotations pizza pizza-toppings pytorch segmentation semantic-segmentation skin-detection skin-segmentation torchvision

Last synced: 21 Aug 2025

https://github.com/mindee/react-mindee-js

Front-End Computer Vision SDK for React

computer-vision javascript ocr react reactjs sdk

Last synced: 14 Apr 2026

https://github.com/SurajDonthi/Multi-Camera-Person-Re-Identification

State-of-the-art model for person re-identification in Multi-camera Multi-Target Tracking. Benchmarked on Market-1501 and DukeMTMTC-reID datasets.

computer-vision convolutional-neural-networks deep-learning dukemtmc-reid market-1501 mtmct multi-camera-tracking person-reidentification pytorch pytorch-lightning spatial-temporal

Last synced: 20 Mar 2025

https://github.com/htdt/hyp_metric

Hyperbolic Vision Transformers: Combining Improvements in Metric Learning | Official repository

computer-vision machine-learning pytorch

Last synced: 08 May 2025

https://github.com/pathak22/videoseg

[CVPR 2017] Video motion segmentation and tracking

computer-vision machine-learning video-segmentation

Last synced: 08 Oct 2025

https://github.com/lacmus-foundation/lacmus

Lacmus is a cross-platform application that helps to find people who are lost in the forest using computer vision and neural networks.

computer-vision cv dataset forest hacktoberfest hacktoberfest2020 keras lacmus liza-alert machine-learning ml neural-networks object-detection ods pascal-voc python retina-net

Last synced: 13 Jul 2025

https://github.com/BMW-InnovationLab/BMW-Anonymization-API

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

anonymization-api anonymization-technique bmw computer-vision data-anonymization deep-learning docker image-transformation inference no-code object-detection openvino privacy-enhancing-technologies privacy-protection pytorch semantic-segmentation tensorflow tensorflow-training video video-anonymization

Last synced: 04 Apr 2025

https://github.com/tobybreckon/python-examples-cv

OpenCV Python Computer Vision Examples used for Teaching

computer-vision opencv python

Last synced: 13 Apr 2025

https://github.com/baegwangbin/irondepth

[BMVC 2022] IronDepth: Iterative Refinement of Single-View Depth using Surface Normal and its Uncertainty

3d-reconstruction bmvc2022 computer-vision deep-learning depth-estimation monocular-depth-estimation surface-normal surface-normals uncertainty

Last synced: 04 Mar 2026

https://github.com/jvgomez/fast_methods

N-Dimensional Fast Methods: Fast Marching, Fast Sweeping, Group Marching, Fast Iterative, etc.

algorithm c-plus-plus computer-vision fast-marching fast-methods gridmap path-planning

Last synced: 11 Feb 2026

https://github.com/lukasmosser/porousmediagan

Reconstruction of three-dimensional porous media using generative adversarial neural networks

computer-vision deep-learning generative-adversarial-network image-reconstruction machine-learning porous-media

Last synced: 20 Aug 2025

https://github.com/waybarrios/vllm-mlx

OpenAI-compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX backend, 400+ tok/s.

apple-silicon audio-processing computer-vision image-understanding inference llm machine-learning macos mllm mlx multimodal-ai speech-to-text stt text-to-speech tts video-understanding vision-language-model vllm

Last synced: 23 Jan 2026

https://github.com/CommissarMa/Crowd_counting_from_scratch

This is an overview and tutorial about crowd counting. In this repository, you can learn how to estimate number of pedestrians in crowd scenes through computer vision and deep learning.

computer-vision crowd-counting deep-learning

Last synced: 16 Apr 2025

https://github.com/wasmvision/wasmvision

wasmVision gets you going with computer vision using WebAssembly.

c-language computer-vision golang opencv rust tinygo wasi wasm wasmcv webassembly

Last synced: 14 Apr 2025

https://github.com/alemart/speedy-vision

GPU-accelerated Computer Vision for JavaScript.

computer-vision gpgpu gpu image-processing linear-algebra machine-vision opencv

Last synced: 06 Apr 2025

https://github.com/idealo/cnn-exposed

๐Ÿ•ต๏ธโ€โ™‚๏ธ Interpreting Convolutional Neural Network (CNN) Results.

computer-vision deep-learning explainable-ai interpretable-machine-learning machine-learning neural-network

Last synced: 01 May 2025

https://github.com/Hemmingsson/FacePause

Look Away to Pause Youtube - Experimental Chrome Extension

chrome-extension computer-vision face-detection face-recognition youtube

Last synced: 16 May 2025

https://github.com/nv-tlabs/meta-sim

Meta-Sim: Learning to Generate Synthetic Datasets (ICCV 2019)

computer-vision iccv2019

Last synced: 20 Aug 2025

https://github.com/tensorlayer/dagan

The implementation code for "DAGAN: Deep De-Aliasing Generative Adversarial Networks for Fast Compressed Sensing MRI Reconstruction"

computer-vision deep-learning generative-adversarial-network mri-reconstruction

Last synced: 24 Apr 2025

https://github.com/tensorlayer/DAGAN

The implementation code for "DAGAN: Deep De-Aliasing Generative Adversarial Networks for Fast Compressed Sensing MRI Reconstruction"

computer-vision deep-learning generative-adversarial-network mri-reconstruction

Last synced: 27 Mar 2025

https://github.com/VSainteuf/pytorch-psetae

PyTorch implementation of the model presented in "Satellite Image Time Series Classification with Pixel-Set Encoders and Temporal Self-Attention"

agriculture computer-vision cvpr cvpr2020 deep-learning earth-observation machine-learning pytorch remote-sensing satellite-image self-attention spatio-temporal time-series-classification transformer-architecture

Last synced: 08 May 2025

https://github.com/hao-lh/the-books-making-you-better

A list of time-lasting classic books, which not only help you figure out how it works, but also grasp when it works and why it works in that way.

bayesian-inference computer-architecture computer-vision deep-learning high-performance-computing linear-algebra machine-learning probabilistic-graphical-models reinforcement-learning statistical-learning

Last synced: 15 Apr 2025

https://github.com/ilmonteux/logohunter

Deep learning tool to find brand logos in everyday pictures

computer-vision logo-detection

Last synced: 05 Apr 2025

https://github.com/matajoh/fourier_feature_nets

Supplemental learning materials for "Fourier Feature Networks and Neural Volume Rendering"

computer-graphics computer-vision deep-learning pytorch

Last synced: 28 Oct 2025

https://github.com/adobe-research/sam_inversion

[CVPR 2022] GAN inversion and editing with spatially-adaptive multiple latent layers

computer-graphics computer-vision deep-learning gan generative-adversarial-network image-manipulation machine-learning pytorch

Last synced: 14 Aug 2025

https://github.com/rubenszimbres/repo-2018

Deep Learning Summer School + Tensorflow + OpenCV cascade training + YOLO + COCO + CycleGAN + AWS EC2 Setup + AWS IoT Project + AWS SageMaker + AWS API Gateway + Raspberry Pi3 Ubuntu Core

aws-ec2 aws-iot aws-sagemaker coco computer-vision cyclegan ec2-instance generative-adversarial-networks haarcascade iot keras opencv opencv-library opencv3 raspberry-pi raspberry-pi-3 sagemaker tensorflow ubuntucore yolo-model

Last synced: 10 Jul 2025

https://github.com/kupynorest/head_detector

Official repo for VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic Dataset..

3dmm computer-vision diffusion-models face-detection face-mesh-detection generative-ai head-detection mesh-network

Last synced: 05 Apr 2025

https://github.com/kangyeolk/Paint-by-Sketch

Stable Diffusion-based image manipulation method with a sketch and reference image

computer-vision diffusion-models image-editing image-generation image-manipulation pytorch stable-diffusion

Last synced: 28 Mar 2025

https://github.com/aisingapore/peekingduck

A modular framework built to simplify Computer Vision inference workloads.

computer-vision deep-learning nodes object-detection object-tracking pipeline pose-estimation python

Last synced: 13 Sep 2025

https://github.com/cloud-cv/origami

:unlock: :key: :closed_lock_with_key: Origami: Artificial Intelligence as a Service

ai-as-a-service artificial-intelligence computer-vision docker flask machine-learning mongodb python reactjs

Last synced: 24 Aug 2025

https://github.com/willbrennan/skindetector

A Python based skin detection system using OpenCV

computer-vision detection opencv python skin-detection skin-segmentation

Last synced: 02 Jul 2025

https://github.com/iago-suarez/efficient-descriptors

:rocket::rocket: Revisiting Binary Local Image Description for Resource Limited Devices

computer-vision descriptors local-features real-time robotics slam

Last synced: 08 May 2025

https://github.com/vietnh1009/ssd-pytorch

SSD: Single Shot MultiBox Detector pytorch implementation focusing on simplicity

computer-vision deep-learning deep-neural-networks deeplearning detection detector neural-networks object-detection pytorch ssd ssdlite

Last synced: 10 Oct 2025

https://github.com/batra-mlp-lab/visdial-rl

PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning

computer-vision deep-learning natural-language-processing pytorch visual-dialog

Last synced: 22 Feb 2026

https://github.com/mrdbourke/airbnb-amenity-detection

Repo for 42 days project to replicate/improve Airbnb's amenity (object) detection pipeline.

airbnb-amenity-detection computer-vision deep-learning detectron2 machine-learning machine-learning-project object-detection

Last synced: 09 Apr 2025

https://github.com/Nico31415/Drowning-Detector

Using YOLO object detection, this program will detect if a person is drowning. This project is still a work in progress, so it can only be implemented with a computer's webcam, and doesn't work completely yet.

computer-vision machine-learning python python3 yolov3

Last synced: 13 Mar 2025

https://github.com/mftnakrsu/automatic_number_plate_recognition_yolo_ocr

Automatic number plate recognition using tech: Yolo, OCR, Scene text detection, scene text recognation, flask, torch

ai artificial-intelligence computer-vision csv database deep-learning detection easyocr flask machine-learning object object-detection ocr opencv paddleocr python realtime yolo yolov5

Last synced: 11 May 2026

https://github.com/cortictechnology/cep

CEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.

artificial-intelligence computer-vision deep-learning edge-computing iot lego-mindstorms natural-language-processing oak-d opencv raspberry-pi smarthome speech-generation speech-recognition visual-programming

Last synced: 11 Apr 2026

https://github.com/MikeS96/autonomous_landing_uav

ROS packages of the Autonomous landing system of a UAV in Gazebo

computer-vision cpp gazebo mavros python sitl uav

Last synced: 10 May 2025

https://github.com/ViCCo-Group/THINGSvision

Python package for extracting representations from state-of-the-art computer vision models

alignment cognitive-science computer-vision deep-learning neural-networks pytorch representations tensorflow

Last synced: 08 May 2025

https://github.com/allenpandas/bjtu-cs-notebook

๐Ÿ‘จโ€๐ŸŽ“ ๅŒ—ไบฌไบค้€šๅคงๅญฆ่ฎก็ฎ—ๆœบ็ง‘ๅญฆไธŽๆŠ€ๆœฏๅญฆ้™ข็ ”็ฉถ็”Ÿ่ฏพ็จ‹่ต„ๆ–™ใ€็ฌ”่ฎฐใ€ๅ›žๅฟ†ๅ’Œๆ•ด็†็š„ๆœŸๆœซ่€ƒ่ฏ•ๅทๅŠ่ฏพ็จ‹ไฝœไธšใ€‚ๅธŒๆœ›ๅฏนไฝ ไปฌๆœ‰ๆ‰€ๅธฎๅŠฉโค๏ธ๏ผŒๅฆ‚ๆžœๅ–œๆฌข่ฎฐๅพ—็ป™ไธชstar๐ŸŒŸ

artificial-intelligence beijing-jiaotong-university bjtu computer-engineering computer-graphics computer-networks computer-science computer-security computer-vision experiments homework lesson-material project-management software software-engineering software-testing

Last synced: 05 Apr 2025

https://github.com/leggedrobotics/wild_visual_navigation

Wild Visual Navigation: A system for fast traversability learning via pre-trained models and online self-supervision

computer-vision field-robotics legged-robots machine-learning navigation online-learning robot-navigation robotics self-supervised-learning traversability-estimation

Last synced: 04 Apr 2025

https://github.com/compvis/brushstroke-parameterized-style-transfer

TensorFlow implementation of our CVPR 2021 Paper "Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes".

computer-vision deep-learning differentiable-rendering style-transfer

Last synced: 24 Jul 2025

https://github.com/mahmoudnafifi/WB_color_augmenter

WB color augmenter improves the accuracy of image classification and image semantic segmentation methods by emulating different WB effects (ICCV 2019) [Python & Matlab].

cnn color-augmentation color-constancy color-correction computer-vision data-augmentation deep-learning deep-neural-network deeplearning iccv19 iccv2019 image-augmentation image-classification semantic-segmentation white-balance whitebalance

Last synced: 08 May 2025

https://github.com/CompVis/brushstroke-parameterized-style-transfer

TensorFlow implementation of our CVPR 2021 Paper "Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes".

computer-vision deep-learning differentiable-rendering style-transfer

Last synced: 03 Apr 2025

https://github.com/localdevices/pyorc

Surface velocity, object tracking, and river flow measurements in an open-source API

computer-vision hydrology hydrometry velocimetry

Last synced: 21 Oct 2025

https://github.com/vietnh1009/SSD-pytorch

SSD: Single Shot MultiBox Detector pytorch implementation focusing on simplicity

computer-vision deep-learning deep-neural-networks deeplearning detection detector neural-networks object-detection pytorch ssd ssdlite

Last synced: 19 Jul 2025

https://github.com/beiyuouo/arxiv-daily

๐ŸŽ“ Automatically Update Some Fields Papers Daily using Github Actions (Update Every 12th hours)

arxiv arxiv-api arxiv-daily arxiv-papers computer-vision federated-learning github-actions meta-learning robotics slam

Last synced: 04 Apr 2025

https://github.com/collidingscopes/3d-model-playground

Control 3D models using hand gestures and voice commands in real-time. Threejs / mediapipe computer vision

3d-model animation augmented-reality computer-vision hand-tracking mediapipe open-source rosebud threejs tutorial voice-recognition web-speech-api

Last synced: 10 Sep 2025

https://github.com/utkarsh-deshmukh/spatially-varying-blur-detection-python

python implementation of the paper "Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes" - cvpr 2017

blur blur-detection blur-detector computer-vision cvpr2017 image-processing image-quality image-quality-assessment python

Last synced: 20 Aug 2025

https://github.com/pwlmaciejewski/imghash

Perceptual image hashing for Node.js

computer-vision image-processing imghash webscraping

Last synced: 05 Apr 2025

https://github.com/gjy3035/gcc-sfcn

This is the official code of spatial FCN in the paper Learning from Synthetic Data for Crowd Counting in the Wild [CVPR2019].

computer-vision crowd-analysis crowd-counting cvpr2019

Last synced: 10 Apr 2025

https://github.com/lightaime/sgas

SGAS: Sequential Greedy Architecture Search (CVPR'2020) https://www.deepgcns.org/auto/sgas

3d-point-clouds automl bioinformatics computer-vision deep-gcns geometric-deep-learning graph-neural-networks neural-architecture-search

Last synced: 03 May 2025

https://github.com/Z-Zheng/ChangeStar

Change is Everywhere: Single-Temporal Supervised Object Change Detection in Remote Sensing Imagery (ICCV 2021) https://arxiv.org/abs/2108.07002

change-detection computer-vision deep-learning remote-sensing segmentation

Last synced: 11 May 2025

https://github.com/weihaox/gen3d

[CSUR 2023] A Survey on Deep Generative 3D-aware Image Synthesis

3d-aware-image-synthesis computer-graphics computer-vision

Last synced: 26 Jan 2026

https://github.com/kemfic/curved-lane-lines

detect curved lane lines using HSV filtering and sliding window search.

computer-vision lane-boundaries opencv python robotics self-driving-car

Last synced: 30 Apr 2025

https://github.com/zju3dv/instant-nvr

[CVPR 2023] Code for "Learning Neural Volumetric Representations of Dynamic Humans in Minutes"

computer-vision cvpr2023 digital-human nerf neural-rendering view-synthesis

Last synced: 24 Jun 2025

https://github.com/huggingface/chug

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

computer-vision dataloading datasets distributed-training document-understanding multi-modal-learning pdf-document webdataset

Last synced: 14 Oct 2025

https://github.com/DavexPro/pytorch-pose-estimation

PyTorch Implementation of Realtime Multi-Person Pose Estimation project.

computer-vision cvpr-2017 deep-learning human-pose-estimation python3 pytorch

Last synced: 19 Jul 2025

https://github.com/zoeyfyi/tex-match

Search through over 1000 different LaTeX symbols by sketching. A desktop version of detexify.

computer-vision latex rust tex

Last synced: 08 Jul 2025

https://github.com/jannisborn/covid19_ultrasound

Open source lung ultrasound (LUS) data collection initiative for COVID-19.

computer-vision covid-19 covid19-dataset deep-learning lung-ultrasound machine-learning pocus ultrasound

Last synced: 07 Apr 2025

https://github.com/daddydrac/-deprecated-NVIDIA-GPU-Tensor-Core-Accelerator-PyTorch-OpenCV

Computer vision container that includes Jupyter notebooks with built-in code hinting, Anaconda, CUDA 11.8, TensorRT inference accelerator for Tensor cores, CuPy (GPU drop in replacement for Numpy), PyTorch, PyTorch geometric for Graph Neural Networks, TF2, Tensorboard, and OpenCV for accelerated workloads on NVIDIA Tensor cores and GPUs.

computer-vision cupy deep-learning image-processing machine-learning opencv pytorch tensorboard tensorflow2 tensorrt-inference-accelerator vision

Last synced: 06 Apr 2025

https://github.com/trzy/fasterrcnn

Clean and readable implementations of Faster R-CNN in PyTorch and TensorFlow 2 with Keras.

computer-vision convolutional-neural-networks faster-rcnn machine-learning object-detection

Last synced: 12 Jun 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-3D-Object-Detection 169 Awesome-Skeleton-based-Action-Recognition 104 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-robotics-datasets 79 awesome-state-of-depth-completion 71 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97