An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/meyerls/aruco-estimator

Automatic Scale Factor Estimation of 3D Reconstruction in COLMAP Utilizing Aruco Marker

3d-reconstruction ambiguity-resolver aruco aruco-markers colmap computer-vision ground-control-point opencv python scale-ambiguity

Last synced: 09 Sep 2025

https://github.com/mohabmes/otsu-thresholding

Image Processing: Segmentation Using Otsu Threshold Method

computer-vision image-processing otsu otsu-threshold

Last synced: 21 Mar 2025

https://github.com/ieee8023/countception

Count-Ception: Counting by Fully Convolutional Redundant Counting

cell-biology cell-counting computer-vision deep-learning machine-learning

Last synced: 02 May 2025

https://github.com/philbort/udacity_self_driving_car

Projects from Udacity Self Driving Car Nanodegree

computer-vision deep-learning robotics self-driving-car udacity

Last synced: 19 Jul 2025

https://github.com/q-viper/contour-based-writing

This is a simple concept to do writing like operation using the contours. Please follow the article https://q-viper.github.io/2020/08/28/gesture-based-visually-writing-system-web-app/ for further details.

computer-vision contour contour-plot gesture gesture-recognition opencv

Last synced: 04 Apr 2026

https://github.com/ollieboyne/found

Official code for FOUND: Foot Optimisation with Uncertain Normals for Surface Deformation using Synthetic Data

3d-reconstruction computer-vision surface-normals surface-normals-estimation surface-reconstruction

Last synced: 05 Oct 2025

https://github.com/oke-aditya/quickvision

An Easy To Use PyTorch Computer Vision Library

computer-vision deep-learning pytorch pytorch-lightning

Last synced: 16 Mar 2025

https://github.com/kevinzakka/torchsdf-fusion

Benchmarking PyTorch variants of TSDF fusion.

computer-vision pytorch robotics tsdf-fusion

Last synced: 24 Mar 2025

https://github.com/sparkfish/shabby-pages

ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to original denoised documents.

binarization born-digital computer-vision corpus data-science dataset denoising layout-detection

Last synced: 17 Aug 2025

https://github.com/jina-ai/example-multimodal-fashion-search

Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP

computer-vision deep-learning neural-search nlp python

Last synced: 10 May 2025

https://github.com/yunxiaoshi/pointnet-pytorch

PointNet in PyTorch with comprehensive experiments

computer-vision deep-learning

Last synced: 20 Mar 2025

https://github.com/sovit-123/vision_transformers

Vision Transformers for image classification, image segmentation, and object detection.

attention computer-vision transformer-models transformers vision-transformer

Last synced: 12 Sep 2025

https://github.com/bhollis/aruco-marker

JavaScript library and custom HTML element for generating Aruco marker images

aruco aruco-marker aruco-markers augmented-reality computer-vision robotics web-components

Last synced: 28 Oct 2025

https://github.com/bhavik-jikadara/ai-ml-roadmap

Welcome to the ultimate guide for starting your journey in Artificial Intelligence and Machine Learning in 2025! This roadmap provides a step-by-step approach to mastering AI and ML, from fundamentals to advanced topics.

artificial-intelligence computer-vision deep-learning deployment fundamentals-of-programming keras libraries machine-learning mathematics mlops natural-language-processing production-code pytorch reinforcement-learning roadmap scikit-learn tensorflow tools

Last synced: 30 Apr 2025

https://github.com/sankalp1999/image_captioning_with_attention

CaptionBot : Sequence to Sequence Modelling where Encoder is CNN(Resnet-50) and Decoder is LSTMCell with soft attention mechanism

attention-mechanism computer-vision encoder-decoder flickr8k-dataset pytorch show-attend-and-tell

Last synced: 03 Mar 2026

https://github.com/lingdong-/pmst

๐ŸŽจ Poor Man's Style Transfer - Painting an image with the style of another, without machine learning

computer-vision emoji impressionist-paintings opencv painting pointillism style-transfer

Last synced: 14 May 2025

https://github.com/strawlab/cam-geom

๐Ÿ“ท ๐Ÿ“ Geometric models of cameras for photogrammetry

camera-models computer-vision geometry photogrammetry rust

Last synced: 05 Apr 2025

https://github.com/SAP-samples/btp-ai-sustainability-bootcamp

This github repository contains the sample code and exercises of btp-ai-sustainability-bootcamp, which showcases how to build Intelligence and Sustainability into Your Solutions on SAP Business Technology Platform with SAP AI Core and SAP Analytics Cloud for Planning.

computer-vision condition-monitoring deep-learning defect-detection image-segmentation machine-learning predictive-maintenance sac-planning sample sample-code sap-ai-core sap-ai-launchpad sap-analytics-cloud sound-classification sustainability

Last synced: 07 May 2025

https://github.com/andersy005/self-driving-car-nd

Udacity's Self-Driving Car Nanodegree project files and notes.

car computer-vision deep-learning localisation self-driving-car sensor-fusion sensors udacity

Last synced: 07 Jul 2025

https://github.com/kekeblom/strayvisualizer

Visualize Data From Stray Scanner https://keke.dev/blog/2021/03/10/Stray-Scanner.html

3d-reconstruction camera computer-vision dataset depth-maps mapping open3d pointclouds python reconstruction rgb-d slam

Last synced: 12 Apr 2025

https://github.com/ndrplz/semiparametric

[TPAMI 2020] Generating Novel Views of Vehicles via Semi-parametric Guidance. A semi-parametric approach for synthesizing novel views of a rigid object from a single monocular image.

computer-vision convolutional-neural-networks deep-learning image-synthesis machine-learning novel-viewpoint-synthesis pascal3d semi-parametric semi-parametric-learning

Last synced: 10 Jul 2025

https://github.com/amazon-science/gluonmm

A library of transformer models for computer vision and multi-modality research

computer-vision iccv-2021 multimodality pytorch transformer video

Last synced: 03 May 2025

https://github.com/elucideye/acf

Aggregated Channel Feature object detection in C++ and OpenGL ES 2.0 based on https://github.com/pdollar/toolbox

acf android cmake computer-vision cplusplus cplusplus-11 dollar face-detection hog hunter ios matlab object-detection opencv opengl opengl-es piotr polly simd toolbox

Last synced: 10 Apr 2025

https://github.com/cambricon/cnstream

CNStream is a streaming framework for building Cambricon machine learning pipelines http://forum.cambricon.com https://gitee.com/SolutionSDK/CNStream

c-plus-plus cambricon cambricon-cnstream computer-vision graph-framework inference machine-learning mlu pipeline-framework

Last synced: 26 Dec 2025

https://github.com/ROCm/rpp

AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.

agumentation amd bitwise channel-extract computer-vision contrast cpu gpu hip histogram hpc mivisionx opencl openvx radeon-performance-primitives rocm rpp warp-affine

Last synced: 14 Mar 2025

https://github.com/bmw-innovationlab/bmw-classification-inference-gpu-cpu

This is a repository for an image classification inference API using the Gluoncv framework. The inference REST API works on CPU/GPU. It's supported on Windows and Linux Operating systems. Models trained using our Gluoncv Classification training repository can be deployed in this API. Several models can be loaded and used at the same time.

classification computer-vision deep-learning gluoncv inference inference-api

Last synced: 03 Oct 2025

https://github.com/donny-hikari/viola-jones

A face detection program in python using Viola-Jones algorithm.

adaboost boosting computer-vision emsembling face-detection haar-cascade machine-learning viola-jones

Last synced: 12 Apr 2025

https://github.com/JuliaImages/ImageFeatures.jl

Image feature detection for the Julia language

computer-vision feature-detection feature-extraction image-processing

Last synced: 16 Nov 2025

https://github.com/sudheerachary/Manga_Colorization

cGAN-based Manga Colorization Using a Single Training Image.

cgan comics computer-vision image-processing machine-learning manga

Last synced: 26 Sep 2025

https://github.com/vvvv/vl.opencv

A VL wrapper for OpenCVSharp

computer-vision opencv visual-programming vl vvvv

Last synced: 13 Apr 2025

https://github.com/charmve/semantic-segmentation-pytorch

PyTorch implementation for Semantic Segmentation, include FCN, U-Net, SegNet, GCN, PSPNet, Deeplabv3, Deeplabv3+, Mask R-CNN, DUC, GoogleNet, and more dataset

charmve computer-vision deep-learning fcn image-classification image-processing mask-rcnn pytorch pytorch-implementation segnet semantic-segmentation unet-pytorch

Last synced: 21 Mar 2025

https://github.com/jackie-chou/mc-cnn-python

a python implementation of MC-CNN

cnn computer-vision python27 stereo-matching tensorflow

Last synced: 09 Apr 2025

https://github.com/seeed-studio/sensecraft-ai

An application suite including an open-source inference server and web UI to deploy any YOLOv8 model to NVIDIA Jetson devices and visualize captured streams, with one line of code.

computer-vision deep-learning instance-segmentation jetpack jetson-orin machine-learning nvidia-jetson object-detection orin-nano orin-nx pytorch yolov5 yolov8

Last synced: 24 Oct 2025

https://github.com/gabrieltseng/solar-panel-segmentation

A U-Net for solar panel identification and segmentation

computer-vision deep-learning machine-learning remote-sensing solar-energy

Last synced: 18 Jan 2026

https://github.com/DevipriyaSarkar/OCR-Reader

An Android app to extract text from camera preview directly.

android camera computer-vision ocr ocr-android ocr-recognition

Last synced: 09 Apr 2025

https://github.com/aecins/symseg

SymSeg: Model-Free Object Segmentation in Point Clouds Using the Symmetry Constraint.

computer-vision geometry pointcloud robotics segmentation symmetry

Last synced: 13 Apr 2025

https://github.com/ibaigorordo/depthai-android-jni-example

Android example to get the rgb and disparity images from the OAK-D device connected to a phone.

android computer-vision cpp deep-learning depthai jni-android object-detection

Last synced: 28 Jun 2025

https://github.com/cuge1995/cvpr-2020-point-cloud-analysis

CVPR 2020 papers focusing on point cloud analysis

computer-vision cvpr cvpr2020 deep-learning point-cloud

Last synced: 26 Feb 2026

https://github.com/darrenjkt/SEE-MTDA

(RA-L 2022) See Eye to Eye: A Lidar-Agnostic 3D Detection Framework for Unsupervised Multi-Target Domain Adaptation.

3d-object-detection autonomous-driving computer-vision deep-learning object-detection surface-completion unsupervised-domain-adaptation

Last synced: 20 Mar 2025

https://github.com/vida-nyu/city-surfaces

CitySurfaces semantic segmentation of sidewalk surfaces

computer-vision material sidewalk sidewalk-surface urban-analytics urban-data-science

Last synced: 10 Apr 2025

https://github.com/juliaimages/imagefeatures.jl

Image feature detection for the Julia language

computer-vision feature-detection feature-extraction image-processing

Last synced: 25 Jun 2025

https://github.com/ibaigorordo/onnx-raft-optical-flow-estimation

Python scripts for performing optical flow estimation using the RAFT model in ONNX

computer-vision onnx onnxruntime optical-flow python

Last synced: 04 Aug 2025

https://github.com/AsRaNi1/live-cctv

To detect any reasonable change in a live cctv to avoid large storage of data. Once, we notice a change, our goal would be track that object or person causing it. We would be using Computer vision concepts. Our major focus will be on Deep Learning and will try to add as many features in the process.

cctv cctv-cameras computer-vision convolutional-neural-networks cosine-similarity darknet deep-learning euclidean-distances frame-change-detection image-classification machine-learning motion-detection neural-network object-detection python yolo yolov3

Last synced: 07 Apr 2025

https://github.com/davidstutz/graph-based-image-segmentation

Implementation of efficient graph-based image segmentation as proposed by Felzenswalb and Huttenlocher [1] that can be used to generate oversegmentations.

computer-vision image-processing image-segmentation opencv superpixel-algorithm superpixels

Last synced: 23 Apr 2025

https://github.com/mseitzer/srgan

Pytorch implementation of "Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network"

computer-vision deep-learning gan generative-adversarial-network pytorch super-resolution

Last synced: 05 Oct 2025

https://github.com/geoffsmith82/symposium2023

Demonstrates Voice Recognition, Text to Speech, Language Translation, OAuth2, Image Generation, Face Detection and Voice Chatbot. Source code and Documentation for my 2023 ADUG Symposium Talk.

ai artificial-intelligence claude-3-haiku claude-3-opus claude-3-sonnet computer-vision gpt gpt-4 gpt-4o image-to-text oauth2 palm palm2 speech-to-text text-to-image text-to-speech translation voice-recognition websockets

Last synced: 01 Feb 2026

https://github.com/pedrol2b/react-native-vision-camera-mlkit

A powerful React Native Vision Camera plugin delivering high-performance Google ML Kit frame processor featuresโ€”including text recognition (OCR), face detection, barcode scanning, pose detection, and more. Seamlessly bridges native ML Kit capabilities for real-time, on-device computer vision in your React Native apps.

android barcode barcode-scanner camera computer-vision face-detection frame-processor google-ml-kit image-processing ios machine-learning mlkit native-module ocr pose-detection react-native react-native-module text-detection vision vision-camera

Last synced: 01 May 2026

https://github.com/engcang/tensorrt_yolov9_ros

(ROS, C++) YOLOv9 detection using TensorRT, now supporting TensorRT 10

computer-vision cpp object-detection ros yolo yolov9

Last synced: 25 Jun 2025

https://github.com/jinhseo/OD-WSCL

[ECCV2022] Official Pytorch Implementation of Object Discovery via Contrastive Learning for Weakly Supervised Object Detection

computer-vision object-detection weakly-supervised-object-detection

Last synced: 08 May 2025

https://github.com/jnbraun/bcnn

A minimalist Deep Learning framework for embedded Computer Vision

arm c99 computer-vision convolutional-neural-networks deep-learning edge-ai embedded-vision gpu high-performance

Last synced: 10 Apr 2025

https://github.com/bwconrad/flexivit

PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes

computer-vision pytorch transformer vision-transformer

Last synced: 26 Oct 2025

https://github.com/bgu-cs-vil/deepmcbm

"A Deep Moving-camera Background Model" [Erez, Shapira Weber, and Freifeld, ECCV 2022]

background-estimation computer-vision deep-learning moving-camera pytorch

Last synced: 23 Apr 2025

https://github.com/goodarzmehr/simbev

[ITSC 2026] SimBEV is a configurable and scalable synthetic driving data generation tool and dataset based on the CARLA Simulator.

3d-object-detection autonomous-driving bev bev-perception bev-segmentation carla-simulator computer-vision data-generation dataset depth-estimation semantic semantic-occupancy-prediction semantic-segmentation

Last synced: 30 May 2026

https://github.com/zyrolasting/racket-vulkan

Racket integration with all things Vulkan :boom:

3d computer-graphics computer-vision gpu racket racket-ffi-bindings vulkan vulkan-api

Last synced: 17 Mar 2025

https://github.com/wilddima/vodopad

Manage your photographs with the power of deep learning! ๐Ÿ“ท

cli computer-vision deep-learning python

Last synced: 22 Mar 2025

https://github.com/sidgan/whats_in_a_question

CVPR'17 Spotlight: Whatโ€™s in a Question: Using Visual Questions as a Form of Supervision

computer-vision deep-learning deep-neural-networks vqa

Last synced: 26 Aug 2025

https://github.com/pierluigiferrari/lane_tracker

A Python program for lane line detection and tracking using a traditional computer vision approach

computer-vision lane-detection lane-lines lane-lines-detection lane-tracker

Last synced: 13 May 2025

https://github.com/dnth/timm-flutter-pytorch-lite-blogpost

PyTorch at the Edge: Deploying Over 964 TIMM Models on Android with TorchScript and Flutter.

android computer computer-vision dart edge edge-ai edge-computing flutter image-classification pytorch torchscript vision

Last synced: 14 Apr 2025

https://github.com/GYXie/visual-search

A toy project for visual search, based on deep learning.

alexnet computer-vision deep-learning tensorflow visual-search

Last synced: 05 Apr 2025

https://github.com/Mattdl/ContinualPrototypeEvolution

Codebase for Continual Prototype Evolution (CoPE) to attain perpetually representative prototypes for online and non-stationary datastreams. Includes implementation of the Pseudo-Prototypical Proxy (PPP) loss.

computer-vision continual-learning datastreaming deep-learning online-learning prototypes representation-learning

Last synced: 08 May 2025

https://github.com/kupynorest/instance_augmentation

[ECCV 2024] Official Repo for: Dataset Enhancement with Instance-Level Augmentations

augmentation computer-vision diffusion-models eccv2024 image-generation image-processing machine-learning pytorch

Last synced: 03 Aug 2025

https://github.com/mattdl/continualprototypeevolution

Codebase for Continual Prototype Evolution (CoPE) to attain perpetually representative prototypes for online and non-stationary datastreams. Includes implementation of the Pseudo-Prototypical Proxy (PPP) loss.

computer-vision continual-learning datastreaming deep-learning online-learning prototypes representation-learning

Last synced: 15 Jul 2025

https://github.com/methyldragon/opencv-motion-detector

Detects transient motion in a video feed. If said motion is large enough, and recent enough, reports that there is motion!

computer-vision motion-detection opencv-python

Last synced: 15 Jul 2025

https://github.com/ethz-asl/autolabel

A project for computing high-quality ground truth training examples for RGB-D data.

computer-vision labeling-tool machine-learning nerf rgb-d robotics

Last synced: 11 Apr 2025

https://github.com/cansik/mediapipe-osc

MediaPipe examples which stream their detections over OSC.

computer-vision hand-tracking mediapipe osc pose-estimation

Last synced: 30 Apr 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-Skeleton-based-Action-Recognition 104 Awesome-3D-Object-Detection 169 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-state-of-depth-completion 71 awesome-robotics-datasets 79 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97