An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/isaaccorley/segmenter-pytorch

PyTorch implementation of "Segmenter: Transformer for Semantic Segmentation" Strudel et al. (2021)

artificial-intelligence computer-vision deep-learning semantic-segmentation transformers vision-transformer

Last synced: 12 Apr 2025

https://github.com/lamm-mit/fieldcompleter

GAN/convolutional and Transformer models to predict missing mechanical information given limited known data in part of the domain, and further characterize the composite geometries from the recovered mechanical fields for 2D and 3D complex microstructures

analysis computer-vision convolutional-neural-networks deep-learning deep-neural-networks design engineering fracture materials materials-informatics non-destructive-testing science strain stress testing transformers

Last synced: 30 Apr 2025

https://github.com/bes-dev/haar_pytorch

Pytorch implementation of forward and inverse Haar Wavelets 2D

computer-vision deeplearning generative-adversarial-network pytorch

Last synced: 10 Apr 2025

https://github.com/clarifai/examples

Examples for Clarifai Python SDK and Integrations. Give the repo a star โญ

clarifai clarifai-python computer-vision examples generative-ai llm natural-language-processing pretrained-language-model pretrained-models python

Last synced: 13 Apr 2025

https://github.com/davanstrien/flyswot

Command Line Interface for running ๐Ÿค— Transformers Image Classification locally

cli command-line-tool computer-vision glam huggingface-transformers image-classification python

Last synced: 17 Mar 2025

https://github.com/junyanz/mirrormirror

An expression training App that helps users mimic their own expressions.

3d-face computer-vision face-swap

Last synced: 06 May 2025

https://github.com/hemangjoshi37a/personalgoalassistant

AI-driven Personal Goal Assistant: Reinforcement learning-powered software mimics user behavior, interacts with computer inputs, and autonomously achieves goals in finance, social networking, and productivity. Open-source, Python-based RL agent.

ai automation autonomous-agents computer-vision data-preprocessing deep-learning finance goal-setting machine-learning mimic natural-language-processing open-source personal-goals productivity python reinforcement-learning social-media user-data

Last synced: 12 Aug 2025

https://github.com/dnstanciu/pool-tracker-pro

Error-free 3D printing. Defect detection and melt pool monitoring in a metal 3D printer. Tracks area of the melt pool, intensity, and radius. The variations of the melt pool are used to detect build defects.

3d-printer 3d-printing additive-manufacturing computer-vision direct-metal-laser-sintering dmls error-free manufacturing melt melt-pool pool pool-tracker selective-laser-melting slm tracking-algorithm

Last synced: 09 Apr 2025

https://github.com/michel-leonard/ciede2000-color-matching

The ๐‚๐ˆ๐„๐ƒ๐„๐Ÿ๐ŸŽ๐ŸŽ๐ŸŽ color difference formula written in 40+ programming languages.

c color computer-vision dart education go image-processing java javascript kotlin ktm620enduro linux python ruby rust swift testing windows

Last synced: 18 Feb 2026

https://github.com/google-research/pangea

Panoramic Graph Environment Annotation toolkit, for collecting audio and text annotations in panoramic graph environments such as Matterport3D and StreetLearn.

annotation-tool computer-vision crowdsourcing nlp

Last synced: 11 Jul 2025

https://github.com/3dlg-hcvc/tricolo

[WACV 2024] TriCoLo: Trimodal Contrastive Loss for Text to Shape Retrieval

3d computer-vision multimodal-learning natual-language-processing pytorch pytorch-lightning

Last synced: 05 Apr 2025

https://github.com/bbelk/handirokuremote

Control your Roku with hand gestures using Mediapipe and Python

accessibility computer-vision cv2 hand-detection mediapipe opencv python roku

Last synced: 13 Aug 2025

https://github.com/knorth55/chainer-light-head-rcnn

[This project has moved to ChainerCV] Chainer Implementation of Light Head RCNN

chainer chainercv computer-vision deep-learning machine-learning object-detection python

Last synced: 28 Oct 2025

https://github.com/czoido/stereo-camera-visualization

Render a point cloud in real time from a cheap stereo camera stream using OpenGL and OpenCV. The point cloud rendering tries to imitate the style from Radiohead's video House of Cards.

computer-vision opencv radiohead stereo-camera

Last synced: 15 Aug 2025

https://github.com/ibaigorordo/onnx-object-localization-network

Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

class-agnostic-detection computer-vision object-detection object-localization onnx onnxruntime opencv python

Last synced: 27 Aug 2025

https://github.com/pottekkat/virtual-drums

A virtual drum set built using Open CV

computer-vision machine-learning opencv virtual-drums

Last synced: 20 Aug 2025

https://github.com/koldim2001/image_captioning

ะ“ะตะฝะตั€ะฐั†ะธั ะพะฟะธัะฐะฝะธะน ะบ ะธะทะพะฑั€ะฐะถะตะฝะธัะผ ั ะฟะพะผะพั‰ัŒัŽ ั€ะฐะทะปะธั‡ะฝั‹ั… ะฐั€ั…ะธั‚ะตะบั‚ัƒั€ ะฝะตะนั€ะพะฝะฝั‹ั… ัะตั‚ะตะน

attention-mechanism captioning-images computer-vision image-captioning lstm model-deployment model-inference nlp soft-attention streamlit website word-embeddings

Last synced: 23 Oct 2025

https://github.com/wkentaro/yolo-world-onnx

ONNX models of YOLO-World (an open-vocabulary object detection).

computer-vision deep-learning foundation-models object-detection

Last synced: 15 Apr 2025

https://github.com/teddyoweh/omark

Omarke is a facial search algorithm that implements a binary search algorithm on a facial recognition model which efficiently identifies faces that are absent in pictures based on a dataset.

algorithms algorithms-and-data-structures computer-vision computer-vision-algorithms face-detection face-recognition python

Last synced: 09 Apr 2025

https://github.com/emmanuelezenwere/magicmirror

Computer Vision and Deep Learning photo editing app to virtually try on different hairstyles from reference photos.

cnn-model computer-vision dlib image-processing opencv restful-api

Last synced: 28 Apr 2025

https://github.com/roboflow/cookbooks

Templates for computer vision projects, referenced in Roboflow blog posts.

computer-vision

Last synced: 05 May 2025

https://github.com/ylp1455/motionheatmapgenerator

Generate motion heatmaps from video sequences with Python. Ideal for analyzing motion patterns in images. Install via pip and start analyzing motion today!

computer-vision heatmap motion pypi-package

Last synced: 23 Jun 2025

https://github.com/adamspannbauer/pixel_art

Turn images into 'iconified' pixel art with Python and OpenCV

computer-vision opencv python

Last synced: 28 Oct 2025

https://github.com/ndrplz/computer_vision_utils

Everything that I code more than twice during my PhD will end up here.

bounding-boxes computer-vision image-processing python stitching

Last synced: 10 Jul 2025

https://github.com/sergioskar/convolutional-neural-network

Computer vision framework based on deep learning and GPU programming

computer-vision convolutional-neural-network deep-learning gpu-computing opencl

Last synced: 10 Jul 2025

https://github.com/guaishou74851/dccm

(Nature Communications Engineering 2024) Compressive Confocal Microscopy Imaging at the Single-Photon Level with Ultra-Low Sampling Ratios [PyTorch]

compressed-sensing compressive-sampling compressive-sensing computer-vision deep-neural-networks deep-unfolding deep-unrolling image-reconstruction

Last synced: 20 Mar 2025

https://github.com/sunsided/stroke-width-transform

Python implementation of and experiments with the Stroke Width Transformation and connected components filtering.

computer-vision image-processing ocr python stroke-width

Last synced: 18 Mar 2025

https://github.com/afathi/turicreate-notebooks

Demonstrating how to create custom Core ML models ๐Ÿ‘พ๐Ÿค–

computer-vision coreml ios machine-learning macos python watchos

Last synced: 04 Apr 2025

https://github.com/snu-vgilab/instaorder

Official repository for the paper "Instance-Wise Holistic Order Prediction in Natural Scenes".

computer-vision depth-order occlusion-order panoptic-segmentation scene-understanding

Last synced: 14 Apr 2025

https://github.com/ilijamihajlovic/coreml-and-vision-with-a-pre-trained-deep-learning-ssd-model

This project shows how to use CoreML and Vision with a pre-trained deep learning SSD (Single Shot MultiBox Detector) model. There are many variations of SSD. The one weโ€™re going to use is MobileNetV2 as the backbone this model also has separable convolutions for the SSD layers, also known as SSDLite. This app can find the locations of several different types of objects in the image. The detections are described by bounding boxes, and for each bounding box, the model also predicts a class.

computer-vision coreml coreml-framework coreml-image-classifier coreml-model coreml-models coreml-vision image-classification image-recognition machine-learning object-detection swift vison

Last synced: 08 Oct 2025

https://github.com/maple-research-lab/CaCo

CaCo: Both Positive and Negative Samples are Directly Learnable via Cooperative-adversarial Contrastive Learning

caco computer-vision contrastive-learning representation-learning self-supervised-learning unsupervised-learning

Last synced: 01 May 2025

https://github.com/tpsatish95/indus-script-ocr

The Indus script optical grapheme recognition engine (from archaeological artifact images)

ancient-texts caffe computer-vision deep-learning digital-humanities epigraphy opencv optical-character-recognition pipeline scikit-image scikit-learn

Last synced: 14 Apr 2025

https://github.com/4211421036/facemind

application uses computer vision and machine learning to analyze mental health based on facial expressions. The app includes login system, and real-time mental health analysis through facial landmarks, using OpenCV, Mediapipe, and PyQt5. This Application Delegation Paper Competition for

ai competition computer-vision instagram mental-health paper python

Last synced: 07 Apr 2025

https://github.com/krisharul26/motorbike-helmet_detection-using-yolov3

This project related to road safety. We have to wear a helmet according to road safety rules when riding a motorcycle. Therefore, these rules are monitored by the police. Also, Due to the invention of high-quality cameras and the development of the state of art in AI, the surveillance system willing to use AI system to detect whether passengers are wearing helmets or not. In this category I have built a model, first, it can be able to detect the motorcycle, then it will find out who is riding the bike. Finally, it will determine whether the person is wearing a helmet or not. To that end, I trained the pre-trained model SSD-MobileNet-V1 to detect motorcycles and humans.

artificial-intelligence computer-vision image-processing imageclassification keras-tensorflow labelimg-tool machine-learning objectdetection tensorflow2 transfer-learning yolov3

Last synced: 12 May 2025

https://github.com/hfahrudin/facex

Python library for detecting faces and classifying emotions in images lightweight, efficient threading and object pooling for concurrent processing.

computer-vision deep-learning machine-learning opencv python raspberry-pi tensorflow

Last synced: 31 Jan 2026

https://github.com/retkowsky/video-search-azure

Video search using Azure Computer Vision 4 (Florence)

azure-ai azure-cognitive-services azure-computer-vision computer-vision

Last synced: 14 Sep 2025

https://github.com/voxel51/reconstruction-error-ratios

Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!

annotation clip-model computer-vision data-centric-ai machine-learning python umap

Last synced: 16 Mar 2026

https://github.com/harshit-saraswat/face-recognition-based-image-separator

This is a small fun project which uses face recognition techniques to separate images from a large dataset into images of different people according to faces.

ai automation beginner blog code computer-vision dl dlib dlib-face-recognition face-recognition medium ml project python

Last synced: 05 May 2025

https://github.com/geekquad/color-recognizer

An application that provides color names and HTML/RGB mappings as output.

color-recognition computer-vision cv2

Last synced: 23 Aug 2025

https://github.com/poodarchu/deep3dbox

Deep3DBox's MXNet implementation. (In Progress: %95)

3d-object-detection autonomous-driving computer-vision kitti object-detection

Last synced: 27 Jul 2025

https://github.com/living-with-machines/nnanno

nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset

computer-vision glam historic-data jupyter-notebook nbdev tutorial

Last synced: 26 Jul 2025

https://github.com/khuyentran1401/dog_classifier

A simple app to classify dogs using fastai and streamlit.

computer-vision deep-learning fastai streamlit streamlit-application streamlit-webapp

Last synced: 13 Aug 2025

https://github.com/lightly-ai/lightly-train

LightlyTrain is the first PyTorch framework to pretrain computer vision models on unlabeled data for industrial applications

computer-vision contrastive-learning deep-learning distillation embeddings machine-learning pretrained-models python pytorch self-supervised self-supervised-learning

Last synced: 18 Apr 2025

https://github.com/tortuvshin/intelligo-game

Augmented reality game development repository - Art & Technology Tohoku 2018 contest

computer-vision express mongodb nodejs threejs

Last synced: 22 Apr 2025

https://github.com/fatchur/simple-tensor

A simplification of Tensorflow Tensor Operations

computer-vision deep-learning inceptionv4 lstm object-detection tensorflow yolov3

Last synced: 06 May 2025

https://github.com/qxresearch/all-opencv-projects

List of all OpenCV projects using #python to further increase the computer vision community.

computer-vision hacktoberfest hacktoberfest-accepted hacktoberfest2020 opencv python

Last synced: 25 Jul 2025

https://github.com/hysts/pytorch_cutout

A PyTorch implementation of Cutout

cifar10 computer-vision pytorch

Last synced: 09 Oct 2025

https://github.com/alvinwan/finger-detection-lite

Real-time Index Finger Detection for the โ˜๏ธ Gesture Proof-of-Concept

cnn-classification computer-vision finger-detection hand-detection object-detection pytorch

Last synced: 08 Aug 2025

https://github.com/nasa/upsp-processing

Software for processing high-speed video recordings from Unsteady Pressure-Sensitive Paint (UPSP) measurement systems. https://nasa.github.io/upsp-processing

aeroacoustics computer-graphics computer-vision hpc linux macos mpi nasa opencv opencv-python openmp parallel-computing photron python scientific-computing scientific-computing-with-python signal-processing turbulence video-processing

Last synced: 31 Aug 2025

https://github.com/Charrrrrlie/Mask-as-Supervision

[ECCV2024] Official Implementation of Our Paper "Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation"

3d-pose-estimation computer-vision human-pose-estimation pose-estimation unsupervised-learning

Last synced: 24 Jul 2025

https://github.com/zmyzheng/3d-semantic-segmentation-for-scene-parsing

A new approach for the real time 3D semantic segmentation based on feature abstract and deep learning method

3d computer-vision keras neural-networks semantic-segmentation

Last synced: 22 Jul 2025

https://github.com/fedebotu/vision-cartpole-dqn

Implementation of the CartPole from OpenAI's Gym using only visual input for Reinforcement Learning control with DQN

cartpole computer-vision dqn inverted-pendulum model-free model-free-control model-free-rl openai-gym reinforcement-learning vision-cartpole

Last synced: 10 Sep 2025

https://github.com/sa-y-an/omrnet

Neural Networks to evaluate OMR Sheets

computer-vision neural-network quantization scikit-learn tensorflow

Last synced: 15 Oct 2025

https://github.com/zhmiao/iterative-human-and-automated-identification-of-wildlife-images

Pytorch implementation for "Iterative Human and Automated Identification of Wildlife Images" (Nature -Machine Intelligence, 2021)

active-learning computer-vision conservation deep-learning ecology humans-in-the-loop lifelong-learning wildlife wildlife-conservation

Last synced: 03 Mar 2026

https://github.com/wazzabeee/image-video-colorization

Colorize your black and white images and YouTube videos for free. Streamlit application based on CNN deployed on Hugging Face.

automatic-colorization cnn colorization colorize-image computer-vision deep-learning huggingface-spaces image-colorization papers streamlit video-colorization youtube-colorization

Last synced: 28 Oct 2025

https://github.com/charlesyuan02/emotion-music-player

A music player that recommends songs based on your detected mood. Created for TOHacks 2021.

computer-vision deep-learning django emotion-recognition html-css-javascript music-player

Last synced: 25 Oct 2025

https://github.com/vidursatija/photowct

Style transfer using WCT transforms

computer-vision deep-learning python3 style-transfer torch

Last synced: 13 Oct 2025

https://github.com/feat7/thermal-screening

Thermal screening using computer vision techniques.

computer-vision screening thermal-camera

Last synced: 13 Feb 2026

https://github.com/rostyq/pico-detect

Rust library for Pixel Intensity Comparison-based Object (PICO) Detection.

computer-vision detection

Last synced: 16 May 2025

https://github.com/AlexanderMelde/Verkehrszeichenerkennung

Traffic Sign Detection ๐Ÿ‡ฉ๐Ÿ‡ช Praktische Implementierung einer Verkehrszeichenerkennung mit OpenCV und Tensorflow im Rahmen einer Studienarbeit.

autonomous-driving cnn computer-vision dataset gtsrb machine-learning opencv paper tensorflow traffic-sign-classification

Last synced: 06 May 2025

https://github.com/vision-cair/3dcompat

Official repository for the 3DCoMPaT dataset (ECCV2022 Oral)

compositional-data computer-vision dataset deep-learning

Last synced: 12 Apr 2025

https://github.com/alexandrevl/pyscreen

PyScreen is an AI-powered tool that extracts, analyzes, and visualizes data from screen recordings. It leverages advanced computer vision, text processing, and AI techniques to provide valuable insights such as dominant color schemes, frequently used words, and content descriptions using OpenAI's GPT-4 model.

artificial-intelligence color-analysis computer-vision gpt-4 ocr open-source openai opencv python screen-recording screenshot text-processing wordcloud

Last synced: 17 Mar 2025

https://github.com/andresberejnoi/opencv_traffic_counter

A car-counting system using background subtraction on a video feed. It makes use of OpenCV API.

background-subtraction car-counting computer computer-vision image-processing opencv python3 traffic-counter video

Last synced: 14 Jul 2025

https://github.com/pt-perkasa-pilar-utama/ppu-ocv

A type-safe, modular, chainable image processing library built on top of OpenCV.js with a fluent API leveraging pipeline processing.

bun canvas chainable computer-vision contours edge-detection image-processing image-processor modular open-cv opencv-js perspective-transformation pipeline threshold type-safe

Last synced: 04 Mar 2026

https://github.com/osamhack2020/web_guardian_guardian

GUARDIAN - CCTV๊ฐ์ง€์ฒด๊ณ„

cctv computer-vision golang opencv react web-application yolov4

Last synced: 28 Oct 2025

https://github.com/kemfic/simple-slam

wip - a simple slam implementation so i can learn stuff

computer-vision multiple-view-geometry opencv opencv3 python3 robotics slam visual-odometry

Last synced: 08 Apr 2026

https://github.com/adendek/DeepLaetitia

Deep Reinforcement Learning that makes you smile

computer-vision deep-neural-networks mathematica reinforcement-learning

Last synced: 08 Jul 2025

https://github.com/mxbonn/ltmp

Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transformers to any desired size with minimal loss of accuracy.

computer-vision deep-learning efficiency pruning transformer vision-transformer

Last synced: 07 May 2025

https://github.com/sayakpaul/parkinson-s-disease-classifier

Deep learning experiments to design a model to predict Parkinson's diseases with the images of Spiral/Wave test

computer-vision deep-learning fastai resnet-34

Last synced: 07 May 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-Skeleton-based-Action-Recognition 104 Awesome-3D-Object-Detection 169 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-robotics-datasets 79 awesome-state-of-depth-completion 71 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97