An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/anishlearnstocode/image2sketch

Converts a 2D Color Image ๐Ÿ–ผ into a Hand drawn Sketch โœ Using Novel Technique.

computer-vision cv digital-image-processing dip image-processing

Last synced: 22 Jul 2025

https://github.com/rishit-dagli/cv-with-tf-demos

My talk about Computer Vision with TensorFlow

cnn cnn-classification cnn-keras computer-vision deep-learning tensorflow

Last synced: 07 May 2025

https://github.com/natmlx/meet-unity

MediaPipe selfie segmentation in Unity Engine.

computer-vision matting mediapipe meet natml selfie-segmentation unity

Last synced: 27 Jul 2025

https://github.com/bhattbhavesh91/pixellib-tutorial

Learn how to change background efficiently and easily by just using 3-4 lines of code using Python's Pixellib library that uses Deeplab's pre-trained Deep Learning model.

computer-vision deep-learning deeplab deeplabv3 instance-segmentation machine-learning maskr-cnn pixellib tensorflow video-segmentation

Last synced: 17 Apr 2025

https://github.com/ankitsharma-007/angular-computer-vision-azure-cognitive-services

An Optical Character Recognition (OCR) using Angular and Azure Computer Vision Cognitive Services

angular ankit-sharma article azure cognitive-services computer-vision optical-character-recognition

Last synced: 30 Jul 2025

https://github.com/aleksandergrzyb/mostitch

Program that automatically stitches set of retinal angiography eye images.

computer-vision cpp image-processing opencv stitched-images stitching

Last synced: 13 Apr 2025

https://github.com/nathancooperjones/super-mario-64-ds-wanted-autoplay

A computer vision model designed to autonomously play the "Wanted!" minigame from Super Mario 64 DS

computer-vision game mario nintendo-ds object-detection

Last synced: 11 Apr 2025

https://github.com/aayusharora/aftershoot

An AI to filter technically bad photos #AndroidDevChallenge

androiddevchallenge automl computer-vision photographers

Last synced: 22 Jun 2025

https://github.com/idinsight/mosaiks

Parallelized encoding of satellite imagery into easy-to-use features using the MOSAIKS algorithm.

computer-vision feature-engineering satellite-imagery

Last synced: 14 Jan 2026

https://github.com/prashant-mahajan/canny-edge-detector-from-scratch

Building a Canny Edge Detector from scratch using Python

canny-edge-detection computer-vision python3

Last synced: 28 Feb 2026

https://github.com/rbbrdckybk/minigpt-4

Simplified local Windows OS setup of MiniGPT-4 running in an Anaconda environment; includes example local server and client.

artificial-intelligence chatgpt computer-vision machine-learning minigpt4 vicuna

Last synced: 26 Jun 2025

https://github.com/geetu040/depthpro-beyond-depth

Depth Estimation model, DepthPro by Apple, trained for Image Segmentation and Image Super Resolution.

computer-vision depth-estimation image-segmentation super-resolution vision-transformer

Last synced: 03 Sep 2025

https://github.com/cepdnaclk/e15-4yp-sports-action-recognition

This includes a novel method to measure the quality of the actions performed in Olympic weightlifting using human action recognition in videos. Human action recognition is a well-studied problem in computer vision and on the other hand action quality assessment is researched and experimented comparatively low. This is due to the lack of datasets that can be used to assess the quality of actions. In this research, we introduce a method to assess player techniques in weightlifting by using skeleton-based human action recognition. Furthermore, we introduce a new video dataset for action recognition in weightlifting which is annotated to frame level. We intended to develop a viable automated scoring system through action recognition that would be beneficial in the sports industry.

action-quality-assessment computer-vision final-year-project human-action-recognition openpose vision

Last synced: 03 Mar 2026

https://github.com/wenbihan/octobos_ijcv2016

OCTOBOS, overcomplete transform, learning and application codes, Matlab implementation, IJCV2015 paper

clustering computer-vision denoising ijcv sparse-coding transform-learning unsupervised-learning

Last synced: 25 Feb 2025

https://github.com/sefakcmn00/gym_project-with-python

In this project, its own AI powered gym implementation was implemented using AI Powered Pose estimation. MediaPipe and Python were used to detect different posts with a webcam. Body up and down gym practice was performed using Mediapipe pose points.

computer-vision mediapipe-pose numpy opencv python

Last synced: 21 Sep 2025

https://github.com/chrockey/fastpointtransformer-votenet

A forked repo from Torch-Points3D for VoteNet with the Fast Point Transformer backbone.

3d-vision computer-vision cvpr2022 point-cloud transformer

Last synced: 10 Jul 2025

https://github.com/nrc-cnrc/nrc-gamma

Large labelled dataset of real-life gas meter images โ€” Vaste ensemble d'images rรฉelles et รฉtiquetรฉes de compteurs de gaz.

ai computer-vision creative-commons crowdsourcing data data-science datanalytics dataset iot machine-learning machinelearning opendata

Last synced: 05 Mar 2026

https://github.com/shreydan/scratchformers

building various transformer model architectures and its modules from scratch.

computer-vision multimodal nlp pytorch transformers

Last synced: 12 Jul 2025

https://github.com/ruskakimov/raised-hand-detection

University Final Year Project (C++, OpenCV)

computer-vision face-detection motion-detection

Last synced: 14 Apr 2025

https://github.com/ashutosh1919/mdp-diffusion

Text-guided image editing by manipulating diffusion path without any training.

computer-vision diffusion diffusion-models generative-model image-editing image-generation-model markov-chain pytorch

Last synced: 19 Mar 2025

https://github.com/ekinakyurek/real-time-face-filtering

The program renders a glass on to user's face which is captured via webcam

computer-vision face-filtering

Last synced: 13 Apr 2025

https://github.com/davide710/scanner

Your pocket scanner: effortless A4 document capture and transformation into a clean PDF

computer-vision opencv python scanner

Last synced: 24 Jul 2025

https://github.com/rizwanmunawar/streamgrid

Multi-stream video inference with Ultralytics YOLO - Display multiple video streams in a grid layout with real-time object detection.

computer-vision deep-learning multi-stream-player multistreaming notebook object-detection object-tracking open-source opencv-python python-apps research-and-development streamgrid threading ultralytics ultralytics-yolo yolo11 yolov8

Last synced: 21 Jul 2025

https://github.com/goldsborough/googlecloudvisionocrexample

Using the Google Cloud Vision API for OCR in Swift

computer-vision google-cloud ml ocr ocr-swift

Last synced: 18 Aug 2025

https://github.com/smaranjitghose/applefoliarai

Using machine learning to diagnose foliar diseases in apple plants

apple-foliar-disease apple-leaf-disease computer-vision deep-learning

Last synced: 03 Jan 2026

https://github.com/cleanpegasus/emotion-recognition

A deep learning code to classify Emotions from a given image or video.

computer-vision deep-learning python tensorflow

Last synced: 03 Aug 2025

https://github.com/abhifuturetech/cnn-imageproc-robotics

This project aims to develop and implement robust object recognition techniques for robotics applications using CNNs and advanced image processing methods.

computer-vision convolutional-neural-networks image-processing python robotics simulation tensorflow

Last synced: 09 Jul 2025

https://github.com/kyegomez/simpleunet

An simple implementation of Unet because all the implementations i've seen are wayy tooo complicated.

artificial-intelligence biomedical biomedical-image-processing computer-vision gpt4 image image-classification image-segmentation texttovide unet vllm

Last synced: 07 May 2025

https://github.com/kumar-shridhar/pytorch-super-resolution

Super Resolution of low resolution Images in PyTorch

cnn computer-vision highresolution pytorch superresolution text

Last synced: 12 Apr 2025

https://github.com/satyajitghana/tsai-deepvision-eva4.0-phase-2

This contains the solutions to the assignments given in The School of AI - Extensive Vision AI 4.0 EVA4 Phase2

aws computer-vision deep-vision edge eva4 lambda phase2 pytorch serverless torchvision tsai tsai-deepvision-eva4

Last synced: 05 May 2025

https://github.com/wtbates99/opencv2-posture-corrector

Toolbar application that leverages OpenCV's powerful computer vision capabilities and MediaPipe's pose detection to create an intelligent posture monitoring system. Running discreetly in your system tray, it provides real-time feedback and gentle reminders when your posture needs attention.

computer-vision ergonomics health-monitoring machine-learning mediapipe opencv pose-estimation posture-detection productivity pyqt real-time-processing system-tray-application visionprocessing workplace-health

Last synced: 13 Apr 2025

https://github.com/camara94/convolutional-neural-networks-tensorflow

In Course 2 of the deeplearning.ai TensorFlow Specialization, you will learn advanced techniques to improve the computer vision model you built in Course 1. You will explore how to work with real-world images in different shapes and sizes, visualize the journey of an image through convolutions to understand how a computer โ€œseesโ€ information, plot loss and accuracy, and explore strategies to prevent overfitting, including augmentation and dropout. Finally, Course 2 will introduce you to transfer learning and how learned features can be extracted from models.

computer-vision convolutional-neural-networks data-science deep-learning deep-neural-networks tensorflow

Last synced: 23 Jun 2025

https://github.com/nok/modanet-eval

Initial data evaluation of ModaNet by eBay.

computer-vision dataset segmentation

Last synced: 07 May 2025

https://github.com/josephpai/fashionai-attributes

Attributes Recognition of Apparel

computer-vision deep-learning fashionai tianchi

Last synced: 06 May 2025

https://github.com/hasibzunair/research-notes

Stuff I find useful, mostly on AI research and computer vision.

computer-vision deep-learning list notes software-engineering

Last synced: 06 Mar 2026

https://github.com/rishav-hub/deep-face-authenticator

This is a modern Face Authentication System which includes state-of-art algorithms to detect face and generate face embedding. This system contains endpoints which can be integrated to any device depending on the requirements.

acr azure blob-storage computer-vision facenet-trained-models fastapi machine-learning mongodb mtcnn-face-detection tensorflow

Last synced: 10 Apr 2025

https://github.com/ar-ray-code/rendertexture2ros2image

Bridge between RenderTexture (Unity) object and sensor_msgs/Image (ROS2).

bridge computer-vision rendertexture ros2 unity

Last synced: 29 Jul 2025

https://github.com/blzq/tf_rodrigues

TensorFlow implementation of Rodrigues rotation conversion from axis-angle (rotation vector) to rotation matrix which supports batch inputs

computer-graphics computer-vision tensorflow

Last synced: 20 Mar 2025

https://github.com/rishikesh-jadhav/adaptive-neural-network-based-control-of-autonomous-car-in-airsim

This repository highlights the integration of neural network-based control with PID and MPC approaches in the AirSim simulator to enhance steering inputs for autonomous vehicles. Employing imitation learning and a hybrid neural network architecture, the project aims to create a robust and unbiased model for improved autonomous vehicle control.

computer-vision deep-learning deployment imitation-learning keras modelpredictivecontrol pid-control pytorch

Last synced: 16 Jul 2025

https://github.com/webarkit/jsfeatnext

Typescript version of jsfeat with typescript definitions.

computer-vision jsfeat jsfeatnext typescript webar webarkit

Last synced: 14 May 2025

https://github.com/lapix-ufsc/lapixdl

Python package with Deep Learning utilities for Computer Vision

computer-vision deep-learning evaluation-framework image-processing

Last synced: 09 Apr 2026

https://github.com/mxagar/open3d_guide

My personal guide to the great Python library Open3D.

3d computer-vision icp open3d point-cloud vision

Last synced: 03 Jul 2025

https://github.com/choaib-elmadi/computer-vision

Computer vision, artificial intelligence applications and projects.

ai arduino artificial-intelligence computer-vision cpp cprogramming python

Last synced: 05 Oct 2025

https://github.com/pacifiquem/opencv-extract-text-from-image

Using OpenCV and Tesseract to extract the reading from image as a text.

computer-vision opencv python

Last synced: 16 Oct 2025

https://github.com/rifkybujana/sam3.c

Efficient SAM3 (Segment Anything Model 3) inference from scratch in pure C โ€” Metal GPU + multithreaded CPU, no Python dependencies

apple-silicon c computer-vision from-scratch ggml image-segmentation inference machine-learning metal pure-c sam3 segment-anything

Last synced: 04 Jun 2026

https://github.com/smsharma/lensing-neural-fields

Implicit neural representations for strong lensing source reconstruction. Code repository associated with https://arxiv.org/abs/2206.14820.

astrophysics computer-vision jax lensing lensing-reconstruction nerf strong-lensing

Last synced: 15 Apr 2026

https://github.com/junweiliang/forkingpaths

Dataset from the paper "The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction"

3d-simulation computer-vision trajectory-prediction trajectory-prediction-benchmark video-understanding

Last synced: 27 Jan 2026

https://github.com/voletiv/gridcorpus-experiments

My experiments with lip reading using GRIDcorpus dataset

computer-vision deep-learning lip-reading

Last synced: 09 Oct 2025

https://github.com/xiaohu2015/cs231n-assignment

The assignment for cs231n course

computer-vision cs231n deeplearning

Last synced: 25 Jun 2025

https://github.com/stemann/Cameras.jl

A Julia interface for cameras

camera computer-vision julia machine-vision

Last synced: 26 Feb 2025

https://github.com/stemann/cameras.jl

A Julia interface for cameras

camera computer-vision julia machine-vision

Last synced: 11 Dec 2025

https://github.com/greatdrake/reparameterized-volume-sampling

Official code for Differentiable Rendering with Reparameterized Volume Sampling (AISTATS 2024)

3d-reconstruction computer-vision monte-carlo nerf neural-radiance-fields neural-rendering reparameterization

Last synced: 08 Sep 2025

https://github.com/lingyzhu0101/udu

[ECCV'24] Unrolled Decomposed Unpaired Learning for Controllable Low-Light Video Enhancement

computer-vision deep-learning low-light-vision video-enhancement

Last synced: 08 Jul 2025

https://github.com/ai-forever/aggme

Aggregation framework for annotating datasets in computer vision tasks (detection, segmentation, video captioning etc.)

aggregation-pipleline annotation-tool computer-vision crowdsourcing image-segmentation object-detection video-captioning

Last synced: 19 Apr 2025

https://github.com/photron-limited/pypuclib

python package for INFINICAM SDK

camera computer-vision machine-vision opencv python

Last synced: 25 Apr 2026

https://github.com/jimiolaniyan/cvm

Python implementation of the algorithms in the book Computer Vision: Models, Learning and Inference

algorithm computer-vision machine-learning

Last synced: 23 Apr 2025

https://github.com/lusinlu/image-entropy-gpu

Calculation of the entropy of the batch of images (whole image or patches)

computer-vision deep-learning entropy entropy-measures gpu pytorch

Last synced: 03 Sep 2025

https://github.com/andreaconti/torch_kitti

PyTorch utilities to handle the KITTI Vision Benchmark Suite

computer-vision deep-learning kitti-dataset machine-learning pytorch

Last synced: 12 May 2025

https://github.com/talfig/sign-language-recognition-v1

Sign language recognition system using CNN architecture, accurately translating images into defined sign language.

computer-vision resnet-18 sign-language-recognition

Last synced: 24 Jun 2025

https://github.com/rfonod/stabilo

๐ŸŒ€ Stabilo is a Python package for stabilizing video frames or object trajectories using advanced transformation techniques. It supports user-defined masks to exclude specific areas, making it ideal for dynamic scenes with moving objects. Unlike traditional stabilization, it stabilizes content relative to a chosen reference frame.

computer-vision feature-extraction feature-matching homography-estimation image-processing image-registration opencv ransac stabilization video-processing video-stabilization

Last synced: 07 Oct 2025

https://github.com/moatifbutt/r2s100k

we introduce R2S100K---a large-scale dataset and benchmark for training and evaluation of road segmentation in challenging unstructured roadways.

autonomous-driving autonomous-vehicles carl computer-vision dataset deep-learning r2s100k road-segmentation scene-segmentation scene-understanding semantic-segmentation teacher-student-learning

Last synced: 27 Aug 2025

https://github.com/sarthak268/nesca-pytorch

PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.

action-anticipation attention computer-vision human-robot-collaboration human-robot-interaction pytorch robot-and-automation short-context video-understanding

Last synced: 11 Aug 2025

https://github.com/alirezasaharkhiz9/computer-vision

This is my undergraduate project at Ferdowsi University of Mashhad, focusing comprehensively on computer vision.

cnn computer-vision deep-learning image-classification image-generation image-manipulation image-segmentation keras object-detection python pytorch-lightning

Last synced: 10 Mar 2026

https://github.com/allenpandas/awesome-computer-career-opportunities

๐Ÿ‘จโ€๐Ÿ’ปโ€ A repository for computer science career opportunities.๏ผˆ้ขๅ‘่ฎก็ฎ—ๆœบ็›ธๅ…ณไธ“ไธšไบบๅ‘˜็š„ๆฑ‚่Œไฟกๆฏ่šๅˆไป“ๅบ“๏ผ‰

2024-campus ai artificial-intelligence campus-recruitment computer-engineering computer-networks computer-science computer-vision cybersecurity doctors-information intership job-search-website jobs-search jobsearch postdoctoral security software-engineering software-testing

Last synced: 04 Jan 2026

https://github.com/cuixing158/cocoapi

Simple, efficient and convenient!

api coco computer-vision deep-learning deep-learning-datasets

Last synced: 02 May 2025

https://github.com/michaeltroger/feature-matching-android

Augmented Reality Template Matching (Feature Matching) using OpenCV 4 for >= Android 5

android augmented-reality augmented-reality-applications computer-vision feature-matching opencv template-matching

Last synced: 17 Aug 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-Skeleton-based-Action-Recognition 104 Awesome-3D-Object-Detection 169 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-state-of-depth-completion 71 awesome-robotics-datasets 79 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97