Computer vision
Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.
- GitHub: https://github.com/topics/computer-vision
- Wikipedia: https://en.wikipedia.org/wiki/Computer_vision
- Related Topics: vision, deep-learning, machine-learning, opencv, gan,
- Aliases: machine-vision, computervision,
- Last updated: 2026-07-04 00:06:12 UTC
- JSON Representation
https://github.com/natmlx/meet-unity
MediaPipe selfie segmentation in Unity Engine.
computer-vision matting mediapipe meet natml selfie-segmentation unity
Last synced: 27 Jul 2025
https://github.com/bobmcdear/efficientnet-pytorch
PyTorch implementation of EfficientNet
computer-vision deep-learning efficientnet machine-learning pytorch
Last synced: 19 Sep 2025
https://github.com/ivandonadello/semantic-pascal-part
This is a curated semantic version of the PASCAL-Part dataset for part-based object detection. Objects are aligned with WordNet and Yago concepts. The dataset is both in PASCAL-Voc and RDF format.
computer-vision dataset knowledge-graph object-detection ontologies ontology owl owl-ontology part-whole partof-ontology pascal-part pascal-voc rdf rdf-format rdflib semantic-web wordnet yago
Last synced: 22 Aug 2025
https://github.com/saptakbhoumik/tinyvision
TinyVision is an evolving project focused on designing ultra-lightweight image classification models with minimal parameter counts. The goal is to explore what’s actually necessary for fundamental vision tasks by combining handcrafted feature preprocessing with highly efficient CNN architectures.
computer-vision machine-learning python python3 pytorch vision
Last synced: 16 Aug 2025
https://github.com/eric-canas/drums-app
Play drums in your browser with your webcam
browser-game computer-vision deep-learning keras music-generation neural-network tensorflow-js
Last synced: 09 Apr 2025
https://github.com/ankitsharma-007/angular-computer-vision-azure-cognitive-services
An Optical Character Recognition (OCR) using Angular and Azure Computer Vision Cognitive Services
angular ankit-sharma article azure cognitive-services computer-vision optical-character-recognition
Last synced: 30 Jul 2025
https://github.com/srohit0/food_mnist
This dataset has 10 food categories, with 5,000 images. For each class, 125 manually reviewed test images are provided as well as 375 training images. All images were rescaled to have a maximum side length of 512 pixels.
cnn cnn-classification computer-vision convolutional-neural-networks dataset mnist mnist-classification mnist-dataset mnist-image-dataset
Last synced: 15 Apr 2025
https://github.com/Eric-Canas/Drums-app
Play drums in your browser with your webcam
browser-game computer-vision deep-learning keras music-generation neural-network tensorflow-js
Last synced: 10 Mar 2025
https://github.com/koldim2001/jupiterlabcv
Готовый сервис JupiterLab для разработок в сфере компьютерного зрения
computer-vision docker docker-compose gpu-computing jupyter-lab jupyter-notebook jupyterlab
Last synced: 03 Sep 2025
https://github.com/IBM/vision-terraform
IBM Maximo Visual Inspection sample terraform templates for deployment in IBM Cloud (formerly IBM PowerAI Vision and IBM Visual Insights)
computer-vision deep-learning ibm-cloud terraform-template vpc
Last synced: 13 May 2025
https://github.com/urholaukkarinen/template-matching
GPU-accelerated template matching for Rust
computer-vision gpu image-recognition rust template-matching
Last synced: 18 Mar 2025
https://github.com/darienmt/carnd-lanelines-p1
Udacity Self Driving Car Nanodegree - Finding Lane Lines in a Video Stream
computer-vision hough-transformation image-processing jupyter-notebook opencv self-driving-car udacity udacity-self-driving-car
Last synced: 15 Aug 2025
https://github.com/neurodata/value-of-ood-data
The value of out-of-distribution data (ICML 2023)
computer-vision machine-learning ood-generalization statistical-learning
Last synced: 25 Apr 2025
https://github.com/ndrplz/cifar-10
Python plug-and-play wrapper to CIFAR-10 dataset.
cifar cifar-10 cifar10 computer-vision convolutional-neural-networks dataset deep-learning machine-learning python36 wrapper
Last synced: 10 Jul 2025
https://github.com/etun/canny-edges-hough-circles
android canny-edge-detection computer-vision java opencv
Last synced: 12 May 2025
https://github.com/rbbrdckybk/minigpt-4
Simplified local Windows OS setup of MiniGPT-4 running in an Anaconda environment; includes example local server and client.
artificial-intelligence chatgpt computer-vision machine-learning minigpt4 vicuna
Last synced: 26 Jun 2025
https://github.com/smaranjitghose/applefoliarai
Using machine learning to diagnose foliar diseases in apple plants
apple-foliar-disease apple-leaf-disease computer-vision deep-learning
Last synced: 03 Jan 2026
https://github.com/sarthak268/nesca-pytorch
PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.
action-anticipation attention computer-vision human-robot-collaboration human-robot-interaction pytorch robot-and-automation short-context video-understanding
Last synced: 11 Aug 2025
https://github.com/abraia/abraia-multiple
Abraia computer vision SDK for edge applications
abraia-api computer-vision convert-images hyperspectral-image hyperspectral-image-analysis image-analysis
Last synced: 08 Mar 2026
https://github.com/lingyzhu0101/udu
[ECCV'24] Unrolled Decomposed Unpaired Learning for Controllable Low-Light Video Enhancement
computer-vision deep-learning low-light-vision video-enhancement
Last synced: 08 Jul 2025
https://github.com/kumar-shridhar/pytorch-super-resolution
Super Resolution of low resolution Images in PyTorch
cnn computer-vision highresolution pytorch superresolution text
Last synced: 12 Apr 2025
https://github.com/andreaconti/torch_kitti
PyTorch utilities to handle the KITTI Vision Benchmark Suite
computer-vision deep-learning kitti-dataset machine-learning pytorch
Last synced: 12 May 2025
https://github.com/greatdrake/reparameterized-volume-sampling
Official code for Differentiable Rendering with Reparameterized Volume Sampling (AISTATS 2024)
3d-reconstruction computer-vision monte-carlo nerf neural-radiance-fields neural-rendering reparameterization
Last synced: 08 Sep 2025
https://github.com/moatifbutt/r2s100k
we introduce R2S100K---a large-scale dataset and benchmark for training and evaluation of road segmentation in challenging unstructured roadways.
autonomous-driving autonomous-vehicles carl computer-vision dataset deep-learning r2s100k road-segmentation scene-segmentation scene-understanding semantic-segmentation teacher-student-learning
Last synced: 27 Aug 2025
https://github.com/wtbates99/opencv2-posture-corrector
Toolbar application that leverages OpenCV's powerful computer vision capabilities and MediaPipe's pose detection to create an intelligent posture monitoring system. Running discreetly in your system tray, it provides real-time feedback and gentle reminders when your posture needs attention.
computer-vision ergonomics health-monitoring machine-learning mediapipe opencv pose-estimation posture-detection productivity pyqt real-time-processing system-tray-application visionprocessing workplace-health
Last synced: 13 Apr 2025
https://github.com/geetu040/depthpro-beyond-depth
Depth Estimation model, DepthPro by Apple, trained for Image Segmentation and Image Super Resolution.
computer-vision depth-estimation image-segmentation super-resolution vision-transformer
Last synced: 03 Sep 2025
https://github.com/ekinakyurek/real-time-face-filtering
The program renders a glass on to user's face which is captured via webcam
computer-vision face-filtering
Last synced: 13 Apr 2025
https://github.com/ar-ray-code/rendertexture2ros2image
Bridge between RenderTexture (Unity) object and sensor_msgs/Image (ROS2).
bridge computer-vision rendertexture ros2 unity
Last synced: 29 Jul 2025
https://github.com/lucasvandroux/pytorch-rocket-esrgan
PyTorch Rocket ESRGAN - Tutorial 1: The Magic Mirror
beginner beginner-friendly computer-vision deep-learning diy-tool easy-to-use gan pytorch pytorch-tutorial super-resolution
Last synced: 13 May 2025
https://github.com/webarkit/jsfeatnext
Typescript version of jsfeat with typescript definitions.
computer-vision jsfeat jsfeatnext typescript webar webarkit
Last synced: 14 May 2025
https://github.com/jacobmarks/fiftyone_florence2_plugin
Run SOTA Vision-Language Model Florence-2 on your data!
computer-vision datacentric fiftyone-datasets florence-2 ml transformer vision-language-model
Last synced: 31 Oct 2025
https://github.com/shreydan/scratchformers
building various transformer model architectures and its modules from scratch.
computer-vision multimodal nlp pytorch transformers
Last synced: 12 Jul 2025
https://github.com/sefakcmn00/gym_project-with-python
In this project, its own AI powered gym implementation was implemented using AI Powered Pose estimation. MediaPipe and Python were used to detect different posts with a webcam. Body up and down gym practice was performed using Mediapipe pose points.
computer-vision mediapipe-pose numpy opencv python
Last synced: 21 Sep 2025
https://github.com/alvinwan/emotion-based-dog-filter
Real-time Emotion-Based, Snapchat-esque Dog Filter using Computer Vision
adversarial-examples computer-vision dog-filter face-detection face-recognition machine-learning opencv python3 pytorch
Last synced: 30 Jul 2025
https://kumuji.github.io/ugains/
[GCPR 2023] UGainS: Uncertainty Guided Anomaly Instance Segmentation
anomaly-detection anomaly-segmentation computer-vision deep-learning deep-neural-networks instance-segmentation pytorch segment-anything segment-anything-model
Last synced: 24 Jul 2025
https://github.com/ruskakimov/raised-hand-detection
University Final Year Project (C++, OpenCV)
computer-vision face-detection motion-detection
Last synced: 14 Apr 2025
https://github.com/ai-forever/aggme
Aggregation framework for annotating datasets in computer vision tasks (detection, segmentation, video captioning etc.)
aggregation-pipleline annotation-tool computer-vision crowdsourcing image-segmentation object-detection video-captioning
Last synced: 19 Apr 2025
https://github.com/jammer345/SL-CycleGAN-Blind-Motion-Deblurring-in-Cycles-using-Sparse-Learning
SL-CycleGAN: Blind Motion Deblurring in Cycles using Sparse Learning
computer-vision deblurring generative-adversarial-network image-processing sparse state-of-the-art
Last synced: 05 Apr 2025
https://github.com/allenpandas/awesome-computer-career-opportunities
👨💻 A repository for computer science career opportunities.(面向计算机相关专业人员的求职信息聚合仓库)
2024-campus ai artificial-intelligence campus-recruitment computer-engineering computer-networks computer-science computer-vision cybersecurity doctors-information intership job-search-website jobs-search jobsearch postdoctoral security software-engineering software-testing
Last synced: 04 Jan 2026
https://github.com/cleanpegasus/emotion-recognition
A deep learning code to classify Emotions from a given image or video.
computer-vision deep-learning python tensorflow
Last synced: 03 Aug 2025
https://github.com/wenbihan/octobos_ijcv2016
OCTOBOS, overcomplete transform, learning and application codes, Matlab implementation, IJCV2015 paper
clustering computer-vision denoising ijcv sparse-coding transform-learning unsupervised-learning
Last synced: 25 Feb 2025
https://github.com/davide710/scanner
Your pocket scanner: effortless A4 document capture and transformation into a clean PDF
computer-vision opencv python scanner
Last synced: 24 Jul 2025
https://github.com/lusinlu/image-entropy-gpu
Calculation of the entropy of the batch of images (whole image or patches)
computer-vision deep-learning entropy entropy-measures gpu pytorch
Last synced: 03 Sep 2025
https://github.com/camara94/convolutional-neural-networks-tensorflow
In Course 2 of the deeplearning.ai TensorFlow Specialization, you will learn advanced techniques to improve the computer vision model you built in Course 1. You will explore how to work with real-world images in different shapes and sizes, visualize the journey of an image through convolutions to understand how a computer “sees” information, plot loss and accuracy, and explore strategies to prevent overfitting, including augmentation and dropout. Finally, Course 2 will introduce you to transfer learning and how learned features can be extracted from models.
computer-vision convolutional-neural-networks data-science deep-learning deep-neural-networks tensorflow
Last synced: 23 Jun 2025
https://github.com/pacifiquem/opencv-extract-text-from-image
Using OpenCV and Tesseract to extract the reading from image as a text.
Last synced: 16 Oct 2025
https://github.com/gtesei/deepexperiments
TensorFlow/Keras experiments on computer vision and natural language processing
alexnet autoencoders computer-vision convolution convolutional-neural-networks deep-learning dropout generative-adversarial-network keras keras-neural-networks mnist natural-language-processing neural-networks regularization tensorflow tensorflow-experiments tensorflow-tutorials word2vec word2vec-model
Last synced: 24 Oct 2025
https://github.com/rifkybujana/sam3.c
Efficient SAM3 (Segment Anything Model 3) inference from scratch in pure C — Metal GPU + multithreaded CPU, no Python dependencies
apple-silicon c computer-vision from-scratch ggml image-segmentation inference machine-learning metal pure-c sam3 segment-anything
Last synced: 04 Jun 2026
https://github.com/cuge1995/cvpr-2021-point-cloud-analysis
CVPR 2021 papers focusing on point cloud analysis
3d-point-clouds classification computer-vision cvpr2021 deep-learning detection few-shot paper point-cloud point-cloud-analysis segmentation
Last synced: 10 Feb 2026
https://github.com/chrockey/fastpointtransformer-votenet
A forked repo from Torch-Points3D for VoteNet with the Fast Point Transformer backbone.
3d-vision computer-vision cvpr2022 point-cloud transformer
Last synced: 10 Jul 2025
https://github.com/hasibzunair/research-notes
Stuff I find useful, mostly on AI research and computer vision.
computer-vision deep-learning list notes software-engineering
Last synced: 06 Mar 2026
https://github.com/rishav-hub/deep-face-authenticator
This is a modern Face Authentication System which includes state-of-art algorithms to detect face and generate face embedding. This system contains endpoints which can be integrated to any device depending on the requirements.
acr azure blob-storage computer-vision facenet-trained-models fastapi machine-learning mongodb mtcnn-face-detection tensorflow
Last synced: 10 Apr 2025
https://github.com/smaranjitghose/movenet
Fastest Pose Estimation on any device
artificial-intelligence computer-vision deeplearning machine-learning movenet pose-estimator tensorflow
Last synced: 13 Apr 2025
https://github.com/talfig/sign-language-recognition-v1
Sign language recognition system using CNN architecture, accurately translating images into defined sign language.
computer-vision resnet-18 sign-language-recognition
Last synced: 24 Jun 2025
https://github.com/ashutosh1919/mdp-diffusion
Text-guided image editing by manipulating diffusion path without any training.
computer-vision diffusion diffusion-models generative-model image-editing image-generation-model markov-chain pytorch
Last synced: 19 Mar 2025
https://github.com/abhifuturetech/cnn-imageproc-robotics
This project aims to develop and implement robust object recognition techniques for robotics applications using CNNs and advanced image processing methods.
computer-vision convolutional-neural-networks image-processing python robotics simulation tensorflow
Last synced: 09 Jul 2025
https://github.com/johnt84/BlazorServerImageRecognitionApp
Simple Image Recognition Blazor Server app
azure-computer-vision blazor blazor-server computer-vision csharp image-recognition net8 unittest
Last synced: 13 May 2025
https://github.com/satyajitghana/tsai-deepvision-eva4.0-phase-2
This contains the solutions to the assignments given in The School of AI - Extensive Vision AI 4.0 EVA4 Phase2
aws computer-vision deep-vision edge eva4 lambda phase2 pytorch serverless torchvision tsai tsai-deepvision-eva4
Last synced: 05 May 2025
https://github.com/imsanjoykb/computer-vision-bootcamp
Computer Vision Bootcamp
artificial-intelligence cnn-classification computer-vision deep-learning opencv opencv-dnn opencv-project opencv-python
Last synced: 30 Oct 2025
https://github.com/blzq/tf_rodrigues
TensorFlow implementation of Rodrigues rotation conversion from axis-angle (rotation vector) to rotation matrix which supports batch inputs
computer-graphics computer-vision tensorflow
Last synced: 20 Mar 2025
https://github.com/abuzarmushtaq/tensorflow.js-development-setup
Tensorflow.js + Webpack + Babel + ESLint Development Environment. 🎉😎🚀
ai artificial-intelligence bable computer-vision deep-learning eslint javascript machine-learning tensorflow tensorflow-examples tensorflow-experiments tensorflow-js tensorflowjs webpack
Last synced: 17 Aug 2025
https://github.com/nok/modanet-eval
Initial data evaluation of ModaNet by eBay.
computer-vision dataset segmentation
Last synced: 07 May 2025
https://github.com/rishikesh-jadhav/adaptive-neural-network-based-control-of-autonomous-car-in-airsim
This repository highlights the integration of neural network-based control with PID and MPC approaches in the AirSim simulator to enhance steering inputs for autonomous vehicles. Employing imitation learning and a hybrid neural network architecture, the project aims to create a robust and unbiased model for improved autonomous vehicle control.
computer-vision deep-learning deployment imitation-learning keras modelpredictivecontrol pid-control pytorch
Last synced: 16 Jul 2025
https://github.com/rizwanmunawar/streamgrid
Multi-stream video inference with Ultralytics YOLO - Display multiple video streams in a grid layout with real-time object detection.
computer-vision deep-learning multi-stream-player multistreaming notebook object-detection object-tracking open-source opencv-python python-apps research-and-development streamgrid threading ultralytics ultralytics-yolo yolo11 yolov8
Last synced: 21 Jul 2025
https://github.com/josephpai/fashionai-attributes
Attributes Recognition of Apparel
computer-vision deep-learning fashionai tianchi
Last synced: 06 May 2025
https://github.com/coding-ai/raspberrypi_handwritten_recognition
Virtual Pen + Recognition of handwritten digits
artificial-intelligence computer-vision handwriting-recognition handwritten-character-recognition handwritten-digit-recognition image-processing machine-learning machine-learning-algorithms mnist mnist-handwriting-recognition model python python37 raspberry-pi raspberry-pi-4 raspberry-pi-camera
Last synced: 28 Oct 2025
https://github.com/matthew-lyu/fusion-model-solution-for-fgvc8-workshop-at-cvpr-2021
FGVC8 Workshop at CVPR 2021
computer-vision deep-learning image-classification pytorch resnet
Last synced: 19 Jun 2025
https://github.com/cpbridge/canopy
The Header-Only Library For Random Forests
circular-regression circular-statistics classification computer-vision machine-learning random-forest template-library
Last synced: 29 Oct 2025
https://github.com/kyegomez/simpleunet
An simple implementation of Unet because all the implementations i've seen are wayy tooo complicated.
artificial-intelligence biomedical biomedical-image-processing computer-vision gpt4 image image-classification image-segmentation texttovide unet vllm
Last synced: 07 May 2025
https://github.com/lapix-ufsc/lapixdl
Python package with Deep Learning utilities for Computer Vision
computer-vision deep-learning evaluation-framework image-processing
Last synced: 09 Apr 2026
https://github.com/isarandi/rlemasklib
Manipulate run-length encoded image masks
compression-algorithm computer-vision cython image-mask image-masking image-processing image-segmentation mask python run-length run-length-encoding runlengthencoding segmentation
Last synced: 14 Apr 2025
https://github.com/cepdnaclk/e15-4yp-sports-action-recognition
This includes a novel method to measure the quality of the actions performed in Olympic weightlifting using human action recognition in videos. Human action recognition is a well-studied problem in computer vision and on the other hand action quality assessment is researched and experimented comparatively low. This is due to the lack of datasets that can be used to assess the quality of actions. In this research, we introduce a method to assess player techniques in weightlifting by using skeleton-based human action recognition. Furthermore, we introduce a new video dataset for action recognition in weightlifting which is annotated to frame level. We intended to develop a viable automated scoring system through action recognition that would be beneficial in the sports industry.
action-quality-assessment computer-vision final-year-project human-action-recognition openpose vision
Last synced: 03 Mar 2026
https://github.com/alirezasaharkhiz9/computer-vision
This is my undergraduate project at Ferdowsi University of Mashhad, focusing comprehensively on computer vision.
cnn computer-vision deep-learning image-classification image-generation image-manipulation image-segmentation keras object-detection python pytorch-lightning
Last synced: 10 Mar 2026
https://github.com/hairymax/yandex.practicum.datascience
My projects from the Yandex Practicum Data Science course.
catboost computer-vision data-analysis data-science keras lightgbm matplotlib nltk numpy pandas python scikit-learn seaborn tensorflow xgboost
Last synced: 25 Sep 2025
https://github.com/junweiliang/forkingpaths
Dataset from the paper "The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction"
3d-simulation computer-vision trajectory-prediction trajectory-prediction-benchmark video-understanding
Last synced: 27 Jan 2026
https://github.com/stemann/cameras.jl
A Julia interface for cameras
camera computer-vision julia machine-vision
Last synced: 11 Dec 2025
https://github.com/goldsborough/googlecloudvisionocrexample
Using the Google Cloud Vision API for OCR in Swift
computer-vision google-cloud ml ocr ocr-swift
Last synced: 18 Aug 2025
https://github.com/amazon-science/regression-constraint-model-upgrade
Regression Constraint Model Upgrade
Last synced: 22 Jul 2025
https://github.com/stemann/Cameras.jl
A Julia interface for cameras
camera computer-vision julia machine-vision
Last synced: 26 Feb 2025
https://github.com/xiaohu2015/cs231n-assignment
The assignment for cs231n course
computer-vision cs231n deeplearning
Last synced: 25 Jun 2025
https://github.com/cedrickchee/pytorch-mobile-kit
PyTorch Mobile starter kit.
android-app computer-vision edge-machine-learning ios-app mobilenet-v2 pytorch-mobile react-native resnext sdk starter-kit torchscript
Last synced: 11 Sep 2025
https://github.com/keyurparalkar/semantic-segmentation-with-unets
Semantic segmentation via UNETs
computer-vision deep-learning fastai pascal-voc-dataset pytorch segmentation-task
Last synced: 23 Jul 2025
https://github.com/lucasvandroux/pytorch-rocket-yolov3-retinanet50-retinanet101
PyTorch Rocket Yolov3 RetinaNet SSD - Tutorial 2: A Tale of 3 Rockets
beginner beginner-friendly computer-vision darknet53 deep-learning diy-tool easy-to-use object-detection pytorch pytorch-tutorial retinanet yolov3
Last synced: 21 Aug 2025
https://github.com/choaib-elmadi/computer-vision
Computer vision, artificial intelligence applications and projects.
ai arduino artificial-intelligence computer-vision cpp cprogramming python
Last synced: 05 Oct 2025
https://github.com/mxagar/open3d_guide
My personal guide to the great Python library Open3D.
3d computer-vision icp open3d point-cloud vision
Last synced: 03 Jul 2025
https://github.com/prashant-mahajan/canny-edge-detector-from-scratch
Building a Canny Edge Detector from scratch using Python
canny-edge-detection computer-vision python3
Last synced: 28 Feb 2026
https://github.com/photron-limited/pypuclib
python package for INFINICAM SDK
camera computer-vision machine-vision opencv python
Last synced: 25 Apr 2026
https://github.com/smsharma/lensing-neural-fields
Implicit neural representations for strong lensing source reconstruction. Code repository associated with https://arxiv.org/abs/2206.14820.
astrophysics computer-vision jax lensing lensing-reconstruction nerf strong-lensing
Last synced: 15 Apr 2026
https://github.com/voletiv/gridcorpus-experiments
My experiments with lip reading using GRIDcorpus dataset
computer-vision deep-learning lip-reading
Last synced: 09 Oct 2025
https://github.com/leeyunjai/simple-face-recognition
train/recognize face and detect landmark, using dlib and opencv
computer-vision deep-learning dlib face face-detection face-recognition facedetection facelandmarkdetect machine-learning machine-vision opencv opencv3-python raspberry-pi raspberry-pi-camera
Last synced: 13 Jun 2025
https://github.com/mohamed-samy26/red-eye-smart-cities-surviellance-system
City wide smart suviellance system that can detect weapons and fires in real-time and report detailed information to the responsible authorities.
automation aws cnn colab-notebook computer-vision deep-learning flutter hackathon jupyter-notebook linux neural-networks object-detection python python3 raspberry-pi-4 raspberry-pi-camera serverless smart-cities smart-surveillance yolov3
Last synced: 28 Oct 2025
https://github.com/and2797/camera_calib
Camera calibration tool in Python + OpenCV
automatic camera camera-calibration checkerboard computer-vision multi-view-geometry multi-view-stereo opencv opencv-python python python-3 stereo-calibration stereo-vision
Last synced: 08 Oct 2025
https://github.com/jimiolaniyan/cvm
Python implementation of the algorithms in the book Computer Vision: Models, Learning and Inference
algorithm computer-vision machine-learning
Last synced: 23 Apr 2025
https://github.com/nrc-cnrc/nrc-gamma
Large labelled dataset of real-life gas meter images — Vaste ensemble d'images réelles et étiquetées de compteurs de gaz.
ai computer-vision creative-commons crowdsourcing data data-science datanalytics dataset iot machine-learning machinelearning opendata
Last synced: 05 Mar 2026
https://github.com/ammarlodhi255/image-captioning-system-to-assist-the-blind
An image captioning system that is able to predict and speak out a caption of an image taken by visually impaired.
bidirectional-lstm cnn computer-vision computer-vision-algorithms css3 deep-learning html5 image-captioning javascript javascript-es6 lstm-neural-network ml-web nlp nlp-machine-learning resnet resnet-50 vgg16
Last synced: 23 Apr 2025
https://github.com/rfonod/stabilo
🌀 Stabilo is a Python package for stabilizing video frames or object trajectories using advanced transformation techniques. It supports user-defined masks to exclude specific areas, making it ideal for dynamic scenes with moving objects. Unlike traditional stabilization, it stabilizes content relative to a chosen reference frame.
computer-vision feature-extraction feature-matching homography-estimation image-processing image-registration opencv ransac stabilization video-processing video-stabilization
Last synced: 07 Oct 2025
https://github.com/cuixing158/cocoapi
Simple, efficient and convenient!
api coco computer-vision deep-learning deep-learning-datasets
Last synced: 02 May 2025
https://github.com/abhi-abhi86/disease-predictor
AI disease detection and prediction for humans, plants, and animals. Complete ML project with custom training, offline operation, no API keys. Detect diseases from images using deep learning and computer vision. Open-source disease detection system for healthcare, agriculture, and veterinary applications. Full code and deployment guides.
agriculture-data ai-agent animals cnn computer-vision custom-training customtraining deeplearning diagnosis diseases healthcare-application imageprocessing machine-learning medical-application offline opensource plants prediction predictor veterinary
Last synced: 04 Mar 2026
https://github.com/hunterdii/tensorflow-advanced-techniques-solution
Tensorflow Advanced Technique Specialization - Solution
computer-vision coursera coursera-specialization custom-model custom-training deep-learning deeplearning-ai distributed-training generative-ai image-detection image-segmentation-tensorflow machine-learning object-detection object-detection-model semantic-segmentation specialization tensorflow tensorflow-tutorials visualization
Last synced: 24 Oct 2025
https://github.com/tahmid-saj/nutri-sight
Nutrition tracker developed to provide predicted nutrition info via image / food description, keep track of meal consumption and burned calories. Also provides numerous recipes and their nutrition info. Developed using MERN, Firebase, Django, GraphQL, Redis, Postgresql, and ML based APIs.
computer-vision django express firebase graphql material-ui mongodb node openai postgresql react redis redux tailwind
Last synced: 15 Apr 2025
https://github.com/compphoto/intrinsichdr
Repo for the paper "Intrinsic Single-Image HDR Reconstruction" (ECCV 2024)
computational-photography computer-graphics computer-vision hdr machine-learning
Last synced: 28 Feb 2025