Computer vision
Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.
- GitHub: https://github.com/topics/computer-vision
- Wikipedia: https://en.wikipedia.org/wiki/Computer_vision
- Related Topics: vision, deep-learning, machine-learning, opencv, gan,
- Aliases: machine-vision, computervision,
- Last updated: 2026-07-02 00:06:11 UTC
- JSON Representation
https://github.com/afondiel/cs-books
A curated collection of computer science books.
ai algorithms books coding computer-science computer-science-books computer-vision computer-vision-books data-science data-structures dl image-processing ml programming software-engineering
Last synced: 13 Apr 2025
https://github.com/meyerls/aruco-estimator
Automatic Scale Factor Estimation of 3D Reconstruction in COLMAP Utilizing Aruco Marker
3d-reconstruction ambiguity-resolver aruco aruco-markers colmap computer-vision ground-control-point opencv python scale-ambiguity
Last synced: 09 Sep 2025
https://github.com/mohabmes/otsu-thresholding
Image Processing: Segmentation Using Otsu Threshold Method
computer-vision image-processing otsu otsu-threshold
Last synced: 21 Mar 2025
https://github.com/moverseai/moai
moai is a PyTorch-based AI Model Development Kit (MDK) created to improve data-driven model workflows, design and reproducibility.
artificial-intelligence cnn computer-vision convolutional-neural-networks data-driven deep-learning depth-estimation experimentation human-pose-estimation hydra mdk neural-network object-pose-estimation pytorch reproducibility reproducible-research vae vae-pytorch
Last synced: 20 Jan 2026
https://github.com/ieee8023/countception
Count-Ception: Counting by Fully Convolutional Redundant Counting
cell-biology cell-counting computer-vision deep-learning machine-learning
Last synced: 02 May 2025
https://github.com/philbort/udacity_self_driving_car
Projects from Udacity Self Driving Car Nanodegree
computer-vision deep-learning robotics self-driving-car udacity
Last synced: 19 Jul 2025
https://github.com/jpleorx/opencv-text-deskew
Tutorial on how to deskew (straighten) text images
computer-vision deskew document-parser image-processing opencv opencv-python python tutorial
Last synced: 10 May 2025
https://github.com/q-viper/contour-based-writing
This is a simple concept to do writing like operation using the contours. Please follow the article https://q-viper.github.io/2020/08/28/gesture-based-visually-writing-system-web-app/ for further details.
computer-vision contour contour-plot gesture gesture-recognition opencv
Last synced: 04 Apr 2026
https://github.com/ollieboyne/found
Official code for FOUND: Foot Optimisation with Uncertain Normals for Surface Deformation using Synthetic Data
3d-reconstruction computer-vision surface-normals surface-normals-estimation surface-reconstruction
Last synced: 05 Oct 2025
https://github.com/oke-aditya/quickvision
An Easy To Use PyTorch Computer Vision Library
computer-vision deep-learning pytorch pytorch-lightning
Last synced: 16 Mar 2025
https://github.com/lucabonfiglioli/nnviz
Torch nn vizualization
computer-vision deep-learning natural-language-processing neural-network pytorch visualization
Last synced: 30 Dec 2025
https://github.com/zubair-irshad/articulated-object-nerf
Experimental repo for modelling Articulated Object Neural Radiance Field
3d-reconstruction 3d-vision articulated-objects artificial-intelligence computer-vision deep-learning implicit-neural-representation machine-learning multi-view-learning nerf neural-networks neural-radiance-fields pytorch rendering
Last synced: 13 Mar 2026
https://github.com/kevinzakka/torchsdf-fusion
Benchmarking PyTorch variants of TSDF fusion.
computer-vision pytorch robotics tsdf-fusion
Last synced: 24 Mar 2025
https://github.com/rthorst/mint_condition
Automatic Sports Trading Card Grading
computer-vision data-science machine-learning sports trading-cards
Last synced: 29 Jul 2025
https://github.com/sparkfish/shabby-pages
ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to original denoised documents.
binarization born-digital computer-vision corpus data-science dataset denoising layout-detection
Last synced: 17 Aug 2025
https://github.com/smartdataanalytics/horus-ner
HORUS: A framework to boost NLP tasks
computer-vision horus information-retrieval machine-learning microblog named-entity-recognition ner noise text-mining twitter
Last synced: 11 Jul 2025
https://github.com/jina-ai/example-multimodal-fashion-search
Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP
computer-vision deep-learning neural-search nlp python
Last synced: 10 May 2025
https://github.com/yunxiaoshi/pointnet-pytorch
PointNet in PyTorch with comprehensive experiments
Last synced: 20 Mar 2025
https://github.com/sovit-123/vision_transformers
Vision Transformers for image classification, image segmentation, and object detection.
attention computer-vision transformer-models transformers vision-transformer
Last synced: 12 Sep 2025
https://github.com/bhollis/aruco-marker
JavaScript library and custom HTML element for generating Aruco marker images
aruco aruco-marker aruco-markers augmented-reality computer-vision robotics web-components
Last synced: 28 Oct 2025
https://github.com/arkanivasarkar/retinal-vessel-segmentation-using-variants-of-unet
Retinal vessel segmentation using U-NET, Res-UNET, Attention U-NET, and Residual Attention U-NET (RA-UNET)
attention-unet computer-vision image-processing intersection-over-union jaccard-coefficient jaccard-loss keras keras-tensorflow patchwise-training residual-attention-unet residual-unet retinal-fundus-images retinal-vessel-segmentation unet
Last synced: 02 Aug 2025
https://github.com/ikomia-dev/ikomiastudio
Your open-source UI software for Computer Vision applications
artificial-intelligence computer-vision deep-learning desktop-app image-processing opencv qt5-gui software video-processing workflow
Last synced: 12 May 2025
https://github.com/westlake-ai/semireward
[ICLR 2024] SemiReward: A General Reward Model for Semi-supervised Learning
audio-classification cifar-100 computer-vision esc-50 label-noise machine-learning natural-language-processing regression reward-model semi-supervised-learning transformer vision-transformer weakly-supervised-learning yahoo-answers
Last synced: 13 May 2025
https://github.com/abdur75648/utrnet-high-resolution-urdu-text-recognition
UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)
computer-vision deep-learning document-analysis high-resolution hrnet icdar icdar2023 machine-learning ocr pytorch scene-text-recognition text-detection text-recognition unet urdu urdu-nlp urdu-ocr urdu-synth utrnet
Last synced: 24 Aug 2025
https://github.com/bhavik-jikadara/ai-ml-roadmap
Welcome to the ultimate guide for starting your journey in Artificial Intelligence and Machine Learning in 2025! This roadmap provides a step-by-step approach to mastering AI and ML, from fundamentals to advanced topics.
artificial-intelligence computer-vision deep-learning deployment fundamentals-of-programming keras libraries machine-learning mathematics mlops natural-language-processing production-code pytorch reinforcement-learning roadmap scikit-learn tensorflow tools
Last synced: 30 Apr 2025
https://github.com/sankalp1999/image_captioning_with_attention
CaptionBot : Sequence to Sequence Modelling where Encoder is CNN(Resnet-50) and Decoder is LSTMCell with soft attention mechanism
attention-mechanism computer-vision encoder-decoder flickr8k-dataset pytorch show-attend-and-tell
Last synced: 03 Mar 2026
https://github.com/visionjo/facerec-bias-bfw
Source code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).
bias bias-mitigation classification computer-vision data-analysis data-visualization dataset ethnic-diversity ethnicity-analysis face-dataset face-recognition face-verification gender-bias machine-learning python
Last synced: 11 Jan 2026
https://github.com/lingdong-/pmst
๐จ Poor Man's Style Transfer - Painting an image with the style of another, without machine learning
computer-vision emoji impressionist-paintings opencv painting pointillism style-transfer
Last synced: 14 May 2025
https://github.com/strawlab/cam-geom
๐ท ๐ Geometric models of cameras for photogrammetry
camera-models computer-vision geometry photogrammetry rust
Last synced: 05 Apr 2025
https://github.com/SAP-samples/btp-ai-sustainability-bootcamp
This github repository contains the sample code and exercises of btp-ai-sustainability-bootcamp, which showcases how to build Intelligence and Sustainability into Your Solutions on SAP Business Technology Platform with SAP AI Core and SAP Analytics Cloud for Planning.
computer-vision condition-monitoring deep-learning defect-detection image-segmentation machine-learning predictive-maintenance sac-planning sample sample-code sap-ai-core sap-ai-launchpad sap-analytics-cloud sound-classification sustainability
Last synced: 07 May 2025
https://github.com/andersy005/self-driving-car-nd
Udacity's Self-Driving Car Nanodegree project files and notes.
car computer-vision deep-learning localisation self-driving-car sensor-fusion sensors udacity
Last synced: 07 Jul 2025
https://github.com/kekeblom/strayvisualizer
Visualize Data From Stray Scanner https://keke.dev/blog/2021/03/10/Stray-Scanner.html
3d-reconstruction camera computer-vision dataset depth-maps mapping open3d pointclouds python reconstruction rgb-d slam
Last synced: 12 Apr 2025
https://github.com/ndrplz/semiparametric
[TPAMI 2020] Generating Novel Views of Vehicles via Semi-parametric Guidance. A semi-parametric approach for synthesizing novel views of a rigid object from a single monocular image.
computer-vision convolutional-neural-networks deep-learning image-synthesis machine-learning novel-viewpoint-synthesis pascal3d semi-parametric semi-parametric-learning
Last synced: 10 Jul 2025
https://github.com/amazon-science/gluonmm
A library of transformer models for computer vision and multi-modality research
computer-vision iccv-2021 multimodality pytorch transformer video
Last synced: 03 May 2025
https://github.com/elucideye/acf
Aggregated Channel Feature object detection in C++ and OpenGL ES 2.0 based on https://github.com/pdollar/toolbox
acf android cmake computer-vision cplusplus cplusplus-11 dollar face-detection hog hunter ios matlab object-detection opencv opengl opengl-es piotr polly simd toolbox
Last synced: 10 Apr 2025
https://github.com/cambricon/cnstream
CNStream is a streaming framework for building Cambricon machine learning pipelines http://forum.cambricon.com https://gitee.com/SolutionSDK/CNStream
c-plus-plus cambricon cambricon-cnstream computer-vision graph-framework inference machine-learning mlu pipeline-framework
Last synced: 26 Dec 2025
https://github.com/ROCm/rpp
AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.
agumentation amd bitwise channel-extract computer-vision contrast cpu gpu hip histogram hpc mivisionx opencl openvx radeon-performance-primitives rocm rpp warp-affine
Last synced: 14 Mar 2025
https://github.com/guaishou74851/adcsr
(CVPR 2025) Adversarial Diffusion Compression for Real-World Image Super-Resolution [PyTorch]
adversarial-diffusion-compression adversarial-distillation computer-vision cvpr2025 deep-learning deep-neural-networks diffusion-models image-reconstruction image-restoration one-step-diffusion one-step-diffusion-model pruning python python3 pytorch super-resolution
Last synced: 15 Apr 2025
https://github.com/bmw-innovationlab/bmw-classification-inference-gpu-cpu
This is a repository for an image classification inference API using the Gluoncv framework. The inference REST API works on CPU/GPU. It's supported on Windows and Linux Operating systems. Models trained using our Gluoncv Classification training repository can be deployed in this API. Several models can be loaded and used at the same time.
classification computer-vision deep-learning gluoncv inference inference-api
Last synced: 03 Oct 2025
https://github.com/donny-hikari/viola-jones
A face detection program in python using Viola-Jones algorithm.
adaboost boosting computer-vision emsembling face-detection haar-cascade machine-learning viola-jones
Last synced: 12 Apr 2025
https://github.com/JuliaImages/ImageFeatures.jl
Image feature detection for the Julia language
computer-vision feature-detection feature-extraction image-processing
Last synced: 16 Nov 2025
https://github.com/sudheerachary/Manga_Colorization
cGAN-based Manga Colorization Using a Single Training Image.
cgan comics computer-vision image-processing machine-learning manga
Last synced: 26 Sep 2025
https://github.com/vvvv/vl.opencv
A VL wrapper for OpenCVSharp
computer-vision opencv visual-programming vl vvvv
Last synced: 13 Apr 2025
https://github.com/jpleorx/detectron2-publaynet
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
artificial-intelligence computer-vision deep-learning detectron2 document-analysis document-classification document-layout document-layout-analysis faster-rcnn instance-segmentation layout-analysis machine-learning neural-network neural-networks object-detection publaynet python python3 pytorch
Last synced: 10 May 2025
https://github.com/utkarsh-deshmukh/blurry-image-detector
Robust Python implementation for detecting blurry images using ROI estimation and DCT analysis.
blur-detection blur-image blur-recognition blurry-images computer-vision image-blur-detection image-processing machine-learning opencv opencv-python python
Last synced: 12 Apr 2025
https://github.com/charmve/autopilot-perception
End to End Autopilot Perception Playbook
autonomous-driving autonomous-vehicles classification computer-vision data-driven deep-learning machine-learning perception
Last synced: 07 Mar 2026
https://github.com/charmve/semantic-segmentation-pytorch
PyTorch implementation for Semantic Segmentation, include FCN, U-Net, SegNet, GCN, PSPNet, Deeplabv3, Deeplabv3+, Mask R-CNN, DUC, GoogleNet, and more dataset
charmve computer-vision deep-learning fcn image-classification image-processing mask-rcnn pytorch pytorch-implementation segnet semantic-segmentation unet-pytorch
Last synced: 21 Mar 2025
https://github.com/jackie-chou/mc-cnn-python
a python implementation of MC-CNN
cnn computer-vision python27 stereo-matching tensorflow
Last synced: 09 Apr 2025
https://github.com/zhenye-na/crnn-pytorch
โ๏ธ Convolutional Recurrent Neural Network in Pytorch | Text Recognition
computer-vision crnn-ocr data-augmentations data-labeling deep-learning emnist-dataset full-stack iam-lines-dataset natural-language-processing serving-predictions text-recognition
Last synced: 02 May 2025
https://github.com/seeed-studio/sensecraft-ai
An application suite including an open-source inference server and web UI to deploy any YOLOv8 model to NVIDIA Jetson devices and visualize captured streams, with one line of code.
computer-vision deep-learning instance-segmentation jetpack jetson-orin machine-learning nvidia-jetson object-detection orin-nano orin-nx pytorch yolov5 yolov8
Last synced: 24 Oct 2025
https://github.com/gabrieltseng/solar-panel-segmentation
A U-Net for solar panel identification and segmentation
computer-vision deep-learning machine-learning remote-sensing solar-energy
Last synced: 18 Jan 2026
https://github.com/julianeuralgraphics/gaussiansplatting.jl
Gaussian Splatting in pure Julia
computer-graphics computer-vision gaussian-splatting gpu radiance-field
Last synced: 21 Oct 2025
https://github.com/openvisionapi/ova-client
OpenVisionAPI Python Client
computer-vision machine-learning object-detection python
Last synced: 17 Apr 2025
https://github.com/DevipriyaSarkar/OCR-Reader
An Android app to extract text from camera preview directly.
android camera computer-vision ocr ocr-android ocr-recognition
Last synced: 09 Apr 2025
https://github.com/gsurma/edge_detector
HED real-time iOS edge detector.
artificial-intelligence computer-vision coreml deep-learning edge-detection hed ios ios-metal machine-learning metal mobile swift xcode
Last synced: 02 Aug 2025
https://github.com/aecins/symseg
SymSeg: Model-Free Object Segmentation in Point Clouds Using the Symmetry Constraint.
computer-vision geometry pointcloud robotics segmentation symmetry
Last synced: 13 Apr 2025
https://github.com/ibaigorordo/depthai-android-jni-example
Android example to get the rgb and disparity images from the OAK-D device connected to a phone.
android computer-vision cpp deep-learning depthai jni-android object-detection
Last synced: 28 Jun 2025
https://github.com/cuge1995/cvpr-2020-point-cloud-analysis
CVPR 2020 papers focusing on point cloud analysis
computer-vision cvpr cvpr2020 deep-learning point-cloud
Last synced: 26 Feb 2026
https://github.com/darrenjkt/SEE-MTDA
(RA-L 2022) See Eye to Eye: A Lidar-Agnostic 3D Detection Framework for Unsupervised Multi-Target Domain Adaptation.
3d-object-detection autonomous-driving computer-vision deep-learning object-detection surface-completion unsupervised-domain-adaptation
Last synced: 20 Mar 2025
https://github.com/ahmedbesbes/understanding-deep-convolutional-neural-networks-with-a-practical-use-case-in-tensorflow-and-keras
What makes convnets so powerful at image classification?
article blog computer-vision convnet convolution-filter convolutional-neural-networks data-science dataset deep-learning deep-learning-tutorial image-classification kaggle kaggle-cats kdd keras keras-tutorials python tensorflow
Last synced: 14 Jul 2025
https://github.com/alleveenstra/attentionocr
Attention OCR in Tensorflow 2.0
computer-vision machine-learning ocr python3 tensorflow2
Last synced: 18 Mar 2025
https://github.com/singh1aryan/intro-to-flutter
๐ธ Basic Flutter stuff.
android computer-vision dart dart-library firebase-functions firebasefirestore firestore flutter flutter-app flutter-demo flutter-examples flutter-ui hackathon ios machine-learning mobile-app mobile-development nodejs open-source widgets
Last synced: 07 Mar 2026
https://github.com/datagym-ai/datagym-core
Open source annotation and labeling tool for image and video assets
annotation annotations bounding-box computer-vision data-labeling dataset image-annotation image-labeling image-labeling-tool label-images label-videos labeling labeling-tool semantic-segmentation video-annotation video-labeling
Last synced: 17 Jan 2026
https://github.com/vida-nyu/city-surfaces
CitySurfaces semantic segmentation of sidewalk surfaces
computer-vision material sidewalk sidewalk-surface urban-analytics urban-data-science
Last synced: 10 Apr 2025
https://github.com/juliaimages/imagefeatures.jl
Image feature detection for the Julia language
computer-vision feature-detection feature-extraction image-processing
Last synced: 25 Jun 2025
https://github.com/ibaigorordo/onnx-raft-optical-flow-estimation
Python scripts for performing optical flow estimation using the RAFT model in ONNX
computer-vision onnx onnxruntime optical-flow python
Last synced: 04 Aug 2025
https://github.com/hairymax/face-antispoofing
Face Anti-Spoofing project. Lanit-Tercom summer school 2022
computer-vision face-antispoofing numpy onnx onnxruntime opencv pandas pytorch scikit-learn tensorboard torchvision
Last synced: 14 Apr 2025
https://github.com/AsRaNi1/live-cctv
To detect any reasonable change in a live cctv to avoid large storage of data. Once, we notice a change, our goal would be track that object or person causing it. We would be using Computer vision concepts. Our major focus will be on Deep Learning and will try to add as many features in the process.
cctv cctv-cameras computer-vision convolutional-neural-networks cosine-similarity darknet deep-learning euclidean-distances frame-change-detection image-classification machine-learning motion-detection neural-network object-detection python yolo yolov3
Last synced: 07 Apr 2025
https://github.com/samanyougarg/group-emotion-recognition
Group Emotion Recognition using deep neural networks and Bayesian classifiers.
bayesian-network cnn computer-vision convolutional-neural-networks deep-learning emotion emotion-analysis emotion-detection emotion-recognition facial-expression-recognition flask group-emotion-recognition image-processing keras numpy pandas
Last synced: 27 Jun 2025
https://github.com/davidstutz/graph-based-image-segmentation
Implementation of efficient graph-based image segmentation as proposed by Felzenswalb and Huttenlocher [1] that can be used to generate oversegmentations.
computer-vision image-processing image-segmentation opencv superpixel-algorithm superpixels
Last synced: 23 Apr 2025
https://github.com/mseitzer/srgan
Pytorch implementation of "Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network"
computer-vision deep-learning gan generative-adversarial-network pytorch super-resolution
Last synced: 05 Oct 2025
https://github.com/ahmedbesbes/Understanding-deep-Convolutional-Neural-Networks-with-a-practical-use-case-in-Tensorflow-and-Keras
What makes convnets so powerful at image classification?
article blog computer-vision convnet convolution-filter convolutional-neural-networks data-science dataset deep-learning deep-learning-tutorial image-classification kaggle kaggle-cats kdd keras keras-tutorials python tensorflow
Last synced: 19 Jul 2025
https://github.com/geoffsmith82/symposium2023
Demonstrates Voice Recognition, Text to Speech, Language Translation, OAuth2, Image Generation, Face Detection and Voice Chatbot. Source code and Documentation for my 2023 ADUG Symposium Talk.
ai artificial-intelligence claude-3-haiku claude-3-opus claude-3-sonnet computer-vision gpt gpt-4 gpt-4o image-to-text oauth2 palm palm2 speech-to-text text-to-image text-to-speech translation voice-recognition websockets
Last synced: 01 Feb 2026
https://github.com/pedrol2b/react-native-vision-camera-mlkit
A powerful React Native Vision Camera plugin delivering high-performance Google ML Kit frame processor featuresโincluding text recognition (OCR), face detection, barcode scanning, pose detection, and more. Seamlessly bridges native ML Kit capabilities for real-time, on-device computer vision in your React Native apps.
android barcode barcode-scanner camera computer-vision face-detection frame-processor google-ml-kit image-processing ios machine-learning mlkit native-module ocr pose-detection react-native react-native-module text-detection vision vision-camera
Last synced: 01 May 2026
https://github.com/engcang/tensorrt_yolov9_ros
(ROS, C++) YOLOv9 detection using TensorRT, now supporting TensorRT 10
computer-vision cpp object-detection ros yolo yolov9
Last synced: 25 Jun 2025
https://github.com/jinhseo/OD-WSCL
[ECCV2022] Official Pytorch Implementation of Object Discovery via Contrastive Learning for Weakly Supervised Object Detection
computer-vision object-detection weakly-supervised-object-detection
Last synced: 08 May 2025
https://github.com/jnbraun/bcnn
A minimalist Deep Learning framework for embedded Computer Vision
arm c99 computer-vision convolutional-neural-networks deep-learning edge-ai embedded-vision gpu high-performance
Last synced: 10 Apr 2025
https://github.com/prem-ium/metahuman-emotion-recognition
Emotionally responsive Virtual Metahuman CV with Real-Time User Facial Emotion Detection (Unreal Engine 5).
computer-vision emotion emotion-classification emotion-detection emotion-recognition emotions facial-expression-recognition machine-learning metahuman metahumans opencv opencv-python unreal-engine unreal-engine-4 unreal-engine-5 virtual-human-agent virtual-humans
Last synced: 15 Oct 2025
https://github.com/bwconrad/flexivit
PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes
computer-vision pytorch transformer vision-transformer
Last synced: 26 Oct 2025
https://github.com/bgu-cs-vil/deepmcbm
"A Deep Moving-camera Background Model" [Erez, Shapira Weber, and Freifeld, ECCV 2022]
background-estimation computer-vision deep-learning moving-camera pytorch
Last synced: 23 Apr 2025
https://github.com/goodarzmehr/simbev
[ITSC 2026] SimBEV is a configurable and scalable synthetic driving data generation tool and dataset based on the CARLA Simulator.
3d-object-detection autonomous-driving bev bev-perception bev-segmentation carla-simulator computer-vision data-generation dataset depth-estimation semantic semantic-occupancy-prediction semantic-segmentation
Last synced: 30 May 2026
https://github.com/zyrolasting/racket-vulkan
Racket integration with all things Vulkan :boom:
3d computer-graphics computer-vision gpu racket racket-ffi-bindings vulkan vulkan-api
Last synced: 17 Mar 2025
https://github.com/mirzaim/cs231n
Note and Assignments for CS231n: Convolutional Neural Networks for Visual Recognition
2021 computer-vision convolutional-neural-networks cs231 cs231n cs231n-assignment deep-learning machine-learning neural-network notes pytorch solutions spring-2021 stanford
Last synced: 27 Apr 2025
https://github.com/wilddima/vodopad
Manage your photographs with the power of deep learning! ๐ท
cli computer-vision deep-learning python
Last synced: 22 Mar 2025
https://github.com/sidgan/whats_in_a_question
CVPR'17 Spotlight: Whatโs in a Question: Using Visual Questions as a Form of Supervision
computer-vision deep-learning deep-neural-networks vqa
Last synced: 26 Aug 2025
https://github.com/pierluigiferrari/lane_tracker
A Python program for lane line detection and tracking using a traditional computer vision approach
computer-vision lane-detection lane-lines lane-lines-detection lane-tracker
Last synced: 13 May 2025
https://github.com/henzler/platonicgan
Escaping Platoโs Cave: 3D Shape from Adversarial Rendering [ICCV 2019]
3d-reconstruction artificial-intelligence computer-graphics computer-vision deeplearning differentiable-rendering iccv2019 machine-learning
Last synced: 14 Apr 2025
https://github.com/dnth/timm-flutter-pytorch-lite-blogpost
PyTorch at the Edge: Deploying Over 964 TIMM Models on Android with TorchScript and Flutter.
android computer computer-vision dart edge edge-ai edge-computing flutter image-classification pytorch torchscript vision
Last synced: 14 Apr 2025
https://github.com/GYXie/visual-search
A toy project for visual search, based on deep learning.
alexnet computer-vision deep-learning tensorflow visual-search
Last synced: 05 Apr 2025
https://github.com/Mattdl/ContinualPrototypeEvolution
Codebase for Continual Prototype Evolution (CoPE) to attain perpetually representative prototypes for online and non-stationary datastreams. Includes implementation of the Pseudo-Prototypical Proxy (PPP) loss.
computer-vision continual-learning datastreaming deep-learning online-learning prototypes representation-learning
Last synced: 08 May 2025
https://github.com/kupynorest/instance_augmentation
[ECCV 2024] Official Repo for: Dataset Enhancement with Instance-Level Augmentations
augmentation computer-vision diffusion-models eccv2024 image-generation image-processing machine-learning pytorch
Last synced: 03 Aug 2025
https://github.com/mattdl/continualprototypeevolution
Codebase for Continual Prototype Evolution (CoPE) to attain perpetually representative prototypes for online and non-stationary datastreams. Includes implementation of the Pseudo-Prototypical Proxy (PPP) loss.
computer-vision continual-learning datastreaming deep-learning online-learning prototypes representation-learning
Last synced: 15 Jul 2025
https://github.com/methyldragon/opencv-motion-detector
Detects transient motion in a video feed. If said motion is large enough, and recent enough, reports that there is motion!
computer-vision motion-detection opencv-python
Last synced: 15 Jul 2025
https://github.com/ethz-asl/autolabel
A project for computing high-quality ground truth training examples for RGB-D data.
computer-vision labeling-tool machine-learning nerf rgb-d robotics
Last synced: 11 Apr 2025
https://github.com/cansik/mediapipe-osc
MediaPipe examples which stream their detections over OSC.
computer-vision hand-tracking mediapipe osc pose-estimation
Last synced: 30 Apr 2025
https://github.com/vikramparimi/vision-object-tracking
Object Tracking using Apple's VISION Framework
computer-vision image-recognition ios ios-swift ios-vision ios11 machine-learning object-tracking rectangle-detection vision-framework
Last synced: 07 May 2025
https://github.com/sravb/computer-vision-weightlifting-coach
Analyzes weightlifting videos for correct posture using pose estimation with OpenCV
computer-vision elastic-net ensemble-learning keypoint-detection lasso-regression logistic-regression machine-learning mpii-dataset opencv openpose pose-estimation random-forest ridge-regression weightlifting
Last synced: 07 May 2025