Computer vision
Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.
- GitHub: https://github.com/topics/computer-vision
- Wikipedia: https://en.wikipedia.org/wiki/Computer_vision
- Related Topics: vision, deep-learning, machine-learning, opencv, gan,
- Aliases: machine-vision, computervision,
- Last updated: 2026-07-02 00:06:11 UTC
- JSON Representation
https://github.com/umercodez/websocketcam
Access Android camera via WebSocket client API and run AI / computer vision tasks with ease on live camera stream
ai-image-detection android-camera camera camera-streamer computer-vision computer-vision-opencv computer-vision-tools image-processing live-camera live-image-recognition object-detection pose-detection websocket
Last synced: 14 Jun 2026
https://github.com/Angzz/panoptic-fpn-gluon
Panoptic Feature Pyramid Networks
computer-vision deep-learning gluon-cv panoptic-segmentation
Last synced: 28 Mar 2025
https://github.com/vbhavank/unstructured-change-detection-using-cnn
Unstructured Change detection in satellite imagery
aerial-imagery computer-vision machine-learning neural-networks remote-sensing
Last synced: 12 Apr 2025
https://github.com/fregu856/ebms_3dod
Official implementation of "Accurate 3D Object Detection using Energy-Based Models", CVPR Workshops 2021.
3d-object-detection computer-vision deep-learning energy-based-model machine-learning object-detection pytorch
Last synced: 20 Mar 2025
https://github.com/ayushexel/trolo
An SDK for Transformers + YOLO and other SSD family models
computer-vision object-detection opencv segmentation transformers yolo
Last synced: 06 Oct 2025
https://github.com/vbhavank/Unstructured-change-detection-using-CNN
Unstructured Change detection in satellite imagery
aerial-imagery computer-vision machine-learning neural-networks remote-sensing
Last synced: 30 Mar 2025
https://github.com/ahmedfgad/cifar10cnnflask
Building a HTTP-accessed convolutional neural network model using TensorFlow NN (tf.nn), CIFAR10 dataset, Python and Flask.
cnn computer-vision convnet convolutional-neural-network css dropout flask fully-connected-network html http image-classification image-processing javascript max-pooling python relu tensorflow web-application
Last synced: 23 Apr 2025
https://github.com/2b-t/stereo-matching
Stereo-image depth reconstruction with different matching costs and matching algorithms in Python using Numpy and Numba
computer-vision docker jupyter-notebook ncc normalized-cross-correlation numba numpy python sad scipy semi-global-matching sgm ssd sum-of-squares winner-take-all wta
Last synced: 12 Apr 2025
https://github.com/tsarjak/simulate-correct-colorblindness
This Script simulates images based on how people with colorblindness would perceive it naturally. You can also vary the degree of colorblindness for simulating. This script can also be used to correct images, making differenciation of certain colors in an image easier for people with colorblindness.
colorblind colorblind-people colorblindness colorblindness-simulator colour-blind-people colours computer-vision computervision converts correct-colorblindness corrector daltonization-algorithm hsv-shifting-algorithm image image-processing simulate social-cause vision
Last synced: 21 Mar 2025
https://github.com/rubixml/cifar-10
Use the famous CIFAR-10 dataset to train a multi-layer neural network to recognize images of cats, dogs, and other things.
artificial-intelligence cifar-10 computer-vision cross-validation deep-learning deep-neural-networks example-project image-classification image-recognition machine-learning machine-learning-tutorial neural-network object-detection php php-ai php-machine-learning php-ml pipeline rubix-ml tutorial
Last synced: 30 Jul 2025
https://github.com/breathingcyborg/mediapipe-face-effects
Realtime Face Effects in Browser using mediapipe and three js.
computer-vision mediapipe mediapipe-facemesh realtime try-on
Last synced: 03 Mar 2026
https://github.com/bagaturchess/chessboardscanner
Java based Chess Board Scanner, which converts 2D chess board image into a machine readable format a.k.a. Forsyth–Edwards Notation (FEN). It uses OpenCV and Deeplearning4j frameworks, complemented with some proprietary algorithms implemented for realizing the goal. It currently supports the chess board and pieces sets of the most common online chess platforms chess.com and lichess.org.
chess chess-board chessboard chessboard-detection computer-vision deeplearning deeplearning4j image-recognition imagerecognition java opencv
Last synced: 10 Sep 2025
https://github.com/themattinthehatt/behavenet
Toolbox for analyzing behavioral videos and neural activity
behavioral-videos computer-vision neuroscience-methods
Last synced: 28 Jul 2025
https://github.com/cxhernandez/cellcount
A Convolutional Neural Network for Segmenting and Counting Cells in Microscopy Images
biology computer-vision python pytorch segmentation
Last synced: 21 Mar 2025
https://github.com/yochengliu/mlic-kd-wsd
Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection (ACM MM 2018)
computer-vision deep-learning knowledge-distillation multi-label-classification weakly-supervised-detection
Last synced: 12 Jun 2025
https://github.com/dsgiitr/reading-group
Discussions on papers, frameworks, blogs and ideas every Saturday.
computer-vision deep-learning discussions machine-learning neural-networks nlp papers
Last synced: 15 Apr 2025
https://github.com/nikolasent/vehicle-detection-and-tracking
Udacity Self-Driving Car Engineer Nanodegree. Project: Vehicle Detection and Tracking
carnd classifier computer-vision opencv self-driving-car svm
Last synced: 30 Apr 2025
https://github.com/katanaml/sparrow-donut
Data extraction with Donut ML model
computer-vision huggingface machine-learning nlp-machine-learning python
Last synced: 07 May 2025
https://github.com/sayakpaul/swin-transformers-tf
Implementation of Swin Transformers in TensorFlow along with converted pre-trained models, code for off-the-shelf classification and fine-tuning.
attention computer-vision hierarchies image-recognition relative-attention swin-transformers tensorflow
Last synced: 30 Apr 2025
https://github.com/hxndev/virtual-mouse-using-opencv
In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.
ai-mouse autopy code computer-vision cv2 handtracking handtrackingmodule mediapipe opencv opencv-python virtual-mouse virtual-mouse-using-hand-gesture webcam
Last synced: 15 Jun 2025
https://github.com/cheese-roll/light-anime-face-detector
A fast and light-weighted anime face detection based on LFFD.
anime computer-vision face-detection lffd mxnet
Last synced: 20 Nov 2025
https://github.com/KushajveerSingh/resize_network_cv
PyTorch implementation of the paper "Learning to Resize Images for Computer Vision Tasks" on Imagenette and Imagewoof datasets
computer-vision pytorch pytorch-implementation pytorch-lightning
Last synced: 08 May 2025
https://github.com/kelvins/lbph
Local Binary Patterns Histograms (LBPH) implementation in Go
computer-vision face-recognition face-recognizer image-processing lbph local-binary-patterns texture-classification
Last synced: 08 May 2025
https://github.com/ucdvision/gen2seg
Code for "gen2seg: Generative Models Enable Generalizable Instance Segmentation"
computer-vision generative-ai machine-learning segmentation self-supervised-learning stable-diffusion
Last synced: 23 Aug 2025
https://github.com/bailool/placerecognition-loopdetection
Light-weight place recognition and loop detection using road markings
computer-vision cpp11 loop-detection matlab place-recognition road-markings
Last synced: 26 Jun 2025
https://github.com/balavenkatesh3322/face_unlock
We can lock and unlock our Ubuntu system using face recognition(currently only on Ubuntu).
computer-vision face-recognition lockscreen numpy opencv os python3 security-automation ubuntu
Last synced: 22 Apr 2025
https://github.com/iterative/example-get-started-experiments
Get started DVC project
computer-vision dvc example machine-learning python reproducibility reproducible-research
Last synced: 18 Jun 2025
https://github.com/Masao-Taketani/FOTS_OCR
TensorFlow Implementation of FOTS, Fast Oriented Text Spotting with a Unified Network.
computer-vision deep-learning image-recognition ocr scene-text-recognition tensorflow
Last synced: 02 Apr 2025
https://github.com/real-stanford/umpnet
[RA-L / ICRA 2022] UMPNet: Universal Manipulation Policy Network for Articulated Objects
computer-vision robotics simulation
Last synced: 05 May 2025
https://github.com/mathworks-robotics/mujoco-simulink-blockset
Blocks for accessing MuJoCo physics engine within Simulink
biomimetics computer-vision control-systems matlab mujoco physics-simulation robotics robotics-control robotics-simulation simulink
Last synced: 30 Apr 2025
https://github.com/ffiirree/alchemy
CV DL
computer-vision cv deep-learning machine-learning matrix opencv
Last synced: 08 Aug 2025
https://github.com/groundlight/python-sdk
Python SDK for accessing Groundlight API
artificial-intelligence computer-vision machine-learning
Last synced: 16 Jan 2026
https://github.com/harboryuan/polyphonicformer
[ECCV 2022] 🎵PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation
Last synced: 25 Oct 2025
https://github.com/eric-canas/homography.js
Lightweight, High-Performance and easy-to-use library for performing Affine, Projective or Piecewise Affine transformations over any Image or HTMLElement from only a set of reference points. In Javascript.
2d affine-transformations computer-vision easy-to-use fast homography image-manipulation javascript node-js perspective-transform piecewise-affine projective-transformations warp
Last synced: 10 Apr 2025
https://github.com/bfortuner/computer-vision
Computer vision sabbatical study materials
computer-vision deep-learning jupyter machine-learning opencv pytorch scikit-learn
Last synced: 24 Apr 2025
https://github.com/Charmve/Semantic-Segmentation-PyTorch
PyTorch implementation for Semantic Segmentation, include FCN, U-Net, SegNet, GCN, PSPNet, Deeplabv3, Deeplabv3+, Mask R-CNN, DUC, GoogleNet, and more dataset
charmve computer-vision deep-learning fcn image-classification image-processing mask-rcnn pytorch pytorch-implementation segnet semantic-segmentation unet-pytorch
Last synced: 13 Feb 2026
https://github.com/calciferzh/minimal-body
A very simple baseline to estimate 2D & 3D SMPL-compatible keypoints from a single color image.
3d-pose-estimation computer-vision deep-learning
Last synced: 21 Apr 2025
https://github.com/fcjian/LOCE
Exploring Classification Equilibrium in Long-Tailed Object Detection, ICCV2021
classification-equilibrium computer-vision ebl equilibrium-loss iccv2021 iccv21 long-tail long-tailed-detection loss-margin mask-rcnn memory-augmented-feature-samling mfs object-detection samplier two-stage-detector
Last synced: 05 Apr 2025
https://github.com/douglasrizzo/dodo_detector_ros
Object detection from images/point cloud using ROS
computer-vision kinect kinect-v2 object-detection opencv3 ros tensorflow tensorflow-models
Last synced: 21 Feb 2026
https://github.com/pushkalkatara/darknet_ros
Robotics Operating System Package for Yolo v3 based on darknet with optimized tracking using Kalman Filter and Optical Flow.
alexeyab-darknet computer-vision darknet darknet-ros deep-learning kalman-filter object-detection ros yolo
Last synced: 15 Oct 2025
https://github.com/agentmaker/webai.js
A simple Web AI model deployment tool using JavaScript based on OpenCV.js and ONNXRuntime
ai classification computer-vision detection javascript js node objectdetection onnx onnxruntime onnxruntime-web opencv segmentation web
Last synced: 30 Aug 2025
https://github.com/ibaiGorordo/ONNX-CenterSnap-6D-Pose-and-Shape-Estimation
Python scripts for performing 6D pose estimation and shape reconstruction using the CenterSnap model in ONNX
3d-object-detection 3d-object-reconstruction 6dof-pose azure-kinect computer-vision depthai kinect oak-d onnx onnxruntime onnxruntime-gpu opencv python
Last synced: 20 Mar 2025
https://github.com/asahi417/wikiart-image-dataset
We release WikiART Crawler, a python-library to download/process images from WikiART via WikiART API, and two image datasets: `WikiART Face` and `WikiART General`.
art computer-vision dataset generative-model
Last synced: 19 Jul 2025
https://github.com/o-tawab/Video-Pixel-Networks
Video Pixel Networks in Tensorflow
computer-vision video-pixel-network vpn
Last synced: 11 Mar 2025
https://github.com/adeelh/pytorch-fpn
PyTorch implementations of some FPN-based semantic segmentation architectures: vanilla FPN, Panoptic FPN, PANet FPN; with ResNet and EfficientNet backbones.
computer-vision deep-learning deeplearning efficientnet feature-pyramid-network fpn implementation-of-research-paper machine-learning neural-network pytorch pytorch-fpn pytorch-implementation resnet semantic-segmentation semantic-segmentation-architectures
Last synced: 14 Apr 2025
https://github.com/gbourniq/bank-statement-analysis
Flask application generating interactive visualisations from bank statements PDF documents
bank-statement-documents computer-vision docker flask-application machine-learning powerbi sql-database web-application
Last synced: 05 Mar 2025
https://github.com/isarandi/metro-pose3d
Metric-Scale Truncation-Robust Heatmaps for 3D Human Pose Estimation
3d-human-pose computer-vision deep-learning human-pose-estimation python tensorflow
Last synced: 14 Apr 2025
https://github.com/alsora/ros2-tensorflow
ROS2 nodes for computer vision tasks in Tensorflow
computer-vision image-classification image-detection ros2 ros2-tensorflow tensorflow
Last synced: 06 Apr 2026
https://github.com/inkbytefo/ScreenMonitorMCP
A REVOLUTIONARY Model Context Protocol (MCP) server! Gives AI real-time vision capabilities and enhanced UI intelligence power. This isn't just screen capture - it gives AI the power to truly "see" and understand your digital world!
ai artificial-intelligence automation computer-vision mcp model-context-protocol openai predictive-ai python real-time revolutionary screen-monitoring ui-automation
Last synced: 27 Sep 2025
https://github.com/esimov/colidr
Coherent Line Drawing implementation in Go.
coherent-line-drawing computer-graphics computer-vision gocv golang non-photorealistic-rendering opencv rendering
Last synced: 13 Apr 2025
https://github.com/edusense/edusense
EduSense: Practical Classroom Sensing at Scale
audio classroom computer-vision gaze hand-raise instructors machine-learning pedagogy posture sensing speech-detection teachers tracking
Last synced: 12 Jan 2026
https://github.com/universaldatatool/coronavirus-mask-image-dataset
Image dataset from Instagram of people wearing medical masks, no mask, or a non-medical (DIY) mask
computer-vision coronavirus covid-19 covid19 dataset fastai images machine-learning
Last synced: 19 Apr 2025
https://github.com/wkentaro/chainer-mask-rcnn
Chainer Implementation of Mask R-CNN. (Training code to reproduce the original result is available.)
chainer coco computer-vision deep-learning detectron iccv-2017 instance-segmentation mask-rcnn object-detection
Last synced: 14 Jan 2026
https://github.com/stephanecharette/darkplate
License plate parsing using Darknet and YOLO
alpr anpr computer-vision darkhelp darknet neural-network opencv vlpr yolo
Last synced: 04 Oct 2025
https://github.com/nikolasent/advanced-lane-lines
Udacity Self-Driving Car Engineer Nanodegree. Project: Advanced Lane Finding
carnd computer-vision opencv self-driving-car
Last synced: 12 Mar 2026
https://github.com/Scitator/catalyst-examples
Examples
computer-vision data-science deep-learning deep-neural-networks deep-reinforcement-learning machine-learning python pytorch
Last synced: 19 Jul 2025
https://github.com/hemansnation/ai-ml-14-hours-free-course
7 Day AI ML Fundamentals Workshop The purpose of this FREE workshop is 1. To give you a boost of getting started with AI. 2. A life-long community with a similar mindset. 3. strong grip on fundamentals that the advanced concepts will be easy to understand.
computer-vision datastructures-algorithms deep-learning generative-ai machine-learning mlops nlp numpy pandas python system-design
Last synced: 03 Sep 2025
https://github.com/seeed-studio/seeed_studio_courses
Edge AI 101 Course using Nvidia Jetson
ai-education computer-vision edge-ai nvidia-jetson-nano python
Last synced: 13 Apr 2025
https://github.com/prs-eth/scene-recognition-in-3d
[IROS 2020] Indoor Scene Recognition in 3D
3dvision computer-vision scene-recognition sparse-convolution
Last synced: 24 Aug 2025
https://github.com/eora-ai/torchok
Production-oriented Computer Vision models training pipeline for common tasks: classification, segmentation, detection and representation🥤
computer-vision deep-learning image-classification image-retrieval image-segmentation representation-learning
Last synced: 08 May 2025
https://github.com/sashido/teachablemachine-node
Using Teachable Machine Models in Node.js
computer-vision machine-learning nodejs teachablemachine tensorflowjs
Last synced: 19 Jul 2025
https://github.com/fidler-lab/efficient-annotation-cookbook
Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)
annotations computer-vision cvpr2021 simulated-workers
Last synced: 14 Apr 2025
https://github.com/ducha-aiki/mods-light-zmq
MODS with external deep descriptors/detectors
affnet cnn computer-vision descriptor hardnet hessian-affine image-matching local-features matching pytorch ransac sift wbs wide-baseline-stereo wxbs
Last synced: 02 Jul 2025
https://github.com/achillesrasquinha/spockpy
:fist: :hand: :v: :point_up: 🖖 A Python hand gesture recognition library for Kinetic User Interface (KUI).
computer-vision gesture gesture-recognition-toolkit gesture-recognizer hand-gestures opencv python
Last synced: 03 Mar 2026
https://github.com/ibaigorordo/onnx-yolo-world-open-vocabulary-object-detection
Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.
computer-vision object-detection onnx onnxruntime open-vocabulary-detection python
Last synced: 19 Aug 2025
https://github.com/kdmayer/3D-PV-Locator
Dockerized Repo for "3D-PV-Locator: Large-scale detection of rooftop-mounted photovoltaic systems in 3D" based on Applied Energy publication.
ai climate-change computer-vision deep-learning deeplabv3 deepsolar inception-v3 network-planning neurips-2020 pv-systems remote-sensing renewable-energy satellite-imagery solar solar-panels
Last synced: 07 May 2025
https://github.com/nianticlabs/airplanes
[CVPR 2024] AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings
computer-vision cvpr2024 machine-learning plane-detection
Last synced: 14 Sep 2025
https://github.com/wkentaro/reorientbot
ReorientBot: Learning Object Reorientation for Specific-Posed Placement, ICRA 2022
artificial-intelligence computer-vision deep-learning machine-learning robotics ros
Last synced: 15 Apr 2025
https://github.com/axiscommunications/acap-computer-vision-sdk-examples
Example applications that provide developers with the tools and knowledge to use Axis Camera Application Platform (ACAP) Computer Vision solution
acap analytics axis camera computer-vision edge python video
Last synced: 25 Jun 2025
https://github.com/wkentaro/safepicking
SafePicking: Learning Safe Object Extraction via Object-Level Mapping, ICRA 2022
artificial-intelligence computer-vision deep-learning machine-learning reinforcement-learning robotics ros
Last synced: 15 Apr 2025
https://github.com/bwconrad/soft-moe
PyTorch implementation of "From Sparse to Soft Mixtures of Experts"
computer-vision machine-learning mixture-of-experts pytorch transformer vision-transformer
Last synced: 11 Apr 2025
https://github.com/cggos/ptam_cg
Modified version of the monocular SLAM framework PTAM
artificial-intelligence computer-vision ptam slam
Last synced: 21 Mar 2025
https://github.com/davidstutz/seeds-revised
Implementation of the superpixel algorithm called SEEDS [1].
computer-vision image-processing opencv superpixel-algorithm superpixels
Last synced: 23 Oct 2025
https://github.com/pixano/pixano
Data-centric AI building blocks for computer vision applications
computer-vision data-annotation data-visualization deep-learning machine-learning python
Last synced: 27 Aug 2025
https://github.com/swarm-lab/ropencvlite
Utility package to install OpenCV in R
Last synced: 05 Mar 2026
https://github.com/erfaniaa/persian-ocr
Optical character recognition of Farsi and Arabic letters (تشخیص حروف فارسی و عربی)
computer-vision convolutional-neural-networks deep-learning delphi farsi image-processing neural-networks ocr optical-character-recognition persian persian-ocr
Last synced: 05 Jan 2026
https://github.com/iago-suarez/fsg
:straight_ruler: :rocket: FSG: A statistical approach to line detection via fast segments grouping
computer-vision image-processing line-detection line-segment-detector vanishing-points
Last synced: 08 May 2025
https://github.com/rohit901/cooperative-foundational-models
[WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"
computer-vision deep-learning novel-objects object-detection open-set-object-detection open-vocabulary-detection pytorch zero-shot-object-detection
Last synced: 24 Jul 2025
https://github.com/ybabakhin/zindi_wheat_growth
1st place solution for CGIAR Wheat Growth Stage Challenge
computer-vision deep-learning image-classification pytorch pytorch-lightning resnet
Last synced: 02 Sep 2025
https://github.com/sunsided/viola-jones-adaboost
Training a face detection cascade using Adaptive Boosting after Viola and Jones.
adaboost computer-vision face-detection haar-cascade image-processing jupyter-notebook machine-learning python viola-jones
Last synced: 18 Mar 2025
https://github.com/matlab-deep-learning/pretrained-yolo-v4
Object detection and transfer learning using pretrained YOLO v4 models in MATLAB.
computer-vision deep-learning image-processing matlab object-detection pretrained-models tiny-yolov4 transfer-learning yolov4 yolov4-coco
Last synced: 07 May 2025
https://github.com/sayakpaul/knowledge-distillation-in-keras
Demonstrates knowledge distillation for image-based models in Keras.
computer-vision keras knowledge-distillation model-optimization tensorflow
Last synced: 11 Jul 2025
https://github.com/soumik12345/kinect-vision
A computer vision based gesture detection system that automatically detects the number of fingers as a hand gesture and enables you to control simple button pressing games using you hand gestures.
computer-vision gaming gesture-recognition opencv opencv3-python python-3 sign-language-recognition-system
Last synced: 07 May 2025
https://github.com/alladinian/Visionaire
Streamlined, ergonomic APIs around Apple's Vision framework
computer-vision ios macos swift vision
Last synced: 22 Jul 2025
https://github.com/aw-junaid/computer-science
Explore a collection of resources and projects in Computer Science, covering algorithms, data structures, programming languages, and emerging technologies. Ideal for learners and enthusiasts looking to enhance their knowledge and skills in the field
algorithms assembly-language automata computer-architecture computer-networks computer-science computer-vision cpp cybersecurity data-science data-science-projects data-structures database game-development machine-learning networking operating-system python
Last synced: 26 Mar 2025
https://github.com/sap-samples/btp-ai-sustainability-bootcamp
This github repository contains the sample code and exercises of btp-ai-sustainability-bootcamp, which showcases how to build Intelligence and Sustainability into Your Solutions on SAP Business Technology Platform with SAP AI Core and SAP Analytics Cloud for Planning.
computer-vision condition-monitoring deep-learning defect-detection image-segmentation machine-learning predictive-maintenance sac-planning sample sample-code sap-ai-core sap-ai-launchpad sap-analytics-cloud sound-classification sustainability
Last synced: 01 Mar 2026
https://github.com/dsgiitr/adversarial_lab
Web-based Tool for visualisation and generation of adversarial examples by attacking ImageNet Models like VGG, AlexNet, ResNet etc.
adversarial-attacks computer-vision flask html-css-javascript imagenet machine-learning python pytorch visualization
Last synced: 15 Apr 2025
https://github.com/wkentaro/osam
Get up and running with SAM, EfficientSAM, YOLO-World, and other promptable vision models locally.
computer-vision deep-learning foundation-models onnx segment-anything
Last synced: 16 Jan 2026
https://github.com/csiro-robotics/TCE
This repository contains the code implementation used in the paper Temporally Coherent Embeddings for Self-Supervised Video Representation Learning (TCE).
action-recognition computer-vision contrastive-learning contrastive-loss deep-learning embeddings hmdb51 kinetics-datasets metric-learning pytorch representation-learning self-supervised-learning tsne-visualisations ucf-101
Last synced: 26 Apr 2025
https://github.com/Erfaniaa/Persian-OCR
Optical character recognition of Farsi and Arabic letters
computer-vision convolutional-neural-networks deep-learning delphi farsi image-processing neural-networks ocr optical-character-recognition persian persian-ocr
Last synced: 08 Jul 2025
https://github.com/dongjunlee/notes
The notes for Math, Machine Learning, Deep Learning and Research papers.
computer-vision deep-learning generative-model machine-learning natural-language-processing notes optimization reinforcement-learning summary
Last synced: 15 Apr 2025
https://github.com/princeton-vl/packit
Code for reproducing results in ICML 2020 paper "PackIt: A Virtual Environment for Geometric Planning"
computer-vision icml-2020 open-ai-gym unity-ml-agents virtual-environments
Last synced: 08 May 2025
https://github.com/utiasstars/cat-net
Canonical Appearance Transformations
cat-net computer-vision deep-learning robotics
Last synced: 28 Jun 2025
https://github.com/saharshleo/easyroi
Custom ROI in Computer Vision Applications
computer-vision custom-region-of-interest line-roi multiple-region-of-interest multiple-roi opencv polygon region-of-interest
Last synced: 26 Jun 2025
https://github.com/bakercp/ofxdlib
An openFrameworks wrapper for dlib. http://dlib.net/
computer-vision deep-learning dlib dnn machine-learning openframeworks
Last synced: 18 Mar 2025
https://github.com/mattdeitke/cvpr-buzz
🐝 Explore Trending Papers at CVPR
computer-vision cvpr cvpr-buzz cvpr2021 d3 emotionjs gatsby graphql react tsx typescript
Last synced: 13 Apr 2025
https://github.com/breta01/docus
Android application for scanning and managing documents.
android android-application application computer-vision docus hacktoberfest opencv scanner scanner-app scanner-cam scanning
Last synced: 20 Aug 2025
https://github.com/sayakpaul/ml-bootcamp-launchpad
Contains notebooks prepared for ML Bootcamp organized by Google Developers Launchpad.
ai-platform computer-vision deep-learning google-cloud-platform tensorflow2
Last synced: 30 Apr 2025
https://github.com/afondiel/cs-books
A curated collection of computer science books.
ai algorithms books coding computer-science computer-science-books computer-vision computer-vision-books data-science data-structures dl image-processing ml programming software-engineering
Last synced: 13 Apr 2025
https://github.com/jpleorx/opencv-text-deskew
Tutorial on how to deskew (straighten) text images
computer-vision deskew document-parser image-processing opencv opencv-python python tutorial
Last synced: 10 May 2025