An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/tiger-ai-lab/viescore

Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024 main)

computer-vision gpt4vision image-editing image-generation visual-question-answering

Last synced: 13 Jun 2025

https://github.com/xN1ckuz/Crosswalks-Detection-using-YOLO

Crosswalks Detection using YOLO, project for Computer Vision and Machine Perception course at University of Basilicata, Computer Science and Engineering.

computer-vision convolutional-neural-networks crosswalk-detection crosswalks deep-learning detection-model neural-networks object-detection onnx onnx-models opencv pytorch yolo

Last synced: 21 Apr 2025

https://github.com/feyzaakyurek/subspace-reg

Code for the ICLR2022 paper on Subspace Regularization for few-shot class incremental image classification

class-incremental-learning computer-vision continual-learning few-shot-learning image-classification incremental-learning machine-learning subspace-learning

Last synced: 18 Apr 2025

https://github.com/muhd-umer/pyramidtabnet

Official PyTorch implementation of PyramidTabNet: Transformer-based Table Recognition in Image-based Documents

computer-vision deep-learning document-analysis implementation pytorch table-detection table-structure-recognition

Last synced: 13 Oct 2025

https://github.com/zcemycl/robotics

Python implementations of University of Pennsylvania Robotics Specialization, and more.

computer-vision robotics robotics-specialization

Last synced: 24 Apr 2025

https://github.com/junweiliang/aicity_action

Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)

action-recognition ai ai-city-challenge computer-vision

Last synced: 19 Apr 2025

https://github.com/maxim5/cs231n-2016-winter

All lecture notes and assignments for CS231n: Convolutional Neural Networks for Visual Recognition class by Stanford

computer-vision convolutional-neural-networks cs231n deep-learning machine-learning

Last synced: 28 Jan 2026

https://github.com/Haleshot/marimo-tutorials

Collection of marimo tutorials which encompass notebook/app examples in varying domains - CS/AI/ML

artificial-intelligence computer-vision data-science image-processing jax llm machine-learning marimo marimo-notebook pytorch recommender-system tensorflow

Last synced: 19 Apr 2025

https://github.com/mees/hulc2

[ICRA2023] Grounding Language with Visual Affordances over Unstructured Data

computer-vision deep-learning grounding manipulation natural-language-processing pytorch robotics vision vision-and-language vision-language

Last synced: 18 Jan 2026

https://github.com/tychenjiajun/art

AI-PP3 is a command-line tool that uses artificial intelligence to analyze RAW photos and generate optimized processing profiles (PP3 files) for RawTherapee.

ai arw batch-processing cli computer-vision cr2 dng image-processing multimodal nef photo-editing photography pp3 raw rawtherapee

Last synced: 11 Jun 2025

https://github.com/denadai2/google_street_view_deep_neural

Deep Neural Network model to predict security perception from Google Street View images. Model based on AlexNet CNNs

computational-social-science computer-vision data-science deep-learning urban-planning urban-science

Last synced: 15 Mar 2025

https://github.com/awsaf49/gcvit-tf

Tensorflow 2.0 Implementation of GCViT: Global Context Vision Transformer

attention cnn computer-vision image-classification image-recognition imagenet self-attention transformer

Last synced: 14 Jan 2026

https://github.com/jjateen/aiot-workshop

This repository contains resources, including circuit diagrams, code, and project files from the IoTics AIoT Workshop, focusing on integrating Artificial Intelligence (AI) with the Internet of Things (IoT). It features hands-on projects exploring sensor integration, cloud services, machine learning, and robotics.

adafruit-io aiot arduino blynk cloud-integration computer-vision cpp deep-learning embedded-systems esp32 gesture-recognition haar-cascade iot machine-learning mediapipe object-detection python sensor-data surveillance wokwi

Last synced: 14 Oct 2025

https://github.com/ahmedfgad/opencvandroid

Using OpenCV in Android Devices

android canny computer-vision java opencv

Last synced: 23 Apr 2025

https://github.com/LIU42/TrafficRules

ๅŸบไบŽ YOLO11 ็š„่ทฏๅฃไบค้€šไฟกๅท็ฏ้€š่กŒ่ง„ๅˆ™่ฏ†ๅˆซ

autonomous-driving computer-vision object-detection onnxruntime opencv traffic traffic-light-detection traffic-rules yolo

Last synced: 15 Jul 2025

https://github.com/haleshot/marimo-tutorials

Collection of marimo tutorials which encompass notebook/app examples in varying domains - CS/AI/ML

artificial-intelligence computer-vision data-science image-processing jax llm machine-learning marimo marimo-notebook pytorch recommender-system tensorflow

Last synced: 07 May 2025

https://github.com/pinto0309/simple_fisheye_calibrator

Simple GUI-based correction of fisheye images. The correction parameters specified on the screen can be diverted to opencv's fisheye correction parameters. Supports execution via Docker.

calibration camera computer-vision cvui docker fisheye gui numpy opencv pillow python

Last synced: 01 May 2025

https://github.com/wellflat/opencv-samples

OpenCV sample codes (C++/Python)

computer-vision cpp machine-learning opencv python

Last synced: 21 Mar 2025

https://github.com/cansik/opencv-processing

OpenCV for Processing. A creative coding computer vision library based on the official OpenCV Java API

computer-vision creative-coding opencv processing

Last synced: 05 May 2025

https://github.com/cuicaihao/split_raster

Split Raster is an open-source and highly versatile Python package designed to easily break down large images into smaller, more manageable tiles. While the package is particularly useful for deep learning and computer vision tasks, it can be applied to a wide range of applications.

computer-vision deep-learning image-recognition image-segmentation image-splitting remote-sensing

Last synced: 02 Apr 2026

https://github.com/groundlight/mcp-vision

Computer vision models as MCP servers

computer-vision mcp

Last synced: 29 Apr 2026

https://github.com/altanai/ramudroid

Ramudroid, autonomous solar-powered robot to clean roads, realtime object detection and webrtc based streaming

arduino computer-vision deep-learning deep-neural-networks deep-reinforcement-learning garbage-classification garbage-collection machinelearning motors opencv raspberry-pi robot streaming

Last synced: 25 Jul 2025

https://github.com/muhamuttaqien/AI-Lab

๐Ÿ”ฌ Absolutely comfort lab for me to work around with my own AIs and to empirically observe how powerful and impactful these technologies are. I do love these technologies!

artificial-intelligence autonomous-systems computer-vision deep-learning machine-learning natural-language-processing reinforcement-learning robotics

Last synced: 14 Mar 2025

https://github.com/iampukar/deep-learning-in-computer-vision

Course work from Deep Learning in Computer Vision from National Research University Higher School of Economics

computer-vision coursera deep-learning national-research-university

Last synced: 12 Apr 2025

https://github.com/brenopolanski/aiEyes

:robot: :eye: Describes photos using audio for Blind and Visually-Impaired Users

angular4 azure blindness cognitive-services computer-vision google-translate ionic-apps ionic3 typescript

Last synced: 14 Apr 2025

https://github.com/justin900429/pytorch-dssn-mer

PyTorch version for the "Dual-Dtream Shallow Networks for Facial Micro-Expression Recognition"

computer-vision micro-expression-recognition paper-implementation python pytorch

Last synced: 02 Sep 2025

https://github.com/tobybreckon/cpp-examples-ipcv

C++ Image Processing and Computer Vision OpenCV Examples used for Teaching

computer-vision image-processing opencv webcam

Last synced: 13 Sep 2025

https://github.com/liu42/inscriptions

2024 ๅนด MathorCup ๆ•ฐๅญฆๅบ”็”จๆŒ‘ๆˆ˜่ต› B ้ข˜๏ผŒๅŸบไบŽ YOLOv8 ็š„็”ฒ้ชจๆ–‡ๅŽŸๅง‹ๆ‹“็‰‡ๅ›พๅƒๅ•ๅญ—ๅˆ†ๅ‰ฒ่ฏ†ๅˆซๆจกๅž‹ใ€‚

computer-vision flask image-classification object-detection onnxruntime opencv webapp yolo yolov8

Last synced: 09 Aug 2025

https://github.com/mv-lab/ViT-FGVC8

"Exploring Vision Transformers for Fine-grained Classification" at CVPRW FGVC8

computer-vision cub200-2011 cvpr deep-learning fgvc fine-grained-classification stanford-dogs-dataset transformers vit

Last synced: 04 Apr 2025

https://github.com/maheshpaulj/lane_detection

Real-time lane and car detection system using YOLOv8 and OpenCV, with distance estimation for vehicles. Ideal for autonomous driving and traffic analysis.

computer-vision opencv python realtime-detection yolo yolov8

Last synced: 02 Mar 2025

https://github.com/autodistill/autodistill-gpt-4v

GPT-4V(ision) module for use with Autodistill.

autodistill computer-vision gpt-4 gpt-4v object-detection

Last synced: 09 Mar 2026

https://github.com/dovyski/setup-opencv-action

Github Action to download and setup OpenCV

ci computer-vision github-action github-actions opencv opencv-library

Last synced: 14 Apr 2025

https://github.com/verifid/mocr

Meaningful Optical Character Recognition from identity cards with Deep Learning.

cards computer-vision deep-learning face-detection identity opencv optical-character-recognition tessaract

Last synced: 13 Jul 2025

https://github.com/kevalmorabia97/object-and-semantic-part-detection-pytorch

Joint detection of Object and its Semantic parts using Attention-based Feature Fusion on PASCAL Parts 2010 dataset

computer-vision convolutional-neural-networks faster-rcnn feature-fusion object-detection pytorch self-attention

Last synced: 10 Jul 2025

https://github.com/nickorzha/road-lane-detection

A computer vision project for detecting road lanes in real-time video streams. This project makes use of OpenCV, a popular open-source computer vision library, to process video frames and identify the road lanes.

artificial-neural-networks computer-vision object-detection opencv opencv-python

Last synced: 29 Oct 2025

https://github.com/sovit-123/image-deblurring-using-deep-learning

PyTorch implementation of image deblurring using deep learning. Use a simple convolutional autoencoder neural network to deblur Gaussian blurred images.

computer-vision computer-vision-neural-networks convolutional-neural-networks deep-learning deep-neural-networks deeplearning-image-deblur image-deblur image-deblurring machine-learning neural-networks pytorch

Last synced: 11 Apr 2025

https://github.com/ankit1khare/Easy_street_parking_with_MASK-RCNN

A prototype that can be worked into a full-fledge real-time parking management app. Park_clever notebook contains the latest code.

colab-notebook colaboratory computer-vision mask-rcnn opencv-python opencv3 parking-automation parking-management parking-spots street-parking

Last synced: 19 Jul 2025

https://github.com/shiyuanh/TANE

Code Repository for "Task-Adaptive Negative Envision for Few-Shot Open-Set Recognition"

computer-vision cvpr2022 few-shot few-shot-open-set-recognition meta-learning

Last synced: 08 May 2025

https://github.com/soumik12345/colorization-using-optimization

Python and C++ implementations of a user-guided image/video colorization technique as proposed by the paper Colorization Using Optimization

colorization computer-vision cpp14 eigen opencv python

Last synced: 14 Jul 2025

https://github.com/allenai/phone2proc

๐Ÿ“ฑ๐Ÿ‘‰๐Ÿ  Perform conditional procedural generation to generate houses like your own!

ai2-thor artificial-intelligence computer-vision embodied-ai procthor robotics

Last synced: 13 Oct 2025

https://github.com/auduno/mosse

Fast javascript correlation filters for tracking objects in video

computer-vision

Last synced: 13 Jun 2025

https://github.com/amogh7joshi/engagement-detection

Engagement Detection, including facial detection and emotion recognition, using CNNs/LSTMs.

cnn computer-vision emotion-recognition engagement-prediction facial-detection fer2013 keras lstm neural-networks tensorflow

Last synced: 28 Apr 2025

https://github.com/yassersouri/task-driven-object-detection

Author's implementation of "What Object Should I Use? - Task Driven Object Detection" (CVPR 2019)

computer-vision cvpr cvpr2019 deep-learning

Last synced: 02 Feb 2026

https://github.com/avik-pal/deeplearningbenchmarks

Benchmarks across Deep Learning Frameworks in Julia and Python

benchmark computer-vision conv2d flux gpu julia machine-learning pytorch

Last synced: 17 Mar 2026

https://github.com/sgrvinod/wide-residual-nets-for-seti

Classification of simulated radio signals using Wide Residual Networks for use in the search for extra-terrestrial intelligence

astronomy astronomy-instrumentation astrophysics computer-vision ml4seti seti signal wide-residual-networks

Last synced: 10 Apr 2025

https://github.com/bytedance/ohta

[CVPR2024] OHTA: One-shot Hand Avatar via Data-driven Implicit Priors

3d-hand-reconstruction 3d-vision avatar computer-vision cvpr cvpr2024 deep-learning hand-pose-estimation neural-rendering research

Last synced: 13 Apr 2025

https://github.com/roboflow/quickstart-python

Start using computer vision in two minutes with our interactive Python notebook experience.

computer-vision

Last synced: 05 May 2025

https://github.com/victorca25/augmennt

Augmentations for Neural Networks. Implementation of Torchvision's transforms using OpenCV and additional augmentations for super-resolution, restoration and image to image translation.

anisotropic bsrgan computer-vision data-augmentation deblur denoise image-processing opencv pytorch real-esrgan super-resolution superpixels unprocessing-images

Last synced: 08 May 2025

https://github.com/rishit-dagli/3d-transforms

3D Transforms is a library to easily work with 3D data and make 3D transformations. This library originally started as a few functions here and there for my own work which I then turned into a library.

3d 3d-graphics 3d-models 3d-reconstruction collaborate computer-vision deep-learning github github-campus-experts github-codespaces github-pages machine-learning tensorflow

Last synced: 04 Aug 2025

https://github.com/tonysy/drn-mxnet

Dense Relation Network: Learning Consistent and Context-Aware Representation For Semantic Image Segmentation. Modification of DRN source code

artificial-intelligence computer-vision deep-learning semantic-segmentation

Last synced: 04 Oct 2025

https://github.com/donny-hikari/facelock

Recognize your face, lock your computer when you are absent or when others appear.

cnn computer-vision deep-learning face-recognition machine-learning opencv

Last synced: 12 Apr 2025

https://github.com/orestis-z/facial-beauty-predictor

Deep learning model to predict a beauty score for faces in images. Outperforms the state-of-the-art by up to 18% (2019).

computer-vision deep-learning deep-neural-networks facial-beauty-prediction scikit-learn tensorflow

Last synced: 18 Sep 2025

https://github.com/yochengliu/scasnet

Semantic Labeling in VHR Images via A Self-Cascaded CNN (ISPRS JPRS, IF=6.942)

computer-vision context-aware deep-learning multi-context multi-scale remote-sensing semantic-labelling semantic-segmentation

Last synced: 20 Feb 2026

https://github.com/matlab-deep-learning/mtcnn-face-detection

Face detection and alignment using deep learning

computer-vision deep-learning face-detection matlab mtcnn

Last synced: 07 May 2025

https://github.com/ai-forever/easy_sign

Easy_sign is an open source russian sign language recognition project that uses small CPU model for predictions and is designed for easy deployment via Streamlit.

ai4good computer-vision deep-learning gesture-recognition open-source russian-language sign-language-recognition sign-language-recognition-system

Last synced: 19 Apr 2025

https://github.com/krasch/simple_landmarks

Ridiculously simple landmark annotation tool

annotation computer-vision computer-vision-annotation-tool

Last synced: 03 Jul 2025

https://github.com/yuhui-zh15/autoconverter

Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 2025)

computer-vision machine-learning natural-language-processing vision-language vision-language-model

Last synced: 19 Nov 2025

https://github.com/davidstutz/hierarchical-graph-based-video-segmentation

Implementation of the hierarchical graph-based video segmentation algorithm proposed by Grundmann et al. [1] based on the image segmentation algorithm by Felzenswalb and Huttenlocher [2].

computer-vision opencv video-segmentation

Last synced: 23 Apr 2025

https://github.com/tsaishien-chen/SPAN

Semantics-guided Part Attention Network (ECCV 2020 Oral)

computer-vision eccv2020 pytorch-implementation span vehicle-reidentification veri-776

Last synced: 16 Apr 2025

https://github.com/goktug97/pydbow

Python Implementation of Bags of Binary Words

bag-of-visual-words computer-vision python slam visual-odometry

Last synced: 26 Apr 2025

https://github.com/carlo-/sepconv-ios

CoreML+Metal implementation of "Video Frame Interpolation via Adaptive Separable Convolution"

accelerate computer-vision coreml cv deep-learning frame-interpolation ios metal metalkit ml onnx pytorch sepconv swift video-frame-interpolation

Last synced: 16 May 2025

https://github.com/cloud-annotations/javascript-sdk

Use custom trained Cloud Annotations models with TensorFlow.js

computer-vision javascript machine-learning object-detection tensorflow

Last synced: 08 Jul 2025

https://github.com/Sukhdip-Sandhu/Automatic-Watermark-Removal

Python computer vision project that aims to automatically remove the watermarks of stock images. The algorithm is designed off of those of Google researchers

algorithm computer-vision google python

Last synced: 13 Apr 2025

https://github.com/codait/magicat

๐Ÿง™๐Ÿ˜บ magicat - Deep learning magic.. with the convenience of cat!

cli-utilities computer-vision deep-learning image-recognition image-segmentation

Last synced: 13 Apr 2025

https://github.com/inexplicablemagic/photodedupe

A utility for locating near duplicate photos irrespective of image resolution, compression settings or file format.

computer-vision computer-vision-tools deduplication duplicate-detection image-deduplication

Last synced: 11 Jan 2026

https://github.com/outsorcerer/pocket-automl-android-tutorial

Pocket AutoML: Tutorial for Creating an Android App for Image Classification with Deep Learning

android automl computer-vision deep-learning deep-learning-tutorial tensorflow tensorflow-examples tensorflow-lite tensorflow-tutorials tutorial

Last synced: 23 Jun 2025

https://github.com/krshrimali/digit-recognition-mnist-svhn-pytorch-cpp

Implementing CNN for Digit Recognition (MNIST and SVHN dataset) using PyTorch C++ API

cnn computer-vision cpp deeplearning digit-recognition libtorch mnist mnist-dataset pytorch

Last synced: 12 Jul 2025

https://github.com/kadirnar/yolov7-pip

This repo is a packaged version of the Yolov7 model.

computer-vision object-detection yolov7

Last synced: 13 Apr 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-Skeleton-based-Action-Recognition 104 Awesome-3D-Object-Detection 169 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-state-of-depth-completion 71 awesome-robotics-datasets 79 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97