An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/victorqribeiro/naivecorners

A naive algorithm to identify corners on a image

computer-vision corner-detection image-processing naive python

Last synced: 05 May 2025

https://github.com/trishume/smartheadtracker

WIP OpenCV webcam head tracker based on coloured markers

computer-vision halide head-tracking opencv

Last synced: 14 Apr 2025

https://github.com/neelanjan00/parkinsons-detection

An AI-based mobile application that is able to diagnose Parkinson's Disease using two independent tests that require only a pencil and a paper. Based on the 2017 research paper Distinguishing Different Stages of Parkinson's Disease Using Composite Index of Speed and Pen-Pressure of Sketching a Spiral by Zham et. al. The trained models were deployed using a Flask backend server, along with a Flutter based frontend mobile application frontend to interact with the REST API.

computer-vision flask machine-learning parkinsons-disease python

Last synced: 23 Mar 2025

https://github.com/mli0603/tatoo

TAToo ("Vision-based Joint Tracking of Anatomy and Tool for Skull-base Surgery"), IPCAI 2023.

computer-vision deep-learning object-tracking

Last synced: 20 Aug 2025

https://github.com/lukexyz/flightvision

๐ŸŽฅ๐ŸŽฏ Tracking dart coordinates with fastai v2

computer-vision dart

Last synced: 19 Mar 2025

https://github.com/jakarto3d/jakarnotator

The Jakarnotator is an annotation tool to create your own database for instance segmentation problem.

annotations computer-vision data database deep-learning detectron instance-segmentation mscoco training-data

Last synced: 15 May 2025

https://github.com/wonder-tree/posecamera

PoseCamera: Realtime Human Pose Estimation Python SDK

computer-vision mobilenet opencv openpose pose-estimation python pytorch

Last synced: 23 Jul 2025

https://github.com/mmasana/ternaryfeaturemasks

Implementation of "Ternary Feature Masks: zero-forgetting for task-incremental learning"

computer-vision continual-learning machine-learning task-incremental-learning zero-forgetting

Last synced: 05 Feb 2026

https://github.com/kachayev/ssl-in-one-epoch

Reproduce results of "EMP-SSL: Towards Self-Supervised Learning in One Training Epoch" paper.

computer-vision pytorch self-supervised-learning

Last synced: 14 Oct 2025

https://github.com/ahadcove/legoblobdetection

Let's walk through on counting Lego using OpenCV's Blob Detection

blob-detection computer-vision lego opencv python

Last synced: 21 Aug 2025

https://github.com/andrewliao11/longperceptualthoughts

[COLM'25] The official implementation of "LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception"

computer-vision large-language-models reasoning reasoning-language-models vision-language-model visual-reasoning

Last synced: 23 Sep 2025

https://github.com/wkentaro/chainer-bicyclegan

Chainer implementation of "Toward Multimodal Image-to-Image Translation".

chainer computer-graphics computer-vision deep-learning gan generative-adversarial-network image-generation pix2pix

Last synced: 19 Jun 2025

https://github.com/oriyarden/computer-vision-image-processing-object-detection-tracking-in-python-from-scratch

Video image processing computer vision to remove background and foreground for object detection and tracking in Python from scratch using only numpy arrays.

algorithms computer-vision from-scratch google-colab-notebook image-processing kernel numpy numpy-arrays object-detection object-tracking python video-processing

Last synced: 10 May 2026

https://github.com/ivanrs297/endoscopycorruptions

The endoscopycorruptions Python package provides utilities to simulate common image corruptions that might occur during endoscopic procedures. This tool is designed to assist in the development and testing of image processing algorithms intended for endoscopic imagery by introducing realistic corruptions into clean images.

computer-vision data-science machine-learning medical-imaging python

Last synced: 25 Apr 2026

https://github.com/dalageo/iddroadsegmentation

Distinguishing Urban Roads from Non-Road Regions in the Indian Driving Dataset(IDD) Using Binary Deep Learning Segmentation ๐Ÿ›ฃ๏ธ

cnn computer-vision deep-learning efficientnet fpn image-segmentation python pytorch segmentation-models u-net

Last synced: 04 Sep 2025

https://github.com/aqafridi/tensorflow-advanced-techniques

Most advanced Machine Learning and Artificial Intelligence techniques using TensorFlow: a free and open-source software library. AI is fun to grow exponentially in all aspect Once you know, how to use it.

computer-vision deep-learning gan machine-learning tensorflow tensorflow-tutorials

Last synced: 24 Oct 2025

https://github.com/sondosaabed/introduction-to-computer-vision

introduction to computer vision including fundamentals, methods for application and machine learning classification.

computer-vision image-analysis image-processing segmentation

Last synced: 09 Apr 2025

https://github.com/junjuew/tf_object_detection

A Thin Wrapper around Tensorflow Object Detection API for Easy Installation and Use

computer-vision deep-learning deep-neural-networks dependency object-detection object-recognition pypi tensorflow tensorflow-models

Last synced: 24 Oct 2025

https://github.com/cvjena/tensordecompositions4pinns

Code accompanying manuscript "Functional Tensor Decompositions for Physics-Informed Neural Networks"

computer-vision pde-solver pinns

Last synced: 28 Jan 2026

https://github.com/dipghoshraj/virtual-pen

virtual pen is a computer vision program to track object movement and draw line according the movement.

computer-vision object-tracking opencv python python3 virtual-pen

Last synced: 12 Apr 2025

https://github.com/darcyai/darcyai

Darcy AI Engine is a Python library for building AI apps

ai ai-pipeline computer-vision edge edge-computing edgeai iot open-source opencv python raspberry-pi

Last synced: 04 Jul 2025

https://github.com/kennethleungty/image-augmentation-libraries

Sample implementation codes for a variety of popular image augmentation Python packages

albumentations augmentor computer-vision deep-learning image-augmentation imgaug opencv python skimage torchvision

Last synced: 12 Jul 2025

https://github.com/nburgdorfer/cvtkit

A collection of some useful (or maybe just interesting) computer vision tools.

3d-reconstruction computer-vision python

Last synced: 14 Jan 2026

https://github.com/unixpickle/autorot

Use Machine Learning to rotate your images automatically

computer-vision machine-learning

Last synced: 02 May 2025

https://github.com/amaansayyad/covid-19_face_mask_recognition_system

The Face Mask Recognition System uses AI technology to detect the person with or without a mask. It can be connected with any surveillance system installed at your premises. The authorities or admin can check the person through the system to confirm whether the person is wearing a mask or not.

computer-vision coronavirus covid-19 covid19 deep-learning face-detection face-recognition facemask-detection human-pose-estimation jupyter-notebook keras keras-tensorflow machinelearning opencv python3 tenserflow trending

Last synced: 02 Jul 2025

https://github.com/margaretmz/icon-classifier

An Icon classifier made with TFLite Model Maker and deployed to Android with ML Model Binding.

android computer-vision deep-learning tensorflow-lite tflite

Last synced: 07 Apr 2025

https://github.com/lindronics/flir_app

Android application for the real-time classification of FLIR One thermal camera images using a deep convolutional neural network built with TensorFlow. Part of my final year honours project.

android computer-vision flir flir-cameras flir-one tensorflow tensorflow-lite

Last synced: 14 Oct 2025

https://github.com/o-b-one/pose-estimator

An application that is able to estimate what pose a person found at, and count the number of repeats during live/recorded exercise.

computer-vision knn python react resnet-50 tensorflow tflite

Last synced: 10 Apr 2025

https://github.com/kareimgazer/perception-stack

software pipeline to identify the lane boundaries and cars in a video for autonomous self-driving cars

car-detection computer-vision deep-learning hog image-processing lane-detection object-detection perception self-driving-car svm yolov3

Last synced: 14 Feb 2026

https://github.com/bonsai-rx/sleap

A Bonsai interface for real-time multi-animal pose tracking using SLEAP

bonsai-rx computer-vision

Last synced: 23 Jul 2025

https://github.com/suriyaavijay/posture-recognition-using-opencv-and-mediapipe

Posture correction using computer vision and Mediapipe library enables the detection and correction of poor posture in images and live videos, promoting a healthier lifestyle in the digital era.

computer-vision digital-wellbeing ipynb jupyter-notebook mediapipe opencv posture-advisor posture-recognition python

Last synced: 28 Oct 2025

https://github.com/tarikseyceri/ollamaoptivus.ai

Ollama Optivus is a video processing AI that extracts meaningful insights from videos using AI technologies like YOLO for object detection, EasyOCR for text extraction, and Whisper for speech-to-text. Integrated with Ollama LLM, it provides detailed summaries and event analysis, supporting various advanced AI model prompting.

ai artificial-intelligence computer-vision deepseek deepseek-r1 docker express-js llm node nodejs ocr ollama ollama-api opencv python video-processing video-summarization whisper-ai yolo yolov8

Last synced: 11 May 2025

https://github.com/q-viper/corn-infection-detection

A Computer Vision Project done from corn field to COLAB.

agriculture-research computer-vision object-detection

Last synced: 04 Apr 2026

https://github.com/aniekanbane/touch-free-keyboard

Virtual touchscreen keyboard using OpenCV and cvzone module

computer-vision ironman mediapipe opencv

Last synced: 18 Apr 2026

https://github.com/yyccphil/kegms

Key frame extraction based on global motion statistics for team-sport videos

computer-vision globalmotion key-frame key-frames-extraction optical-flow sports-analytics statistics team-sports video-analysis

Last synced: 18 Jan 2026

https://github.com/siyuanwu99/dsec

Paper Reproduction for "Learning Monocular Dense Depth from Events" | CS4245 Computer Vision by Deep Learning course project

computer-vision course-project event-camera paper-reproduction

Last synced: 26 Apr 2025

https://github.com/muntahashams/object-detection-using-yolov3

YOLO is a state-of-the-art, real-time object detection algorithm. In this notebook, I had applied the YOLO algorithm to detect objects in images ,videos and webcam

computer-vision deep-learning detect-objects detection-algorithm image machine-learning object-detection object-detection-model udacity-nanodegree video webcam yolo yolo-algorithm

Last synced: 09 Apr 2025

https://github.com/imwildcat/aitk

Artificial Intelligence Toolkit, a powerful tool that makes your life better.

ai baidu cloud-ai cloud-ml-engine computer-vision google machine-learning machine-translation nlp speech-recognition tencent text-to-speech

Last synced: 19 Apr 2025

https://github.com/tomrunia/repetitionestimation

CVPR 2018: Real-World Repetition Estimation by Div, Grad and Curl

computer-vision cvpr2018 machine-learning motion-estimation research video

Last synced: 02 Mar 2025

https://github.com/ashok-arjun/gaussian-poisson-gans-for-image-blending

Blending composite images using a generative model and a Gaussian-Poisson equation with a Laplacian Pyramid

blending-images computer-vision deep-learning gan gans generative-adversarial-network machine-learning pytorch

Last synced: 28 Jul 2025

https://github.com/blurred-machine/computer-vision-image-classification

In this repository I have implemented computer vision on MNIST dataset for images classification for digits between 0-9, fashion clothings and sign language hand signals. The models are implemented using TesorFlow. Feel free to send a PR for any oprimization or modification.

computer-vision data-science deeplearning images-classification machine-learning mnist-dataset python

Last synced: 10 Sep 2025

https://github.com/sayakpaul/analytics-vidhya-game-of-deep-learning-hackathon

Contains my experiments for the Game of Deep Learning Hackathon conducted by Analytics Vidhya

active-learning analytics-vidhya computer-vision data-cleaning deep-learning fastai label-noise

Last synced: 28 Jul 2025

https://github.com/hritik5102/fundamentals_of_ds_ml_dl

The repository encompasses the core concepts of Python, Statistics, Machine Learning, Deep Learning, Computer Vision, and Natural Language Processing.

computer-vision data-science deep-learning machine-learning neural-network statistics

Last synced: 13 May 2025

https://github.com/alesiong/template-matching

Simple template matching by GPU (CUDA)

computer-vision cuda template-matching

Last synced: 30 Apr 2025

https://github.com/butlerlabs/docai

DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning models for a wide range of applications

computer-vision information-extraction information-retrieval language-models machine-learning machine-learning-library natural-language-processing nlp nlp-library ocr ocr-python pretrained-models python

Last synced: 01 Mar 2026

https://github.com/neka-nat/cuimage

Rust implementation of image processing library with CUDA

computer-vision cuda rust

Last synced: 13 Apr 2025

https://github.com/shiqimei/digit-recognition

Recognize the handwritten digits online with FCNet which is powered by MNIST dataset ๐Ÿ˜„

computer-vision handwritten-character-recognition neural-network visualization

Last synced: 19 Mar 2025

https://github.com/boechat107/eye-boof

Clojure image processing library based on BoofCV.

boofcv clojure computer-vision image-processing

Last synced: 30 Apr 2025

https://github.com/mr7495/Pesteh-Set

Pesteh-Set, the pistachios dataset from the paper https://doi.org/10.1007/s42044-021-00090-6

closed-mouth-pistachios computer-vision deep-learning deep-neural-networks open-mouth-pistachios pistachio

Last synced: 05 Apr 2025

https://github.com/xtalism/object-detection

Object detection with AI using YOLO V8, Opencv and Python 3.8.

artificial-intelligence computer-vision object-detection python3 yolov8

Last synced: 10 Apr 2025

https://github.com/yahia3200/face-detection-with-viola-jones-algorithm

๐Ÿ“ธ A face detector uses the Viola Jones algorithm implemented with Python.

computer-vision face-detection machine-learning python

Last synced: 12 Jul 2025

https://github.com/wondervictor/dpl.pytorch

Pytorch Implementation for "Deep Patch Learning for Weakly Supervised Object Classification and Discovery"

computer-vision deep-learning object-detection weakly-supervised-detection weakly-supervised-learning

Last synced: 13 Jul 2025

https://github.com/emalagoli92/gcvit-tensorflow

TensorFlow 2.X reimplementation of Global Context Vision Transformers, Ali Hatamizadeh, Hongxu (Danny) Yin, Jan Kautz Pavlo Molchanov.

computer-vision deep-learning image-classification python pytorch tensorflow transformers

Last synced: 12 Apr 2025

https://github.com/li-plus/ssm-vos

Separable Structure Modeling for Semi-supervised Video Object Segmentation

computer-vision davis-challenge machine-learning segmentation semi-supervised video-object-segmentation vos youtube-vos

Last synced: 18 Jul 2025

https://github.com/gustavz/computer_vision

TUM Master Course "Computer Vision" Assignment Solutions

coding computer-vision matlab tum university-course

Last synced: 20 Jul 2025

https://github.com/mayukhdeb/deep-chicken-saviour

using adversarial attacks to confuse deep-chicken-terminator :shield: :chicken:

adversarial-attacks adversarial-examples computer-vision fgsm object-detection opencv pytorch

Last synced: 18 Mar 2025

https://github.com/owalid/leaf-valley

[ ๐Ÿค–๐ŸŒฟ COMPUTER VISION ] Contain AI models and plants images processing. The programs aim to recognize from leaf pictures, the species and the disease if present.

cnn-classification cnn-keras computer-vision kubernetes machine-learning nuxt segmentation

Last synced: 17 Jan 2026

https://github.com/kupynorest/amurtigercvwc

[ICCV2019] Challenge - Computer Vision for Wildlife Conservation Solution

computer-vision deep-learning iccv iccv2019 object-detection wildlife-conservation

Last synced: 09 May 2025

https://github.com/edaaydinea/python-ml-dl-ds-projects

This repository is included artificial intelligence, machine learning, data science, computer vision projects which are written Python language.

computer-vision data-science deep-learning machine-learning projects python

Last synced: 02 Jul 2025

https://github.com/daleonpz/cv_yolo_on_esp32

Screw type detection using ESP-EYE, YOLOv5, and TensorFlow Lite Micro for real-time classification on ESP32.

computer-vision esp32 machne-learning tinyml yolo yolov5

Last synced: 09 Oct 2025

https://github.com/pekkaran/violet

A toy stereo visual inertial odometry (VIO) system

computer-vision ekf extended-kalman-filter kalman-filter msckf slam vio visual-inertial-odometry

Last synced: 18 Jan 2026

https://github.com/yakovenkodenis/dogs-vs-cats-kaggle

A tiny CNN for Dogs vs Cats Redux Kernels Edition on Kaggle

computer-vision convolutional-neural-networks deeplearning kaggle-competiton keras

Last synced: 09 Apr 2025

https://github.com/khyatimahendru/eigenfaceswithsvd

Facial Recognition on 'Labelled Faces in the Wild Dataset' using the concept of Eigenfaces. I have used Singular Value Decomposition to obtain the eigenfaces used.

classification computer-vision decomposition eigenfaces face-recognition facial-recognition labelled-faces linear-algebra svd svd-factorization

Last synced: 07 Apr 2025

https://github.com/aiwithqasim/streamlit-bootcamp

I'll add all the classes material & projects related to the HacktoberFest2021 Open source Event & Streamlit . https://streamlit.io/

computer-vision hacktoberfest hacktoberfest2021 streamlit-webapp

Last synced: 17 Mar 2025

https://github.com/Cheffromspace/MCPControl

Cross-platform MCP server for OS automation

ai claude computer-control computer-vision mcp os-automation windows

Last synced: 16 Mar 2025

https://github.com/natmlx/blazeface-unity

Realtime face detection in Unity Engine with NatML and NatDevice.

computer-vision deep-learning face-detection machine-learning natml

Last synced: 25 Jul 2025

https://github.com/salim4n/pro-pretorian-system

This project is a Computer Vision application that allows users to customize their detection parameters according to their needs. Whether you want to detect specific objects, define the area of interest, or schedule detection times, this application provides flexible and powerful options.

azure-app-service azure-storage azure-table-storage ci-cd computer-vision mediapipe nextjs python shadcn-ui telegram-bot tensorflowjs typescript yolov8 zod

Last synced: 14 Jun 2025

https://github.com/madeyoga/emnist-cnn

Handwritten Character Recognition. EMNIST dataset on Kaggle. Tensorflow2 - Keras - CNN - 0.85 evaluation.

computer-vision emnist emnist-classification handwritten-character-recognition tensorflow2

Last synced: 28 Oct 2025

https://github.com/quva-lab/timeception

Please check the updated repository https://github.com/noureldien/timeception

action-recognition computer-vision convolutional-neural-networks temporal-models

Last synced: 01 Aug 2025

https://github.com/autodistill/autodistill-sam-hq

Segment Anything HQ (SAM HQ) module for use with Autodistill.

autodistill computer-vision instance-segmentation samhq segment-anything

Last synced: 14 Apr 2025

https://github.com/azazdeaz/homography

Rust reimplementation of OpenCV's homography estimation algorithm

computer-vision computer-vision-algorithms homography homography-estimator rust

Last synced: 12 Apr 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-3D-Object-Detection 169 Awesome-Skeleton-based-Action-Recognition 104 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-state-of-depth-completion 71 awesome-robotics-datasets 79 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97