An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/aimaster-dev/image-recognition-service

Image Recognition as a Service is a scalable #AWS-powered web app that exposes a deep learning model via REST API, using EC2 auto-scaling and load balancing to handle image input and return predictions efficiently.

api-development auto-scaling aws aws-lambda aws-s3 cloud-computing computer-vision deep-learning ec2 elastic-search image-analysis image-recognition load-balancer python rest-api serverless

Last synced: 09 May 2026

https://github.com/dayyass/synthetic-health-data-hackathon-2020

Synthetic Health Data Hackathon 2020 Alzheimer's MRI Analysis solution.

bioinformatics computer-vision image-classification mri-images pytorch synthetic-data

Last synced: 03 Feb 2026

https://github.com/fancyboi999/ai-engineering-from-scratch-zh

从零精通 AI 工程 · 20 阶段 468 课 · 中文全量翻译 + 配套站点 如何成为一名Agent 工程师 修成指南

agents ai ai-agents ai-engineering chinese chinese-translation computer-vision course deep-learning from-scratch generative-ai learn-ai llm machine-learning mcp nlp python reinforcement-learning transformers tutorial

Last synced: 01 Jun 2026

https://github.com/maheshkkumar/adacrowd

AdaCrowd: Unlabeled Scene Adaptation for Crowd Counting (TMM 2021)

computer-vision crowd-counting few-shot-learning pytorch

Last synced: 15 Feb 2026

https://github.com/cbaezp/roulette

Proof of concept using a custom-trained vision model for object tracking . First step in exploring AI risks and use cases for online casinos.

computer-vision machine-learning roulette

Last synced: 04 Apr 2026

https://github.com/mc-cat-tty/DoorbellCamDaemon

Part of DoorbellCam project: daemon for people recognition with YOLO from a RTSP video stream.

cicd computer-vision container cpp docker doorbell ezviz mqtt mqtt-client opencv rtsp-client rtsp-stream smart-door smart-doorbell yolo yolov5

Last synced: 21 Apr 2025

https://github.com/cvjena/tensordecompositions4pinns

Code accompanying manuscript "Functional Tensor Decompositions for Physics-Informed Neural Networks"

computer-vision pde-solver pinns

Last synced: 28 Jan 2026

https://github.com/oriyarden/computer-vision-image-processing-object-detection-tracking-in-python-from-scratch

Video image processing computer vision to remove background and foreground for object detection and tracking in Python from scratch using only numpy arrays.

algorithms computer-vision from-scratch google-colab-notebook image-processing kernel numpy numpy-arrays object-detection object-tracking python video-processing

Last synced: 10 May 2026

https://github.com/aiwithqasim/streamlit-bootcamp

I'll add all the classes material & projects related to the HacktoberFest2021 Open source Event & Streamlit . https://streamlit.io/

computer-vision hacktoberfest hacktoberfest2021 streamlit-webapp

Last synced: 17 Mar 2025

https://github.com/kachayev/ssl-in-one-epoch

Reproduce results of "EMP-SSL: Towards Self-Supervised Learning in One Training Epoch" paper.

computer-vision pytorch self-supervised-learning

Last synced: 14 Oct 2025

https://github.com/yyccphil/kegms

Key frame extraction based on global motion statistics for team-sport videos

computer-vision globalmotion key-frame key-frames-extraction optical-flow sports-analytics statistics team-sports video-analysis

Last synced: 18 Jan 2026

https://github.com/pekkaran/violet

A toy stereo visual inertial odometry (VIO) system

computer-vision ekf extended-kalman-filter kalman-filter msckf slam vio visual-inertial-odometry

Last synced: 18 Jan 2026

https://github.com/owalid/leaf-valley

[ 🤖🌿 COMPUTER VISION ] Contain AI models and plants images processing. The programs aim to recognize from leaf pictures, the species and the disease if present.

cnn-classification cnn-keras computer-vision kubernetes machine-learning nuxt segmentation

Last synced: 17 Jan 2026

https://github.com/kupynorest/amurtigercvwc

[ICCV2019] Challenge - Computer Vision for Wildlife Conservation Solution

computer-vision deep-learning iccv iccv2019 object-detection wildlife-conservation

Last synced: 09 May 2025

https://github.com/xtalism/object-detection

Object detection with AI using YOLO V8, Opencv and Python 3.8.

artificial-intelligence computer-vision object-detection python3 yolov8

Last synced: 10 Apr 2025

https://github.com/yahia3200/face-detection-with-viola-jones-algorithm

📸 A face detector uses the Viola Jones algorithm implemented with Python.

computer-vision face-detection machine-learning python

Last synced: 12 Jul 2025

https://github.com/butlerlabs/docai

DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning models for a wide range of applications

computer-vision information-extraction information-retrieval language-models machine-learning machine-learning-library natural-language-processing nlp nlp-library ocr ocr-python pretrained-models python

Last synced: 01 Mar 2026

https://github.com/wondervictor/dpl.pytorch

Pytorch Implementation for "Deep Patch Learning for Weakly Supervised Object Classification and Discovery"

computer-vision deep-learning object-detection weakly-supervised-detection weakly-supervised-learning

Last synced: 13 Jul 2025

https://github.com/alesiong/template-matching

Simple template matching by GPU (CUDA)

computer-vision cuda template-matching

Last synced: 30 Apr 2025

https://github.com/emalagoli92/gcvit-tensorflow

TensorFlow 2.X reimplementation of Global Context Vision Transformers, Ali Hatamizadeh, Hongxu (Danny) Yin, Jan Kautz Pavlo Molchanov.

computer-vision deep-learning image-classification python pytorch tensorflow transformers

Last synced: 12 Apr 2025

https://github.com/li-plus/ssm-vos

Separable Structure Modeling for Semi-supervised Video Object Segmentation

computer-vision davis-challenge machine-learning segmentation semi-supervised video-object-segmentation vos youtube-vos

Last synced: 18 Jul 2025

https://github.com/mr7495/Pesteh-Set

Pesteh-Set, the pistachios dataset from the paper https://doi.org/10.1007/s42044-021-00090-6

closed-mouth-pistachios computer-vision deep-learning deep-neural-networks open-mouth-pistachios pistachio

Last synced: 05 Apr 2025

https://github.com/neka-nat/cuimage

Rust implementation of image processing library with CUDA

computer-vision cuda rust

Last synced: 13 Apr 2025

https://github.com/boechat107/eye-boof

Clojure image processing library based on BoofCV.

boofcv clojure computer-vision image-processing

Last synced: 30 Apr 2025

https://github.com/shiqimei/digit-recognition

Recognize the handwritten digits online with FCNet which is powered by MNIST dataset 😄

computer-vision handwritten-character-recognition neural-network visualization

Last synced: 19 Mar 2025

https://github.com/mayukhdeb/deep-chicken-saviour

using adversarial attacks to confuse deep-chicken-terminator :shield: :chicken:

adversarial-attacks adversarial-examples computer-vision fgsm object-detection opencv pytorch

Last synced: 18 Mar 2025

https://github.com/hritik5102/fundamentals_of_ds_ml_dl

The repository encompasses the core concepts of Python, Statistics, Machine Learning, Deep Learning, Computer Vision, and Natural Language Processing.

computer-vision data-science deep-learning machine-learning neural-network statistics

Last synced: 13 May 2025

https://github.com/edaaydinea/python-ml-dl-ds-projects

This repository is included artificial intelligence, machine learning, data science, computer vision projects which are written Python language.

computer-vision data-science deep-learning machine-learning projects python

Last synced: 02 Jul 2025

https://github.com/gustavz/computer_vision

TUM Master Course "Computer Vision" Assignment Solutions

coding computer-vision matlab tum university-course

Last synced: 20 Jul 2025

https://github.com/unixpickle/autorot

Use Machine Learning to rotate your images automatically

computer-vision machine-learning

Last synced: 02 May 2025

https://github.com/jakarto3d/jakarnotator

The Jakarnotator is an annotation tool to create your own database for instance segmentation problem.

annotations computer-vision data database deep-learning detectron instance-segmentation mscoco training-data

Last synced: 15 May 2025

https://github.com/autodistill/autodistill-sam-hq

Segment Anything HQ (SAM HQ) module for use with Autodistill.

autodistill computer-vision instance-segmentation samhq segment-anything

Last synced: 14 Apr 2025

https://github.com/div99/crashdetection

Automatic detection and alert of vehicle accidents

accident-prediction azure computer-vision detection machine-learning

Last synced: 18 Jul 2025

https://github.com/kennethleungty/image-augmentation-libraries

Sample implementation codes for a variety of popular image augmentation Python packages

albumentations augmentor computer-vision deep-learning image-augmentation imgaug opencv python skimage torchvision

Last synced: 12 Jul 2025

https://github.com/ahadcove/legoblobdetection

Let's walk through on counting Lego using OpenCV's Blob Detection

blob-detection computer-vision lego opencv python

Last synced: 21 Aug 2025

https://github.com/ashok-arjun/gaussian-poisson-gans-for-image-blending

Blending composite images using a generative model and a Gaussian-Poisson equation with a Laplacian Pyramid

blending-images computer-vision deep-learning gan gans generative-adversarial-network machine-learning pytorch

Last synced: 28 Jul 2025

https://github.com/sayakpaul/analytics-vidhya-game-of-deep-learning-hackathon

Contains my experiments for the Game of Deep Learning Hackathon conducted by Analytics Vidhya

active-learning analytics-vidhya computer-vision data-cleaning deep-learning fastai label-noise

Last synced: 28 Jul 2025

https://github.com/tarikseyceri/ollamaoptivus.ai

Ollama Optivus is a video processing AI that extracts meaningful insights from videos using AI technologies like YOLO for object detection, EasyOCR for text extraction, and Whisper for speech-to-text. Integrated with Ollama LLM, it provides detailed summaries and event analysis, supporting various advanced AI model prompting.

ai artificial-intelligence computer-vision deepseek deepseek-r1 docker express-js llm node nodejs ocr ollama ollama-api opencv python video-processing video-summarization whisper-ai yolo yolov8

Last synced: 11 May 2025

https://github.com/wonder-tree/posecamera

PoseCamera: Realtime Human Pose Estimation Python SDK

computer-vision mobilenet opencv openpose pose-estimation python pytorch

Last synced: 23 Jul 2025

https://github.com/aniekanbane/touch-free-keyboard

Virtual touchscreen keyboard using OpenCV and cvzone module

computer-vision ironman mediapipe opencv

Last synced: 18 Apr 2026

https://github.com/pvnieo/deep-levi

Automatic colorization using deep neural networks.

automatic-colorization colorization computer-vision deep-learning keras

Last synced: 01 Aug 2025

https://github.com/margaretmz/icon-classifier

An Icon classifier made with TFLite Model Maker and deployed to Android with ML Model Binding.

android computer-vision deep-learning tensorflow-lite tflite

Last synced: 07 Apr 2025

https://github.com/khyatimahendru/eigenfaceswithsvd

Facial Recognition on 'Labelled Faces in the Wild Dataset' using the concept of Eigenfaces. I have used Singular Value Decomposition to obtain the eigenfaces used.

classification computer-vision decomposition eigenfaces face-recognition facial-recognition labelled-faces linear-algebra svd svd-factorization

Last synced: 07 Apr 2025

https://github.com/wkentaro/chainer-bicyclegan

Chainer implementation of "Toward Multimodal Image-to-Image Translation".

chainer computer-graphics computer-vision deep-learning gan generative-adversarial-network image-generation pix2pix

Last synced: 19 Jun 2025

https://github.com/andrewliao11/longperceptualthoughts

[COLM'25] The official implementation of "LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception"

computer-vision large-language-models reasoning reasoning-language-models vision-language-model visual-reasoning

Last synced: 23 Sep 2025

https://github.com/amaansayyad/covid-19_face_mask_recognition_system

The Face Mask Recognition System uses AI technology to detect the person with or without a mask. It can be connected with any surveillance system installed at your premises. The authorities or admin can check the person through the system to confirm whether the person is wearing a mask or not.

computer-vision coronavirus covid-19 covid19 deep-learning face-detection face-recognition facemask-detection human-pose-estimation jupyter-notebook keras keras-tensorflow machinelearning opencv python3 tenserflow trending

Last synced: 02 Jul 2025

https://github.com/joehowarth/goscanner

Converts images of Go boards to .sgf file using OpenCV and Keras

computer-vision deep-learning opencv python sgf-files

Last synced: 22 Jul 2025

https://github.com/veb-101/tensorflow-specialization

This repository contains course notebooks from the TensorFlow for practice specialization on Coursera

computer-vision coursera-specialization deep-learning python3 sequence-models tensorflow2 time-series

Last synced: 06 Apr 2025

https://github.com/rembertdesigns/gemma3n-disaster-assistant

AI-powered disaster response platform with offline-first architecture using Gemma 3n. Provides computer vision hazard detection, voice analysis with emergency keywords, PDF report generation, and multi-user coordination - all working without internet access.

ai computer-vision emergency-response fastapi gemma3n kaggle kaggle-competition multimodal offline-disaster-response offline-sync ondevice-ai pwa triageservice voice-processing

Last synced: 21 Jul 2025

https://github.com/natmlx/blazeface-unity

Realtime face detection in Unity Engine with NatML and NatDevice.

computer-vision deep-learning face-detection machine-learning natml

Last synced: 25 Jul 2025

https://github.com/quva-lab/timeception

Please check the updated repository https://github.com/noureldien/timeception

action-recognition computer-vision convolutional-neural-networks temporal-models

Last synced: 01 Aug 2025

https://github.com/bonsai-rx/sleap

A Bonsai interface for real-time multi-animal pose tracking using SLEAP

bonsai-rx computer-vision

Last synced: 23 Jul 2025

https://github.com/siyuanwu99/dsec

Paper Reproduction for "Learning Monocular Dense Depth from Events" | CS4245 Computer Vision by Deep Learning course project

computer-vision course-project event-camera paper-reproduction

Last synced: 26 Apr 2025

https://github.com/pankajr141/experiments

This Repository contains the code base for performing some experiments. I will be reusing some of the codebase present here as a full scale repository

audio-processing computer-vision deep-learning reinforcement-learning

Last synced: 26 Jun 2025

https://github.com/salim4n/pro-pretorian-system

This project is a Computer Vision application that allows users to customize their detection parameters according to their needs. Whether you want to detect specific objects, define the area of interest, or schedule detection times, this application provides flexible and powerful options.

azure-app-service azure-storage azure-table-storage ci-cd computer-vision mediapipe nextjs python shadcn-ui telegram-bot tensorflowjs typescript yolov8 zod

Last synced: 14 Jun 2025

https://github.com/kareimgazer/perception-stack

software pipeline to identify the lane boundaries and cars in a video for autonomous self-driving cars

car-detection computer-vision deep-learning hog image-processing lane-detection object-detection perception self-driving-car svm yolov3

Last synced: 14 Feb 2026

https://github.com/aleksandergrzyb/masters-thesis

Spatial coregistration algorithms for angiographic imaging with optical coherence tomography.

computer-vision image-processing latex master-thesis spatial-coregistration-algorithms stitching

Last synced: 13 Apr 2025

https://github.com/junjuew/tf_object_detection

A Thin Wrapper around Tensorflow Object Detection API for Easy Installation and Use

computer-vision deep-learning deep-neural-networks dependency object-detection object-recognition pypi tensorflow tensorflow-models

Last synced: 24 Oct 2025

https://github.com/yakovenkodenis/dogs-vs-cats-kaggle

A tiny CNN for Dogs vs Cats Redux Kernels Edition on Kaggle

computer-vision convolutional-neural-networks deeplearning kaggle-competiton keras

Last synced: 09 Apr 2025

https://github.com/daleonpz/cv_yolo_on_esp32

Screw type detection using ESP-EYE, YOLOv5, and TensorFlow Lite Micro for real-time classification on ESP32.

computer-vision esp32 machne-learning tinyml yolo yolov5

Last synced: 09 Oct 2025

https://github.com/aqafridi/tensorflow-advanced-techniques

Most advanced Machine Learning and Artificial Intelligence techniques using TensorFlow: a free and open-source software library. AI is fun to grow exponentially in all aspect Once you know, how to use it.

computer-vision deep-learning gan machine-learning tensorflow tensorflow-tutorials

Last synced: 24 Oct 2025

https://github.com/shan18/tensornet

A high-level deep learning library built on top of PyTorch.

computer-vision convolutional-neural-networks deep-learning deep-learning-framework python pytorch

Last synced: 18 Apr 2025

https://github.com/o-b-one/pose-estimator

An application that is able to estimate what pose a person found at, and count the number of repeats during live/recorded exercise.

computer-vision knn python react resnet-50 tensorflow tflite

Last synced: 10 Apr 2025

https://github.com/lindronics/flir_app

Android application for the real-time classification of FLIR One thermal camera images using a deep convolutional neural network built with TensorFlow. Part of my final year honours project.

android computer-vision flir flir-cameras flir-one tensorflow tensorflow-lite

Last synced: 14 Oct 2025

https://github.com/suriyaavijay/posture-recognition-using-opencv-and-mediapipe

Posture correction using computer vision and Mediapipe library enables the detection and correction of poor posture in images and live videos, promoting a healthier lifestyle in the digital era.

computer-vision digital-wellbeing ipynb jupyter-notebook mediapipe opencv posture-advisor posture-recognition python

Last synced: 28 Oct 2025

https://github.com/mmasana/ternaryfeaturemasks

Implementation of "Ternary Feature Masks: zero-forgetting for task-incremental learning"

computer-vision continual-learning machine-learning task-incremental-learning zero-forgetting

Last synced: 05 Feb 2026

https://github.com/tomrunia/repetitionestimation

CVPR 2018: Real-World Repetition Estimation by Div, Grad and Curl

computer-vision cvpr2018 machine-learning motion-estimation research video

Last synced: 02 Mar 2025

https://github.com/imwildcat/aitk

Artificial Intelligence Toolkit, a powerful tool that makes your life better.

ai baidu cloud-ai cloud-ml-engine computer-vision google machine-learning machine-translation nlp speech-recognition tencent text-to-speech

Last synced: 19 Apr 2025

https://github.com/ivanrs297/endoscopycorruptions

The endoscopycorruptions Python package provides utilities to simulate common image corruptions that might occur during endoscopic procedures. This tool is designed to assist in the development and testing of image processing algorithms intended for endoscopic imagery by introducing realistic corruptions into clean images.

computer-vision data-science machine-learning medical-imaging python

Last synced: 25 Apr 2026

https://github.com/muntahashams/object-detection-using-yolov3

YOLO is a state-of-the-art, real-time object detection algorithm. In this notebook, I had applied the YOLO algorithm to detect objects in images ,videos and webcam

computer-vision deep-learning detect-objects detection-algorithm image machine-learning object-detection object-detection-model udacity-nanodegree video webcam yolo yolo-algorithm

Last synced: 09 Apr 2025

https://github.com/darcyai/darcyai

Darcy AI Engine is a Python library for building AI apps

ai ai-pipeline computer-vision edge edge-computing edgeai iot open-source opencv python raspberry-pi

Last synced: 04 Jul 2025

https://github.com/sondosaabed/introduction-to-computer-vision

introduction to computer vision including fundamentals, methods for application and machine learning classification.

computer-vision image-analysis image-processing segmentation

Last synced: 09 Apr 2025

https://github.com/madeyoga/emnist-cnn

Handwritten Character Recognition. EMNIST dataset on Kaggle. Tensorflow2 - Keras - CNN - 0.85 evaluation.

computer-vision emnist emnist-classification handwritten-character-recognition tensorflow2

Last synced: 28 Oct 2025

https://github.com/q-viper/corn-infection-detection

A Computer Vision Project done from corn field to COLAB.

agriculture-research computer-vision object-detection

Last synced: 04 Apr 2026

https://github.com/nburgdorfer/cvtkit

A collection of some useful (or maybe just interesting) computer vision tools.

3d-reconstruction computer-vision python

Last synced: 14 Jan 2026

https://github.com/azazdeaz/homography

Rust reimplementation of OpenCV's homography estimation algorithm

computer-vision computer-vision-algorithms homography homography-estimator rust

Last synced: 12 Apr 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-3D-Object-Detection 169 Awesome-Skeleton-based-Action-Recognition 104 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-state-of-depth-completion 71 awesome-robotics-datasets 79 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97