An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/wmylxmj/YOLO-V3-IOU

YOLO3 动漫人脸检测 (Based on keras and tensorflow) 2019-1-19

anime cnn computer-vision deep-learning iou keras object-detection real-time yolo yolov3

Last synced: 21 Apr 2025

https://github.com/EhsanKia/CatalogScanner

Scans Animal Crossing: New Horizon catalog from video of user scrolling through.

animal-crossing animal-crossing-new-horizons computer-vision hacktoberfest

Last synced: 15 Apr 2025

https://github.com/ehsankia/catalogscanner

Scans Animal Crossing: New Horizon catalog from video of user scrolling through.

animal-crossing animal-crossing-new-horizons computer-vision hacktoberfest

Last synced: 05 Apr 2025

https://github.com/louisfb01/top-10-cv-papers-2021

A curated list of the top 10 computer vision papers in 2021 with video demos, articles, code and paper reference.

2021 ai computer-vision deep-learning gans innovation machine-learning machinelearning paper python research research-paper sota sota-technique state-of-art state-of-the-art technology transformers

Last synced: 10 Jul 2025

https://github.com/LayneH/self-adaptive-training

[TPAMI2022 & NeurIPS2020] Official implementation of Self-Adaptive Training

adversarial-robustness computer-vision generalization label-noise machine-learning overfitting

Last synced: 19 Jul 2025

https://github.com/rpng/android-camera-calibration

Updated (opencv3 and camera2 API) android camera calibration application

android camera-calibration camera2-api computer-vision opencv

Last synced: 27 Jun 2025

https://github.com/capjamesg/cv-book-svg

Turn an image of a bookshelf into an interactive SVG.

books computer-vision image-analysis library personal-library

Last synced: 08 Jan 2026

https://github.com/dahyun-kang/ifsl

[CVPR'22] Official PyTorch implementation of Integrative Few-Shot Learning for Classification and Segmentation

computer-vision cvpr2022 deep-learning few-shot-classification few-shot-learning few-shot-segmentation pytorch pytorch-lightning

Last synced: 12 Apr 2025

https://github.com/lancedb/yoloexplorer

YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds

computer-vision object-detection yolov5 yolov8

Last synced: 05 Apr 2025

https://github.com/hellonico/origami

Lowest barrier of entry to Image Processing, Computer Vision and Neural Networks on the JavaVM

clojure computer-vision deep-learning dnn java kotlin opencv yolov8

Last synced: 16 May 2025

https://github.com/simonw/cougar-or-not

An API for identifying cougars v.s. bobcats v.s. other USA cat species

computer-vision fastai inaturalist starlette zeit-now

Last synced: 27 Mar 2026

https://github.com/voxel51/papers-with-data

A curated list of papers that released datasets along with their work

ai artificial-intelligence computer-vision data-science datasets deep-learning machine-learning papers

Last synced: 16 Jul 2025

https://github.com/ispc-lab/LiDAR4D

💫 [CVPR 2024] LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis

autonomous-driving computer-vision cvpr2024 dynamic-scene lidar neural-rendering novel-view-synthesis point-cloud reconstruction

Last synced: 20 Mar 2025

https://github.com/sayakpaul/robustness-vit

Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022).

computer-vision jax pytorch robustness self-attention tensorflow transformers

Last synced: 15 Oct 2025

https://github.com/dragonfly90/mxnet_Realtime_Multi-Person_Pose_Estimation

This is a mxnet version of Realtime_Multi-Person_Pose_Estimation, origin code is here https://github.com/ZheC/Realtime_Multi-Person_Pose_Estimation

computer-vision deep-learning humanpose

Last synced: 17 Apr 2025

https://github.com/vt-vl-lab/reading_group

Virginia Tech Vision and Learning Reading Group

computer-vision deep-learning machine-learning

Last synced: 06 Mar 2026

https://github.com/lyken17/sparsenet

[ECCV 2018] Sparsely Aggreagated Convolutional Networks https://arxiv.org/abs/1801.05895

computer-vision convolutional-neural-networks deep-learning

Last synced: 17 Mar 2025

https://github.com/e-candeloro/driver-state-detection

A real time, webcam based, driver attention state detection/monitoring system in Python3 using OpenCV and Mediapipe

automotive collaborate computer-vision distracted-driving-detection dlib driver-safety human-attention machine-learning mediapipe opencv-python perclos pose-estimation python3 real-time

Last synced: 13 Aug 2025

https://github.com/ibaiGorordo/ONNX-CREStereo-Depth-Estimation

Python scripts performing stereo depth estimation using the CREStereo model in ONNX.

computer-vision crestereo depth-estimation onnx onnxruntime opencv python stereo-depth-estimation stereo-matching stereo-vision

Last synced: 20 Mar 2025

https://github.com/LingDong-/chinese-hershey-font

Convert Chinese Characters to Single-Line Fonts using Computer Vision

chinese computer-vision convert font hershey-fonts typography

Last synced: 27 Jul 2025

https://github.com/andrewekhalel/edafa

Test Time Augmentation (TTA) wrapper for computer vision tasks: segmentation, classification, super-resolution, ... etc.

augmentation classification computer-vision keras pytorch segmentation super-resolution tensorflow wrapper

Last synced: 19 Aug 2025

https://github.com/yuweihao/kern

Code for Knowledge-Embedded Routing Network for Scene Graph Generation (CVPR 2019)

computer-vision graph-neural-network pytorch scene-graph scene-graph-generation

Last synced: 07 Apr 2025

https://github.com/yuweihao/KERN

Code for Knowledge-Embedded Routing Network for Scene Graph Generation (CVPR 2019)

computer-vision graph-neural-network pytorch scene-graph scene-graph-generation

Last synced: 02 Apr 2025

https://github.com/hugozanini/realtime-semantic-segmentation

Implementation of RefineNet to perform real time instance segmentation in the browser using TensorFlow.js

computer-vision real-time-machine-learning semantic-segmentation tensorflow tensorflowjs

Last synced: 21 Jul 2025

https://github.com/leverxgroup/esrgan

Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution

computer-vision deep-neural-networks generative-adversarial-network image-processing pytorch super-resolution

Last synced: 15 Mar 2025

https://github.com/101arrowz/scanner

Document scanning from scratch

computer-vision rust typescript wasm

Last synced: 17 Mar 2025

https://github.com/seva100/optic-nerve-cnn

Code repository for a paper "Optic Disc and Cup Segmentation Methods for Glaucoma Detection with Modification of U-Net Convolutional Neural Network"

computer-vision cup-segmentation-methods glaucoma-detection ipynb medical-imaging optic-disc paper

Last synced: 12 Mar 2026

https://github.com/maps-as-data/mapreader

A computer vision pipeline for exploring and analyzing images at scale

article computer-vision deep-learning digital-humanities hut23 hut23-96 machine-learning maps pytorch spatial-data

Last synced: 08 Oct 2025

https://github.com/mheriyanto/machine-learning-in-computer-vision

:memo: References list for machine learning and deep learning in computer vision.

artificial-intelligence computer-vision deep-learning machine-learning python pytorch roadmap tensorflow

Last synced: 22 Nov 2025

https://github.com/olafenwamoses/deepstack_exdark

A DeepStack custom model for detecting common objects in dark/night images and videos.

ai cctv computer-vision deep-learning deepstack machine-learning night-vision night-vision-camera object-detection yolo yolov3 yolov5

Last synced: 07 Oct 2025

https://github.com/jeeliz/jeelizpupillometry

Real-time pupillometry in the web browser using a 4K webcam video feed processed by this WebGL/Javascript library. 2 demo experiments are included.

camera cctv computer-vision detection eye eyelink face iris javascript measurement pupil pupillometry radius real-time size tracking video webcam webgl

Last synced: 16 Apr 2025

https://github.com/real-stanford/flingbot

[CoRL 2021 Best System Paper] This repository contains code for training and evaluating FlingBot in both simulation and real-world settings on a dual-UR5 robot arm setup for Ubuntu 18.04

cloth-simulation computer-vision robotics

Last synced: 05 May 2025

https://github.com/ihhub/penguinv

Computer vision library with focus on heterogeneous systems

avx computer-vision cpp cuda gpu hacktoberfest heterogeneous-systems image-processing opencl python simd sse thread-pool

Last synced: 30 Oct 2025

https://github.com/dahyun-kang/renet

[ICCV'21] Official PyTorch implementation of Relational Embedding for Few-Shot Classification

computer-vision deep-learning few-shot-classifcation few-shot-learning iccv2021 neural-networks pytorch

Last synced: 16 Aug 2025

https://github.com/freedomofkeima/moeflow

Repository for anime characters recognition website, powered by TensorFlow

anime classification computer-vision python3 tensorflow transfer-learning

Last synced: 20 Jul 2025

https://github.com/scrubbbbs/cbird

Command-line program for Content-Based Image Retrieval of images and videos. Includes tools for general search and de-duplication.

command-line-interface computer-vision content-based-image-retrieval duplicate-detection duplicate-files duplicates ffmpeg opencv qt6 similarity-search

Last synced: 06 Apr 2025

https://github.com/voxel51/fiftyone-plugins

A curated list of plugins that you can add to your FiftyOne install!

artificial-intelligence computer-vision data-science deep-learning fiftyone machine-learning plugins python

Last synced: 11 Apr 2025

https://github.com/open-edge-platform/geti-sdk

Software Development Kit (SDK) for the Intel® Geti™ platform for Computer Vision AI model training.

annotations computer-vision deep-learning model-deployment model-development openvino python3 sdk

Last synced: 29 Jun 2025

https://github.com/utkarsh-deshmukh/single-image-dehazing-python

python implementation of the paper: "Efficient Image Dehazing with Boundary Constraint and Contextual Regularization"

airlight-estimation computer-vision defogging fog-removal haze-removal haze-removal-algorithm ieee image-dehazing opencv python single-image-defogging single-image-dehazing

Last synced: 09 Apr 2025

https://github.com/noshluk2/ROS2-Self-Driving-Car-AI-using-OpenCV

ROS2 Self Driving Car using Deeplearning and Object Tracking through openCV

computer-vision deep-learning gazebo object-detection opencv ros2 self-driving-car

Last synced: 24 Apr 2026

https://github.com/filipbasara0/simple-diffusion

A minimal implementation of a denoising diffusion model in PyTorch.

attention-mechanism computer-vision deep-learning diffusion image-generation pytorch stable-diffusion unet

Last synced: 15 Oct 2025

https://github.com/jmisilo/clip-gpt-captioning

CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.

computer-vision cv deep-learning image-caption image-caption-generator image-captioning machine-learning nlp python pytorch

Last synced: 09 Apr 2025

https://github.com/pollen-robotics/pollen-vision

Simple and unified interface to zero-shot computer vision models curated for robotics use cases.

computer-vision grasping object-detection object-segmentation robotics

Last synced: 05 Apr 2025

https://github.com/raoyongming/PointGLR

[CVPR 2020] Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds

3d-point-clouds computer-vision deep-learning metric-learning representation-learning unsupervised-learning

Last synced: 20 Mar 2025

https://github.com/honlan/dmt

Disentangled Makeup Transfer with Generative Adversarial Network

computer-vision deep-learning disentangled-representations generative-adversarial-network makeup-transfer tensorflow

Last synced: 30 Aug 2025

https://github.com/mindspore-lab/d2l-mindspore

《动手学深度学习》的MindSpore实现。供MindSpore学习者配合李沐老师课程使用。

computer-vision deep-learning machine-learning mindspore natural-language-processing notebook

Last synced: 17 Jan 2026

https://github.com/mars885/doc-skanner

An Android application that makes it possible to automatically scan and digitize documents from photos.

android android-application computer-vision doc-scanner document-scanner opencv

Last synced: 06 Apr 2025

https://github.com/margaretmz/cartoonizer-with-tflite

How to create a Cartoonizer Android app with TensorFlow Lite models.

android computer-vision gans tensorflow-lite tflite

Last synced: 07 Apr 2025

https://github.com/deep-diver/mlops-hf-tf-vision-models

MLOps for Vision Models (TensorFlow) from 🤗 Transformers with TensorFlow Extended (TFX)

computer-vision huggingface-transformers mlops tensorflow tensorflow-extended

Last synced: 07 Mar 2026

https://github.com/real-stanford/semantic-abstraction

[CoRL 2022] This repository contains code for generating relevancies, training, and evaluating Semantic Abstraction.

computer-vision deep-learning robotics

Last synced: 05 May 2025

https://github.com/sergio11/vehicle_detection_tracker

🚗 VehicleDetectionTracker: Real-time vehicle detection and tracking powered by YOLO. 🚙🚕 Enhance your computer vision projects with speed, precision, and adaptability.

automated-surveillance computer-vision object-detection python3 traffic-analysis ultralytics vehicle-tracking yolo yolov8

Last synced: 07 Apr 2025

https://github.com/skalskip/fashion-assistant

Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from images. We pass the prompt, along with the extracted features, to LLM, allowing for advanced image dataset queries.

computer-vision llm natural-language-processing openai openai-api yolo

Last synced: 15 Apr 2025

https://github.com/rizwanmunawar/yolov5-object-tracking

YOLOv5 Object Tracking + Detection + Object Blurring + Streamlit Dashboard Using OpenCV, PyTorch and Streamlit

computer-vision object-detection object-tracking streamlit-dashboard yolov5

Last synced: 05 Apr 2025

https://github.com/ismail-mebsout/Parsing-PDFs-using-YOLOV3

Parsing pdf tables using YOLOV3

computer-vision pdf python

Last synced: 21 Apr 2025

https://github.com/UT-Austin-RPL/Ditto

Code for Ditto: Building Digital Twins of Articulated Objects from Interaction

3d-reconstruction computer-vision deep-learning digital-twin robotics

Last synced: 11 Apr 2025

https://github.com/mybridge/learn-machine-learning

Learn to Build a Machine Learning Application from Top Articles

computer-vision data-science deep-learning machine-learning neural-networks

Last synced: 27 Jan 2026

https://github.com/bytedance-seed/traceanything

Trace Anything: Representing Any Video in 4D via Trajectory Fields

3d-reconstruction 4d-reconstruction computer-vision

Last synced: 29 Oct 2025

https://github.com/McGill-NLP/weblinx

WebLINX is a benchmark for building web navigation agents with conversational capabilities

agent agents computer-vision llm multimodal navigation nlp web

Last synced: 06 Mar 2025

https://github.com/sunriseapps/imagesorcery-mcp

An MCP server providing tools for image processing operations

computer-vision image-editing image-manipulation image-processing mcp mcp-server ocr opencv

Last synced: 13 Aug 2025

https://github.com/chrisprasanna/exercise_recognition_ai

Developing a virtual personal fitness tracker and exercise activity recognition A.I. using computer vision and deep learning.

activity-recognition attention-mechanism classification-model computer-vision deep-learning human-pose-estimation lstm-neural-networks machine-learning pose-estimation real-time

Last synced: 21 Jul 2025

https://github.com/sunset1995/hohonet

"HoHoNet: 360 Indoor Holistic Understanding with Latent Horizontal Features" official pytorch implementation.

360-photo computer-vision cvpr2021 depth-estimation hohonet room-layout semantic-segmentation

Last synced: 17 Jun 2025

https://mcgill-nlp.github.io/weblinx/

WebLINX is a benchmark for building web navigation agents with conversational capabilities

agent agents computer-vision llm multimodal navigation nlp web

Last synced: 11 May 2025

https://github.com/n2cholas/jax-resnet

Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).

computer-vision deep-learning flax jax machine-learning neural-networks resnet

Last synced: 16 Jan 2026

https://github.com/abd-shoumik/Social-distance-detection

Social distance detection , a deep learning computer vision project with yolo object detection

computer-vision covid-19 dataset deeplearning deeplearning-ai socialdistancing yolo

Last synced: 21 Apr 2025

https://github.com/landing-ai/landingai-python

LandingAI Python library that enables you to use LandingLens with ease. (https://app.landing.ai/)

computer-vision deep-learning machine-learning

Last synced: 30 Jan 2026

https://github.com/into-ai/deeplearning2020

course materials for introduction to deep learning 2020

computer-vision course-materials deep-learning lab learning mooc openhpi tensorflow

Last synced: 10 Oct 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-3D-Object-Detection 169 Awesome-Skeleton-based-Action-Recognition 104 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-robotics-datasets 79 awesome-state-of-depth-completion 71 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97