An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/agentmorris/megadetector

MegaDetector is an AI model that helps conservation folks spend less time doing boring things with camera trap images.

camera-traps cameratraps computer-vision conservation ecology machine-learning megadetector wildlife

Last synced: 16 May 2025

https://github.com/chengchunhsu/everypixelmatters

Implementation of ECCV 2020 paper "Every Pixel Matters: Center-aware Feature Alignment for Domain Adaptive Object Detector"

adversarial-learning anchor-free computer-vision domain-adaptation eccv eccv-2020 eccv2020 fcos object-detection pytorch transfer-learning unsupervised-domain-adaptation

Last synced: 30 Jan 2026

https://github.com/ahundt/grl

Robotics tools in C++11. Implements soft real time arm drivers for Kuka LBR iiwa plus V-REP, ROS, Constrained Optimization based planning, Hand Eye Calibration and Inverse Kinematics integration.

atracsys c-plus-plus calibration computer-vision constrained-optimization control driver grl hand-eye-calibration iiwa inverse-kinematics kuka kuka-lbr-iiwa optimization real-time robot robotics ros vrep vrep-plugin

Last synced: 12 Jul 2025

https://github.com/joelibaceta/video-keyframe-detector

It is a simple python tool to extract key-frames from a video file using peak estimation from frame difference.

cli-tools computer-vision key-frame opencv peakutils python3 video

Last synced: 04 Apr 2025

https://github.com/lumiguide/haskell-opencv

Haskell binding to OpenCV-3.x

computer-vision haskell image-processing opencv

Last synced: 09 Apr 2025

https://github.com/Julian-Wyatt/AnoDDPM

CVPR Workshop paper - AnoDDPM: Anomaly Detection with Denoising Diffusion Probabilistic Models using Simplex Noise

anomaly-detection computer-vision cvpr ddpm dl medical segmentation

Last synced: 14 Mar 2025

https://github.com/vsymbol/CUTIE

CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)

computer-vision deep-learning text-extraction

Last synced: 02 Apr 2025

https://github.com/imoonlab/hyper-yolo

The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".

computer-vision deep-learning hypergraph-neural-networks object-detection yolo

Last synced: 05 Apr 2025

https://github.com/thebabylonai/babylog

A lightweight logger for machine learning teams to log images and predictions in production.

computer-vision cvops data-science logger logging-library machine-learning ml mlops python python3

Last synced: 07 May 2025

https://github.com/customdiffusion360/custom-diffusion360

CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control

camera-pose-control computer-vision customization generative-model pytorch stable-diffusion text-to-image

Last synced: 27 Mar 2025

https://github.com/arthurdouillard/cvpr2021_plop

Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation

computer-vision continual-learning cvpr2021 deep-learning semantic-segmentation

Last synced: 21 Jun 2025

https://github.com/viplix3/BoTSORT-cpp

C++ implementation of BoT-SORT MOT algorithm with Re-ID and Camera Motion Compensation

computer-vision cpp kalman-filter multi-object-tracking reid

Last synced: 18 Mar 2025

https://github.com/sanyam5/arc-pytorch

The first public PyTorch implementation of Attentive Recurrent Comparators

computer-vision deep-learning deep-neural-networks machine-learning pytorch recurrent-neural-networks

Last synced: 19 Jul 2025

https://github.com/swarm-lab/Rvision

Basic computer vision library for R

computer-vision opencv r

Last synced: 13 Jul 2025

https://github.com/devrimcavusoglu/pybboxes

Light weight toolkit for bounding boxes providing conversion between bounding box types and simple computations.

computer-vision deep-learning image-recognition machine-learning numpy python pytorch tensorflow

Last synced: 08 Apr 2025

https://github.com/hzwer/wacv2024-safa

WACV2024 - Scale-Adaptive Feature Aggregation for Efficient Space-Time Video Super-Resolution

computer-vision deep-learning super-resolution video-processing

Last synced: 05 Apr 2025

https://github.com/skalskip/sketchy-vision

Each week I create sketches covering key Computer Vision concepts. If you want to learn more about CV stick around!

computer-vision convolutional-neural-networks deep-learning neural-network non-maximum-suppression numpy object-detection

Last synced: 14 Apr 2025

https://github.com/erogol/raspisecurity

Home Surveillance for Raspberry

computer-vision opencv raspberry-pi surveillance

Last synced: 28 Oct 2025

https://github.com/line/lighthouse

[EMNLP2024 Demo], [ICASSP 2025] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.

audio audio-moment-retrieval audio-processing computer-vision highlight-detection moment-retrieval multimodal natural-language-processing video video-moment-retrieval video-processing

Last synced: 04 Jul 2025

https://github.com/ofa-sys/ofasys

OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models

audio computer-vision deep-learning motion multimodal-learning multitask-learning nlp pretrained-models pytorch transformers vision-and-language

Last synced: 24 Oct 2025

https://github.com/patrikhuber/eos-model-viewer

3D model viewer for the eos Morphable Model library

3d-face 3d-morphable-face-model computer-vision face-models

Last synced: 21 Jul 2025

https://github.com/eddieoz/youtube-clips-automator

MARCELO: an AI powered bot to automate the editing and thumbnail creation for your Youtube clips channel

ai audio-processing automation bot clip computer-vision editing thumbnail video video-processing youtube

Last synced: 20 Oct 2025

https://github.com/ai-forever/ru-clip

CLIP implementation for Russian language

clip computer-vision nlp

Last synced: 20 Jun 2025

https://github.com/oxedom/parker

Parking detection and monitoring webapp that runs entirely in the browser

computer-vision object-detection parking-management tensorflowjs webrtc yolov7

Last synced: 02 May 2025

https://github.com/VedankPurohit/LiveRecall

Welcome to **LiveRecall**, the open-source alternative to Microsoft's Recall. LiveRecall captures snapshots of your screen and allows you to recall them using natural language queries, leveraging semantic search technology. For added security, all images are encrypted.

computer-vision memory productivity python recall sementic

Last synced: 11 Sep 2025

https://github.com/lit26/streamlit-img-label

streamlit-img-label is a graphical image annotation tool using streamlit. Annotations are saved as XML files in PASCAL VOC format.

computer-vision image-annotation image-processing

Last synced: 27 Jun 2025

https://github.com/LeapLabTHU/Pseudo-Q

[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding

computer-vision cvpr2022 deep-learning multimodal-deep-learning pytorch vision-and-language visual-grounding

Last synced: 03 Apr 2025

https://github.com/astra-vision/rain-rendering

Rain Rendering for Evaluating and Improving Robustness to Bad Weather (Tremblay et al., 2020) (S. S. Halder et al., 2019)

computer-vision fog gan particles-simulation physically-based-rendering rain

Last synced: 06 Feb 2026

https://github.com/rayryeng/xiaohuluvpdetection

This is a Python + OpenCV implementation of the Vanishing Point algorithm by Xiaohu Lu et al. - http://xiaohulugo.github.io/papers/Vanishing_Point_Detection_WACV2017.pdf

computer-vision image-processing line-detection opencv python rotation-matrix vanishing-points

Last synced: 27 Oct 2025

https://github.com/megvii-basedetection/autoassign

Pytorch implementation of "AutoAssign: Differentiable Label Assignment for Dense Object Detection"

coco-dataset computer-vision object-detection

Last synced: 09 Oct 2025

https://github.com/zekunhao1995/DualSDF

DualSDF: Semantic Shape Manipulation using a Two-Level Representation. CVPR'20

computer-vision machine-learning

Last synced: 20 Mar 2025

https://github.com/Megvii-BaseDetection/AutoAssign

Pytorch implementation of "AutoAssign: Differentiable Label Assignment for Dense Object Detection"

coco-dataset computer-vision object-detection

Last synced: 14 Mar 2025

https://github.com/gjy3035/gcc-cl

GCC dataset Collector and Labeler (GCC CL) [CVPR2019]

computer-vision crowd-analysis crowd-counting crowd-segmentation cvpr2019 gtav

Last synced: 10 Apr 2025

https://github.com/allenai/dnw

Discovering Neural Wirings (https://arxiv.org/abs/1906.00586)

computer-vision machine-learning neural-networks

Last synced: 24 Oct 2025

https://github.com/MCG-NJU/MeMOTR

[ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking

computer-vision deep-learning multi-object-tracking tracking

Last synced: 20 Mar 2025

https://github.com/microsoft/snca.pytorch

Improving Generalization via Scalable Neighborhood Component Analysis

computer-vision deep-learning eccv-2018 few-shot-learning nearest-neighbors transfer-learning visual-recognition

Last synced: 24 Oct 2025

https://github.com/arthurdouillard/dytox

Dynamic Token Expansion with Continual Transformers, accepted at CVPR 2022

cifar100 computer-vision continual-learning dynamic-network imagenet transformer

Last synced: 16 Mar 2025

https://github.com/giannisdaras/ylg

[CVPR 2020] Official Implementation: "Your Local GAN: Designing Two Dimensional Local Attention Mechanisms for Generative Models".

attention-mechanism computer-vision cvpr cvpr20 cvpr2020 gans generative-adversarial-networks inverse-problems machine-learning

Last synced: 17 Mar 2025

https://github.com/VSainteuf/utae-paps

PyTorch implementation of U-TAE and PaPs for satellite image time series panoptic segmentation.

agriculture computer-vision deep-learning iccv iccv2021 panoptic-segmentation satellite-imagery self-attention semantic-segmentation sentinel-2

Last synced: 08 May 2025

https://github.com/bytedance/sptsv2

The official implementation of SPTS v2: Single-Point Text Spotting

artificial-intelligence computer-vision deep-learning ocr research

Last synced: 18 Aug 2025

https://github.com/dlpbc/keras-kinetics-i3d

keras implementation of inflated 3d from Quo Vardis paper + weights

action-recognition computer-vision deep-learning pretrained-kinetics-model

Last synced: 26 Jul 2025

https://github.com/juniorxsound/threadeddepthcleaner

๐Ÿ“ท Threaded depth-map cleaning and inpainting using OpenCV

computer-vision depth-camera depth-map image-filters image-processing librealsense2 opencv

Last synced: 02 Mar 2026

https://github.com/arthurdouillard/deepcourse

Learn the Deep Learning for Computer Vision in three steps: theory from base to SotA, code in PyTorch, and space-repetition with Anki

anki computer-vision course deep-learning pytorch research

Last synced: 30 Oct 2025

https://github.com/likojack/bnv_fusion

This repository implements our CVPR2022 paper "BNV-Fusion: Dense 3D Reconstruction using Bi-level Neural Volume Fusion"

3d-reconstruction computer-vision

Last synced: 29 Mar 2025

https://github.com/hv0905/nekoimagegallery

An AI-powered natural language & reverse Image Search Engine powered by CLIP & qdrant.

clip computer-vision image-search image-search-engine search-engine transformers

Last synced: 06 Apr 2025

https://github.com/sayakpaul/mirnet-tflite-trt

TensorFlow Lite models for MIRNet for low-light image enhancement.

computer-vision tensorflow2 tflite tinyml

Last synced: 15 Mar 2026

https://github.com/ibaigorordo/sapiens-pytorch-inference

Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch

computer-vision deep-learning depth-estimation inference pose-estimation python pytorch semantic-segmentation

Last synced: 04 Apr 2025

https://github.com/wtct-hungary/UnityVision-iOS

This native plugin enables Unity to take advantage of specific features of Core-ML and Vision Framework on the iOS platform.

c-sharp computer-vision core-ml coreml ios machine-learning unity unity3d

Last synced: 08 May 2025

https://github.com/z-x-yang/aot

Associating Objects with Transformers for Video Object Segmentation

computer-vision video-object-segmentation

Last synced: 12 Jan 2026

https://github.com/lingdong-/chinese-hershey-font

Convert Chinese Characters to Single-Line Fonts using Computer Vision

chinese computer-vision convert font hershey-fonts typography

Last synced: 14 May 2025

https://github.com/Ground-A-Video/Ground-A-Video

Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)

computer-vision diffusion-models iclr stable-diffusion text-to-video video-editing

Last synced: 28 Mar 2025

https://github.com/geekquad/pixel-processing

๐Ÿ“ท This repository is focused on having various feature implementation of OpenCV in Python. The aim is to have a minimal implementation of all OpenCV features together, under one roof.

computer-vision hacktoberfest hacktoberfest2021 image-processing implementation jupyter-notebook machine-learning open-source opencv opencv-python python scratch-implementation

Last synced: 15 Mar 2025

https://github.com/xilaili/AOGNet

Code for CVPR 2019 paper: " Learning Deep Compositional Grammatical Architectures for Visual Recognition"

aognet cifar10 cifar100 computer-vision deep-learning deep-neural-networks image-classification imagenet mxnet

Last synced: 16 Apr 2025

https://github.com/see2sound/see2sound

Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

audio-processing computer-vision see-2-sound

Last synced: 07 Apr 2026

https://github.com/hkchengrex/mask-propagation

[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code :star2:. Semi-supervised video object segmentation evaluation.

computer-vision cvpr2021 deep-learning pytorch segmentation video-object-segmentation video-segmentation

Last synced: 22 Apr 2025

https://github.com/vita-group/orthogonality-in-cnns

[NeurIPS '18] "Can We Gain More from Orthogonality Regularizations in Training Deep CNNs?" Official Implementation.

cnns computer-vision deep-learning orthogonal

Last synced: 06 Mar 2026

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-3D-Object-Detection 169 Awesome-Skeleton-based-Action-Recognition 104 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-robotics-datasets 79 awesome-state-of-depth-completion 71 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97