An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/lukasbommes/pv-hawk

Tool for the extraction and mapping of photovoltaics modules from IR drone videos of utility-scale PV plants (my PhD project)

computer-vision drone inspection instance-segmentation map mask-rcnn photovoltaic structure-from-motion thermography tracking

Last synced: 27 Jun 2025

https://github.com/YifanXu74/Evo-ViT

Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

computer-vision deep-learning image-classification vision-transformer vision-transformers

Last synced: 03 Oct 2025

https://github.com/rishit-dagli/cppe-dataset

Code for our paper CPPE - 5 (Medical Personal Protective Equipment), a new challenging object detection dataset

artificial-intelligence computer-vision cppe5 data dataset deep-learning machine-learning models object-detection pretrained-models pytorch tensorflow vision

Last synced: 15 Jun 2025

https://github.com/justint/stringless

Real-time markerless facial motion capture into Maya.

camera computer-vision dlib maya motion-capture opencv

Last synced: 17 Mar 2025

https://github.com/akash-rajak/real-time-human-detection-counting

A tensorflow based Faster RCNN inception v2 python model to detect and count humans in real time images, videos & camera.

computer-vision counting cv2 deep-learning detection detection-model faster-rcnn-inception-v2 human-detection machine-learning matplotlib numpy opencv os pil python python3 tensorflow time tkinter video

Last synced: 18 Oct 2025

https://github.com/nikolasent/lyft-perception-challenge

The 4th place and the fastest solution of the Lyft Perception Challenge (Image semantic segmentation with PyTorch)

computer-vision deep-learning linknet pytorch sdcnd self-driving-car semantic-segmentation

Last synced: 30 Apr 2025

https://github.com/akash-rajak/Real-Time-Human-Detection-Counting

A tensorflow based Faster RCNN inception v2 python model to detect and count humans in real time images, videos & camera.

computer-vision counting cv2 deep-learning detection detection-model faster-rcnn-inception-v2 human-detection machine-learning matplotlib numpy opencv os pil python python3 tensorflow time tkinter video

Last synced: 06 Mar 2025

https://github.com/dataxujing/yolact_pytorch

:fire: :fire: :fire:Train Your Own DataSet for YOLACT and YOLACT++ Instance Segmentation Model!!!

computer-vision convolutional-neural-networks deep-learning instance-segmentation pytorch yolact

Last synced: 16 Aug 2025

https://github.com/The-AI-Summer/GANs-in-Computer-Vision

GANs in computer vision AI Summer article series

adverserial computer-vision deep-learning gan gans learning

Last synced: 08 May 2025

https://github.com/NikolasEnt/Lyft-Perception-Challenge

The 4th place and the fastest solution of the Lyft Perception Challenge (Image semantic segmentation with PyTorch)

computer-vision deep-learning linknet pytorch sdcnd self-driving-car semantic-segmentation

Last synced: 25 Jul 2025

https://github.com/iago-suarez/beblid-opencv-demo

How to improve a 14% your image matching with only one line of code? BEBLID is the key!

augmented-reality computer-vision image-descriptor machile-learning python-notebook

Last synced: 14 Jul 2025

https://github.com/erotemic/shitspotter

An open source algorithm and dataset for finding poop in pictures.

ai computer-vision dogs poop

Last synced: 09 May 2025

https://github.com/qinjian623/pytorch_toys

Personal Pytorch toy script.

computer-vision deep-learning pytorch

Last synced: 31 Jan 2026

https://github.com/Imageomics/bioclip-2

Repository for the BioCLIP 2 model project. [NeurIPS'25 Spotlight] BioCLIP 2 is a biological foundation model trained on TreeOfLife-200M. Despite the narrow training objective, BioCLIP 2 yields extraordinary accuracy when applied to various biological visual tasks such as habitat classification and trait prediction.

biology clip computer-vision imageomics knowledge-guided-machine-learning taxonomy

Last synced: 29 Jun 2026

https://github.com/JohnBetaCode/Social-Distancing-Analyser

Considering the big change that the world is facing, as well as our lives due to the COVID-19, we provide to people and companies a complete open-source tool to analyze the social distancing for streets, parks, offices, and even crowded places like malls, train stations, and others.

ai anaconda calibration calibration-process computer-vision covid19 deep-learning detection-model extrinsic image-processing intrinsic object-detection opencv opencv4 opensource social-distancing tensorflow yolo yolov4

Last synced: 21 Apr 2025

https://github.com/visual-layer/visuallayer

Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, mislabels and others.

cleaning computer computer-vision data data-science dataset datasets-preparation generative machine-learning python vision

Last synced: 19 Apr 2025

https://github.com/pathak-ashutosh/eye-blink-detection

Detect eye blinks based on eye aspect ratio (EAR) introduced by Soukupová and Čech in their 2016 paper, Real-Time Eye Blink Detection Using Facial Landmarks.

blink-detection-algorithm computer-vision deep-learning eye-detection eye-tracking hog machine-learning svm

Last synced: 23 Apr 2025

https://github.com/vra/dinov2-retrieval

A cli program of image retrieval using dinov2

ai computer-vision deep-learning dinov2 image-retrieval pytorch

Last synced: 12 Apr 2025

https://github.com/picsart-ai-research/zero-painter

🔥 [CVPR 2024] The official repo for Zero-Painter!

computer-vision cvpr2024 generative-ai zero-painter

Last synced: 12 Apr 2025

https://github.com/charmve/paperweeklyai

📚「@MaiweiAI」Studying papers in the fields of computer vision, NLP, and machine learning algorithms every week.

advanced applied-machine-learning computer-vision data-mining data-science deep-learning machine-learning machine-learning-algorithms nlp paper-with-code papers study-papers tutorials

Last synced: 23 Jun 2025

https://github.com/ai-forever/digital_peter_aij2020

Materials of the AI Journey 2020 competition dedicated to the recognition of Peter the Great's manuscripts, https://ai-journey.ru/contest/task01

ancient-texts computer-vision handwritten-text-recognition nlp python3 transfer-learning

Last synced: 19 Apr 2025

https://github.com/jinxins/adversarial-automixup

Official PyTorch(MMCV) implementation of “Adversarial AutoMixup” (ICLR 2024 spotlight)

cars196 classification computer-vision convolutional-neural-networks corruption deep-learning fgvc-aircraft imagenet pytorch

Last synced: 14 Apr 2025

https://github.com/jason718/game-feature-learning

Code for paper "Cross-Domain Self-supervised Multi-task Feature Learning using Synthetic Imagery", Ren et al., CVPR'18

computer-vision deep-learning domain-adaptation representation-learning self-supervised synthetic-data

Last synced: 10 Jul 2025

https://github.com/fedml-ai/fedcv

FedCV: An Industrial-grade Federated Learning Framework for Diverse Computer Vision Tasks

computer-vision federated-learning image-classifiation image-segmentation medical-imaging object-detection

Last synced: 31 Jan 2026

https://aibluefisher.github.io/DReg-NeRF/

Official Implementation of our ICCV 2023 paper: "DReg-NeRF: Deep Registration for Neural Radiance Fields"

3d-computer-vision computer-vision nerf registration

Last synced: 26 Mar 2025

https://github.com/aangelopoulos/conformal-risk

Conformal prediction for controlling monotonic risk functions. Simple accompanying PyTorch code for conformal risk control in computer vision and natural language processing.

computer-vision conformal conformal-prediction natural-language-processing python pytorch pytorch-implementation uncertainty-estimation uncertainty-quantification

Last synced: 05 Apr 2025

https://github.com/kyegomez/kosmos-x

The Next Generation Multi-Modality Superintelligence

computer-vision gemini gpt gpt3 gpt4 multi-modal pytorch vision

Last synced: 15 Apr 2025

https://github.com/eurus-holmes/research_papers

Record some papers I have read and paper notes I have taken, also including some awesome papers reading lists and academic blog posts.

computer-vision deep-learning natural-language-processing paper-writing papers-reading research-paper

Last synced: 04 Mar 2026

https://github.com/unclearness/vacancy

Vacancy: A Voxel Carving implementation in C++

3d-reconstruction c-plus-plus computer-vision marching-cubes shape-from-silhouette

Last synced: 08 May 2025

https://github.com/ycheng517/tabletop-handybot

A low-cost AI powered robotic arm assistant that listens to your voice commands and can carry out a variety of tabletop tasks.

ai chatgpt computer-vision llm robot-assistant robotic-arm robotics ros2

Last synced: 24 Mar 2025

https://github.com/hemansnation/machine-learning-mlops-generativeai-nlp-cv-mlsystem-design

MLOps - Deploy models at scale, Generative AI - Build applications with LLMs, NLP - Understand Transformers & Text Generation Models, Computer Vision - Build GANs projects like Deepfakes, ML System Design, hands-on project building and code algorithms from scratch.

computer-vision data-science deep-learning generative-ai machine-learning natural-language-processing python

Last synced: 15 Apr 2025

https://github.com/una-dinosauria/rayuela.jl

Code for my PhD thesis. Library of quantization-based methods for fast similarity search in high dimensions. Presented at ECCV 18.

computer-vision eccv-18 julia nearest-neighbor-search

Last synced: 26 Mar 2025

https://github.com/pathak22/ccnn

[ICCV 2015] Framework for optimizing CNNs with linear constraints for Semantic Segmentation

computer-vision constraints deep-learning fcn fully-convolutional-networks linear-constraints machine-learning segmentation

Last synced: 17 Jun 2025

https://github.com/eth-siplab/RAP

Restore Anything Pipeline (RAP): Segment Anything Meets Image Restoration

computer-vision deep-learning image-deblurring image-denoising image-restoration jpeg-artifact-removal segment-anything

Last synced: 24 Jul 2025

https://github.com/LaihoE/did-it-spill

Check if you have training samples in your test set

computer-vision data-science deep-learning pytorch semantic-similarity time-series

Last synced: 01 May 2025

https://github.com/evercam/ex_nvr

Video recording and computer vision for edge devices

computer-vision elixir ip-camera nvr video-streaming

Last synced: 17 Feb 2026

https://github.com/hellloxiaotian/ctnet

A cross Transformer for image denoising(Information Fusion, 2024)

cnn computer-vision deep-learning deep-neural-networks image-denoising pytorch transformer

Last synced: 16 Mar 2026

https://github.com/tnybny/Frame-level-anomalies-in-videos

Frame level anomaly detection and localization in videos using auto-encoders

autoencoder computer-vision video video-anomaly-detection

Last synced: 04 Apr 2025

https://github.com/Orion-AI-Lab/KuroSiwo

Code and data for Kuro Siwo flood mapping dataset

computer-vision flood remote-sensing sar synthetic-aperture-radar

Last synced: 20 Jul 2025

https://github.com/atticuszeller/gsplatloc

😎 GsplatLoc 🎯: Ultra-Precise Pose Optimization via 3D Gaussian Reprojection 🌟

3d-reconstruction 6dof camerapose-estimate computer-vision deeplearning gaussian-splatting gsplat localization python pytorch slam

Last synced: 09 Apr 2025

https://github.com/khaledsabry97/Argus

[IEEE Research paper + Project] Real Time Road Accidents Detection System based on crash estimation; a computer vision techniques that detects road accidents and reports them in real-time as well as allowing the monitoring of accidents using a client server architecture and an interactive GUI.

accidents autonomous-vehicles cctv collision collision-estimation computer-vision crash detect detection detects-road-accidents ieee monitoring-tool real-time research-paper road-accidents road-crashes roads traffic-analysis yolo

Last synced: 21 Apr 2025

https://github.com/junyanz/realismcnn

code for predicting and improving visual realism in composite images

computer-vision deep-learning image-manipulation realism-prediction

Last synced: 30 Apr 2025

https://github.com/tomrunia/pytorchsteerablepyramid

PyTorch implementation of the Complex Steerable Pyramid

batch computer-vision cuda image-processing mkl pyramid pytorch

Last synced: 04 May 2025

https://github.com/aimagelab/pacscore

[CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation

captioning captioning-images captioning-videos computer-vision cvpr cvpr2023 vision-and-language

Last synced: 18 Oct 2025

https://github.com/MinaGhadimiAtigh/hyperbolic_representation_learning

The repository for Hyperbolic Representation Learning for Computer Vision, ECCV 2022

computer-vision eccv2022 hyperbolic-embeddings tutorial

Last synced: 03 Apr 2025

https://github.com/baiksung/MeTAL

Official PyTorch implementation of "Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning" (ICCV2021 Oral)

computer-vision deep-learning few-shot-learning iccv machine-learning maml meta-learning pytorch

Last synced: 08 May 2025

https://github.com/harleyszhang/cv-books

These are open and classic books for computer version engineers, it can help us from entry to practice.

books computer-vision cpp deep-learning machine-learning open-books programming python

Last synced: 23 Apr 2025

https://github.com/roboflow/rf100-vl

Code from the paper "Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models"

computer-vision multimodal-datasets object-detection object-detection-benchmarks rf100

Last synced: 06 Sep 2025

https://github.com/jordicorbilla/ocular-disease-intelligent-recognition-deep-learning

ODIR-2019: Ocular Disease Intelligent Recognition is a project leveraging state-of-the-art deep learning architectures to analyze and classify ocular diseases based on medical imaging data. This repository implements advanced machine learning techniques and modern neural network architectures to push the boundaries of intelligent recognition

computer-vision convolutional-neural-networks deep-learning deep-neural-networks fundus-image-analysis image-classification keras-tensorflow machine-learning medical-image-analysis python python3 tensorflow trasfer-learning

Last synced: 09 Apr 2025

https://github.com/acecoooool/interview-computer-vision

:books: 计算机视觉笔记和总结

computer-vision interview summary

Last synced: 28 Jun 2026

https://akshatdave.github.io/pandora/

Official Pytorch implementation of PANDORA: Polarization-Aided Neural Decomposition of Radiance

3d-reconstruction computer-vision inverse-rendering nerf neural-rendering polarization polarization-imaging

Last synced: 06 May 2025

https://github.com/ehsanik/segan

SeGAN: Segmenting and Generating the Invisible (https://arxiv.org/pdf/1703.10239.pdf)

computer-vision deep-learning generative-adversarial-network image-generation segan segmentation

Last synced: 14 Oct 2025

https://github.com/konfuzio-ai/konfuzio-sdk

Run OCR, extract information from documents and classify them. In addition, annotate documents and build custom NLP and computer vision models tailored for your specific use cases. Find examples with code in our Tutorials section of dev.konfuzio.com and get inspiration from Use Cases section of our blog: https://konfuzio.com/en/category/marketplace

computer-vision document-annotate document-annotation document-annotation-tool document-extraction nlp ocr python text-classification text-processing

Last synced: 08 Aug 2025

https://github.com/hustvl/gneuvox

GNeuVox: Generalizable Neural Voxels for Fast Human Radiance Fields

3d computer-graphics computer-vision humannerf monocular nerf pytorch view-synthesis

Last synced: 12 May 2025

https://github.com/ericsujw/Matterport3DLayoutAnnotation

Layout annotation on a subset of Matterport3D dataset

3d-layout 3d-reconstruction computer-vision room-layout

Last synced: 20 Nov 2025

https://github.com/jderobot/behaviormetrics

Autonomous driving neural network comparison tool

computer-vision deeplearning machine-learning reinforcement-learning

Last synced: 20 Jun 2025

https://github.com/visioncortex/visionmagic

Collection of vision & graphics algorithms

computer-graphics computer-vision

Last synced: 25 Apr 2025

https://github.com/pyronear/pyro-vision

Computer vision library for wildfire detection 🌲 Deep learning models in PyTorch & ONNX for inference on edge devices (e.g. Raspberry Pi)

computer-vision deep-learning image-classification keypoint-detection object-detection onnx python pytorch wildfire

Last synced: 27 Jul 2025

https://github.com/lkevinzc/dance

Code for "DANCE: A Deep Attentive Contour Model for Efficient Instance Segmentation", WACV2021

coco computer-vision deep-learning instance-segmentation pytorch wacv2021

Last synced: 30 Apr 2025

https://github.com/rocm/rpp

AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.

agumentation amd bitwise channel-extract computer-vision contrast cpu gpu hip histogram hpc mivisionx opencl openvx radeon-performance-primitives rocm rpp warp-affine

Last synced: 11 Apr 2025

https://github.com/wiseplat/hackathon-finam-nn-trade-robot

Торговый робот с использованием нейросетей на основе компьютерного зрения для поиска определенных формаций на торговом графике акций, который используя лучшую обученную модель осуществляет торговые операции. ## A trading robot using neural networks based on computer vision to search for certain formations on a stock trading chart with live trading.

algo-trading algo-trading-strategies algotrading computer-vision finam hackathon neural-network trading trading-bot trading-strategies

Last synced: 12 Apr 2025

https://github.com/Link009/LPEX

Detect vehicles license plate location

alpr computer-vision license-plate-detection

Last synced: 28 Mar 2025

https://github.com/sayakpaul/breast-cancer-detection-using-deep-learning

Experiments to show the usage of deep learning to detect breast cancer from breast histopathology images

class-imbalance computer-vision deep-learning fastai medical-imaging resnet50

Last synced: 30 Apr 2025

https://github.com/sayakpaul/sharpness-aware-minimization-tensorflow

Implements sharpness-aware minimization (https://arxiv.org/abs/2010.01412) in TensorFlow 2.

computer-vision deep-neural-networks generalization loss-landscape tensorflow tpu-acceleration

Last synced: 11 Mar 2026

https://github.com/alladinian/visionaire

Streamlined, ergonomic APIs around Apple's Vision framework

computer-vision ios macos swift vision

Last synced: 01 Aug 2025

https://github.com/tirth27/skin-cancer-classification-using-deep-learning

Classify Skin cancer from the skin lesion images using Image classification. The dataset for the project is obtained from the Kaggle SIIM-ISIC-Melanoma-Classification competition.

computer-vision convolutional-neural-networks deep-learning image-classification kaggle-competition keras-tensorflow machine-learning medical-imaging melanoma-diagnosis skin-cancer skin-lesion-image

Last synced: 05 Mar 2025

https://github.com/saturncloud/dask-pytorch-ddp

dask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.

computer-vision dask deep-learning distributed-computing machine-learning nlp pytorch

Last synced: 14 Dec 2025

https://github.com/recursionpharma/maes_microscopy

Official repo for Recursion's accepted spotlight paper at NeurIPS 2023 Generative AI & Biology workshop.

biology computer-vision deep-learning generative-ai masked-autoencoder microscopy phenomics

Last synced: 06 Feb 2026

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-Skeleton-based-Action-Recognition 104 Awesome-3D-Object-Detection 169 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-state-of-depth-completion 71 awesome-robotics-datasets 79 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97