An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/opendrivelab/persformer_3dlane

[ECCV 2022 Oral] Perspective Transformer on 3D Lane Detection

3d-lane-detection autonomous-driving computer-vision deep-learning lane-detection

Last synced: 05 Apr 2025

https://github.com/OpenDriveLab/PersFormer_3DLane

[ECCV 2022 Oral] Perspective Transformer on 3D Lane Detection

3d-lane-detection autonomous-driving computer-vision deep-learning lane-detection

Last synced: 20 Mar 2025

https://github.com/NVlabs/intrinsic3d

Intrinsic3D - High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting (ICCV 2017)

3d 3d-reconstruction computer-vision geometry iccv iccv-2017 iccv2017 intrinsic3d keyframes lighting nvidia rgbd sdf shape-from-shading spherical-harmonics surface-reconstruction tum

Last synced: 04 May 2025

https://github.com/Gladiator07/Harvestify

A machine learning based website that recommends the best crop to grow, fertilizers to use, and the diseases caught by your crops.

computer-vision crop-recommendation crops deep-learning fertilizer-recommendation fertilizers machinelearning precisionagriculture

Last synced: 05 May 2025

https://github.com/nv-tlabs/xcube

[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies

3d-generation computer-vision generative-ai graphics voxel

Last synced: 16 May 2025

https://github.com/raoyongming/GFNet

[NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification

computer-vision deep-learning image-classification image-recognition vision-transformer

Last synced: 08 May 2025

https://github.com/PaulDanielML/MuJoCo_RL_UR5

A MuJoCo/Gym environment for robot control using Reinforcement Learning. The task of agents in this environment is pixel-wise prediction of grasp success chances.

computer-vision gym-environment mujoco pick-and-place reinforcement-learning robotics

Last synced: 11 Jul 2025

https://github.com/JunweiLiang/Object_Detection_Tracking

Out-of-the-box code and models for CMU's object detection and tracking system for multi-camera surveillance videos. Speed optimized Faster-RCNN model. Tensorflow based. Also supports EfficientDet. WACVW'20

activity-detection computer-vision deep-learning detection-tracking efficientdet maskrcnn multi-camera multi-camera-tracking multi-camera-vehicle-reid object-detection reid surveillance-videos tracking tracking-detection video-object-detection video-object-tracking

Last synced: 07 Apr 2025

https://github.com/skhadem/3D-BoundingBox

PyTorch implementation for 3D Bounding Box Estimation Using Deep Learning and Geometry

3d computer-vision localization pytorch

Last synced: 20 Mar 2025

https://github.com/danini/graph-cut-ransac

The Graph-Cut RANSAC algorithm proposed in paper: Daniel Barath and Jiri Matas; Graph-Cut RANSAC, Conference on Computer Vision and Pattern Recognition, 2018. It is available at http://openaccess.thecvf.com/content_cvpr_2018/papers/Barath_Graph-Cut_RANSAC_CVPR_2018_paper.pdf

computer-vision essential-matrix fundamental-matrix graph-cut-ransac homography pattern-recognition ransac robust robust-estimators

Last synced: 18 Mar 2025

https://github.com/vlgiitr/DL_Topics

List of DL topics and resources essential for cracking interviews

computer-vision deep-learning generative-models linear-algebra natural-language-processing probability-statistics

Last synced: 01 May 2025

https://github.com/junweiliang/object_detection_tracking

Out-of-the-box code and models for CMU's object detection and tracking system for multi-camera surveillance videos. Speed optimized Faster-RCNN model. Tensorflow based. Also supports EfficientDet. WACVW'20

activity-detection computer-vision deep-learning detection-tracking efficientdet maskrcnn multi-camera multi-camera-tracking multi-camera-vehicle-reid object-detection reid surveillance-videos tracking tracking-detection video-object-detection video-object-tracking

Last synced: 05 Apr 2025

https://github.com/abdur75648/deep-learning-specialization-coursera

This repo contains the updated version of all the assignments/labs (done by me) of Deep Learning Specialization on Coursera by Andrew Ng. It includes building various deep learning models from scratch and implementing them for object detection, facial recognition, autonomous driving, neural machine translation, trigger word detection, etc.

andrew-ng andrew-ng-machine-learning computer-vision convolutional-neural-networks coursera deep-learning deep-learning-andrew-ng deep-learning-coursera deep-learning-specialization face-recognition lstm machine-learning neural-networks nlp object-detection recurrent-neural-networks tensorflow transformer-architecture unet-segmentation updated

Last synced: 04 Apr 2025

https://github.com/davidcaron/pclpy

Python bindings for the Point Cloud Library (PCL)

computer-vision lidar pcl point-cloud python

Last synced: 04 Apr 2025

https://github.com/CopterExpress/clover

ROS-based framework and RPi image to control PX4-powered drones ๐Ÿ€

aruco computer-vision drone education mavros optical-flow px4 quadcopter raspberry-pi robotics ros

Last synced: 07 May 2025

https://github.com/ruiminshen/yolo2-pytorch

PyTorch implementation of the YOLO (You Only Look Once) v2

caffe2 computer-vision deep-learning deep-neural-networks object-detection onnx python pytorch yolo2

Last synced: 03 Apr 2025

https://weiyithu.github.io/NerfingMVS/

[ICCV 2021 Oral] NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

3d-reconstruction 3dvision computer-vision deep-learning multi-view-stereo neural-radiance-fields

Last synced: 21 Nov 2025

https://github.com/Mohitkr95/Best-Data-Science-Resources

This repository contains the best Data Science free hand-picked resources to equip you with all the industry-driven skills and interview preparation kit.

ai artificial-intelligence artificial-intelligence-algorithms aws computer-vision data data-structures datascience deep-learning git github jupyter-notebook machine-learning mongodb natural-language-processing neural-network python sql statistics

Last synced: 07 May 2025

https://github.com/weiyithu/NerfingMVS

[ICCV 2021 Oral] NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

3d-reconstruction 3dvision computer-vision deep-learning multi-view-stereo neural-radiance-fields

Last synced: 11 Apr 2025

https://github.com/ibaiGorordo/ONNX-YOLOv8-Object-Detection

Python scripts performing object detection using the YOLOv8 model in ONNX.

computer-vision deep-learning object-detection onnx onnxruntime opencv python yolo yolov8

Last synced: 21 Apr 2025

https://github.com/omerbsezer/fast-pytorch

Pytorch Tutorial, Pytorch with Google Colab, Pytorch Implementations: CNN, RNN, DCGAN, Transfer Learning, Chatbot, Pytorch Sample Codes

cnn cnn-pytorch colab-notebook colab-tutorial colaboratory computer-vision deep-learning machine-learning mlp python pytorch pytorch-implementation pytorch-tutorial rnn rnn-pytorch transfer-learning

Last synced: 01 Oct 2025

https://github.com/albarji/mixture-of-diffusers

Mixture of Diffusers for scene composition and high resolution image generation

ai computer-vision diffusion-models stable-diffusion

Last synced: 04 Apr 2025

https://github.com/tanjeffreyz/auto-maple

Artificial intelligence software for MapleStory that uses various machine learning and computer vision techniques to navigate challenging in-game environments

ai bot computer-vision deep-learning maplestory python

Last synced: 11 Apr 2025

https://github.com/facebookresearch/ego4d

Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

computer-vision dataset feature-extraction video visuzalization

Last synced: 15 May 2025

https://github.com/ocampor/image-quality

Image quality is an open source software library for Image Quality Assessment (IQA).

artificial-intelligence computer-vision image-analysis image-quality-assessment machine-learning python tensorflow

Last synced: 21 Oct 2025

https://github.com/eliahuhorwitz/deepsim

Official PyTorch implementation of the paper: "DeepSIM: Image Shape Manipulation from a Single Augmented Training Sample" (ICCV 2021 Oral)

computer-graphics computer-vision deep-learning deep-neural-networks edge-to-image generative-adversarial-network iccv2021 image-editing image-to-image-translation pytorch segmantation-to-image single-image

Last synced: 06 Sep 2025

https://github.com/fabric-project/fabric

Node Creative Coding / 3D / Image Processing tool inspired by Quartz Composer

3d computer-vision creative-coding graphics llm metal mlx multimedia node-based post-processing realtime shaders swift swiftui video vlm

Last synced: 29 Jan 2026

https://actors-hq.com/

Official code for "HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion"

3d-reconstruction computer-graphics computer-vision machine-learning nerf siggraph2023

Last synced: 26 Mar 2025

https://github.com/eliahuhorwitz/DeepSIM

Official PyTorch implementation of the paper: "DeepSIM: Image Shape Manipulation from a Single Augmented Training Sample" (ICCV 2021 Oral)

computer-graphics computer-vision deep-learning deep-neural-networks edge-to-image generative-adversarial-network iccv2021 image-editing image-to-image-translation pytorch segmantation-to-image single-image

Last synced: 19 Jul 2025

https://github.com/aimagelab/multimodal-garment-designer

This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023

computer-vision diffusion-models dresscode fashionai generative-models iccv2023 image-editing latent-diffusion-models multimodal-fashion-image-editing stable-diffusion viton-hd

Last synced: 05 Apr 2025

https://github.com/ashishpatel26/resourcebank_cv_nlp_mlops_2022

This repository offers a goldmine of materials for students of computer vision, natural language processing, and machine learning operations.

computer-vision data-science deep-learning mlops natural-language-processing

Last synced: 05 Apr 2025

https://github.com/bchao1/Anime-Face-Dataset

๐Ÿ–ผ A collection of high-quality anime faces.

anime computer-vision dataset gan image-generation machine-learning

Last synced: 28 Apr 2025

https://github.com/hysts/anime-face-detector

Anime Face Detector using mmdet and mmpose

anime computer-vision face-detection face-landmark-detection pytorch

Last synced: 04 Apr 2025

https://github.com/bchao1/anime-face-dataset

๐Ÿ–ผ A collection of high-quality anime faces.

anime computer-vision dataset gan image-generation machine-learning

Last synced: 06 Apr 2025

https://github.com/IvanDrokin/torch-conv-kan

This project is dedicated to the implementation and research of Kolmogorov-Arnold convolutional networks. The repository includes implementations of 1D, 2D, and 3D convolutions with different kernels, ResNet-like and DenseNet-like models, training code based on accelerate/PyTorch, as well as scripts for experiments with CIFAR-10 and Tiny ImageNet.

computer-vision convolutional-neural-networks kolmogorov-arnold-networks

Last synced: 01 Aug 2025

https://github.com/google-deepmind/dm_pix

PIX is an image processing library in JAX, for JAX.

computer-vision image image-processing jax machine-learning python

Last synced: 17 Jun 2025

https://github.com/una-dinosauria/human-motion-prediction

Simple baselines and RNNs for predicting human motion in tensorflow. Presented at CVPR 17.

computer-vision cvpr-2017 tensorflow

Last synced: 05 Apr 2025

https://github.com/alicevision/popsift

PopSift is an implementation of the SIFT algorithm in CUDA.

computer-vision cuda feature-extraction gpu image-processing sift

Last synced: 05 Apr 2025

https://github.com/francescosaveriozuppichini/vit

Implementing Vi(sion)T(transformer)

computer-vision deep-learning

Last synced: 06 Apr 2025

https://github.com/Ank-Cha/Social-Distancing-Analyser-COVID-19

A social distancing analyzer AI tool to regulate social distancing protocol using video surveillance of CCTV cameras and drones. Social Distancing Analyser to prevent COVID19

artificial-intelligence computer-vision covid-19 covid19 python social-distancing stay-home yolo

Last synced: 21 Apr 2025

https://github.com/vincentfung13/MINE

Code and models for our ICCV 2021 paper "MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis"

3d-reconstruction 3d-vision computer-vision deep-learning depth-estimation nerf novel-view-synthesis

Last synced: 20 Mar 2025

https://github.com/imfing/ava_downloader

:arrow_double_down: Download AVA dataset (A Large-Scale Database for Aesthetic Visual Analysis)

aesthetic-visual-analysis ava computer-vision dataset python

Last synced: 05 Apr 2025

https://github.com/kunalj101/Data-Science-Hacks

Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.

computer-vision data data-analysis data-science data-visualization dataset hacks image-augmentation ipynb machine-learning nlp nlp-machine-learning numpy pandas pandas-dataframe pandas-python pandas-tutorial python python3 tips-and-tricks

Last synced: 05 May 2025

https://github.com/Sha-Lab/FEAT

The code repository for "Few-Shot Learning via Embedding Adaptation with Set-to-Set Functions"

computer-vision cvpr2020 few-shot few-shot-learning machine-learning meta-learning

Last synced: 15 Mar 2025

https://github.com/davidstutz/superpixel-benchmark

An extensive evaluation and comparison of 28 state-of-the-art superpixel algorithms on 5 datasets.

benchmark computer-vision evaluation image-procesing opencv superpixel-algorithms superpixels

Last synced: 05 Apr 2025

https://github.com/patrikhuber/superviseddescent

C++11 implementation of the supervised descent optimisation method

computer-vision landmark-detection machine-learning modern-cpp

Last synced: 06 Apr 2025

https://github.com/kunalj101/data-science-hacks

Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.

computer-vision data data-analysis data-science data-visualization dataset hacks image-augmentation ipynb machine-learning nlp nlp-machine-learning numpy pandas pandas-dataframe pandas-python pandas-tutorial python python3 tips-and-tricks

Last synced: 28 Oct 2025

https://github.com/lvis-dataset/lvis-api

Python API for LVIS Dataset

computer-vision object-detection

Last synced: 21 Oct 2025

https://github.com/foo123/filter.js

Video and Image Processing and Computer Vision Library in pure JavaScript (Browser and Node.js)

browser computer-vision glsl image-processing machine-learning node-js parallel-processing real-time video-processing wasm web-assembly webgl

Last synced: 16 May 2025

https://github.com/amosgropp/IGR

Implicit Geometric Regularization for Learning Shapes

3d-reconstruction computer-vision deep-learning

Last synced: 11 May 2025

https://github.com/mihaibujanca/dynamicfusion

Implementation of Newcombe et al. CVPR 2015 DynamicFusion paper

3d-reconstruction computer-vision cuda non-rigid opencv

Last synced: 30 Jan 2026

https://github.com/yassouali/CCT

:page_facing_up: Semi-Supervised Semantic Segmentation with Cross-Consistency Training (CVPR 2020).

computer-vision semantic-segmentation semi-supervised-learning

Last synced: 02 Apr 2025

https://github.com/landskape-ai/triplet-attention

Official PyTorch Implementation for "Rotate to Attend: Convolutional Triplet Attention Module." [WACV 2021]

arxiv attention-mechanism attention-mechanisms computer-vision convolutional-neural-networks deep-learning detection gradcam imagenet paper triplet-attention

Last synced: 08 May 2025

https://github.com/yassouali/cct

:page_facing_up: Semi-Supervised Semantic Segmentation with Cross-Consistency Training (CVPR 2020).

computer-vision semantic-segmentation semi-supervised-learning

Last synced: 06 Apr 2025

https://github.com/amazon-science/bigdetection

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

computer-vision few-shot object-detection pretraining

Last synced: 16 May 2025

https://github.com/Yang7879/3D-BoNet

๐Ÿ”ฅ3D-BoNet in Tensorflow (NeurIPS 2019, Spotlight)

3d-object-detection 3d-point-clouds 3d-vision computer-vision instance-segmentation

Last synced: 20 Mar 2025

https://github.com/foo123/FILTER.js

Video and Image Processing and Computer Vision Library in pure JavaScript (Browser and Node.js)

browser computer-vision glsl image-processing machine-learning node-js parallel-processing real-time video-processing wasm web-assembly webgl

Last synced: 05 Apr 2025

https://github.com/mkocabas/pare

Code for ICCV2021 paper PARE: Part Attention Regressor for 3D Human Body Estimation

3d-human-reconstruction 3d-human-shape-and-pose-estimation computer-graphics computer-vision human-pose-estimation

Last synced: 05 Apr 2025

https://github.com/allenai/procthor

๐Ÿ˜๏ธ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses

ai2-thor computer-vision embodied-ai procedural-generation robotics

Last synced: 24 Oct 2025

https://github.com/ShirAmir/dino-vit-features

Official implementation for the paper "Deep ViT Features as Dense Visual Descriptors".

co-segmentation computer-vision deep-learning dino part-segmentation pytorch semantic-correspondence vision-transformers

Last synced: 03 Apr 2025

https://github.com/eddyhkchiu/mahalanobis_3d_multi_object_tracking

[NeurIPS Workshop 2019] Official code of the paper "Probabilistic 3D Multi-Object Tracking for Autonomous Driving." First Place of the First NuScenes Tracking Challenge in the AI Driving Olympics Workshop of NeurIPS.

3d-multi-object-tracking autonomous-driving computer-vision kalman-filter robotics

Last synced: 20 Mar 2025

Computer vision Awesome Lists
Awesome-pytorch-list 707 awesome-multimodal-ml 480 awesome-self-supervised-learning 463 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-industrial-anomaly-detection 1,102 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 468 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 Awesome-World-Model 725 Awesome-Federated-Learning 558 Awesome-FL 4,069 awesome-low-light-image-enhancement 219 Awesome-pytorch-list-CNVersion 692 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 191 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 150 awesome-grounding 157 openstl 43 Awesome-Open-Vocabulary 162 awesome-ai-awesomeness 236 awesome-capsule-networks 67 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-multi-task-learning 233 awesome-robotics-3d 111 awesome-photogrammetry 90 awesome-open-data-centric-ai 56 awesome-ai-data-guided-projects 56 Awesome-3D-Object-Detection 169 Awesome-Skeleton-based-Action-Recognition 104 awesome-optical-flow 110 awesome-holistic-3d 129 awesome-data-annotation 93 Awesome-Parameter-Efficient-Transfer-Learning 124 awesome-panoptic-segmentation 45 awesome-computer-vision-models 189 awesome-robotics-datasets 79 awesome-state-of-depth-completion 71 awesome-nerf-editing 537 arctic 34 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 Awesome-Monocular-3D-detection 97