Computer vision
Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.
- GitHub: https://github.com/topics/computer-vision
- Wikipedia: https://en.wikipedia.org/wiki/Computer_vision
- Related Topics: vision, deep-learning, machine-learning, opencv, gan,
- Aliases: machine-vision, computervision,
- Last updated: 2026-06-29 00:06:16 UTC
- JSON Representation
https://github.com/opendrivelab/persformer_3dlane
[ECCV 2022 Oral] Perspective Transformer on 3D Lane Detection
3d-lane-detection autonomous-driving computer-vision deep-learning lane-detection
Last synced: 05 Apr 2025
https://github.com/OpenDriveLab/PersFormer_3DLane
[ECCV 2022 Oral] Perspective Transformer on 3D Lane Detection
3d-lane-detection autonomous-driving computer-vision deep-learning lane-detection
Last synced: 20 Mar 2025
https://github.com/NVlabs/intrinsic3d
Intrinsic3D - High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting (ICCV 2017)
3d 3d-reconstruction computer-vision geometry iccv iccv-2017 iccv2017 intrinsic3d keyframes lighting nvidia rgbd sdf shape-from-shading spherical-harmonics surface-reconstruction tum
Last synced: 04 May 2025
https://github.com/Gladiator07/Harvestify
A machine learning based website that recommends the best crop to grow, fertilizers to use, and the diseases caught by your crops.
computer-vision crop-recommendation crops deep-learning fertilizer-recommendation fertilizers machinelearning precisionagriculture
Last synced: 05 May 2025
https://github.com/genielabs/homegenie
HomeGenie: The Programmable Intelligence with 100% Local Agentic AI.
artificial-intelligence automation building-automation computer-vision gpio home-automation iot knx machine-learning open-source raspberrypi self-hosted smart-home x10 zigbee zwave
Last synced: 26 Apr 2026
https://github.com/kscottz/PythonFromSpace
Python Examples for Remote Sensing
computer-vision gis image-processing imaging jupyter-notebook python satellite vision
Last synced: 08 May 2025
https://github.com/nv-tlabs/xcube
[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
3d-generation computer-vision generative-ai graphics voxel
Last synced: 16 May 2025
https://github.com/raoyongming/GFNet
[NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification
computer-vision deep-learning image-classification image-recognition vision-transformer
Last synced: 08 May 2025
https://github.com/PaulDanielML/MuJoCo_RL_UR5
A MuJoCo/Gym environment for robot control using Reinforcement Learning. The task of agents in this environment is pixel-wise prediction of grasp success chances.
computer-vision gym-environment mujoco pick-and-place reinforcement-learning robotics
Last synced: 11 Jul 2025
https://github.com/francescosaveriozuppichini/glasses
High-quality Neural Networks for Computer Vision ๐
classification computer-vision convolutional-networks deep-learning deep-neural-networks machine-learning neural-network pretrained-models pytorch
Last synced: 04 Apr 2025
https://github.com/FrancescoSaverioZuppichini/glasses
High-quality Neural Networks for Computer Vision ๐
classification computer-vision convolutional-networks deep-learning deep-neural-networks machine-learning neural-network pretrained-models pytorch
Last synced: 27 Mar 2025
https://github.com/JunweiLiang/Object_Detection_Tracking
Out-of-the-box code and models for CMU's object detection and tracking system for multi-camera surveillance videos. Speed optimized Faster-RCNN model. Tensorflow based. Also supports EfficientDet. WACVW'20
activity-detection computer-vision deep-learning detection-tracking efficientdet maskrcnn multi-camera multi-camera-tracking multi-camera-vehicle-reid object-detection reid surveillance-videos tracking tracking-detection video-object-detection video-object-tracking
Last synced: 07 Apr 2025
https://github.com/skhadem/3D-BoundingBox
PyTorch implementation for 3D Bounding Box Estimation Using Deep Learning and Geometry
3d computer-vision localization pytorch
Last synced: 20 Mar 2025
https://github.com/danini/graph-cut-ransac
The Graph-Cut RANSAC algorithm proposed in paper: Daniel Barath and Jiri Matas; Graph-Cut RANSAC, Conference on Computer Vision and Pattern Recognition, 2018. It is available at http://openaccess.thecvf.com/content_cvpr_2018/papers/Barath_Graph-Cut_RANSAC_CVPR_2018_paper.pdf
computer-vision essential-matrix fundamental-matrix graph-cut-ransac homography pattern-recognition ransac robust robust-estimators
Last synced: 18 Mar 2025
https://github.com/vlgiitr/DL_Topics
List of DL topics and resources essential for cracking interviews
computer-vision deep-learning generative-models linear-algebra natural-language-processing probability-statistics
Last synced: 01 May 2025
https://github.com/junweiliang/object_detection_tracking
Out-of-the-box code and models for CMU's object detection and tracking system for multi-camera surveillance videos. Speed optimized Faster-RCNN model. Tensorflow based. Also supports EfficientDet. WACVW'20
activity-detection computer-vision deep-learning detection-tracking efficientdet maskrcnn multi-camera multi-camera-tracking multi-camera-vehicle-reid object-detection reid surveillance-videos tracking tracking-detection video-object-detection video-object-tracking
Last synced: 05 Apr 2025
https://github.com/vita-epfl/monoloco
A 3D vision library from 2D keypoints: monocular and stereo 3D detection for humans, social distancing, and body orientation.
3d-deep-learning 3d-detection 3d-object-detection 3d-vision computer-vision covid-19 deep-learning human-pose-estimation iccv2019 icra2021 kitti-dataset machine-learning object-detection openpifpaf pifpaf pose-estimation pytorch uncertainty
Last synced: 07 May 2025
https://github.com/abdur75648/deep-learning-specialization-coursera
This repo contains the updated version of all the assignments/labs (done by me) of Deep Learning Specialization on Coursera by Andrew Ng. It includes building various deep learning models from scratch and implementing them for object detection, facial recognition, autonomous driving, neural machine translation, trigger word detection, etc.
andrew-ng andrew-ng-machine-learning computer-vision convolutional-neural-networks coursera deep-learning deep-learning-andrew-ng deep-learning-coursera deep-learning-specialization face-recognition lstm machine-learning neural-networks nlp object-detection recurrent-neural-networks tensorflow transformer-architecture unet-segmentation updated
Last synced: 04 Apr 2025
https://github.com/davidcaron/pclpy
Python bindings for the Point Cloud Library (PCL)
computer-vision lidar pcl point-cloud python
Last synced: 04 Apr 2025
https://github.com/CopterExpress/clover
ROS-based framework and RPi image to control PX4-powered drones ๐
aruco computer-vision drone education mavros optical-flow px4 quadcopter raspberry-pi robotics ros
Last synced: 07 May 2025
https://github.com/KevinMusgrave/powerful-benchmarker
A library for ML benchmarking. It's powerful.
benchmarking computer-vision deep-learning domain-adaptation machine-learning metric-learning pytorch transfer-learning
Last synced: 08 May 2025
https://github.com/ruiminshen/yolo2-pytorch
PyTorch implementation of the YOLO (You Only Look Once) v2
caffe2 computer-vision deep-learning deep-neural-networks object-detection onnx python pytorch yolo2
Last synced: 03 Apr 2025
https://weiyithu.github.io/NerfingMVS/
[ICCV 2021 Oral] NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo
3d-reconstruction 3dvision computer-vision deep-learning multi-view-stereo neural-radiance-fields
Last synced: 21 Nov 2025
https://github.com/Mohitkr95/Best-Data-Science-Resources
This repository contains the best Data Science free hand-picked resources to equip you with all the industry-driven skills and interview preparation kit.
ai artificial-intelligence artificial-intelligence-algorithms aws computer-vision data data-structures datascience deep-learning git github jupyter-notebook machine-learning mongodb natural-language-processing neural-network python sql statistics
Last synced: 07 May 2025
https://github.com/cambrian-mllm/cambrian-s
Cambrian-S: Towards Spatial Supersensing in Video
computer-vision llm multimodal-large-language-models spatial-understanding vision-language-model
Last synced: 17 Jan 2026
https://github.com/weiyithu/NerfingMVS
[ICCV 2021 Oral] NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo
3d-reconstruction 3dvision computer-vision deep-learning multi-view-stereo neural-radiance-fields
Last synced: 11 Apr 2025
https://github.com/Haochen-Wang409/U2PL
[CVPR'22] Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels
cityscapes computer-vision contrastive-learning cvpr2022 deep-learning mechine-learning pascal-voc pytorch semantic-segmentation semi-supervised-learning
Last synced: 02 Apr 2025
https://github.com/kevinmusgrave/powerful-benchmarker
A library for ML benchmarking. It's powerful.
benchmarking computer-vision deep-learning domain-adaptation machine-learning metric-learning pytorch transfer-learning
Last synced: 17 Nov 2025
https://github.com/ibaiGorordo/ONNX-YOLOv8-Object-Detection
Python scripts performing object detection using the YOLOv8 model in ONNX.
computer-vision deep-learning object-detection onnx onnxruntime opencv python yolo yolov8
Last synced: 21 Apr 2025
https://github.com/omerbsezer/fast-pytorch
Pytorch Tutorial, Pytorch with Google Colab, Pytorch Implementations: CNN, RNN, DCGAN, Transfer Learning, Chatbot, Pytorch Sample Codes
cnn cnn-pytorch colab-notebook colab-tutorial colaboratory computer-vision deep-learning machine-learning mlp python pytorch pytorch-implementation pytorch-tutorial rnn rnn-pytorch transfer-learning
Last synced: 01 Oct 2025
https://github.com/alturosdestinations/alturos.yolo
C# Yolo Darknet Wrapper (real-time object detection)
computer-vision csharp dotnet image-classification neural-network object-detection objectdetection visual-studio yolo yolo2 yolo3 yolov2 yolov3
Last synced: 16 May 2025
https://github.com/albarji/mixture-of-diffusers
Mixture of Diffusers for scene composition and high resolution image generation
ai computer-vision diffusion-models stable-diffusion
Last synced: 04 Apr 2025
https://github.com/tanjeffreyz/auto-maple
Artificial intelligence software for MapleStory that uses various machine learning and computer vision techniques to navigate challenging in-game environments
ai bot computer-vision deep-learning maplestory python
Last synced: 11 Apr 2025
https://github.com/facebookresearch/ego4d
Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset
computer-vision dataset feature-extraction video visuzalization
Last synced: 15 May 2025
https://github.com/ocampor/image-quality
Image quality is an open source software library for Image Quality Assessment (IQA).
artificial-intelligence computer-vision image-analysis image-quality-assessment machine-learning python tensorflow
Last synced: 21 Oct 2025
https://github.com/AlturosDestinations/Alturos.Yolo
C# Yolo Darknet Wrapper (real-time object detection)
computer-vision csharp dotnet image-classification neural-network object-detection objectdetection visual-studio yolo yolo2 yolo3 yolov2 yolov3
Last synced: 20 Apr 2025
https://github.com/nicholaskajoh/ivy
Video-based object counting software.
computer-vision object-counter object-counting video-analysis video-processing
Last synced: 13 Jul 2025
https://github.com/eliahuhorwitz/deepsim
Official PyTorch implementation of the paper: "DeepSIM: Image Shape Manipulation from a Single Augmented Training Sample" (ICCV 2021 Oral)
computer-graphics computer-vision deep-learning deep-neural-networks edge-to-image generative-adversarial-network iccv2021 image-editing image-to-image-translation pytorch segmantation-to-image single-image
Last synced: 06 Sep 2025
https://github.com/fabric-project/fabric
Node Creative Coding / 3D / Image Processing tool inspired by Quartz Composer
3d computer-vision creative-coding graphics llm metal mlx multimedia node-based post-processing realtime shaders swift swiftui video vlm
Last synced: 29 Jan 2026
https://github.com/3dlg-hcvc/plan2scene
Official implementation of the paper Plan2Scene.
3d-reconstruction computer-vision indoor-reconstruction machine-learning texture-synthesis
Last synced: 05 Apr 2025
https://actors-hq.com/
Official code for "HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion"
3d-reconstruction computer-graphics computer-vision machine-learning nerf siggraph2023
Last synced: 26 Mar 2025
https://github.com/LSH9832/edgeyolo
an edge-real-time anchor-free object detector with decent performance
anchor-free ascend computer-vision deep-learning edge-computing mnn object-detection object-detector onnx pytorch rk3588 rk3588s rknn tensorrt yolo
Last synced: 20 Mar 2025
https://github.com/eliahuhorwitz/DeepSIM
Official PyTorch implementation of the paper: "DeepSIM: Image Shape Manipulation from a Single Augmented Training Sample" (ICCV 2021 Oral)
computer-graphics computer-vision deep-learning deep-neural-networks edge-to-image generative-adversarial-network iccv2021 image-editing image-to-image-translation pytorch segmantation-to-image single-image
Last synced: 19 Jul 2025
https://github.com/DigitalSlideArchive/HistomicsTK
A Python toolkit for pathology image analysis algorithms.
bioimage-informatics computer-vision digital-slide-archive histology machine-learning medical-image-processing python
Last synced: 06 May 2025
https://github.com/aimagelab/multimodal-garment-designer
This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023
computer-vision diffusion-models dresscode fashionai generative-models iccv2023 image-editing latent-diffusion-models multimodal-fashion-image-editing stable-diffusion viton-hd
Last synced: 05 Apr 2025
https://github.com/rohan-paul/machinelearning-deeplearning-code-for-my-youtube-channel
The full collection of all codes for my Youtube Channel segregated as per topic.
computer-vision data-science data-science-portfolio datascience deep-learning deep-neural-networks machine-learning machine-learning-algorithms math neural-network python pytorch pytorch-implementation pytorch-tutorial statistics tensorflow tensorflow-examples tensorflow-tutorials tensorflow2 youtube
Last synced: 04 Apr 2025
https://github.com/ashishpatel26/resourcebank_cv_nlp_mlops_2022
This repository offers a goldmine of materials for students of computer vision, natural language processing, and machine learning operations.
computer-vision data-science deep-learning mlops natural-language-processing
Last synced: 05 Apr 2025
https://github.com/cyrusbehr/YOLOv8-TensorRT-CPP
YOLOv8 TensorRT C++ Implementation
computer-vision cpp machine-learning tensorrt yolo yolov8
Last synced: 21 Apr 2025
https://github.com/bchao1/Anime-Face-Dataset
๐ผ A collection of high-quality anime faces.
anime computer-vision dataset gan image-generation machine-learning
Last synced: 28 Apr 2025
https://github.com/hysts/anime-face-detector
Anime Face Detector using mmdet and mmpose
anime computer-vision face-detection face-landmark-detection pytorch
Last synced: 04 Apr 2025
https://github.com/bchao1/anime-face-dataset
๐ผ A collection of high-quality anime faces.
anime computer-vision dataset gan image-generation machine-learning
Last synced: 06 Apr 2025
https://github.com/IvanDrokin/torch-conv-kan
This project is dedicated to the implementation and research of Kolmogorov-Arnold convolutional networks. The repository includes implementations of 1D, 2D, and 3D convolutions with different kernels, ResNet-like and DenseNet-like models, training code based on accelerate/PyTorch, as well as scripts for experiments with CIFAR-10 and Tiny ImageNet.
computer-vision convolutional-neural-networks kolmogorov-arnold-networks
Last synced: 01 Aug 2025
https://github.com/google-deepmind/dm_pix
PIX is an image processing library in JAX, for JAX.
computer-vision image image-processing jax machine-learning python
Last synced: 17 Jun 2025
https://github.com/MMMU-Benchmark/MMMU
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
computer-vision deep-learning deep-neural-networks evaluation foundation-models large-language-models large-multimodal-models llm llms machine-learning multimodal multimodal-deep-learning multimodal-learning multimodality natural-language-processing question-answering stem visual-question-answering
Last synced: 17 Apr 2025
https://github.com/digitalslidearchive/histomicstk
A Python toolkit for pathology image analysis algorithms.
bioimage-informatics computer-vision digital-slide-archive histology machine-learning medical-image-processing python
Last synced: 14 May 2025
https://github.com/frankkramer-lab/MIScnn
A framework for Medical Image Segmentation with Convolutional Neural Networks and Deep Learning
clinical-decision-support computer-vision convolutional-neural-networks deep-learning framework healthcare-imaging medical-image-analysis medical-image-processing medical-image-segmentation medical-imaging neural-network pip segmentation tensorflow
Last synced: 09 May 2025
https://github.com/una-dinosauria/human-motion-prediction
Simple baselines and RNNs for predicting human motion in tensorflow. Presented at CVPR 17.
computer-vision cvpr-2017 tensorflow
Last synced: 05 Apr 2025
https://github.com/ankonzoid/artificio
Deep Learning Computer Vision Algorithms for Real-World Use
ai applications artificial-intelligence auto-encoders computer-vision convolutional-neural-networks data-science deep-learning image-classification image-finder image-processing image-recognition image-retrieval machine-learning neural-networks object-recognition python recommender-system recommender-systems transfer-learning
Last synced: 08 May 2025
https://github.com/alicevision/popsift
PopSift is an implementation of the SIFT algorithm in CUDA.
computer-vision cuda feature-extraction gpu image-processing sift
Last synced: 05 Apr 2025
https://github.com/francescosaveriozuppichini/vit
Implementing Vi(sion)T(transformer)
Last synced: 06 Apr 2025
https://github.com/ahmedbesbes/cartoonify
Deploy and scale serverless machine learning app - in 4 steps.
aws aws-lambda cartoongan cartoonify computer-vision deployment gan machine-learning-production netlify netlify-deployment production pytorch react scalable-applications serverless serverless-framework streamlit style-transfer
Last synced: 05 Apr 2025
https://github.com/Ank-Cha/Social-Distancing-Analyser-COVID-19
A social distancing analyzer AI tool to regulate social distancing protocol using video surveillance of CCTV cameras and drones. Social Distancing Analyser to prevent COVID19
artificial-intelligence computer-vision covid-19 covid19 python social-distancing stay-home yolo
Last synced: 21 Apr 2025
https://github.com/vincentfung13/MINE
Code and models for our ICCV 2021 paper "MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis"
3d-reconstruction 3d-vision computer-vision deep-learning depth-estimation nerf novel-view-synthesis
Last synced: 20 Mar 2025
https://github.com/hustvl/queryinst
[ICCV 2021] Instances as Queries
computer-vision instance-segmentation mmdetection video-instance-segmentation
Last synced: 05 Apr 2025
https://github.com/imfing/ava_downloader
:arrow_double_down: Download AVA dataset (A Large-Scale Database for Aesthetic Visual Analysis)
aesthetic-visual-analysis ava computer-vision dataset python
Last synced: 05 Apr 2025
https://github.com/hustvl/QueryInst
[ICCV 2021] Instances as Queries
computer-vision instance-segmentation mmdetection video-instance-segmentation
Last synced: 03 Oct 2025
https://github.com/pochih/FCN-pytorch
๐ Easiest Fully Convolutional Networks
computer-vision deep-learning fully-convolutional-networks pytorch semantic-segmentation
Last synced: 13 May 2025
https://github.com/megengine/megflow
Efficient ML solution for long-tailed demands.
computer-vision deep-learning megengine megenginelite pipeline pipeline-framework python3 rust
Last synced: 05 Apr 2025
https://github.com/kunalj101/Data-Science-Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
computer-vision data data-analysis data-science data-visualization dataset hacks image-augmentation ipynb machine-learning nlp nlp-machine-learning numpy pandas pandas-dataframe pandas-python pandas-tutorial python python3 tips-and-tricks
Last synced: 05 May 2025
https://github.com/dctian/DeepPiCar
Deep Learning Autonomous Car based on Raspberry Pi, SunFounder PiCar-V Kit, TensorFlow, and Google's EdgeTPU Co-Processor
artificial-intelligence autonomous-vehicles colab-notebook computer-vision convolutional-neural-networks deep-learning edgetpu end-to-end-machine-learning nvidia opencv picar python raspberry-pi sunfounder tensorflow tensorflow-tutorials transfer-learning
Last synced: 05 May 2025
https://github.com/Sha-Lab/FEAT
The code repository for "Few-Shot Learning via Embedding Adaptation with Set-to-Set Functions"
computer-vision cvpr2020 few-shot few-shot-learning machine-learning meta-learning
Last synced: 15 Mar 2025
https://github.com/dctian/deeppicar
Deep Learning Autonomous Car based on Raspberry Pi, SunFounder PiCar-V Kit, TensorFlow, and Google's EdgeTPU Co-Processor
artificial-intelligence autonomous-vehicles colab-notebook computer-vision convolutional-neural-networks deep-learning edgetpu end-to-end-machine-learning nvidia opencv picar python raspberry-pi sunfounder tensorflow tensorflow-tutorials transfer-learning
Last synced: 05 Apr 2025
https://github.com/davidstutz/superpixel-benchmark
An extensive evaluation and comparison of 28 state-of-the-art superpixel algorithms on 5 datasets.
benchmark computer-vision evaluation image-procesing opencv superpixel-algorithms superpixels
Last synced: 05 Apr 2025
https://github.com/MegEngine/MegFlow
Efficient ML solution for long-tailed demands.
computer-vision deep-learning megengine megenginelite pipeline pipeline-framework python3 rust
Last synced: 26 Apr 2025
https://github.com/patrikhuber/superviseddescent
C++11 implementation of the supervised descent optimisation method
computer-vision landmark-detection machine-learning modern-cpp
Last synced: 06 Apr 2025
https://github.com/kunalj101/data-science-hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
computer-vision data data-analysis data-science data-visualization dataset hacks image-augmentation ipynb machine-learning nlp nlp-machine-learning numpy pandas pandas-dataframe pandas-python pandas-tutorial python python3 tips-and-tricks
Last synced: 28 Oct 2025
https://github.com/lvis-dataset/lvis-api
Python API for LVIS Dataset
computer-vision object-detection
Last synced: 21 Oct 2025
https://github.com/carefree0910/carefree-learn
Deep Learning โค๏ธ PyTorch
algorithm automl computer-vision data-science deep-learning ensemble machine-learning numpy python pytorch tabular-data tabular-datasets
Last synced: 13 Apr 2025
https://github.com/foo123/filter.js
Video and Image Processing and Computer Vision Library in pure JavaScript (Browser and Node.js)
browser computer-vision glsl image-processing machine-learning node-js parallel-processing real-time video-processing wasm web-assembly webgl
Last synced: 16 May 2025
https://github.com/sparkfish/augraphy
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
augmentation-pipeline computer-vision crappification data-augmentation data-pipeline deep-neural-networks image-processing machine-learning synthetic-data synthetic-dataset-generation training-data
Last synced: 10 Apr 2025
https://github.com/amosgropp/IGR
Implicit Geometric Regularization for Learning Shapes
3d-reconstruction computer-vision deep-learning
Last synced: 11 May 2025
https://github.com/mihaibujanca/dynamicfusion
Implementation of Newcombe et al. CVPR 2015 DynamicFusion paper
3d-reconstruction computer-vision cuda non-rigid opencv
Last synced: 30 Jan 2026
https://github.com/yassouali/CCT
:page_facing_up: Semi-Supervised Semantic Segmentation with Cross-Consistency Training (CVPR 2020).
computer-vision semantic-segmentation semi-supervised-learning
Last synced: 02 Apr 2025
https://github.com/landskape-ai/triplet-attention
Official PyTorch Implementation for "Rotate to Attend: Convolutional Triplet Attention Module." [WACV 2021]
arxiv attention-mechanism attention-mechanisms computer-vision convolutional-neural-networks deep-learning detection gradcam imagenet paper triplet-attention
Last synced: 08 May 2025
https://github.com/yassouali/cct
:page_facing_up: Semi-Supervised Semantic Segmentation with Cross-Consistency Training (CVPR 2020).
computer-vision semantic-segmentation semi-supervised-learning
Last synced: 06 Apr 2025
https://github.com/overeasy-sh/overeasy
Orchestrate zero-shot computer vision models
agent agents artificial-intelligence computer-vision llms open-source vision-framework
Last synced: 25 Sep 2025
https://github.com/amarlearning/finger-detection-and-tracking
Finger Detection and Tracking using OpenCV and Python
computer-vision finger-detection hand-detection matplotlib numpy object-detection opencv opencv-python python stargazers tracking-by-detection
Last synced: 04 Apr 2025
https://github.com/amazon-science/bigdetection
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
computer-vision few-shot object-detection pretraining
Last synced: 16 May 2025
https://github.com/choyingw/synergynet
3DV 2021: Synergy between 3DMM and 3D Landmarks for Accurate 3D Facial Geometry
2d-3d 3d 3d-face-alignment 3d-face-reconstruction 3dv2021 3dvision computer-graphics computer-vision deep-neural-networks facial-keypoints facial-landmarks head-pose-estimation pytorch
Last synced: 08 Apr 2025
https://github.com/akanametov/yolov8-face
YOLO Face ๐ in PyTorch
computer-vision detection face face-detection object-detection pytorch yolov10 yolov10-face yolov11 yolov11-face yolov8 yolov8-drone yolov8-face yolov8-football yolov8-parking
Last synced: 02 Apr 2025
https://github.com/commaai/rednose
Kalman filter library
computer-vision hacktoberfest kalman kalman-filter odometry slam
Last synced: 19 Jun 2025
https://github.com/Yang7879/3D-BoNet
๐ฅ3D-BoNet in Tensorflow (NeurIPS 2019, Spotlight)
3d-object-detection 3d-point-clouds 3d-vision computer-vision instance-segmentation
Last synced: 20 Mar 2025
https://github.com/foo123/FILTER.js
Video and Image Processing and Computer Vision Library in pure JavaScript (Browser and Node.js)
browser computer-vision glsl image-processing machine-learning node-js parallel-processing real-time video-processing wasm web-assembly webgl
Last synced: 05 Apr 2025
https://github.com/shoumikchow/bbox-visualizer
Make drawing and labeling bounding boxes easy as cake
annotation annotations bboxes bounding-box bounding-boxes boundingbox computer-vision computer-vision-tools cv data-labeling deep-learning image-annotation image-labeling image-labeling-tool labeling object-detection object-recognition python3
Last synced: 07 Apr 2025
https://github.com/koldim2001/yolo-patch-based-inference
Python library for YOLO small object detection and instance segmentation
computer-vision detection fastsam inference non-maximum-suppression patch-based patch-based-inference patch-inference patchify pip-package pypi-package rtdetr sahi slicing-inference small-object-detection yolo yolo11 yolov8 yolov8-seg yolov9
Last synced: 14 Apr 2025
https://github.com/mkocabas/pare
Code for ICCV2021 paper PARE: Part Attention Regressor for 3D Human Body Estimation
3d-human-reconstruction 3d-human-shape-and-pose-estimation computer-graphics computer-vision human-pose-estimation
Last synced: 05 Apr 2025
https://github.com/rapidsai/cucim
cuCIM - RAPIDS GPU-accelerated image processing library
computer-vision cuda digital-pathology gpu image-analysis image-data image-processing medical-imaging microscopy multidimensional-image-processing nvidia segmentation
Last synced: 11 Apr 2025
https://github.com/allenai/procthor
๐๏ธ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses
ai2-thor computer-vision embodied-ai procedural-generation robotics
Last synced: 24 Oct 2025
https://github.com/ShirAmir/dino-vit-features
Official implementation for the paper "Deep ViT Features as Dense Visual Descriptors".
co-segmentation computer-vision deep-learning dino part-segmentation pytorch semantic-correspondence vision-transformers
Last synced: 03 Apr 2025
https://github.com/eddyhkchiu/mahalanobis_3d_multi_object_tracking
[NeurIPS Workshop 2019] Official code of the paper "Probabilistic 3D Multi-Object Tracking for Autonomous Driving." First Place of the First NuScenes Tracking Challenge in the AI Driving Olympics Workshop of NeurIPS.
3d-multi-object-tracking autonomous-driving computer-vision kalman-filter robotics
Last synced: 20 Mar 2025