Computer vision
Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.
- GitHub: https://github.com/topics/computer-vision
- Wikipedia: https://en.wikipedia.org/wiki/Computer_vision
- Related Topics: vision, deep-learning, machine-learning, opencv, gan,
- Aliases: machine-vision, computervision,
- Last updated: 2026-06-30 00:06:16 UTC
- JSON Representation
https://github.com/iboing/CliqueNet
Convolutional Neural Networks with Alternately Updated Clique (to appear in CVPR 2018)
computer-vision deep-learning image-recognition
Last synced: 20 Mar 2025
https://github.com/clovaai/assembled-cnn
Tensorflow implementation of "Compounding the Performance Improvements of Assembled Techniques in a Convolutional Neural Network"
computer-vision convolutional-neural-networks deep-learning food-101 image-classification imagenet inference-throughput mce robustness tensorflow transfer-learning
Last synced: 01 Aug 2025
https://github.com/frgfm/Holocron
PyTorch implementations of recent Computer Vision tricks (ReXNet, RepVGG, Unet3p, YOLOv4, CIoU loss, AdaBelief, PolyLoss, MobileOne). Other additions: AdEMAMix
ademamix computer-vision cspdarknet53 darknet deep-learning object-detection pytorch resnet rexnet tridentnet unet-image-segmentation yolo yolov4
Last synced: 05 Apr 2025
https://github.com/neuralmagic/sparsify
ML model optimization product to accelerate inference.
automl computer-vision deep-learning-accelerator image-classification inference-performance keras object-detection onnx pruning pytorch quantization smaller-models sparsification-recipe sparsify tensorflow
Last synced: 12 Apr 2025
https://github.com/brainhubeu/react-native-opencv-tutorial
๐ฉโ๐ซFully working example of the OpenCV library used together with React Native
camera computer-vision java javascript jest objective-c objective-c-plus-plus opencv react react-native react-native-camera reactnative
Last synced: 05 Apr 2025
https://github.com/IBM/powerai-counting-cars
Run a Jupyter Notebook to detect, track, and count cars in a video using Maximo Visual Insights (formerly PowerAI Vision) and OpenCV
computer-vision ibmcode jupyter-notebook mp4-video opencv powerai powerai-vision video
Last synced: 10 Apr 2025
https://github.com/gabrielarchanjo/marvinj
Javascript Image Processing Framework based on Marvin Framework
computer-vision image-processing javascript javascript-library
Last synced: 10 Jul 2025
https://github.com/vzhou842/cnn-from-scratch
A Convolutional Neural Network implemented from scratch (using only numpy) in Python.
computer-vision convolutional-neural-networks guide machine-learning neural-network python
Last synced: 09 Sep 2025
https://github.com/ishandutta0098/mukh
A comprehensive face analysis library that provides unified APIs for various face-related tasks
ai computer-vision deepfake-detection face-analysis face-detection face-reenactment facial-landmarks machine-learning opencv pip python torch
Last synced: 09 Oct 2025
https://github.com/khanhnamle1994/fashion-recommendation
A clothing retrieval and visual recommendation model for fashion images.
computer-vision deepfashion fashion-recommendation floydhub numpy pandas python3 resnet tensorflow
Last synced: 09 Apr 2025
https://github.com/z-x-yang/cfbi
The official implementation of CFBI(+): Collaborative Video Object Segmentation by (Multi-scale) Foreground-Background Integration.
computer-vision pytorch-implementation video-object-segmentation
Last synced: 25 Aug 2025
https://github.com/autoai-org/AID
One-Stop System for Machine Learning.
computer-vision deep-learning deep-learning-framework hacktoberfest hacktoberfest2021 machine-learning
Last synced: 03 May 2025
https://github.com/kobiso/Computer-Vision-Leaderboard
Comparison of famous convolutional neural network models
cars196 cnn computer-vision convolutional-neural-network cub200 image-retrieval imagenet in-shop sop sota
Last synced: 01 May 2025
https://github.com/Sierkinhane/mtcnn-pytorch
A face detection algorithm
computer-vision facedetector pytorch
Last synced: 20 Mar 2025
https://github.com/google-research/rigl
End-to-end training of sparse deep neural networks with little-to-no performance loss.
computer-vision machine-learning neural-networks sparse-training
Last synced: 06 Apr 2025
https://github.com/sierkinhane/mtcnn-pytorch
A face detection algorithm
computer-vision facedetector pytorch
Last synced: 07 Apr 2025
https://github.com/explosion/lightnet
๐ Bringing pjreddie's DarkNet out of the shadows #yolo
ai artificial-intelligence c computer-vision cython cython-wrapper darknet-image-classification image-classification machine-learning neural-network neural-networks object-detection python yolo
Last synced: 27 Sep 2025
https://github.com/rizwanmunawar/yolov8-object-tracking
YOLOv8 Object Tracking Using PyTorch, OpenCV and Ultralytics
computer-vision deep-learning object-detection object-tracking yolov8
Last synced: 16 May 2025
https://github.com/alexandre01/UltimateLabeling
A multi-purpose Video Labeling GUI in Python with integrated SOTA detector and tracker
computer-vision labeling labeling-tool object-detection object-tracking python pytorch video
Last synced: 23 Apr 2025
https://github.com/pylabel-project/pylabel
Python library for computer vision labeling tasks. The core functionality is to translate bounding box annotations between different formats-for example, from coco to yolo.
annotation-tool bounding-boxes coco computer-vision dataset object-detection yolov5
Last synced: 05 Apr 2025
https://github.com/LMD0311/PointMamba
PointMamba: A Simple State Space Model for Point Cloud Analysis
3d-point-clouds computer-vision mamba state-space-model
Last synced: 20 Mar 2025
https://github.com/BMW-InnovationLab/BMW-Labeltool-Lite
This repository provides you with an easy-to-use labeling tool for State-of-the-art Deep Learning training purposes. It supports Auto-Labeling.
annotaion auto-label autolabeling bounding-box boundingbox computer-vision deep-learning docker image-annotation inference label labeling-tool labeltool neural-network object-detection smart-labeling synthetic-data tensorflow voc yolov4
Last synced: 07 May 2025
https://github.com/fcakyon/labelme2coco
A lightweight package for converting your labelme annotations into COCO object detection format.
annotation-conversion coco computer-vision deep-learning image-classification instance-segmentation labelme linux machine-learning macos object-detection python semantic-segmentation windows
Last synced: 14 Jan 2026
https://github.com/yqyao/FCOS_PLUS
Some improvements (center sample) about FCOS (FCOS: Fully Convolutional One-Stage Object Detection).
anchor-free computer-vision fcos object-detection one-stage pytorch
Last synced: 13 Jul 2025
https://github.com/unity-technologies/robotics-object-pose-estimation
A complete end-to-end demonstration in which we collect training data in Unity and use that data to train a deep neural network to predict the pose of a cube. This model is then deployed in a simulated robotic pick-and-place task.
autonomy computer-vision deep-learning machine-learning manipulation model-training motion-planning perception physics-simulation pose-estimation robotics robotics-simulation ros simulation synthetic-data trajectory-generation tutorial unity ur3-robot-arm urdf
Last synced: 06 Apr 2025
https://github.com/YapaLab/yolo-face
YOLO Face ๐ in PyTorch
computer-vision detection face face-detection object-detection pytorch yolov10 yolov10-face yolov11 yolov11-face yolov8 yolov8-drone yolov8-face yolov8-football yolov8-parking
Last synced: 01 Oct 2025
https://github.com/BMW-InnovationLab/BMW-TensorFlow-Inference-API-GPU
This is a repository for an object detection inference API using the Tensorflow framework.
api computer-vision deep-learning deep-neural-networks detection-inference-api docker docker-swarm dockerfile gpu inference neural-network nvidia object-detection rest-api tensorflow tensorflow-framework tensorflow-models
Last synced: 07 Apr 2025
https://github.com/bmw-innovationlab/bmw-tensorflow-inference-api-gpu
This is a repository for an object detection inference API using the Tensorflow framework.
api computer-vision deep-learning deep-neural-networks detection-inference-api docker docker-swarm dockerfile gpu inference neural-network nvidia object-detection rest-api tensorflow tensorflow-framework tensorflow-models
Last synced: 16 May 2025
https://github.com/unity-technologies/peoplesanspeople
Unity's privacy-preserving human-centric synthetic data generator
applied-ml-research billing-5160 computer-vision deep-learning human-activity-recognition human-centric-ml human-pose-estimation icml-2022 labeling object-detection owner-machine-learning perception pose-estimation synthetic-data synthetic-data-generation synthetic-dataset-generation synthetic-datasets transfer-learning unity unity3d
Last synced: 06 Apr 2025
https://github.com/Unity-Technologies/PeopleSansPeople
Unity's privacy-preserving human-centric synthetic data generator
applied-ml-research billing-5160 computer-vision deep-learning human-activity-recognition human-centric-ml human-pose-estimation icml-2022 labeling object-detection owner-machine-learning perception pose-estimation synthetic-data synthetic-data-generation synthetic-dataset-generation synthetic-datasets transfer-learning unity unity3d
Last synced: 24 Apr 2025
https://github.com/0ssamaak0/dlta-ai
Data Labeling, Tracking and Annotation with AI
annotation computer-vision datasets-preparation pytorch
Last synced: 02 May 2026
https://github.com/andyzeng/arc-robot-vision
MIT-Princeton Vision Toolbox for Robotic Pick-and-Place at the Amazon Robotics Challenge 2017 - Robotic Grasping and One-shot Recognition of Novel Objects with Deep Learning.
3d amazon-robotics-challenge artificial-intelligence computer-vision deep-learning grasping manipulation mit-princeton rgbd vision
Last synced: 09 Apr 2025
https://github.com/linkedai/flip
Synthetic Image generation with Flip. Generate thousands of new 2D images from a small batch of objects and backgrounds.
computer-vision deep-learning generated-labels hacktoberfest object-detection synthetic-data
Last synced: 02 Apr 2026
https://github.com/andyzeng/apc-vision-toolbox
MIT-Princeton Vision Toolbox for the Amazon Picking Challenge 2016 - RGB-D ConvNet-based object segmentation and 6D object pose estimation.
3d amazon-picking-challenge artificial-intelligence computer-vision deep-learning marvin mit-princeton rgbd ros segmentation vision
Last synced: 09 Apr 2025
https://github.com/bfortuner/pytorch_tiramisu
FC-DenseNet in PyTorch for Semantic Segmentation
computer-vision deep-learning neural-network pytorch semantic-segmentation
Last synced: 09 Apr 2025
https://github.com/zubair-irshad/centersnap
Pytorch code for ICRA'22 paper: "Single-Shot Multi-Object 3D Shape Reconstruction and Categorical 6D Pose and Size Estimation"
3d-computer-vision 3d-reconstruction 6d-pose-estimation 6dof-pose artificial-intelligence auto-encoder computer-vision deep-learning detection manipulation pose-estimation pytorch robotics shape-reconstruction single-shot-detection single-shot-reconstruction
Last synced: 06 Apr 2025
https://github.com/alexeyab/yolo2_light
Light version of convolutional neural network Yolo v3 & v2 for objects detection with a minimum of dependencies (INT8-inference, BIT1-XNOR-inference)
bit1-xnor-inference computer-vision machine-learning neural-network object-detection
Last synced: 09 Apr 2025
https://github.com/mantasu/cs231n
Shortest solutions for CS231n 2021-2024
computer-vision convolutional-neural-networks cs231n deep-learning pytorch tensorflow visual-recognition
Last synced: 06 Apr 2025
https://github.com/ayush1997/Xvision
Chest Xray image analysis using Deep learning !
chest-xray-images computer-vision deep-learning medical-imaging python tensorflow transfer-learning vggnet
Last synced: 19 Jul 2025
https://github.com/robustsam/RobustSAM
RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)
artificial-intelligence computer-vision cvpr cvpr2024 deep-learning eccv iccv image-processing sam segment-anything segment-anything-meta segment-anything-model segmentation zero-shot-segmentation
Last synced: 24 Jul 2025
https://github.com/AlexeyAB/yolo2_light
Light version of convolutional neural network Yolo v3 & v2 for objects detection with a minimum of dependencies (INT8-inference, BIT1-XNOR-inference)
bit1-xnor-inference computer-vision machine-learning neural-network object-detection
Last synced: 20 Apr 2025
https://github.com/ContrastiveSR/Contrastive_Learning_Papers
A list of contrastive Learning papers
computer-vision contrastive-learning deep-learning graph natural-language-processing natural-language-understanding research-paper self-supervised-learning
Last synced: 29 Mar 2025
https://github.com/jankrepl/pychubby
Automated face warping tool.
cli computer-vision dlib-face-detection dlib-face-recognition face face-recognition facial-detection facial-features landmark-detection machine-learning morphing transformation warping
Last synced: 10 Apr 2026
https://github.com/maciejczyzewski/neural-chessboard
โ An Extremely Efficient Chess-board Detection for Non-trivial Photos โ
chess chess-board computer-vision machine-learning
Last synced: 25 Jul 2025
https://github.com/bamwani/car-counting-and-speed-estimation-yolo-sort-python
This project imlements the following tasks in the project: 1. Vehicle counting, 2. Lane detection. 3.Lane change detection and 4.speed estimation
car car-counting computer-vision lane-change-detection lane-detection lane-segmentation object-detection opencv python3 sort sort-tracking speed-detection speed-estimation vehicle-counting vehicle-tracking video yolo
Last synced: 21 Apr 2025
https://github.com/WisconsinAIVision/few-shot-gan-adaptation
[CVPR '21] Official repository for Few-shot Image Generation via Cross-domain Correspondence
computer-vision few-shot-learning gans
Last synced: 08 May 2025
https://github.com/wisconsinaivision/few-shot-gan-adaptation
[CVPR '21] Official repository for Few-shot Image Generation via Cross-domain Correspondence
computer-vision few-shot-learning gans
Last synced: 09 Apr 2025
https://github.com/fcakyon/yolov5-pip
Packaged version of ultralytics/yolov5 + many extra features
aws cli coco computer-vision deep-learning machine-learning neptune neptune-ai object-detection pip pypi python pytorch s3 ultralytics yolo yolov3 yolov4 yolov5
Last synced: 14 May 2025
https://github.com/Anciukevicius/RenderDiffusion
computer-graphics computer-vision generative-model machine-learning
Last synced: 03 Apr 2025
https://github.com/mbeyeler/opencv-python-blueprints
M. Beyeler (2015). OpenCV with Python Blueprints: Design and develop advanced computer vision projects using OpenCV with Python, Packt Publishing Ltd., ISBN 978-178528269-0.
computer-vision image-processing machine-learning opencv python
Last synced: 02 Sep 2025
https://github.com/ZhangGongjie/SAM-DETR
[CVPR'2022] SAM-DETR & SAM-DETR++: Official PyTorch Implementation
computer-vision cvpr cvpr2022 deep-learning detection detr machine-learning object-detection pytorch transformer vision vision-transformer
Last synced: 20 Mar 2025
https://github.com/ZumoLabs/zpy
Synthetic data for computer vision. An open source toolkit using Blender and Python.
ai blender blender-addon computer-vision data deep-learning ml python synthetic synthetic-data
Last synced: 11 May 2025
https://github.com/luxonis/depthai-ros
Official ROS Driver for DepthAI Sensors.
camera camera-driver computer-vision cv depthai robotics ros
Last synced: 21 Jun 2025
https://github.com/zhkkke/pixelssl
A PyTorch-based Semi-Supervised Learning (SSL) Codebase for Pixel-wise (Pixel) Vision Tasks [ECCV 2020]
computer-vision pixel-wise-task pytorch-implementation semi-supervised toolbox
Last synced: 08 Apr 2025
https://github.com/ZHKKKe/PixelSSL
A PyTorch-based Semi-Supervised Learning (SSL) Codebase for Pixel-wise (Pixel) Vision Tasks [ECCV 2020]
computer-vision pixel-wise-task pytorch-implementation semi-supervised toolbox
Last synced: 02 Apr 2025
https://github.com/paddlecv-sig/paddlelabel
้ฃๆกจๆบ่ฝๆ ๆณจ๏ผ่ฎฉๆ ๆณจๅฟซไบบไธๆญฅ
annotations classification computer-vision deep-learning image-annotation instance-segmentation python semantic-segmentation
Last synced: 14 Dec 2025
https://github.com/mkalten/reactivision
computer vision framework for tangible interactive surfaces
blob-tracking camera computer-vision cross-platform fiducial-markers fiducial-symbols multi-touch open-sound-control reactivision tuio tuio-tracker
Last synced: 10 Sep 2025
https://github.com/stereolabs/zed-unity
ZED SDK Unity plugin
3d-reconstruction computer-vision csharp depth-camera human-pose-estimation machine-learning motion-capture point-cloud stereo-vision stereolabs unity unity3d vr zed zed-camera zed2
Last synced: 22 Feb 2026
https://github.com/nvidia-ai-iot/deepstream_pose_estimation
This is a DeepStream application to demonstrate a human pose estimation pipeline.
computer-vision deepstream human-pose-estimation jetson real-time tensorrt tesla
Last synced: 28 Jun 2025
https://github.com/manideep2510/eye-in-the-sky
Satellite Image Classification using semantic segmentation methods in deep learning
artificial-intelligence computer-vision deep-learning keras machine-learning pspnet remote-sensing satellite-image-classification satellite-images semantic-segmentation tensorflow unet
Last synced: 18 Jan 2026
https://github.com/ethereon/lycon
A minimal and fast image library for Python and C++
computer-vision cpp image-processing python
Last synced: 05 Apr 2025
https://github.com/dwctod/eccv2022-papers-with-code-demo
ๆถ้ ECCV ๆๆฐ็ๆๆ๏ผๅ ๆฌ่ฎบๆใไปฃ็ ๅdemo่ง้ข็ญ๏ผๆฌข่ฟๅคงๅฎถๆจ่๏ผ
ai computer-vision cv dataset diffusion eccv eccv2022 face-recognition image-segmentation multimodal-deep-learning nerf objection-detection vision-transformer
Last synced: 27 Jan 2026
https://github.com/DLR-RM/SingleViewReconstruction
Official Code: 3D Scene Reconstruction from a Single Viewport
computer-vision deep-learning eccv2020 open-source paper reconstruction singleview
Last synced: 14 Mar 2025
https://github.com/VlSomers/native-opencv-android-template
A tutorial for setting up OpenCV 4.6.0 (and other 4.x.y version) for Android in Android Studio with Native Development Kit (NDK) support for C++ development.
android android-ndk android-studio computer-vision cpp java java-native-interface jni kotlin kotlin-android ndk opencv opencv-android-release opencv-android-sdk opencv4android
Last synced: 02 Apr 2025
https://github.com/hsankesara/deepresearch
This repository is the collection of research papers in Deep learning, computer vision and NLP.
computer-vision deep-learning keras machine-learning nlp nueral-networks python3 research-paper
Last synced: 09 Apr 2025
https://github.com/VIAME/VIAME
Video and Image Analytics for Multiple Environments
annotation-framework artificial-intelligence computer-vision conservation deep-learning ecology image-annotation image-processing machine-learning marine-biology object-detection oceanography open-source video-analysis video-analytics video-annotation video-search
Last synced: 07 Apr 2025
https://github.com/amanovishnu/ineuron-full-stack-data-science-assignments
this repository features assignments and projects from the iNeuron full stack data science course, providing valuable resources for learners to enhance their skills and apply their knowledge.
computer-vision data-science datascience deep-learning exploratory-data-analysis linear-regression machine-learning natural-language-processing python recommender-system sql statistics
Last synced: 08 Apr 2025
https://github.com/debkbanerji/lego-art-remix
Powerful computer vision assisted Lego mosaic creator ยท Over 1 million images created (so far!)
computer-vision custom-pictures deep-neural-network javascript lego lego-art lego-art-remix lego-mosaic onnx
Last synced: 31 Mar 2025
https://github.com/postech-cvlab/fastpointtransformer
Official source code of Fast Point Transformer, CVPR 2022
3d-vision computer-vision cvpr2022 point-cloud transformer
Last synced: 09 Apr 2025
https://github.com/POSTECH-CVLab/FastPointTransformer
Official source code of Fast Point Transformer, CVPR 2022
3d-vision computer-vision cvpr2022 point-cloud transformer
Last synced: 20 Mar 2025
https://github.com/fregu856/3DOD_thesis
3D Object Detection for Autonomous Driving in PyTorch, trained on the KITTI dataset.
3d-deep-learning 3d-detection autonomous-driving computer-vision deep-learning machine-learning object-detection pytorch
Last synced: 19 Jul 2025
https://github.com/hoangcuong2011/Good-Papers
I try my best to keep updated cutting-edge knowledge in Machine Learning/Deep Learning and Natural Language Processing. These are my notes on some good papers
computer-vision convolutional-networks deep-kernel-learning deep-learning gaussian-processes kernel-learning machine-learning natural-language-processing neural-machine-translation neural-network representation-learning semi-supervised-learning variational-autoencoders variational-inference
Last synced: 19 Jul 2025
https://github.com/preddy5/Im2Vec
[CVPR 2021 Oral] Im2Vec Synthesizing Vector Graphics without Vector Supervision
computer-graphics computer-vision cvpr2021 vector-graphics
Last synced: 03 Apr 2025
https://github.com/visipedia/annotation_tools
Visipedia Annotation Tools
annotations computer-vision datasets
Last synced: 04 Apr 2025
https://github.com/RizwanMunawar/yolov8-object-tracking
YOLOv8 Object Tracking Using PyTorch, OpenCV and Ultralytics
computer-vision deep-learning object-detection object-tracking yolov8
Last synced: 17 Apr 2025
https://github.com/amanovishnu/ineuron-full-stack-data-science-assignment-collection
this repository features assignments and projects from the iNeuron full stack data science course, providing valuable resources for learners to enhance their skills and apply their knowledge.
computer-vision data-science datascience deep-learning exploratory-data-analysis linear-regression machine-learning natural-language-processing python recommender-system sql statistics
Last synced: 28 Feb 2025
https://github.com/anilbas/3dmmasstn
MatConvNet implementation for incorporating a 3D Morphable Model (3DMM) into a Spatial Transformer Network (STN)
3d-morphable-models 3dmm basel-face-model computer-vision convolutional-neural-networks dagnn deep-learning deep-neural-networks face machine-learning matconvnet matlab siamese-network spatial-transformer-network stn vgg-face-matconvnet
Last synced: 13 Apr 2025
https://github.com/bmw-innovationlab/bmw-yolov4-inference-api-gpu
This is a repository for an nocode object detection inference API using the Yolov3 and Yolov4 Darknet framework.
alexeyab-darknet api bounding-boxes computer-vision deep-learning deeplearning detection-inference-api docker dockerfile gpu inference inference-gui inference-server neural-network no-code rest-api yolo yolo-gui yolov3 yolov4
Last synced: 02 Jul 2025
https://github.com/gsurma/style_transfer
CNN image style transfer ๐จ.
cnn computer-vision convolutional-neural-networks deep-learning jupyter-notebook keras machine-learning neural-network notebook python style-transfer
Last synced: 09 Apr 2025
https://github.com/gszfwsb/NCFM
Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function" (NCFM) in CVPR 2025.
computer-vision data-centric-ai dataset-distillation synthetic-data
Last synced: 01 Apr 2025
https://github.com/kkanshul/finegan
FineGAN: Unsupervised Hierarchical Disentanglement for Fine-grained Object Generation and Discovery
computer-vision deep-learning disentangled-representations fine-grained gans generative-adversarial-network image-generation image-manipulation pytorch
Last synced: 08 May 2025
https://github.com/bharathsudharsan/tinyml-cam
Code for MobiCom paper 'TinyML-CAM: 80 FPS Image Recognition in 1 Kb RAM'
arduino-library c-plus-plus computer-vision edge-analytics edge-computing edgeimpulse embedded-c embedded-systems esp32-cam image-recognition iot-device tinyml
Last synced: 10 Apr 2025
https://github.com/DerrickXuNu/v2x-vit
[ECCV2022] Official Implementation of paper "V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer"
3d-object-detection autonomous-driving collaborative-perception computer-vision deep-learning machine-learning multi-agent-system pytorch simulation v2x vehicle-to-everything vision-transformer
Last synced: 20 Mar 2025
https://github.com/lucasjinreal/thor
thor: C++ helper library, for deep learning purpose
computer-vision deeplearning object-detection tracking
Last synced: 07 Apr 2025
https://github.com/Haiyang-W/UniTR
[ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Birdโs-Eye-View Representation"
3d 3d-object-detection 3d-segmentation backbone bev camera computer-vision iccv2023 lidar multi-modal multi-view point-cloud transformer unified
Last synced: 20 Mar 2025
https://github.com/nianticlabs/map-free-reloc
[ECCV 2022] Map-free Visual Relocalization: Metric Pose Relative to a Single Image
benchmark computer-vision dataset eccv2022 pose-estimation pytorch relocalization visual-relocalization
Last synced: 06 Apr 2025
https://github.com/joaolages/diffusers-interpret
Diffusers-Interpret ๐ค๐งจ๐ต๏ธโโ๏ธ: Model explainability for ๐ค Diffusers. Get explanations for your generated images.
computer-vision deep-learning diffusers diffusion explainable-ai image-generation interpretability model-explainability primary-attributions pytorch text2image transformers
Last synced: 05 Apr 2025
https://github.com/jackpopc/ailearnnotes
Artificial Intelligence Learning Notes.
activation-function alexnet cnn computer-vision deep-learning dpm dropout harris hog image-augmentation image-denoising image-enhancement image-processing image-segmentation lenet machine-learning reinforcement-learning sift tensorflow vgg
Last synced: 24 Oct 2025
https://github.com/HusseinYoussef/Arabic-OCR
OCR system for Arabic language that converts images of typed text to machine-encoded text.
arabic character-segmentation computer-vision dataset image-processing machine-learning neural-network ocr opencv-python scikit-learn segmentation
Last synced: 02 Apr 2025
https://github.com/prasunroy/stefann
:fire: [CVPR 2020] STEFANN: Scene Text Editor using Font Adaptive Neural Network (official code).
color-transfer colornet computer-vision cvpr cvpr2020 deep-learning fannet font-generation scene-text-editor stefann
Last synced: 15 Jul 2025
https://github.com/riju/webcamera
Camera controls for the Web
camera-controls computer-vision demos getusermedia hdr hdr-image opencv opencv-js webrtc
Last synced: 07 Apr 2025
https://github.com/bnosac/image
Computer Vision and Image Recognition algorithms for R users
canny-edge-detection computer-vision contours darknet dlib f9 harris-corners harris-interest-point-detector hog-features image-algorithms image-recognition openpano otsu r r-package surf
Last synced: 04 Apr 2025
https://github.com/ducha-aiki/affnet
Code and weights for local feature affine shape estimation paper "Repeatability Is Not Enough: Learning Discriminative Affine Regions via Discriminability"
affine-shape-estimator computer-vision convolutional-networks convolutional-neural-networks deep-learning hessian image-matching image-retrieval local-features pytorch
Last synced: 09 Apr 2025
https://github.com/servicenow/highres-net
Pytorch implementation of HighRes-net, a neural network for multi-frame super-resolution, trained and tested on the European Space Agencyโs Kelvin competition. This is a ServiceNow Research project that was started at Element AI.
computer-vision deep-learning esa highres-net mfsr multi-frame proba-v satellite-imagery super-resolution
Last synced: 09 Oct 2025
https://github.com/iago-suarez/elsed
ELSED: Enhanced Line SEgment Drawing
computer-vision image-processing line-segment-detector local-features
Last synced: 06 Apr 2025
https://github.com/natanielruiz/disrupting-deepfakes
๐ฅ๐ฅDefending Against Deepfakes Using Adversarial Attacks on Conditional Image Translation Networks
adversarial-attacks computer-vision deep-learning deepfake-detection deepfakes defending defending-deepfakes disrupting-deepfakes face-swap faceswap fake-news machine-learning
Last synced: 04 Nov 2025
https://github.com/lannguyen0910/food-recognition
๐๐๐ Food analysis baseline with Theseus. Integrate object detection, image classification and multi-class semantic segmentation ๐๐๐
computer-vision deep-learning efficientnet food-analysis food-classification food-detection food-segmentation image-classification object-detection pytorch semantic-segmentation unetplusplus yolov5 yolov8
Last synced: 21 Apr 2025
https://github.com/ali-design/gan_steerability
On the "steerability" of generative adversarial networks
computer-vision deep-learning gan generative-adversarial-network
Last synced: 08 May 2025
https://github.com/eric-canas/qreader
Robust and Straight-Forward solution for reading difficult and tricky QR codes within images in Python. Powered by YOLOv8
computer-vision easy-to-use image-processing object-detection pip python pytorch pyzbar qr qrcode qrcode-reader qrcode-scanner yolov8
Last synced: 15 May 2025
https://github.com/roboflow/roboflow-100-benchmark
Code for replicating Roboflow 100 benchmark results and programmatically downloading benchmark datasets
computer-vision dataset deep-learning machine-learning object-detection pytorch
Last synced: 16 May 2025