Computer vision
Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.
- GitHub: https://github.com/topics/computer-vision
- Wikipedia: https://en.wikipedia.org/wiki/Computer_vision
- Related Topics: vision, deep-learning, machine-learning, opencv, gan,
- Aliases: machine-vision, computervision,
- Last updated: 2026-07-01 00:06:02 UTC
- JSON Representation
https://github.com/cansik/deep-vision-processing
Deep computer-vision algorithms for the Processing framework.
classification computer-vision cuda-support deep-neural-networks inference-engine machine-learning pose-estimation processing
Last synced: 15 Apr 2025
https://github.com/visioncortex/visioncortex
Core library for computer graphics & vision applications
computer-graphics computer-vision rust
Last synced: 06 Apr 2025
https://github.com/kwan3854/ceb_econ
AI based all-in-one character generator Blender plug-in. This project contains unofficial updates for the CEB_ECON Blender add-on.
3d-graphics 3d-modeling 3d-reconstruction blender-addon character-generator computer-graphics computer-vision game-development metaverse model-generator pifu stable-diffusion texture-generation virtual-humans
Last synced: 27 Sep 2025
https://github.com/nasa-jpl-memex/image_space
Interactive Image similarity and Visual Search and Retrieval application
alexnet computer-vision deep-learning image-analysis image-recognition image-similarity image-viewer jpl kitware machine-learning python tika tika-python
Last synced: 10 Mar 2026
https://github.com/jaisayush/Fatigue-Detection-System-Based-On-Behavioural-Characteristics-Of-Driver
A computer vision system made with the help of opencv that can automatically detect driver drowsiness in a real-time video stream and then play an alarm if the driver appears to be drowsy.
anaconda-environment anaconda3 computer-vision cv cv2 drowsiness-detection drowsy-driver-warning-system imutils miniproject numpy opencv opencv-python opencv2 playsound python python3 scipy
Last synced: 10 Apr 2025
https://github.com/interdigitalinc/compressai-vision
CompressAI-Vision helps you design, test and compare Video Compression for Machines pipelines. Compression methods can be either pulled from custom AI-based modules from CompressAI or traditional codecs such as H.266/VVC.
compression computer-vision deep-learning machine-to-machine-communication pytorch video-compression
Last synced: 10 Oct 2025
https://github.com/libmir/dcv
Computer Vision Library for D Programming Language
computer-vision image-processing
Last synced: 04 Jan 2026
https://github.com/zhanghang1989/torch-encoding-layer
Deep Texture Encoding Network
computer-vision deep-learning deep-neural-networks encoding texture
Last synced: 15 Jun 2025
https://github.com/imoonlab/hyper-yolov1.1
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
computer-vision deep-learning hypergraph hypergraph-neural-networks object-detection yolo
Last synced: 07 Apr 2025
https://github.com/ternaus/check_orientation
Model to check if image was rotated by 90, 180, 270 degrees.
computer-vision deep-learning image-classification orientation-detection
Last synced: 04 Jul 2025
https://github.com/distant-viewing/dvt
Distant Viewing Toolkit for the Analysis of Visual Culture
computer-vision cultural-analytics digital-humanities
Last synced: 17 Mar 2025
https://github.com/kiyoon/nvim-hand-gesture
Write programs with hand gestures
action-recognition computer-vision gesture-recognition neovim nvim nvim-plugin vim
Last synced: 25 Oct 2025
https://github.com/mdsrqbl/omnihuman
AI model that understands text & humanoids.
computer-vision generative-ai nlp pose-synthesis pytorch transformers
Last synced: 15 Apr 2025
https://github.com/cambridgeltl/visual-spatial-reasoning
[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.
computer-vision multimodal-deep-learning nlp vision-and-language
Last synced: 03 Apr 2025
https://github.com/kevalmorabia97/cova-web-object-detection
A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!
attention computer-vision convolutional-neural-networks deep-learning graph-attention-networks graph-convolutional-networks information-extraction multimodal-learning object-detection pytorch visual-attention
Last synced: 07 Apr 2025
https://github.com/AIVFI/Video-Frame-Interpolation-Rankings-and-Video-Deblurring-Rankings
Rankings include: ABME AdaFNIO ALANET AMT BiT BVFI CDFI CtxSyn DBVI DeMFI DQBC DRVI EAFI EBME EDC EDENVFI EDSC EMA-VFI FGDCN FILM FLAVR H-VFI IFRNet IQ-VFI JNMR LADDER M2M MA-GCSPA NCM PerVFI PRF ProBoost-Net RIFE RN-VFI SoftSplat SSR ST-MFNet Swin-VFI TDPNet TTVFI UGFI UPR-Net UTI-VFI VFIformer VFIFT VFIMamba VFIT VIDUE VRT
artificial-intelligence computer-vision deblurring deep-learning frame-interpolation interpolation low-level-vision machine-learning motion-blur motion-deblurring ncnn restoration rife tensorrt vapoursynth video-deblurring video-frame-interpolation video-interpolation video-processing video-restoration
Last synced: 06 Mar 2025
https://github.com/saizk/deep-learning-for-solar-panel-recognition
CNN models for Solar Panel Detection and Segmentation in Aerial Images.
cnn computer-vision deep-learning google-maps image-segmentation object-detection pv-systems solar-panels
Last synced: 19 Aug 2025
https://github.com/robert80203/HuPR-A-Benchmark-for-Human-Pose-Estimation-Using-Millimeter-Wave-Radar
The official implementation of HuPR: A Benchmark for Human Pose Estimation Using Millimeter Wave Radar
computer-vision deep-learning millimeter-wave radar
Last synced: 18 Nov 2025
https://github.com/gantman/rps_tfjs_demo
Training a Rock Paper Scissors model in the browser via TFJS - Learn along style
computer-vision machine-learning machinelearning react rock-paper-scissors rps tensorflow tensorflow-js tensorflowjs
Last synced: 06 Apr 2025
https://github.com/scut-aitcm/Competitive-Inner-Imaging-SENet
Source code of paper: (not available now)
attention-mechanism computer-vision deep-learning mxnet residual-networks
Last synced: 16 Apr 2025
https://github.com/jmmanley/vgg-multiple-view-geometry
A set of MATLAB utilities for multiple view geometry, provided alongside Hartley & Zisserman's "Multiple View Geometry in Computer Vision, Second Edition" (2004). Obtained from http://www.robots.ox.ac.uk/~vgg/hzbook/code/.
3d-reconstruction camera-calibration computer-vision multiple-view-geometry
Last synced: 05 Mar 2025
https://github.com/joydeepmedhi/anchor-boxes-with-kmeans
How to initialize Anchors in Faster RCNN for custom dataset?
anchor anchor-box aspect-ratio bounding-boxes clusters computer-vision custom-dataset detection distance-metric faster-rcnn hyperparameters iou kmeans object-detection python tensorflow-models
Last synced: 02 May 2025
https://github.com/bytedeco/sbt-javacv
Start using OpenCV in your JVM project in just 1 line, no separate compiling, installing OpenCV, or fussing with your system required
computer-vision javacv opencv-java sbt-plugin scala
Last synced: 13 Apr 2025
https://github.com/khanhnamle1994/deep-learning
Assignmends done for Udacity's Deep Learning MOOC with Vincent Vanhoucke
computer-vision convolutional-neural-networks deep-learning natural-language-processing recurrent-neural-networks tensorflow
Last synced: 25 Apr 2025
https://github.com/eutropicai/final2x-core
core for Final2x
ccrestoration computer-vision cross-platform image-processing low-level-vision python3 pytorch super-resolution
Last synced: 08 Oct 2025
https://github.com/qdata/lamp
ECML 2019: Graph Neural Networks for Multi-Label Classification
computer-vision graph-attention-networks graph-neural-networks multi-label-classification nlp transformers
Last synced: 13 Apr 2025
https://github.com/abcSup/NotEnoughSleepAI
Implementation of the Multi-Task Multi-Sensor Fusion for 3D Object Detection paper by Uber
computer-vision deep-learning neural-networks
Last synced: 20 Mar 2025
https://github.com/ndrplz/dreyeve
[TPAMI 2018] Predicting the Driverโs Focus of Attention: the DR(eye)VE Project. A deep neural network learnt to reproduce the human driver focus of attention (FoA) in a variety of real-world driving scenarios.
adas attention automotive autonomous-driving autonomous-vehicles computer-vision convolutional-neural-networks deep-learning driver-focus
Last synced: 10 Jul 2025
https://github.com/datarootsio/tutorial-face-mask-detection
In this project, we develop a pipeline to detect unmasked faces in images. This can, for example, be used to alert people that do not wear a mask when entering a building.
classification computer-vision deep-learning face-mask-detection face-mask-detector facemaskdetection keras opencv python-3 tensorflow tutorial vggface2
Last synced: 11 Apr 2025
https://github.com/sayakpaul/dreambooth-keras
Implementation of DreamBooth in KerasCV and TensorFlow.
computer-vision dreambooth generative-ai keras keras-cv stable-diffusion tensorflow
Last synced: 19 Apr 2025
https://github.com/EutropicAI/Final2x-core
core for Final2x
ccrestoration computer-vision cross-platform image-processing low-level-vision python3 pytorch super-resolution
Last synced: 06 Sep 2025
https://github.com/bluemirrors/cvu
Computer Vision deployment tools for dummies and experts. CVU aims at making CV pipelines easier to build and consistent around platforms, devices, and models.
computer-vision object-detection onnxruntime pytorch tensorflow2 tensorrt yolov5
Last synced: 08 May 2025
https://github.com/sahagobinda/GPM
Official [ICLR] Code Repository for "Gradient Projection Memory for Continual Learning"
computer-vision continual-learning deep-learning optimizaiton pytorch
Last synced: 08 May 2025
https://github.com/ashishpatel26/tensorflow-advanced-techniques-specialization
Tensorflow Advanced Technique Specialization
computer-vision coursera coursera-assignment cousera-projects cousera-specialization deep-learning deeplearning-ai image-processing object-detection semantic-segmentation specialization tensorflow-tutorials tensorflow2
Last synced: 23 Oct 2025
https://github.com/kuanghuei/clean-net
Tensorflow source code for "CleanNet: Transfer Learning for Scalable Image Classifier Training with Label Noise" (CVPR 2018)
computer-vision deep-learning label-noise large-scale-learning tensorflow transfer-learning
Last synced: 01 Mar 2026
https://github.com/khrylx/egopose
[ICCV 2019] Official PyTorch Implementation of "Ego-Pose Estimation and Forecasting as Real-Time PD Control". ICCV 2019.
character-controller computer-vision physics-based pose-estimation pose-forecasting reinforcement-learning
Last synced: 05 Mar 2026
https://github.com/funkelab/gunpowder
A library to facilitate machine learning on multi-dimensional images.
computer-vision gunpowder machine-learning multidimensional pipeline
Last synced: 06 Apr 2026
https://github.com/cvg/scrstudio
[CVPR 2025] A unified framework for Scene Coordinate Regression-based visual localization
3d-reconstruction 3d-vision computer-vision deep-learning pose-estimation scene-coordinate-regression visual-localization
Last synced: 11 Oct 2025
https://github.com/norlab-ulaval/PercepTreeV1
Implementation of Grondin et al. 2022 "Tree Detection and Diameter Estimation Based on Deep Learning". Also includes datasets and some of the pretrained models.
computer-vision datasets forestry
Last synced: 09 May 2025
https://github.com/wgcban/spin_roadmapper
Official implementation of our ICRA'22 paper: SPIN Road Mapper: Extracting Roads from Aerial Images via Spatial and Interaction Space Graph Reasoning for Autonomous Driving
aerial-imagery autonomous-driving computer-vision road-detection satellite-imagery segmentation
Last synced: 26 Apr 2025
https://github.com/nikolasent/road-semantic-segmentation
Udacity Self-Driving Car Engineer Nanodegree. Project: Road Semantic Segmentation
carnd classification computer-vision deep-learning fcn self-driving-car semantic-segmentation tensorflow
Last synced: 30 Apr 2025
https://github.com/lancopku/label-embedding-network
Label Embedding Network
cifar10 cifar100 computer-vision deep-learning label-embedding label-representation mnist natural-language-processing
Last synced: 25 Jan 2026
https://github.com/skalskip/som
Unofficial implementation and experiments related to Set-of-Mark (SoM) ๐๏ธ
chatgpt computer-vision gpt-4 multimodal
Last synced: 22 Apr 2025
https://github.com/M-Nauta/ProtoTree
ProtoTrees: Neural Prototype Trees for Interpretable Fine-grained Image Recognition, published at CVPR2021
computer-vision cvpr2021 decision-trees deep-neural-networks explainability explainable-ai explainable-ml fine-grained-classification fine-grained-visual-categorization interpretability interpretable-deep-learning interpretable-machine-learning pytorch
Last synced: 08 May 2025
https://github.com/cjekel/tindetheus
Build personalized machine learning models for Tinder using Python
classification-model computer-vision database facenet facenet-models machine-learning machinelearning python tinder
Last synced: 20 Aug 2025
https://github.com/tensoraws/final2x-core
core for Final2x
ccrestoration computer-vision cross-platform image-processing low-level-vision python3 pytorch super-resolution
Last synced: 13 Aug 2025
https://github.com/ultralytics/notebooks
Ultralytics Notebooks ๐
computer-vision datasets deep-learning examples examples-python instance-segmentation llm manufacturing medical-imaging ml notebooks notebooks-jupyter object-detection object-tracking ultralytics yolo yolo11 youtube
Last synced: 18 Jun 2025
https://github.com/hkchengrex/scribble-to-mask
[CVPR 2021] MiVOS - Scribble to Mask module
computer-vision cvpr2021 deep-learning interactive-segmentation pytorch segmentation
Last synced: 22 Apr 2025
https://github.com/locaal-ai/obs-detect
Object Detection in OBS, real-time, local, GPU optional
computer-vision live-streaming livestreaming object-detection object-tracking obs-studio obs-studio-plugin obsproject video-object-tracking yolo
Last synced: 11 May 2025
https://github.com/r3gm/insightsolver-colab
InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.
ai-ops aiops autogpt colab-notebook colorization computer-vision deep-learning llama-2 llama-cpp llm machine-learning object-detection stable-diffusion text-to-speech
Last synced: 12 Oct 2025
https://github.com/kitware/dive
Media annotation and analysis tools for web and desktop. Get started at https://viame.kitware.com
annotation computer-vision docker image-annotation machine-learning marine-biology object-detection video video-analytics video-annotation
Last synced: 05 Apr 2025
https://github.com/dipanjans/adversarial-learning-robustness
Contains materials for workshops pertaining to adversarial robustness in deep learning.
adversarial-attacks adversarial-learning computer-vision deep-learning python tensorflow
Last synced: 30 Apr 2025
https://github.com/google/localized-narratives
Localized Narratives
computer-vision image-captioning speech-analysis
Last synced: 13 Apr 2026
https://github.com/guaishou74851/pcnet
(TPAMI 2024) Practical Compact Deep Compressed Sensing [PyTorch]
algorithm-unrolling compressed-sensing compressive-sampling compressive-sensing computational-imaging computer-vision deep-learning deep-neural-networks deep-unfolding image-reconstruction image-restoration python pytorch sampling-matrix single-pixel-imaging structural-reparameterization
Last synced: 10 Apr 2025
https://github.com/miteshputhran/image-caption-generator
The LSTM model generates captions for the input images after extracting features from pre-trained VGG-16 model. (Computer Vision, NLP, Deep Learning, Python)
bleu-score caption-generator computer-vision flickr image-captioning jupyter-notebook lstm-model machine-learning natural-language-processing python
Last synced: 14 Apr 2025
https://github.com/aitorzip/keras-icnet
Keras implementation of Real-Time Semantic Segmentation on High-Resolution Images
autonomous-driving computer-vision deep-learning fully-convolutional-networks image-processing image-segmentation keras semantic-segmentation tensorflow
Last synced: 07 Apr 2025
https://github.com/kartik4949/tensorpipe
High Performance Tensorflow Data Pipeline with State of Art Augmentations and low level optimizations.
computer-vision datapipeline tensorflow
Last synced: 26 Oct 2025
https://github.com/devinterview-io/computer-vision-interview-questions
๐ฃ Computer Vision interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews computer-vision computer-vision-interview-questions computer-vision-questions computer-vision-tech-interview data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions
Last synced: 14 Feb 2026
https://github.com/yahoo/graphkit
A lightweight Python module for creating and running ordered graphs of computations.
computer-vision machine-learning python
Last synced: 13 Apr 2025
https://github.com/fabioperez/skin-data-augmentation
Source code for the paper 'Data Augmentation for Skin Lesion Analysis' โ ๐ Best Paper Award at the ISIC Skin Image Analysis Workshop @ MICCAI 2018
computer-vision data-augmentation deep-learning machine-learning melanoma melanoma-classification pytorch skin-lesion-analysis
Last synced: 06 Apr 2025
https://github.com/moygcc/hpcwild
Human Performance Capture from Monocular Video in the Wild (3DV2021)
3d-vision 3dv2021 computer-vision
Last synced: 29 Sep 2025
https://github.com/tensoraws/finalrip
a distributed AI video processing tool
asynq computer-vision docker ffmpeg finalrip gin go golang vapoursynth video-processing
Last synced: 12 Jan 2026
https://github.com/zjunlp/mkg_analogy
[ICLR 2023] Multimodal Analogical Reasoning over Knowledge Graphs
analogical-reasoning analogy computer-vision dataset iclr iclr2023 kg knowledge-graph language-model markg mars multimodal natural-language-processing pre-trained-language-models prompt reasoning
Last synced: 24 Jul 2025
https://github.com/sayakpaul/adaptive-gradient-clipping
Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.
colab-notebook computer-vision deep-neural-networks normalization-free-training tensorflow2
Last synced: 30 Apr 2025
https://github.com/ShuweiShao/IEBins
[NeurIPS2023] IEBins: Iterative Elastic Bins for Monocular Depth Estimation
computer-vision deep-learning depth-estimation
Last synced: 05 May 2025
https://github.com/wellflat/cat-fancier
cat detection, recognition, classifier engine :cat:
computer-vision machine-learning python
Last synced: 21 Mar 2025
https://github.com/thushv89/manning_tf2_in_action
The official code repository for "TensorFlow in Action" by Manning.
computer-vision deep-learning machine-learning nlp notebook python tensorflow tensorflow2 tf tf2
Last synced: 18 Mar 2025
https://github.com/l1997i/LiM3D
๐ฅ(CVPR 2023) Less is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation
complexity computer-vision cvpr cvpr2023 deep-learning lidar point-cloud semantic-segmentation
Last synced: 20 Mar 2025
https://github.com/unity-technologies/datasetinsights
Synthetic Dataset Insights
computer-vision machine-learning pytorch synthetic-datasets unity-simulation
Last synced: 30 Aug 2025
https://github.com/magamig/duplicate-images-finder
๐๐ Find and Delete Duplicate Images / Photos
computer-vision duplicate-images image-recognition lowe sift
Last synced: 18 Jan 2026
https://github.com/iotJumpway/RPI-Examples
Examples of using the IoT JumpWay with the Raspberry Pi 3.
computer-vision iot iot-jumpway mqtt python raspberry-pi
Last synced: 17 Jul 2025
https://github.com/iotjumpway/rpi-examples
Examples of using the IoT JumpWay with the Raspberry Pi 3.
computer-vision iot iot-jumpway mqtt python raspberry-pi
Last synced: 14 Apr 2025
https://github.com/tohrusky/final2x-core
core for Final2x
artificial-intelligence ccrestoration computer-vision cross-platform image-processing low-level-vision ncnn opencv python3 pytorch super-resolution
Last synced: 22 Jun 2025
https://github.com/richardaecn/cvpr18-caption-eval
Learning to Evaluate Image Captioning. CVPR 2018
caption computer-vision cvpr2018 deep-learning evaluation-metrics image-captioning tensorflow
Last synced: 01 Mar 2026
https://github.com/ahmdtaha/knowledge_evolution
(CVPR-Oral 2021) PyTorch implementation of Knowledge Evolution approach and Split-Nets
classification computer-vision deep-learning deep-neural-networks few-shot-learning knowledge-distillation knowledge-evolution machine-learning python3 pytorch slim-networks
Last synced: 24 Jul 2025
https://github.com/Tohrusky/Final2x-core
core for Final2x
artificial-intelligence ccrestoration computer-vision cross-platform image-processing low-level-vision ncnn opencv python3 pytorch super-resolution
Last synced: 05 Mar 2025
https://github.com/svishnu88/TGS-SaltNet
Kaggle | 21st place solution for TGS Salt Identification Challenge
computer-vision deep-learning fastai kaggle-competition pytorch
Last synced: 19 Jul 2025
https://google.github.io/localized-narratives/
Localized Narratives
computer-vision image-captioning speech-analysis
Last synced: 16 Mar 2025
https://github.com/hongyehu/RG-Flow
This is project page for the paper "RG-Flow: a hierarchical and explainable flow model based on renormalization group and sparse prior". Paper link: https://arxiv.org/abs/2010.00029
computer-vision generative-model hierarchical-models normalizing-flow renormalization-group representation-learning sparse-coding
Last synced: 15 Jul 2025
https://github.com/lddl/go-darknet
Go bindings for Darknet (YOLO v4 / v7-tiny / v3)
computer-vision darknet darknet-bindings hacktoberfest neural-network object-detection yolo yolov2 yolov2-tiny yolov3 yolov3-tiny yolov4 yolov7 yolov7-tiny
Last synced: 10 Jul 2025
https://github.com/fabioperez/transito-cv
[DEPRECATED] Traffic sign detector and classifier that uses dlib and its implementation of the Felzenszwalb's version of the Histogram of Oriented Gradients (HoG) detector
computer-vision dlib hog traffic-signs
Last synced: 10 Oct 2025
https://github.com/gofynd/mildnet
Visual Similarity research at Fynd. Contains code to reproduce 2 of our research papers.
ai artificial-intelligence computer-vision deep-learning ecommerce feature-extraction image-retrieval keras machine-learning machinelearning ml recommendation-system recommender-system tensorflow visual-recommendations visual-search
Last synced: 07 May 2025
https://github.com/MiraPurkrabek/BBoxMaskPose
[ICCV 25] The official repository of paper 'Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle'
computer-vision human-pose-estimation iccv iccv2025 keypoint-detection pose-estimation research-paper
Last synced: 30 Jan 2026
https://github.com/s32x/gamedetect
:space_invader: Game Detection API using Tensorflow and Go
computer-vision docker gaming golang tensorflow
Last synced: 23 Apr 2025
https://github.com/LdDl/go-darknet
Go bindings for Darknet (YOLO v4 / v7-tiny / v3)
computer-vision darknet darknet-bindings hacktoberfest neural-network object-detection yolo yolov2 yolov2-tiny yolov3 yolov3-tiny yolov4 yolov7 yolov7-tiny
Last synced: 20 Apr 2025
https://github.com/theos-ai/easy-yolov7
This a clean and easy-to-use implementation of YOLOv7 in PyTorch, made with โค๏ธ by Theos AI.
computer-vision deep-learning machine-learning neural-networks object-detection python pytorch yolov5 yolov7
Last synced: 20 Apr 2025
https://github.com/Uehwan/3D-Scene-Graph
3D scene graph generator implemented in Pytorch.
3d-models 3d-scene-graph computer-vision deep-learning deeplearning environmental-modelling intelligence pytorch robot robotics scene-graph scenegraph
Last synced: 16 Apr 2025
https://github.com/Hallway-Inc/AvatarWebKit
Web-first SDK that provides real-time ARKit-compatible 52 blend shapes from a camera feed, video or image at 60 FPS using ML models.
animation ar augmented-reality avatar avatars blendshapes computer-vision face-det face-detection face-recognition face-tracking javascript javascript-library machine-learning metaverse metaverse-infrastructure motion-capture typescript typescript-library virtual-reality
Last synced: 01 Apr 2025
https://github.com/mukosame/aoda
Official implementation of "Adversarial Open Domain Adaptation for Sketch-to-Photo Synthesis"(WACV 2022/CVPRW 2021)
computer-vision cvpr cvpr2021 deep-learning gan gans image-generation image-manipulation image2image open-domain pytorch sketch wacv wacv2022
Last synced: 20 Oct 2025
https://github.com/vision-cair/3dcompat-v2
3DCoMPaT++: An improved large-scale 3D vision dataset for compositional recognition
3d compositional-learning computer-vision deep-learning multimodal-deep-learning
Last synced: 15 Apr 2025
https://github.com/amazon-science/semimtr-text-recognition
Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)
computer-vision consistency-regularization contrastive-learning deep-learning ocr pytorch scene-text-recognition self-supervised-learning semi-supervised-learning text-recognition
Last synced: 14 Jun 2025
https://github.com/collidingscopes/threejs-handtracking-101
A threejs / WebGL / MediaPipe-powered interactive demo that allows you to control a 3D sphere using hand gestures.
3d computer-vision free gestures hand hand-tracking mediapipe open-source threejs tutorial webgl
Last synced: 14 Mar 2026
https://github.com/mkang315/RCS-YOLO
[MICCAI'23] Official implementation of "RCS-YOLO: A Fast and High-Accuracy Object Detector for Brain Tumor Detection".
br35h-brain-tumor-detection-2020 brain-tumor-detection computer-vision convolutional-neural-networks deep-learning deep-neural-networks medical-image-analysis medical-image-computing medical-image-dataset medical-image-detection medical-image-processing object-detection panet reparameterization repconv-model repvgg-model shufflenet vovnet yolo
Last synced: 20 Mar 2025
https://github.com/Suoivy/LPD-net
LPD-Net: 3D Point Cloud Learning for Large-Scale Place Recognition and Environment Analysis, ICCV 2019, Seoul, Korea
computer-vision deep-learning place-recognition point-cloud
Last synced: 20 Mar 2025
https://github.com/thaoshibe/beautygan-pytorch-reimplementation
A re-implementation of BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network (ACM MM'18)
color-matching computer-vision computervision deep-learning gan makeup-transfer pytorch
Last synced: 07 Mar 2026
https://github.com/exadel-inc/compreface-javascript-sdk
JavaScript SDK for CompreFace - free and open-source face recognition system from Exadel
compreface compreface-sdk computer-vision face-detection face-recognition face-recognition-js face-verification facenet insightface javascript sdk
Last synced: 17 Mar 2025
https://github.com/marteinn/wagtail-alt-generator
Insert image description and tags with the help of computer vision
computer-vision django wagtail
Last synced: 10 Apr 2025
https://github.com/johnsmithm/multi-heads-attention-image-classification
Multi heads attention for image classification
attention-is-all-you-need attention-mechanisms computer-vision keras tensorflow
Last synced: 12 Apr 2025
https://github.com/harveyslash/patchmatch
GPU and CPU implementation of Patchmatch in Python!
computer-graphics computer-vision
Last synced: 08 Jul 2025
https://github.com/alemart/encantar-js
GPU-accelerated Augmented Reality for the web.
3d aframe ar augmented-reality babylonjs computer-vision image-tracking marker-detection marker-tracking mixed-reality orb threejs virtual-reality vr wasm webar webgl webxr xr
Last synced: 09 Apr 2025