Computer vision
Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.
- GitHub: https://github.com/topics/computer-vision
- Wikipedia: https://en.wikipedia.org/wiki/Computer_vision
- Related Topics: vision, deep-learning, machine-learning, opencv, gan,
- Aliases: machine-vision, computervision,
- Last updated: 2026-07-01 00:06:02 UTC
- JSON Representation
https://github.com/agentmorris/megadetector
MegaDetector is an AI model that helps conservation folks spend less time doing boring things with camera trap images.
camera-traps cameratraps computer-vision conservation ecology machine-learning megadetector wildlife
Last synced: 16 May 2025
https://github.com/chengchunhsu/everypixelmatters
Implementation of ECCV 2020 paper "Every Pixel Matters: Center-aware Feature Alignment for Domain Adaptive Object Detector"
adversarial-learning anchor-free computer-vision domain-adaptation eccv eccv-2020 eccv2020 fcos object-detection pytorch transfer-learning unsupervised-domain-adaptation
Last synced: 30 Jan 2026
https://github.com/ahundt/grl
Robotics tools in C++11. Implements soft real time arm drivers for Kuka LBR iiwa plus V-REP, ROS, Constrained Optimization based planning, Hand Eye Calibration and Inverse Kinematics integration.
atracsys c-plus-plus calibration computer-vision constrained-optimization control driver grl hand-eye-calibration iiwa inverse-kinematics kuka kuka-lbr-iiwa optimization real-time robot robotics ros vrep vrep-plugin
Last synced: 12 Jul 2025
https://github.com/joelibaceta/video-keyframe-detector
It is a simple python tool to extract key-frames from a video file using peak estimation from frame difference.
cli-tools computer-vision key-frame opencv peakutils python3 video
Last synced: 04 Apr 2025
https://github.com/lumiguide/haskell-opencv
Haskell binding to OpenCV-3.x
computer-vision haskell image-processing opencv
Last synced: 09 Apr 2025
https://github.com/Julian-Wyatt/AnoDDPM
CVPR Workshop paper - AnoDDPM: Anomaly Detection with Denoising Diffusion Probabilistic Models using Simplex Noise
anomaly-detection computer-vision cvpr ddpm dl medical segmentation
Last synced: 14 Mar 2025
https://github.com/vsymbol/CUTIE
CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)
computer-vision deep-learning text-extraction
Last synced: 02 Apr 2025
https://github.com/dr413677671/promptgallery-stable-diffusion-webui
A prompt cookbook worked as stable-diffusion-webui extenstions.
computer-vision deep-learning gan gradio image-generation stable-diffusion stable-diffusion-webui stable-diffusion-webui-plugin vuejs
Last synced: 21 Aug 2025
https://github.com/dr413677671/PromptGallery-stable-diffusion-webui
A prompt cookbook worked as stable-diffusion-webui extenstions.
computer-vision deep-learning gan gradio image-generation stable-diffusion stable-diffusion-webui stable-diffusion-webui-plugin vuejs
Last synced: 18 Jul 2025
https://github.com/imoonlab/hyper-yolo
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
computer-vision deep-learning hypergraph-neural-networks object-detection yolo
Last synced: 05 Apr 2025
https://github.com/jonathanraiman/tensorflow-infogan
:dolls: InfoGAN: Interpretable Representation Learning
chairs-dataset computer-vision gan generative-adversarial-network infogan mnist neural-network python tensorflow
Last synced: 03 Jul 2025
https://github.com/thebabylonai/babylog
A lightweight logger for machine learning teams to log images and predictions in production.
computer-vision cvops data-science logger logging-library machine-learning ml mlops python python3
Last synced: 07 May 2025
https://github.com/customdiffusion360/custom-diffusion360
CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control
camera-pose-control computer-vision customization generative-model pytorch stable-diffusion text-to-image
Last synced: 27 Mar 2025
https://github.com/arthurdouillard/cvpr2021_plop
Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation
computer-vision continual-learning cvpr2021 deep-learning semantic-segmentation
Last synced: 21 Jun 2025
https://github.com/DarshanDeshpande/jax-models
Unofficial JAX implementations of deep learning research papers
artificial-intelligence computer-vision convolutional-neural-networks deep-learning flax jax machine-learning research-paper-implementation transformers
Last synced: 13 May 2025
https://github.com/htcr/sam_road
Segment Anything Model for large-scale, vectorized road network extraction from aerial imagery. CVPRW 2024
autonomous-driving computer-vision cvpr2024 graph graph-neural-networks graph-representation-learning mapping navigation remote-sensing scene-graph scene-graph-generation segment-anything segmentation-models transformers
Last synced: 12 Jan 2026
https://github.com/karan-malik/facemaskdetector
Real time face-mask detection using Deep Learning and OpenCV
cnn cnn-keras coding computer-vision convolutional-neural-networks coronavirus covid-19 covid19 deep-learning deep-neural-networks face-mask-detection opencv project python python3
Last synced: 09 Apr 2025
https://github.com/viplix3/BoTSORT-cpp
C++ implementation of BoT-SORT MOT algorithm with Re-ID and Camera Motion Compensation
computer-vision cpp kalman-filter multi-object-tracking reid
Last synced: 18 Mar 2025
https://github.com/sanyam5/arc-pytorch
The first public PyTorch implementation of Attentive Recurrent Comparators
computer-vision deep-learning deep-neural-networks machine-learning pytorch recurrent-neural-networks
Last synced: 19 Jul 2025
https://github.com/grib0ed0v/face_recognition.pytorch
Deep Face Recognition in PyTorch
am-softmax arcface center-loss computer-vision cosface deep-face-recognition deep-learning face-recognition focal-loss landmark-detection lfw metric-learning mobilefacenet pytorch sphereface sv-softmax vggface2
Last synced: 20 Nov 2025
https://github.com/devrimcavusoglu/pybboxes
Light weight toolkit for bounding boxes providing conversion between bounding box types and simple computations.
computer-vision deep-learning image-recognition machine-learning numpy python pytorch tensorflow
Last synced: 08 Apr 2025
https://github.com/ai4ce/disconet
[NeurIPS2021] Learning Distilled Collaboration Graph for Multi-Agent Perception
3d-object-detection 3d-scene-understanding autonomous-driving collaborative-learning communication-networks computer-vision deep-learning graph graph-learning knowledge-distillation multi-agent-learning multi-agent-perception multi-agent-system point-cloud-processing v2v v2x-communication
Last synced: 24 Feb 2026
https://github.com/hzwer/wacv2024-safa
WACV2024 - Scale-Adaptive Feature Aggregation for Efficient Space-Time Video Super-Resolution
computer-vision deep-learning super-resolution video-processing
Last synced: 05 Apr 2025
https://github.com/skalskip/sketchy-vision
Each week I create sketches covering key Computer Vision concepts. If you want to learn more about CV stick around!
computer-vision convolutional-neural-networks deep-learning neural-network non-maximum-suppression numpy object-detection
Last synced: 14 Apr 2025
https://github.com/neural-nuts/image-caption-generator
[DEPRECATED] A Neural Network based generative model for captioning images using Tensorflow
artificial-intelligence captioning-images computer-vision convolutional-neural-networks image-captioning lstm lstm-neural-networks natural-language-generation neural-network recurrent-neural-networks tensorflow
Last synced: 14 May 2025
https://github.com/jevois/jevois
JeVois smart machine vision framework
arm computer-vision cpp edgetpu hailo linux machine-learning myriadx neon neural-networks python
Last synced: 15 Mar 2025
https://github.com/erogol/raspisecurity
Home Surveillance for Raspberry
computer-vision opencv raspberry-pi surveillance
Last synced: 28 Oct 2025
https://github.com/amazon-science/video-contrastive-learning
Video Contrastive Learning with Global Context, ICCVW 2021
computer-vision contrastive-learning iccv-2021 self-supervised-learning video-understanding
Last synced: 03 May 2025
https://github.com/1adrianb/binary-face-alignment
Real time face alignment
binary-convolutions computer-vision convolutional-neural-networks deep-learning face-alignment torch7
Last synced: 25 Jun 2025
https://github.com/umbertogriffo/fast-near-duplicate-image-search
Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.
computer-vision duplicate-detection duplicates-removed hashing image-deduplication image-processing kd-tree kdtree near-duplicate nearest-neighbor-search nearest-neighbors perceptual-hashing phash t-sne
Last synced: 06 Mar 2025
https://github.com/line/lighthouse
[EMNLP2024 Demo], [ICASSP 2025] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
audio audio-moment-retrieval audio-processing computer-vision highlight-detection moment-retrieval multimodal natural-language-processing video video-moment-retrieval video-processing
Last synced: 04 Jul 2025
https://github.com/ofa-sys/ofasys
OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models
audio computer-vision deep-learning motion multimodal-learning multitask-learning nlp pretrained-models pytorch transformers vision-and-language
Last synced: 24 Oct 2025
https://github.com/ai4ce/DiscoNet
[NeurIPS2021] Learning Distilled Collaboration Graph for Multi-Agent Perception
3d-object-detection 3d-scene-understanding autonomous-driving collaborative-learning communication-networks computer-vision deep-learning graph graph-learning knowledge-distillation multi-agent-learning multi-agent-perception multi-agent-system point-cloud-processing v2v v2x-communication
Last synced: 20 Mar 2025
https://github.com/patrikhuber/eos-model-viewer
3D model viewer for the eos Morphable Model library
3d-face 3d-morphable-face-model computer-vision face-models
Last synced: 21 Jul 2025
https://github.com/stephanecharette/DarkHelp
C++ wrapper library for Darknet
computer-vision cpp darknet machine-learning neural-network yolo
Last synced: 20 Apr 2025
https://github.com/aydinnyunus/facerecognitionsecurity
Face Recognition Security
computer-vision cv2 cyber-security cybersecurity face-detect face-detection face-detection-using-opencv face-recognition face-recognition-application face-recognition-python face-recognizer opencv opencv-python opencv2 protection python python3 security security-tools
Last synced: 29 Apr 2025
https://github.com/eddieoz/youtube-clips-automator
MARCELO: an AI powered bot to automate the editing and thumbnail creation for your Youtube clips channel
ai audio-processing automation bot clip computer-vision editing thumbnail video video-processing youtube
Last synced: 20 Oct 2025
https://github.com/ai-forever/ru-clip
CLIP implementation for Russian language
Last synced: 20 Jun 2025
https://github.com/aakashjhawar/face-recognition-using-deep-learning
Identify faces from video and images using OpenCV and Deep Learning
computer-vision cv2 deep-learning face face-classification face-detection face-embeddings face-recognition face-recognition-python image-processing live-face-recognition machine-learning neural-network opencv openface python svm
Last synced: 12 Apr 2025
https://github.com/oxedom/parker
Parking detection and monitoring webapp that runs entirely in the browser
computer-vision object-detection parking-management tensorflowjs webrtc yolov7
Last synced: 02 May 2025
https://github.com/addy1997/robotics-resources
List of commonly used robotics libraries and packages
cognition collision-checking computer-vision international-conference mobile-robotics motion-planning perception robot-perception robot-vision robotics-libraries robotics-resources slam visualization
Last synced: 09 Nov 2025
https://github.com/VedankPurohit/LiveRecall
Welcome to **LiveRecall**, the open-source alternative to Microsoft's Recall. LiveRecall captures snapshots of your screen and allows you to recall them using natural language queries, leveraging semantic search technology. For added security, all images are encrypted.
computer-vision memory productivity python recall sementic
Last synced: 11 Sep 2025
https://github.com/lit26/streamlit-img-label
streamlit-img-label is a graphical image annotation tool using streamlit. Annotations are saved as XML files in PASCAL VOC format.
computer-vision image-annotation image-processing
Last synced: 27 Jun 2025
https://github.com/LeapLabTHU/Pseudo-Q
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
computer-vision cvpr2022 deep-learning multimodal-deep-learning pytorch vision-and-language visual-grounding
Last synced: 03 Apr 2025
https://github.com/astra-vision/rain-rendering
Rain Rendering for Evaluating and Improving Robustness to Bad Weather (Tremblay et al., 2020) (S. S. Halder et al., 2019)
computer-vision fog gan particles-simulation physically-based-rendering rain
Last synced: 06 Feb 2026
https://github.com/rayryeng/xiaohuluvpdetection
This is a Python + OpenCV implementation of the Vanishing Point algorithm by Xiaohu Lu et al. - http://xiaohulugo.github.io/papers/Vanishing_Point_Detection_WACV2017.pdf
computer-vision image-processing line-detection opencv python rotation-matrix vanishing-points
Last synced: 27 Oct 2025
https://github.com/pyxu-org/pyxu
Modular and scalable computational imaging in Python with GPU/out-of-core computing.
array-api array-computing artificial-intelligence bayesian-methods computational-imaging computer-vision foundation-models image-enhancement image-processing image-reconstruction image-restoration inverse-problem machine-learning matrix-free modular python-hpc python-library scalable sensing sparse-learning
Last synced: 17 Mar 2026
https://github.com/megvii-basedetection/autoassign
Pytorch implementation of "AutoAssign: Differentiable Label Assignment for Dense Object Detection"
coco-dataset computer-vision object-detection
Last synced: 09 Oct 2025
https://github.com/zekunhao1995/DualSDF
DualSDF: Semantic Shape Manipulation using a Two-Level Representation. CVPR'20
computer-vision machine-learning
Last synced: 20 Mar 2025
https://github.com/esimov/forensic
Copy-move image forgery detection library.
computer-vision dct digital-signal-processing forgery-detection golang image-forensics image-processing
Last synced: 17 Mar 2025
https://github.com/Megvii-BaseDetection/AutoAssign
Pytorch implementation of "AutoAssign: Differentiable Label Assignment for Dense Object Detection"
coco-dataset computer-vision object-detection
Last synced: 14 Mar 2025
https://github.com/taipingeric/yolo-v4-tf.keras
A simple tf.keras implementation of YOLO v4
computer-vision keras keras-model object-detection python tensorflow tensorflow2 yolo yolov4
Last synced: 18 Jan 2026
https://github.com/adityaoberai/alt-text-generator
Generate alt text for your images!
a11y accessibility azure-openai-service computer-vision hacktoberfest javascript serverless svelte vercel vite
Last synced: 22 Aug 2025
https://github.com/zhmiao/opencompounddomainadaptation-ocda
Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)
computer-vision cvpr2020 deep-learning domain-adaptation ocda open-compound-domain-adaptation pytorch-implementation
Last synced: 23 Jul 2025
https://github.com/sayakpaul/maxim-tf
Implementation of MAXIM in TensorFlow.
all-mlp computer-vision conv deblurring dehazing denoising deraining enhancement gmlp keras tensorflow
Last synced: 14 Apr 2025
https://github.com/gopala-kr/summary
summaries of all the papers I read
arxiv blockchain-technology computer-vision deep-learning information-retrieval machine-learning neural-network nlp-resources quantum-computing reinforcement-learning speech-recognition
Last synced: 07 Apr 2025
https://github.com/gjy3035/gcc-cl
GCC dataset Collector and Labeler (GCC CL) [CVPR2019]
computer-vision crowd-analysis crowd-counting crowd-segmentation cvpr2019 gtav
Last synced: 10 Apr 2025
https://github.com/allenai/dnw
Discovering Neural Wirings (https://arxiv.org/abs/1906.00586)
computer-vision machine-learning neural-networks
Last synced: 24 Oct 2025
https://github.com/MCG-NJU/MeMOTR
[ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking
computer-vision deep-learning multi-object-tracking tracking
Last synced: 20 Mar 2025
https://github.com/sign-language-translator/sign-language-translator
Python library & framework to build custom translators for the hearing-impaired and translate between Sign Language & Text using Artificial Intelligence.
artificial-intelligence attention-mechanism computer-vision deep-learning encoder-decoder language-models machine-learning motion-transfer nlp pose-estimation python pytorch rule-based-nlp sign-language sign-language-recognition sign-language-translation sign-to-text text-to-sign transformers translation
Last synced: 06 Apr 2025
https://github.com/microsoft/snca.pytorch
Improving Generalization via Scalable Neighborhood Component Analysis
computer-vision deep-learning eccv-2018 few-shot-learning nearest-neighbors transfer-learning visual-recognition
Last synced: 24 Oct 2025
https://github.com/ProGamerGov/neural-dream
PyTorch implementation of DeepDream algorithm
computer-vision deep-dream deepdream googlenet inception neural-dream nin pytorch pytorch-deepdream resnet tiling vgg visualization
Last synced: 07 May 2025
https://github.com/arthurdouillard/dytox
Dynamic Token Expansion with Continual Transformers, accepted at CVPR 2022
cifar100 computer-vision continual-learning dynamic-network imagenet transformer
Last synced: 16 Mar 2025
https://github.com/w-m/3dgs-compression-survey
An open survey on 3D Gaussian Splatting compression methods
3d-gaussian-splatting 3d-reconstruction compression computer-vision gaussian-splatting paper-survey
Last synced: 05 Apr 2025
https://github.com/progamergov/neural-dream
PyTorch implementation of DeepDream algorithm
computer-vision deep-dream deepdream googlenet inception neural-dream nin pytorch pytorch-deepdream resnet tiling vgg visualization
Last synced: 07 May 2025
https://github.com/giannisdaras/ylg
[CVPR 2020] Official Implementation: "Your Local GAN: Designing Two Dimensional Local Attention Mechanisms for Generative Models".
attention-mechanism computer-vision cvpr cvpr20 cvpr2020 gans generative-adversarial-networks inverse-problems machine-learning
Last synced: 17 Mar 2025
https://github.com/souvikmajumder26/land-cover-semantic-segmentation-pytorch
๐ฃ Building an end-to-end Promptable Semantic Segmentation (Computer Vision) project from training to inferencing a model on LandCover.ai data (Satellite Imagery).
ai-project ai-projects artificial-intelligence computer-vision deep-learning end-to-end image-segmentation landcover machine-learning-project machine-learning-projects ml-project ml-projects neural-network promptable promptable-segmentation pytorch satellite-imagery segmentation-models-pytorch semantic-segmentation unet
Last synced: 22 Apr 2025
https://github.com/stoic1979/robovision
AI and machine leaning-based computer vision for a robot
artificial-intelligence classification computer-vision face-detection face-recognition haar-cascade-classifier machine-learning opencv pyqt5 surveillance
Last synced: 19 Jul 2025
https://github.com/VSainteuf/utae-paps
PyTorch implementation of U-TAE and PaPs for satellite image time series panoptic segmentation.
agriculture computer-vision deep-learning iccv iccv2021 panoptic-segmentation satellite-imagery self-attention semantic-segmentation sentinel-2
Last synced: 08 May 2025
https://github.com/bytedance/sptsv2
The official implementation of SPTS v2: Single-Point Text Spotting
artificial-intelligence computer-vision deep-learning ocr research
Last synced: 18 Aug 2025
https://github.com/digantamisra98/echo
Python package containing all custom layers used in Neural Networks (Compatible with PyTorch, TensorFlow and MegEngine)
algorithms computer-vision deep-learning deep-learning-algorithms deep-neural-networks deeplearning functions gitbook machine-learning machine-learning-algorithms mathematics megengine nlp python pytorch tensorflow tensorflow2
Last synced: 09 Apr 2025
https://github.com/dlpbc/keras-kinetics-i3d
keras implementation of inflated 3d from Quo Vardis paper + weights
action-recognition computer-vision deep-learning pretrained-kinetics-model
Last synced: 26 Jul 2025
https://github.com/juniorxsound/threadeddepthcleaner
๐ท Threaded depth-map cleaning and inpainting using OpenCV
computer-vision depth-camera depth-map image-filters image-processing librealsense2 opencv
Last synced: 02 Mar 2026
https://github.com/arthurdouillard/deepcourse
Learn the Deep Learning for Computer Vision in three steps: theory from base to SotA, code in PyTorch, and space-repetition with Anki
anki computer-vision course deep-learning pytorch research
Last synced: 30 Oct 2025
https://github.com/likojack/bnv_fusion
This repository implements our CVPR2022 paper "BNV-Fusion: Dense 3D Reconstruction using Bi-level Neural Volume Fusion"
3d-reconstruction computer-vision
Last synced: 29 Mar 2025
https://github.com/ai4ce/occ4cast
Occ4cast: LiDAR-based 4D Occupancy Completion and Forecasting
3d-perception 3d-scene-understanding 3d-to-4d artificial-intelligence autonomous-driving autonomous-vehicles computer-vision dataset lidar machine-learning occupancy-grid-map occupancy-prediction point-cloud spatial-temporal-forecasting temporal-perception
Last synced: 12 Jul 2025
https://github.com/hv0905/nekoimagegallery
An AI-powered natural language & reverse Image Search Engine powered by CLIP & qdrant.
clip computer-vision image-search image-search-engine search-engine transformers
Last synced: 06 Apr 2025
https://github.com/sayakpaul/mirnet-tflite-trt
TensorFlow Lite models for MIRNet for low-light image enhancement.
computer-vision tensorflow2 tflite tinyml
Last synced: 15 Mar 2026
https://github.com/ibaigorordo/sapiens-pytorch-inference
Minimal code and examnples for inferencing Sapiens foundation human models in Pytorch
computer-vision deep-learning depth-estimation inference pose-estimation python pytorch semantic-segmentation
Last synced: 04 Apr 2025
https://github.com/Lucid1ty/Yolov5ForCSGO
CSGO agent detection and auto aim
aimbot auto-aim autoaim chinese computer-vision csgo detection english object-detection python python3 pytorch yolov5
Last synced: 21 Apr 2025
https://github.com/wtct-hungary/UnityVision-iOS
This native plugin enables Unity to take advantage of specific features of Core-ML and Vision Framework on the iOS platform.
c-sharp computer-vision core-ml coreml ios machine-learning unity unity3d
Last synced: 08 May 2025
https://github.com/z-x-yang/aot
Associating Objects with Transformers for Video Object Segmentation
computer-vision video-object-segmentation
Last synced: 12 Jan 2026
https://github.com/qengineering/rpi-image
Raspberry Pi 4 Buster 64-bit OS with deep learning examples
aarch64 armv8 computer-vision cpp deep-learning face-recognition mnn ncnn opencv paddle-lite pose-estimation raspberry-pi-4 raspberry-pi-64-os raspberry-pi-image sd-card-image ssd tensorflow tensorflow-lite
Last synced: 21 Feb 2026
https://github.com/ternaus/people_segmentation
Code for the model to segment people at the image
computer-vision deep-learning image-segmentation people-segmentation python semantic-segmentation
Last synced: 10 Apr 2025
https://github.com/videokit-ai/videokit
Cross-platform, low-code media SDK for Unity Engine.
computer-vision natml unity3d user-generated-content video-editing video-effects video-filter video-recording
Last synced: 13 Apr 2025
https://github.com/salvatorera/ml-news-of-the-week
A collection of the the best ML and AI news every week (research, news, resources)
agents ai artificial-intelligence computer-vision deeplearning llms machine-learning nlp python rag retrieval-augmented-generation transformer
Last synced: 05 Apr 2025
https://github.com/sayakpaul/supervised-contrastive-learning-in-tensorflow-2
Implements the ideas presented in https://arxiv.org/pdf/2004.11362v1.pdf by Khosla et al.
computer-vision contrastive-learning deep-learning keras representation-learning supervised-contrastive-learning tensorflow
Last synced: 15 Apr 2025
https://github.com/jac99/MinkLoc3D
MinkLoc3D: Point Cloud Based Large-Scale Place Recognition
3d-convolutional-network 3d-vision computer-vision deep-learning metric-learning minkowski-engine place-recognition point-cloud
Last synced: 20 Mar 2025
https://github.com/lingdong-/chinese-hershey-font
Convert Chinese Characters to Single-Line Fonts using Computer Vision
chinese computer-vision convert font hershey-fonts typography
Last synced: 14 May 2025
https://github.com/decisionforce/pgdrive
PGDrive: an open-ended driving simulator with infinite scenes from procedural generation
autonomous-driving computer-vision deep-learning generalization imitation-learning machine-learning multi-agent panda3d procedural-generation reinforcement-learning simulation simulator
Last synced: 24 Dec 2025
https://github.com/exadel-inc/compreface-python-sdk
Python SDK for CompreFace - free and open-source face recognition system from Exadel
compreface compreface-sdk computer-vision face-detection face-recognition face-recognition-python face-verification facenet insightface python sdk
Last synced: 07 Apr 2025
https://github.com/Ground-A-Video/Ground-A-Video
Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)
computer-vision diffusion-models iclr stable-diffusion text-to-video video-editing
Last synced: 28 Mar 2025
https://github.com/geekquad/pixel-processing
๐ท This repository is focused on having various feature implementation of OpenCV in Python. The aim is to have a minimal implementation of all OpenCV features together, under one roof.
computer-vision hacktoberfest hacktoberfest2021 image-processing implementation jupyter-notebook machine-learning open-source opencv opencv-python python scratch-implementation
Last synced: 15 Mar 2025
https://github.com/xilaili/AOGNet
Code for CVPR 2019 paper: " Learning Deep Compositional Grammatical Architectures for Visual Recognition"
aognet cifar10 cifar100 computer-vision deep-learning deep-neural-networks image-classification imagenet mxnet
Last synced: 16 Apr 2025
https://github.com/see2sound/see2sound
Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
audio-processing computer-vision see-2-sound
Last synced: 07 Apr 2026
https://github.com/hkchengrex/mask-propagation
[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code :star2:. Semi-supervised video object segmentation evaluation.
computer-vision cvpr2021 deep-learning pytorch segmentation video-object-segmentation video-segmentation
Last synced: 22 Apr 2025
https://github.com/choyingw/cross-modal-perceptionist
CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
3d 3d-models 3dmm biometrics cognitive-science computer-vision cross-modal-learning cvpr cvpr2022 deep-learning machine-learning pytorch speech speech-synthesis speech-to-face
Last synced: 05 Apr 2025
https://github.com/vita-group/orthogonality-in-cnns
[NeurIPS '18] "Can We Gain More from Orthogonality Regularizations in Training Deep CNNs?" Official Implementation.
cnns computer-vision deep-learning orthogonal
Last synced: 06 Mar 2026
https://github.com/amazon-science/crossnorm-selfnorm
CrossNorm and SelfNorm for Generalization under Distribution Shifts, ICCV 2021
computer-vision domain-adaptation domain-generalization iccv-2021 model-robustness natural-language-processing normalization
Last synced: 03 May 2025