Computer vision
Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.
- GitHub: https://github.com/topics/computer-vision
- Wikipedia: https://en.wikipedia.org/wiki/Computer_vision
- Related Topics: vision, deep-learning, machine-learning, opencv, gan,
- Aliases: machine-vision, computervision,
- Last updated: 2026-07-01 00:06:02 UTC
- JSON Representation
https://github.com/longxiang-ai/human101
The official implementation of "Human101: Training 100+FPS Human Gaussians in 100s from 1 View".
3d 3d-reconstruction 3d-vision computer-vision digital-human gaussian gaussian-splatting human-reconstruction monocular real-time real-time-rendering view-synthesis
Last synced: 03 Jan 2026
https://github.com/hal-42/alchemycat
Alchemy Cat โโ ๐ฅConfig System for SOTA
auto-tuning computer-vision config deep-learning machine-learning parameter-tuning
Last synced: 14 Jan 2026
https://github.com/pooya-mohammadi/deep_utils
An open-source toolkit which is full of handy functions, including the most used models and utilities for deep-learning practitioners!
augmentation coco computer-vision cutmix deep-learning face-detection face-recognition machine-learning modelcheckpoint nlp object-detection python pytorch senet tensorflow utils vggface2 yolov5
Last synced: 16 May 2025
https://github.com/li-plus/seam-carving
A super-fast Python implementation of seam carving algorithm for intelligent image resizing.
computer-graphics computer-vision image-processing image-resizing python seam-carving
Last synced: 04 Mar 2026
https://github.com/ravisvi/super-resolution-videos
Applying SRGAN technique implemented in https://github.com/zsdonghao/SRGAN on videos to super resolve them.
computer-vision srgan super-resolution tensorlayer
Last synced: 13 Apr 2025
https://github.com/nrl-ai/daisykit
DaisyKit is an easy AI toolkit with face mask detection, pose detection, background matting, barcode detection, face recognition and more. - with NCNN, OpenCV, Python wrappers
background-matting barcode-detection computer-vision cpp deep-learning deployment embedded face-detection face-mask-detection hand-pose inference-engine machine-learning mobile ncnn neural-network no-code object-detection python vulkan
Last synced: 09 Mar 2026
https://github.com/jishnujayakumar/MV-Tractus
A simple tool to extract motion vectors from h264 encoded videos.
artificial-intelligence artificial-neural-networks compression computer-science computer-vision extract-motion-vectors h264 machine-learning motion-vectors mpeg-video mv-tractus video
Last synced: 18 Mar 2025
https://github.com/orrzohar/PROB
[CVPR 2023] Official Pytorch code for PROB: Probabilistic Objectness for Open World Object Detection
computer-vision object-detection open-set-object-detection open-world-object-detection owod
Last synced: 15 Jun 2025
https://github.com/ChargedMonk/Social-Distancing-using-YOLOv5
Classifying people as high risk and low risk based on their distance to other people.
computer-vision deep-learning opencv python pytorch social-distance-monitoring social-distancing social-distancing-detection yolo
Last synced: 21 Apr 2025
https://github.com/ai4ce/msg
[NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)
computer-vision deep-learning multiview neurips-2024 scene-graph scene-representations visual-navigation
Last synced: 11 Oct 2025
https://github.com/kenshohara/3d-resnets
3D ResNets for Action Recognition
action-recognition computer-vision deep-learning lua torch7 video-recognition
Last synced: 17 Mar 2026
https://github.com/soubhiksanyal/now_evaluation
This is the official repository for evaluation on the NoW Benchmark Dataset. The goal of the NoW benchmark is to introduce a standard evaluation metric to measure the accuracy and robustness of 3D face reconstruction methods from a single image under variations in viewing angle, lighting, and common occlusions.
2d-3d 3d-data 3d-face-alignment 3d-landmarks 3d-mesh 3d-reconstruction benchmark-datasets computer-vision face face-alignment face-reconstruction face-reconstruction-challenge flame flame-model now-evaluation python python3 single-image-reconstruction triplet-loss
Last synced: 16 Jan 2026
https://github.com/karolzak/boxdetect
BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on scanned forms.
bounding-boxes box-detection boxes checkbox checkboxes computer-vision cv2 documents forms handwritten-character-recognition handwritten-characters handwritten-documents handwritten-forms opencv opencv-python rectangle-detection scanned-documents scanned-image-pdfs scanned-images
Last synced: 07 Apr 2025
https://github.com/henzler/neuraltexture
Learning a Neural 3D Texture Space from 2D Exemplars [CVPR 2020]
computer-graphics computer-vision deep-learning machine-learning texture-synthesis
Last synced: 10 Oct 2025
https://github.com/RizwanMunawar/yolov5-object-tracking
YOLOv5 Object Tracking + Detection + Object Blurring + Streamlit Dashboard Using OpenCV, PyTorch and Streamlit
computer-vision object-detection object-tracking streamlit-dashboard yolov5
Last synced: 21 Apr 2025
https://github.com/ai4ce/V2X-Sim
[RA-L2022] V2X-Sim Dataset and Benchmark
benchmark collaborative-perception computer-vision dataset deep-learning machine-learning multi-robot-systems pytorch simulation v2x vehicle-to-everything
Last synced: 20 Mar 2025
https://github.com/rese1f/aurora
[ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark
benchmark computer-vision deploy llm multimodal-large-language-models transformers video-understanding
Last synced: 19 Aug 2025
https://github.com/karayaman/lichess-with-a-real-board
Lichess.org client for real life chess boards.
Last synced: 15 Apr 2025
https://github.com/yihongXU/TransCenter
This is the official implementation of TransCenter (TPAMI). The code and pretrained models are now available here: https://gitlab.inria.fr/yixu/TransCenter_official.
computer-vision deep-learning multiple-object-tracking pytorch transformers
Last synced: 05 May 2025
https://github.com/furkan-gulsen/sport-with-ai
The human body is detected with the help of the Mediapipe library. Then, using the mathematical methods applied, it is determined how much the exercise count is done.
ai artificial-intelligence computer-vision deep-learning image-processing keras machine-learning mediapipe python python3 sport tensorflow
Last synced: 16 Oct 2025
https://github.com/MIT-SPARK/CertifiablyRobustPerception
Certifiable Outlier-Robust Geometric Perception
autonomous-driving computer-vision convex-optimization mixed-integer-programming outliers-detection point-cloud-registration robotics robust-estimation semidefinite-programming
Last synced: 20 Mar 2025
https://github.com/fasttrackorg/fasttrack
FastTrack is a cross-platform application designed to track multiple objects in video recording.
computer-vision image-processing linux macos opencv opencv4 qt qt6 research science tracking windows
Last synced: 05 Apr 2025
https://github.com/openvisionapi/ova-server
OpenVisionAPI server
api computer-vision deep-learning flask machine-learning object-detection python tensorflow tensorflow-lite yolov4
Last synced: 17 Apr 2025
https://github.com/the-ai-summer/learn-deep-learning
AI Summer's complete catalog of articles
computer-vision data-science deep-learning deep-neural-networks machine-learning natural-language-processing
Last synced: 09 Jul 2025
https://github.com/tonylianlong/crossmae
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
computer-vision deep-learning mae masked-autoencoder self-supervised-learning
Last synced: 15 Sep 2025
https://github.com/by-sabbir/headposeestimation
Head Pose Estimation using OpenCV solving PNP.
computer-vision headpose-estimation opencv
Last synced: 05 Mar 2025
https://github.com/MichiganCOG/M-PACT
A one stop shop for all of your activity recognition needs.
action-recognition activity-recognition c3d computer-vision convnet deep-learning i3d inception lrcn lrcn-network m-pact mpact resnet resnet-50 temporal-segment-networks tensorflow tfrecords tsn video
Last synced: 16 Nov 2025
https://github.com/QingyongHu/SQN
SQN in Tensorflow (ECCV'2022)
annotations computer-vision deep-learning point-cloud weakly-supervised-segmentation
Last synced: 20 Mar 2025
https://github.com/xiaosu-zhu/mcquic
Repository of CVPR'22 paper "Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression"
computer-vision cvpr2022 image-compression image-processing pytorch
Last synced: 12 Oct 2025
https://github.com/khronosgroup/openvx-samples
OpenVX Samples to use with any conformant implementation of OpenVX
api bubble-pop canny canny-edge-detection canny-live computer-vision cross-platform khronos-openvx khronosgroup open-source open-standard opencv openvx openvx-conformant openvx-graph openvx-sample openvx-samples royalty-free skin-tone-detect skin-tone-sample
Last synced: 13 Jun 2025
https://github.com/clovaai/frostnet
FrostNet: Towards Quantization-Aware Network Architecture Search
classification computer-vision deep-learning int8-quantization network-architecture object-detection optimizers post-quantization pytorch quantization quantization-aware-training quantization-efficient-network semantic-segmentation style-transfer
Last synced: 06 Oct 2025
https://github.com/dthuerck/mapmap_cpu
A high-performance general-purpose MRF MAP solver, heavily exploiting SIMD instructions.
algorithm computer-vision dynamic-programming feedback-vertex-set markov-random-field massively-parallel mrf simd simd-programming vectorization
Last synced: 21 Apr 2025
https://github.com/ZichengDuan/EZIGen
[BMVC 2025] Official implementation for paper EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidance
aigc computer-vision deep-learning diffusers diffusion-models generative-ai
Last synced: 14 Sep 2025
https://github.com/connervieira/Predator
The ultimate customizable dash-cam platform, with ALPR and object recognition capabilities
alpr computer-vision dashcam data-analysis gps gpx-files license-plate-detection license-plate-recognition object-detection object-recognition openalpr opencv-python realtime security tensorflow video
Last synced: 18 Jul 2025
https://github.com/twke18/SPML
Universal Weakly Supervised Segmentation by Pixel-to-Segment Contrastive Learning
computer-vision deep-learning metric-learning semantic-segmentation weakly-supervised-learning
Last synced: 26 Mar 2025
https://github.com/JunlinHan/YOCO
Code for You Only Cut Once: Boosting Data Augmentation with a Single Cut, ICML 2022.
cifar10 cifar100 classification computer-vision contrastive-learning data-augmentation data-augmentation-strategies data-augmentations icml imagenet imagenet-classifier instance-segmentation object-detection pytorch rain-removal super-resolution
Last synced: 15 Aug 2025
https://github.com/dmitryryumin/neweraai-papers
The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementation insights beyond the scope of the article. Stay up to date with the latest advances in AI research!
artificial-intelligence computer-vision cvpr deep-learning emnlp icassp iccv image-processing interspeech ismir mashine-learning natural-language-processing neural-networks signal-processing text-classification video-processing
Last synced: 12 Apr 2025
https://github.com/jhu-lcsr/good_robot
"Good Robot! Now Watch This!": Repurposing Reinforcement Learning for Task-to-Task Transfer; and โGood Robot!โ: Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer
computer-vision deep-learning deep-q-network deep-reinforcement-learning grasping learning-from-demonstration manipulation multi-step-dqn multi-step-learning reinforcement-learning robotic-manipulation robotics simulation task-to-task-transfer vrep-simulator zero-shot-learning
Last synced: 03 Oct 2025
https://github.com/giulioz/laser-scanning
๐ท๐ฆ๐ญ A 3D Scanner using Laser Structured Light, written in Python using OpenCV and NumPy.
3d-reconstruction 3d-scanning computer-vision opencv structured-light
Last synced: 09 Mar 2026
https://github.com/young-geng/m3ae_public
Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation
computer-vision flax jax natural-language-processing transformers
Last synced: 06 Apr 2025
https://github.com/ccextractor/rekognition
Free and Open Source alternative to Amazon's Rekognition service. CCExtractor Development | Poor Man's Rekognition
computer-vision deep-learning django django-rest-framework docker face-detection gsoc image-processing machine-learning machinelearning opencv python rest rest-api tensorflow tensorflow-serving video-processing
Last synced: 21 Aug 2025
https://github.com/alkasm/colorfilters
Image thresholding in multiple colorspaces.
computer-vision image-processing opencv
Last synced: 26 Oct 2025
https://github.com/amazon-science/exponential-moving-average-normalization
PyTorch implementation of EMAN for self-supervised and semi-supervised learning: https://arxiv.org/abs/2101.08482
computer-vision normalization self-supervised-learning semi-supervised-learning
Last synced: 03 May 2025
https://github.com/andri27-ts/1-year-machinelearning-journey
An advanced program in Machine Learning and Deep Learning
computer-vision deep-learning deep-reinforcement-learning machine-learning nlp reinforcement-learning
Last synced: 17 Mar 2026
https://github.com/georg-wolflein/chesscog
Determining chess game state from an image.
chess chess-recognition computer-vision deep-learning pytorch
Last synced: 16 Jan 2026
https://github.com/kyegomez/rt-x
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
artificial-intelligence attention-is-all-you-need attention-model computer-vision gpt4 gpt4all multimodal vision vision-transformer
Last synced: 06 May 2025
https://github.com/nvidia/nvimagecodec
A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface
computer-vision cpp cuda dali data-processing deep-learning fast-data-pipeline gpu image-processing machine-learning nvidia python pytorch
Last synced: 12 Jan 2026
https://github.com/nikitadurasov/masksembles
Official repository for the paper "Masksembles for Uncertainty Estimation" (CVPR 2021).
computer-vision deep-learning out-of-distribution-detection paper tensorflow torch uncertainty-estimation uncertainty-neural-networks uncertainty-quantification
Last synced: 16 Jan 2026
https://github.com/tatsuyah/Lane-Lines-Detection-Python-OpenCV
Lane Lines Detection using Python and OpenCV for self-driving car
computer-vision self-driving-car
Last synced: 02 Apr 2025
https://github.com/qingyonghu/sqn
SQN in Tensorflow (ECCV'2022)
annotations computer-vision deep-learning point-cloud weakly-supervised-segmentation
Last synced: 12 Jul 2025
https://github.com/mdietrichstein/yolo-tiny-v1-mobile
Yolo for Android and iOS - Mobile Deep Learning Object Detection in Realtime written in Kotlin and Swift
android computer-vision deep-learning deep-neural-networks ios keras keras-tensorflow mobile tensorflow
Last synced: 10 Apr 2025
https://github.com/anupsarkar-dev/exovisix
Auto Attendance System Using Real Time Face Recognition With Various Computer Vision & Machine Learning Tools
bio-metric-attendance-system computer-vision eye-detection face-recognition gesture-detection haar-training javacv javafx mechine-learing motion-detection mysql-database ocr-recognition opencv
Last synced: 26 Oct 2025
https://github.com/etherealengine/digital-beings
A platform for letting researchers connect an intelligent AI directly to real time communication networks and 3D worlds. Your AI, Anywhere.
ai artificial-intelligence bot computer-vision cv digital-beings digital-humans machine-learning ml nlp telegram
Last synced: 11 Mar 2026
https://github.com/eliphatfs/calibur
Conversion between different conventions of camera matrices and transform matrices.
computer-graphics computer-vision developer-productivity lightweight
Last synced: 21 Aug 2025
https://github.com/microsoft/orbit-dataset
The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mobile phone. The dataset is presented with a teachable object recognition benchmark task which aims to drive few-shot learning on challenging real-world data.
benchmark classification computer-vision dataset few-shot-learning machine-learning meta-learning microsoft object-recognition video
Last synced: 26 Oct 2025
https://github.com/plstcharles/litiv
C++ implementation pool for computer vision R&D projects.
algorithms computer-vision image-processing library litiv opencv segmentation
Last synced: 18 Mar 2025
https://github.com/yuanze-lin/revive
[NeurIPS 2022] Official code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
computer-vision deep-learning gpt-3 knowledge-based multimodal-deep-learning neurips2022 ok-vqa pytorch question-answering vision-and-languge vqa
Last synced: 09 Apr 2025
https://github.com/shi-labs/semi-supervised-transfer-learning
[CVPR 2021] Adaptive Consistency Regularization for Semi-Supervised Transfer Learning
computer-vision semi-supervised-learning transfer-learning
Last synced: 12 Sep 2025
https://github.com/bytedance/twist
Official codes: Self-Supervised Learning by Estimating Twin Class Distribution
computer-vision deep-learning pretraining research self-supervised-learning twist
Last synced: 13 Apr 2025
https://github.com/skylab-tech/ffhqr-dataset
FFHQR -- the first large-scale retouching dataset for computer vision research.
computer-vision dataset deep-learning high-resolution large-scale retouching
Last synced: 25 Jan 2026
https://github.com/SHI-Labs/Semi-Supervised-Transfer-Learning
[CVPR 2021] Adaptive Consistency Regularization for Semi-Supervised Transfer Learning
computer-vision semi-supervised-learning transfer-learning
Last synced: 08 May 2025
https://github.com/Z-Zheng/FreeNet
FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image Classification (TGRS 2020) https://ieeexplore.ieee.org/document/9007624
computer-vision convolutional-neural-network deep-learning geospatial hyperspectral-image-classification remote-sensing
Last synced: 29 Mar 2025
https://github.com/sayakpaul/convnext-tf
Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.
computer-vision convolutional-neural-networks efficient-models imagenet-1k imagenet-21k keras tensorflow
Last synced: 16 Apr 2025
https://github.com/nalepae/bounding-box
Bounding Box is a library to plot pretty bounding boxes with a simple Python API.
bounding-boxes computer-vision python visualization
Last synced: 10 Apr 2025
https://github.com/fcakyon/balanced-loss
Easy to use class balanced cross entropy and focal loss implementation for Pytorch
balanced-loss binary-crossentropy class-balanced-loss computer-vision cross-entropy cvpr deep-learning focal-loss image-classification loss-functions machine-learning pip pypi python pytorch
Last synced: 06 Oct 2025
https://github.com/abhshkdz/neural-vqa-attention
:question: Attention-based Visual Question Answering in Torch
computer-vision deep-learning natural-language-processing torch
Last synced: 15 Apr 2025
https://github.com/imatge-upc/sentiment-2017-imavis
From Pixels to Sentiment: Fine-tuning CNNs for Visual Sentiment Prediction
affective-computing cnn computer-vision deep-learning fine-tuning-cnns sentiment sentiment-maps
Last synced: 11 May 2025
https://github.com/michaelzhang-ai/Speech2Video
Code for ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"
3d accv aigc body computer-vision face gan generative-ai gesture hand pose rnn simulation speech speech2video-synthesis talking-head talking-heads video
Last synced: 10 May 2025
https://github.com/serengil/deepface-react-ui
Deep Face Recognition UI With ReactJS
computer-vision deep-learning deepface face-recognition facial-recognition machine-learning
Last synced: 04 Apr 2025
https://github.com/wl-zhao/diml
[ICCV 2021] Towards Interpretable Deep Metric Learning with Structural Matching
computer-vision deep-learning deep-metric-learning iccv2021 interpretable-deep-learning
Last synced: 15 May 2025
https://github.com/alan-turing-institute/scivision
scivision: a framework for scientific image analysis
computer-vision data-science hut23 hut23-1205 image-processing machine-learning scientific-research
Last synced: 16 May 2025
https://github.com/varunagrawal/bbox
Python library for 2D/3D bounding boxes
3d-bounding-boxes bounding-boxes computer-vision computer-vision-tools
Last synced: 24 Oct 2025
https://github.com/Evgeneus/Graph-Domain-Adaptaion
PyTorch code for the paper "Curriculum Graph Co-Teaching for Multi-target Domain Adaptation" (CVPR2021)
cdan computer-vision coteaching curriculum-learning domain-adaptation domainnet graph-convolutional-networks graph-domain-adaptation graph-neural-networks mtdap office pacs pytorch
Last synced: 08 May 2025
https://github.com/norlab-ulaval/perceptreev1
Implementation of Grondin et al. 2022 "Tree Detection and Diameter Estimation Based on Deep Learning". Also includes datasets and some of the pretrained models.
computer-vision datasets forestry
Last synced: 22 Jul 2025
https://github.com/karanchawla/monocular-visual-inertial-odometry
This contains the code(in development) for monocular visual odometry of a quadrotor. The visual data from the monocular camera is fused with onboard IMU to develop indoor control and navigation algorithms.
computer-vision fuse imu odometry opencv visual-odmetry
Last synced: 27 Jul 2025
https://github.com/jaisayush/fatigue-detection-system-based-on-behavioural-characteristics-of-driver
A computer vision system made with the help of opencv that can automatically detect driver drowsiness in a real-time video stream and then play an alarm if the driver appears to be drowsy.
anaconda-environment anaconda3 computer-vision cv cv2 drowsiness-detection drowsy-driver-warning-system imutils miniproject numpy opencv opencv-python opencv2 playsound python python3 scipy
Last synced: 12 Apr 2025
https://github.com/nearthlab/SiamMaskCpp
C++ Implementation of SiamMask
computer-vision deep-learning pytorch-cpp real-time video-object-segmentation visual-tracking
Last synced: 19 Mar 2025
https://github.com/juglab/FourierImageTransformer
Fourier Image Transformer (FIT) can solve relevant image analysis tasks in Fourier space.
computer-vision image-processing transformer
Last synced: 08 May 2025
https://github.com/adityap27/face-mask-detector
๐๐๐๐ฅ-๐๐ข๐ฆ๐ ๐ ๐๐๐ ๐ฆ๐๐ฌ๐ค ๐๐๐ญ๐๐๐ญ๐ข๐จ๐ง ๐ฎ๐ฌ๐ข๐ง๐ ๐๐๐๐ฉ๐ฅ๐๐๐ซ๐ง๐ข๐ง๐ ๐ฐ๐ข๐ญ๐ก ๐๐ฅ๐๐ซ๐ญ ๐ฌ๐ฒ๐ฌ๐ญ๐๐ฆ ๐ป๐
artificial-intelligence computer-vision deep-learning machine-learning neural-network yolo
Last synced: 21 Apr 2025
https://github.com/tuvovan/vision_transformer_keras
Keras Implementation of Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
computer-vision image-recognition keras tensorflow transformer vision
Last synced: 21 Jul 2025
https://github.com/lingdong-/visionosc
PoseOSC + FaceOSC + HandOSC + OcrOSC + CatOSC + DogOSC
computer-vision facial-landmarks-detection hand-tracking ocr osc pose-estimation vision-framework
Last synced: 29 Dec 2025
https://github.com/baidut/PaQ-2-PiQ
Source code for "From Patches to Pictures (PaQ-2-PiQ): Mapping the Perceptual Space of Picture Quality"
computer-vision image-processing image-quality image-quality-assessment picture-quality
Last synced: 02 Apr 2025
https://github.com/KempnerInstitute/overcomplete
๐ Overcomplete is a Vision-based SAE Toolbox
computer-vision deep-learning interpretability
Last synced: 27 Nov 2025
https://github.com/tensorush/awesome-graphics-programming
๐ ๐ง Collection of the most awesome learning resources on graphics programming in the form of videos, tutorials and books.
3d 3d-graphics augmented-reality awesome-list cg computer-graphics computer-vision game-development gamedev geometry-processing graphics graphics-programming learning opengl raytracing rendering shaders tutorials virtual-reality vulkan
Last synced: 08 Apr 2025
https://github.com/Placenote/PlacenoteSDK-iOS
PlacenoteSDK Sample app in native iOS using ARKit, written primarily in Swift
arkit augmented-reality computer-vision ios ios-swift scene-recognition scenekit slam
Last synced: 30 Jul 2025
https://github.com/opengvlab/piip
[NeurIPS 2024 Spotlight โญ๏ธ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)
computer-vision image-classification instance-segmentation multimodal-large-language-models object-detection semantic-segmentation vision-language-models vision-transformer
Last synced: 10 Oct 2025
https://github.com/eveningglow/multitask-CycleGAN
Pytorch implementation of multitask CycleGAN with auxiliary classification loss
computer-vision cyclegan deep-learning gan generative-adversarial-network image-translation pytorch
Last synced: 08 May 2025
https://github.com/aljosaosep/ciwt
This repository contains code for the tracking system as described in ''Combined Image- and World-Space Tracking in Traffic Scenes'', ICRA 2017.
3d-vision autonomous-driving autonomous-vehicles computer-vision multi-object-tracking research research-project stereo-vision tracking
Last synced: 18 Mar 2025
https://github.com/robertwgh/ezSIFT
ezSIFT: An easy-to-use standalone SIFT library written in C/C++
computer-vision cpp feature-detection feature-extraction feature-matching sift sift-algorithm sift-descriptors
Last synced: 08 May 2025
https://github.com/elijahcole/single-positive-multi-label
Multi-Label Learning from Single Positive Labels - CVPR 2021
computer-vision cvpr cvpr2021 deep-learning missing-labels multi-label-classification multilabel-classification
Last synced: 13 Apr 2025
https://github.com/rehanguha/brisque
Blind/Referenceless Image Spatial QUality Evaluator (BRISQUE)
brisque computer-vision cv image-processing python statistical-analysis
Last synced: 22 Feb 2026
https://github.com/pliang279/multiviz
[ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models
computer-vision machine-learning multimodal-learning natural-language-processing
Last synced: 12 Apr 2025
https://github.com/olonok69/llm_notebooks
Notebooks and Code about Generative Ai, LLMs, MLOPS, NLP , CV and Graph databases
azure-machine-learning-studio computer-vision databricks docker generative-ai langchain llamaindex llms machine-learning mlflow mlops neo4j nlp ocr prompt-engineering rag vertex-ai
Last synced: 06 Apr 2025
https://github.com/cansik/deep-vision-processing
Deep computer-vision algorithms for the Processing framework.
classification computer-vision cuda-support deep-neural-networks inference-engine machine-learning pose-estimation processing
Last synced: 15 Apr 2025
https://github.com/nobuotsukamoto/tensorrt-examples
TensorRT Examples (TensorRT, Jetson Nano, Python, C++)
computer-vision deep-learning jetson object-detection pose-estimation python segmentation super-resolution tensorrt
Last synced: 15 Apr 2025
https://github.com/yoonit-labs/nativescript-yoonit-camera
The most advanced and modern NativeScript Camera module for Android and iOS with a lot of awesome features
android camera camerax-api computer-vision hacktoberfest ios machine-learning mlkit nativescript vuejs-plugin
Last synced: 23 Oct 2025
https://github.com/tlatkowski/inpainting-gmcnn-keras
Keras implementation of "Image Inpainting via Generative Multi-column Convolutional Neural Networks" paper published at NIPS 2018
cnn computer-vision convolutional-neural-networks deep-learning deep-neural-networks gan generative-adversarial-network image-inpainting improved-wasserstein improved-wgan inpainting keras neurips-2018 nips-2018 nvidia places365 python3 tensorflow wasserstein-gan wgan-gp
Last synced: 14 Jun 2025
https://github.com/rssr25/computer-vision
Some computer vision projects written using openCV and python.
computer-vision opencv python python27
Last synced: 09 Apr 2025
https://github.com/lzhbrian/deepfashion2-kps-agg-finetune
1st place solution (Team StylingAI Inc. & PKU AIIC) for CVPR 2020 DeepFashion2 Clothes Landmark Detection Track. Aggregation and Finetuning for Clothes Landmark Detection
ai clothes computer-vision deepfashion2 fashion keypoints landmark
Last synced: 23 Jun 2025
https://github.com/r00tman/eventnerf
Neural Radiance Fields from a Single Colour Event Camera [CVPR 2023]
computer-vision cvpr2023 event-camera eventnerf mesh nerf neural-radiance-fields neural-rendering pytorch volume-rendering
Last synced: 13 Oct 2025