Computer vision
Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.
- GitHub: https://github.com/topics/computer-vision
- Wikipedia: https://en.wikipedia.org/wiki/Computer_vision
- Related Topics: vision, deep-learning, machine-learning, opencv, gan,
- Aliases: machine-vision, computervision,
- Last updated: 2025-05-01 00:05:39 UTC
- JSON Representation
https://github.com/louisfb01/best_ai_paper_2020
A curated list of the latest breakthroughs in AI by release date with a clear video explanation, link to a more in-depth article, and code
2020 ai artificial-intelligence artificialintelligence computer-vision deep-learning deep-neural-networks deeplearning list machine-learning machinelearning paper paper-references papers sota sota-technique state-of-art state-of-art-general-segmentation state-of-the-art state-of-the-art-models
Last synced: 23 Mar 2025
https://github.com/louisfb01/Best_AI_paper_2020
A curated list of the latest breakthroughs in AI by release date with a clear video explanation, link to a more in-depth article, and code
2020 ai artificial-intelligence artificialintelligence computer-vision deep-learning deep-neural-networks deeplearning list machine-learning machinelearning paper paper-references papers sota sota-technique state-of-art state-of-art-general-segmentation state-of-the-art state-of-the-art-models
Last synced: 14 Mar 2025
https://github.com/autodistill/autodistill
Images to inference with no labeling (use foundation models to train supervised models).
auto-labeling computer-vision deep-learning foundation-models grounding-dino image-annotation image-classification instance-segmentation labeling-tool machine-learning model-distillation multimodal object-detection pytorch segment-anything yolov5 yolov8
Last synced: 25 Apr 2025
https://github.com/andrewssobral/bgslibrary
A C++ Background Subtraction Library with wrappers for Python, MATLAB, Java and GUI on QT
background-subtraction bgs computer-vision foreground-detection moving-object-detection opencv pybgs
Last synced: 09 Apr 2025
https://github.com/google-research-datasets/Objectron
Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
3d 3d-reconstruction 3d-vision ai augmented-reality computer-vision dataset deep-learning machine-learning neural-network python pytorch tensorflow
Last synced: 20 Mar 2025
https://github.com/google-research/uda
Unsupervised Data Augmentation (UDA)
computer-vision cv natural-language-processing nlp semi-supervised-learning tensorflow
Last synced: 07 Apr 2025
https://github.com/jsbroks/coco-annotator
:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints
annotate-images coco coco-annotator coco-format computer-vision datasets deep-learning detection image-annotation image-labeling image-segmentation label machine-learning
Last synced: 13 Apr 2025
https://github.com/enpeizhao/CVprojects
computer vision projects | 计算机视觉相关好玩的AI项目(Python、C++、embedded system)
computer-vision cpp cuda deep-learning embedded-systems machine-learning python tensorrt
Last synced: 20 Mar 2025
https://github.com/azavea/raster-vision
An open source library and framework for deep learning on satellite and aerial imagery.
classification computer-vision deep-learning geospatial machine-learning object-detection pytorch remote-sensing semantic-segmentation
Last synced: 23 Apr 2025
https://github.com/HypoX64/DeepMosaics
Automatically remove the mosaics in images and videos, or add mosaics to them.
computer-vision deep-learning mosaic pytorch video-inpainting
Last synced: 07 Apr 2025
https://github.com/emgucv/emgucv
Emgu CV is a cross platform .Net wrapper to the OpenCV image processing library.
ai computer-vision dotnet emgu emgucv image-classification image-processing image-recognition machine-learning maui native-bindings nuget nuget-package nuget-packages opencv
Last synced: 10 Apr 2025
https://github.com/kingyiusuen/image-to-latex
Convert images of LaTex math equations into LaTex code.
computer-vision deep-learning encoder-decoder end-to-end-machine-learning latex pytorch streamlit-webapp transformer
Last synced: 08 Apr 2025
https://github.com/amusi/eccv2024-papers-with-code
ECCV 2024 论文和开源项目合集,同时欢迎各位大佬提交issue,分享ECCV 2024论文和开源项目
computer-vision deep-learning eccv eccv-2020 eccv2020 eccv2022 eccv2024 image-classification image-segmentation neural-network object-detection
Last synced: 26 Mar 2025
https://github.com/idealo/image-quality-assessment
Convolutional Neural Networks to predict the aesthetic and technical quality of images.
aws computer-vision convolutional-neural-networks deep-learning e-commerce idealo image-quality-assessment keras machine-learning mobilenet neural-network nima tensorflow
Last synced: 10 Dec 2024
https://github.com/ajbrock/Neural-Photo-Editor
A simple interface for editing natural photos with generative neural networks.
computer-vision convolutional-neural-networks deep-learning gans interfaces machine-learning
Last synced: 15 Mar 2025
https://github.com/ajbrock/neural-photo-editor
A simple interface for editing natural photos with generative neural networks.
computer-vision convolutional-neural-networks deep-learning gans interfaces machine-learning
Last synced: 08 Apr 2025
https://github.com/bgshih/crnn
Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.
computer-vision machine-learning ocr sequence-recognition torch7
Last synced: 08 Apr 2025
https://github.com/OpenStitching/stitching
A Python package for fast and robust Image Stitching
computer-vision image-stitching opencv-python panorama python
Last synced: 20 Mar 2025
https://github.com/strasdat/Sophus
C++ implementation of Lie Groups using Eigen.
2d 3d c-plus-plus computer-vision geometry graphics math python robotics
Last synced: 15 Mar 2025
https://github.com/strasdat/sophus
C++ implementation of Lie Groups using Eigen.
2d 3d c-plus-plus computer-vision geometry graphics math python robotics
Last synced: 09 Apr 2025
https://github.com/sharpai/deepcamera
Open-Source AI Camera. Empower any camera/CCTV with state-of-the-art AI, including facial recognition, person recognition(RE-ID) car detection, fall detection and more
ai camera computer-vision deep-learning edge face-detection face-recognition home-assistant machine-learning nvidia-jetson-nano object-detection python raspberry-pi tensorflow tf-lite video-surveillance
Last synced: 13 Apr 2025
https://github.com/kha-white/manga-ocr
Optical character recognition for Japanese text, with the main focus being Japanese manga
comics computer-vision deep-learning japanese manga ocr transformers
Last synced: 28 Apr 2025
https://github.com/pkhungurn/talking-head-anime-demo
Demo for the "Talking Head Anime from a Single Image."
ai anime computer-graphics computer-vision deep-learning machine-learning python pytorch vtuber
Last synced: 08 Apr 2025
https://github.com/SharpAI/DeepCamera
Open-Source AI Camera. Empower any camera/CCTV with state-of-the-art AI, including facial recognition, person recognition(RE-ID) car detection, fall detection and more
ai camera computer-vision deep-learning edge face-detection face-recognition home-assistant machine-learning nvidia-jetson-nano object-detection python raspberry-pi tensorflow tf-lite video-surveillance
Last synced: 20 Mar 2025
https://github.com/mukosame/anime2sketch
A sketch extractor for anime/illustration.
anime comic computer-vision deep-learning gan gans generative-adversarial-network gradio image-generation manga pytorch sketch wacv
Last synced: 07 Apr 2025
https://github.com/gaparmar/img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
computer-vision deep-learning generative-adversarial-network generative-art stable-diffusion
Last synced: 12 Apr 2025
https://github.com/lucasb-eyer/pydensecrf
Python wrapper to Philipp Krähenbühl's dense (fully connected) CRFs with gaussian edge potentials.
computer-vision crf cython eigen machine-learning pairwise-potentials unary-potentials
Last synced: 11 Apr 2025
https://github.com/Mukosame/Anime2Sketch
A sketch extractor for anime/illustration.
anime comic computer-vision deep-learning gan gans generative-adversarial-network gradio image-generation manga pytorch sketch wacv
Last synced: 15 Nov 2024
https://github.com/universaldatatool/universal-data-tool
Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.
annotate-images annotation-tool classification computer-vision csv dataset deep-learning desktop entity-recognition hacktoberfest image-annotation image-labeling-tool image-segmentation labeling labeling-tool machine-learning named-entity-recognition semantic-segmentation text-annotation text-labeling
Last synced: 07 Apr 2025
https://github.com/UniversalDataTool/universal-data-tool
Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.
annotate-images annotation-tool classification computer-vision csv dataset deep-learning desktop entity-recognition hacktoberfest image-annotation image-labeling-tool image-segmentation labeling labeling-tool machine-learning named-entity-recognition semantic-segmentation text-annotation text-labeling
Last synced: 14 Mar 2025
https://github.com/mrpt/mrpt
:zap: The Mobile Robot Programming Toolkit (MRPT)
autonomous-driving c-plus-plus computer-vision maps mobile-robotics mobile-robots mrpt particle-filter robot-framework robot-motion-estimate robot-programming robotics slam
Last synced: 10 Apr 2025
https://github.com/MRPT/mrpt
:zap: The Mobile Robot Programming Toolkit (MRPT)
autonomous-driving c-plus-plus computer-vision maps mobile-robotics mobile-robots mrpt particle-filter robot-framework robot-motion-estimate robot-programming robotics slam
Last synced: 13 Mar 2025
https://github.com/patrikhuber/eos
A lightweight 3D Morphable Face Model library in modern C++
3d-face 3d-face-reconstruction 3dmm c-plus-plus computer-vision cpp14 cpp17 cross-platform face-models image-processing machine-learning modern-cpp python
Last synced: 27 Apr 2025
https://github.com/scannet/scannet
3d-reconstruction computer-graphics computer-vision deep-learning rgbd
Last synced: 14 Apr 2025
https://github.com/adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
computer-vision customization diffusion-models few-shot fine-tuning pytorch text-to-image-generation
Last synced: 07 Apr 2025
https://github.com/ScanNet/ScanNet
3d-reconstruction computer-graphics computer-vision deep-learning rgbd
Last synced: 18 Mar 2025
https://github.com/unrealcv/unrealcv
UnrealCV: Connecting Computer Vision to Unreal Engine
computer-vision embodied-ai machine-learning simulation synthetic-data ue4 virtual-worlds
Last synced: 11 Apr 2025
https://github.com/jyjblrd/low-cost-mocap
Low cost motion capture system for room scale tracking
autonomous-robots bundle-adjustment computer-vision epipolar-geometry esp32 motion-capture motion-tracking quadcopter
Last synced: 14 Apr 2025
https://github.com/jyjblrd/Low-Cost-Mocap
Low cost motion capture system for room scale tracking
autonomous-robots bundle-adjustment computer-vision epipolar-geometry esp32 motion-capture motion-tracking quadcopter
Last synced: 02 Apr 2025
https://github.com/MIT-SPARK/Kimera
Index repo for Kimera code
3d-reconstruction computer-vision robotics semantics slam visual-inertial-odometry
Last synced: 01 Apr 2025
https://github.com/cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
chatbot clip computer-vision dino instruction-tuning large-language-models llms mllm multimodal-large-language-models representation-learning
Last synced: 11 Apr 2025
https://github.com/pierluigiferrari/ssd_keras
A Keras port of Single Shot MultiBox Detector
computer-vision deep-learning fcn fully-convolutional-networks keras keras-models object-detection single-shot-multibox-detector ssd ssd-model
Last synced: 07 Apr 2025
https://github.com/mit-spark/kimera
Index repo for Kimera code
3d-reconstruction computer-vision robotics semantics slam visual-inertial-odometry
Last synced: 23 Mar 2025
https://github.com/facebookresearch/theseus
A library for differentiable nonlinear optimization
bilevel-optimization computer-vision deep-learning differentiable-optimization embodied-ai gauss-newton implicit-differentiation levenberg-marquardt nonlinear-least-squares pytorch robotics
Last synced: 28 Apr 2025
https://github.com/astorfi/lip-reading-deeplearning
:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
3d-convolutional-network computer-vision deep-learning speech-recognition tensorflow
Last synced: 08 Apr 2025
https://github.com/haitongli/knowledge-distillation-pytorch
A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility
cifar10 computer-vision dark-knowledge deep-neural-networks knowledge-distillation model-compression pytorch
Last synced: 07 Apr 2025
https://github.com/apple/ml-cvnets
CVNets: A library for training computer vision networks
ade20k classification computer-vision deep-learning detection imagenet machine-learning mscoco pascal-voc pytorch segmentation
Last synced: 07 Apr 2025
https://github.com/junshutang/make-it-3d
[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
3d-generation 3d-vision computer-vision deep-learning diffusion-models generative-art nerf
Last synced: 07 Apr 2025
https://github.com/hkchengrex/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
computer-vision deep-learning eccv-2022 eccv2022 pytorch segmentation video-object-segmentation video-segmentation
Last synced: 18 Apr 2025
https://github.com/hkchengrex/xmem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
computer-vision deep-learning eccv-2022 eccv2022 pytorch segmentation video-object-segmentation video-segmentation
Last synced: 11 Apr 2025
https://github.com/gauravbh1010tt/deeplearn
Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.
audio-processing computer-vision deep-learning nlp
Last synced: 08 Apr 2025
https://github.com/GauravBh1010tt/DeepLearn
Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.
audio-processing computer-vision deep-learning nlp
Last synced: 14 Mar 2025
https://github.com/probcomp/gen.jl
A general-purpose probabilistic programming system with programmable inference
bayesian computer-vision deep-learning differentiable-programming gen julia-language machine-learning probabilistic-programming robotics
Last synced: 10 Apr 2025
https://github.com/probcomp/Gen.jl
A general-purpose probabilistic programming system with programmable inference
bayesian computer-vision deep-learning differentiable-programming gen julia-language machine-learning probabilistic-programming robotics
Last synced: 15 Mar 2025
https://probcomp.github.io/Gen
A general-purpose probabilistic programming system with programmable inference
bayesian computer-vision deep-learning differentiable-programming gen julia-language machine-learning probabilistic-programming robotics
Last synced: 26 Nov 2024
https://github.com/alibaba/EasyCV
An all-in-one toolkit for computer vision
classification computer-vision object-detection pytorch self-supervised-learning transformers vision-transformer
Last synced: 14 Mar 2025
https://github.com/alibaba/easycv
An all-in-one toolkit for computer vision
classification computer-vision object-detection pytorch self-supervised-learning transformers vision-transformer
Last synced: 09 Apr 2025
https://github.com/rmurai0610/mast3r-slam
[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
computer-vision cvpr2025 robotics slam
Last synced: 14 Apr 2025
https://github.com/junshutang/Make-It-3D
[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
3d-generation 3d-vision computer-vision deep-learning diffusion-models generative-art nerf
Last synced: 14 Nov 2024
https://github.com/symisc/sod
An Embedded Computer Vision & Machine Learning Library (CPU Optimized & IoT Capable)
c computer-vision convolutional-neural-networks cpu deep-learning detection embedded face-detection facial-landmarks image-analysis image-processing image-recognition iot iot-device library machine-learning-algorithms object-detection real-time vision-framework webassembly
Last synced: 08 Apr 2025
https://github.com/wkentaro/pytorch-fcn
PyTorch Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)
computer-vision convolutional-networks deep-learning fcn fcn8s pytorch semantic-segmentation
Last synced: 17 Jan 2025
https://github.com/longcw/faster_rcnn_pytorch
Faster RCNN with PyTorch
computer-vision detection faster-rcnn pytorch
Last synced: 08 Apr 2025
https://github.com/paddlepaddle/research
novel deep learning research works with PaddlePaddle
computer-vision data-mining deep-learning knowledge-graph nlp spatial-temporal
Last synced: 11 Apr 2025
https://github.com/laboshinl/loam_velodyne
Laser Odometry and Mapping (Loam) is a realtime method for state estimation and mapping using a 3D lidar.
3d computer-vision lidar loam loam-velodyne mapping pcl pointcloud ros slam velodyne
Last synced: 15 Apr 2025
https://github.com/spla-tam/splatam
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024)
computer-vision cvpr2024 gaussian-splatting robotics slam
Last synced: 07 Apr 2025
https://github.com/xinshuoweng/ab3dmot
(IROS 2020, ECCVW 2020) Official Python Implementation for "3D Multi-Object Tracking: A Baseline and New Evaluation Metrics"
2d-mot-evaluation 3d-mot 3d-multi 3d-multi-object-tracking 3d-tracking computer-vision evaluation evaluation-metrics kitti kitti-3d machine-learning multi-object-tracking real-time robotics tracking
Last synced: 07 Apr 2025
https://github.com/dog-qiuqiu/mobilenet-yolo
MobileNetV2-YoloV3-Nano: 0.5BFlops 3MB HUAWEI P40: 6ms/img, YoloFace-500k:0.1Bflops 420KB:fire::fire::fire:
cnn computer-vision cv darknet deep-learning face-detection landmark landmark-detection mnn mnn-framework mobilenet-yolo mobilenetv2 ncnn ncnn-model object-detection yolo yolov3
Last synced: 08 Apr 2025
https://github.com/dog-qiuqiu/MobileNet-Yolo
MobileNetV2-YoloV3-Nano: 0.5BFlops 3MB HUAWEI P40: 6ms/img, YoloFace-500k:0.1Bflops 420KB:fire::fire::fire:
cnn computer-vision cv darknet deep-learning face-detection landmark landmark-detection mnn mnn-framework mobilenet-yolo mobilenetv2 ncnn ncnn-model object-detection yolo yolov3
Last synced: 15 Mar 2025
https://github.com/xinshuoweng/AB3DMOT
(IROS 2020, ECCVW 2020) Official Python Implementation for "3D Multi-Object Tracking: A Baseline and New Evaluation Metrics"
2d-mot-evaluation 3d-mot 3d-multi 3d-multi-object-tracking 3d-tracking computer-vision evaluation evaluation-metrics kitti kitti-3d machine-learning multi-object-tracking real-time robotics tracking
Last synced: 20 Mar 2025
https://github.com/Future-Scholars/paperlib
An open-source academic paper management tool.
academic academic-paper computer-science computer-vision management math neural-language-processing paper-management references science
Last synced: 14 Apr 2025
https://github.com/PaddlePaddle/Research
novel deep learning research works with PaddlePaddle
computer-vision data-mining deep-learning knowledge-graph nlp spatial-temporal
Last synced: 30 Mar 2025
https://github.com/aiff22/dped
Software and pre-trained models for automatic photo quality enhancement using Deep Convolutional Networks
computer-vision convolutional-neural-networks deep-learning dped gan generative-adversarial-networks image-enhancement image-processing
Last synced: 08 Apr 2025
https://github.com/aiff22/DPED
Software and pre-trained models for automatic photo quality enhancement using Deep Convolutional Networks
computer-vision convolutional-neural-networks deep-learning dped gan generative-adversarial-networks image-enhancement image-processing
Last synced: 07 Apr 2025
https://github.com/nvlabs/pwc-net
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume, CVPR 2018 (Oral)
caffe computer-vision cvpr2018 deeplearning optical-flow pwc-net pytorch
Last synced: 08 Apr 2025
https://github.com/adobe/antialiased-cnns
pip install antialiased-cnns to improve stability and accuracy
antialiasing artificial-intelligence cnns computer-vision convolutional-neural-networks icml icml-2019 shift-equivariant shift-invariant
Last synced: 12 Apr 2025
https://github.com/victordibia/handtracking
Building a Real-time Hand-Detector using Neural Networks (SSD) on Tensorflow
computer-vision detector hand-detection hand-detector neural-network ssd tensorflow
Last synced: 08 Apr 2025
https://github.com/opendatacam/opendatacam
An open source tool to quantify the world
camera computer-vision darknet dataviz iot jetson jetson-nano jetson-tx2 jetson-xavier smart-city yolo
Last synced: 13 Apr 2025
https://github.com/yuliangxiu/icon
[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals
3d-reconstruction animation avatar-generator cloth-simulation computer-graphics computer-vision human-pose-estimation implicit-functions mesh-deformation metaverse normal-maps pifu pifuhd pose-estimation pytorch smpl smpl-body smpl-model smplx virtual-humans
Last synced: 08 Apr 2025
https://github.com/muskie82/monogs
[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM
computer-vision cvpr2024 gaussian-splatting robotics slam
Last synced: 07 Apr 2025
https://github.com/tannerhelland/photodemon
A free portable photo editor focused on pro-grade features, high performance, and maximum usability.
computer-vision image-editor image-filters image-processing paint photo-editor vb6 win32
Last synced: 10 Apr 2025
https://github.com/roboflow/inference
Turn any computer or edge device into a command center for your computer vision projects.
agents classification computer-vision deployment docker inference inference-api inference-server instance-segmentation jetson machine-learning object-detection onnx python tensorrt vit yolo11 yolov12 yolov5 yolov8
Last synced: 23 Apr 2025
https://github.com/YuliangXiu/ICON
[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals
3d-reconstruction animation avatar-generator cloth-simulation computer-graphics computer-vision human-pose-estimation implicit-functions mesh-deformation metaverse normal-maps pifu pifuhd pose-estimation pytorch smpl smpl-body smpl-model smplx virtual-humans
Last synced: 20 Nov 2024
https://github.com/kakaobrain/fast-autoaugment
Official Implementation of 'Fast AutoAugment' in PyTorch.
augmentation automated-machine-learning automl cnn computer-vision convolutional-neural-networks deep-learning distributed image-classification pytorch
Last synced: 08 Apr 2025
https://github.com/yassouali/pytorch-segmentation
:art: Semantic segmentation models, datasets and losses implemented in PyTorch.
computer-vision deep-learning semantic-segmentation
Last synced: 07 Apr 2025
https://github.com/aws-samples/aws-machine-learning-university-accelerated-cv
Machine Learning University: Accelerated Computer Vision Class
computer-vision deep-learning gluon gluoncv machine-learning mxnet python
Last synced: 11 Apr 2025
https://github.com/anything-of-anything/anything-3d
Segment-Anything + 3D. Let's lift anything to 3D.
3d computer-vision reconstruction segment segment-anything
Last synced: 08 Apr 2025
https://github.com/Anything-of-anything/Anything-3D
Segment-Anything + 3D. Let's lift anything to 3D.
3d computer-vision reconstruction segment segment-anything
Last synced: 20 Mar 2025
https://github.com/invictus717/metatransformer
Meta-Transformer for Unified Multimodal Learning
artificial-intelligence computer-vision foundationmodel machine-learning multimedia multimodal transformers
Last synced: 08 Apr 2025
https://github.com/rmurai0610/MASt3R-SLAM
[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
computer-vision cvpr2025 robotics slam
Last synced: 26 Mar 2025
https://github.com/GaParmar/img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
computer-vision deep-learning generative-adversarial-network generative-art stable-diffusion
Last synced: 28 Mar 2025
https://github.com/artivis/manif
A small C++11 header-only library for Lie theory.
2d 3d c-plus-plus computer-vision cpp11 geometry header-only lie-groups lie-theory python3 robotics slam state-estimation
Last synced: 11 Apr 2025
https://github.com/chandrikadeb7/face-mask-detection
Face Mask Detection system based on computer vision and deep learning using OpenCV and Tensorflow/Keras
bing-search caffe computer-vision covid-19 deep-learning face-mask face-mask-detection facemask-detection facemaskdetect hacktoberfest keras keras-tensorflow machine-learning mask-detection mask-detection-system mobilenetv2 python python-3 python3 ssd-mobilenet
Last synced: 14 Apr 2025
https://github.com/lufficc/ssd
High quality, fast, modular reference implementation of SSD in PyTorch
computer-vision deep-learning object-detection pytorch ssd
Last synced: 14 Apr 2025
https://github.com/snavely/bundler_sfm
Bundler Structure from Motion Toolkit
3d-reconstruction 3d-vision bundle-adjustment computer-vision computer-vision-tools structure-from-motion
Last synced: 08 Apr 2025
https://github.com/lufficc/SSD
High quality, fast, modular reference implementation of SSD in PyTorch
computer-vision deep-learning object-detection pytorch ssd
Last synced: 20 Mar 2025
https://github.com/MaaXYZ/MaaFramework
基于图像识别的自动化黑盒测试框架 | An automation black-box testing framework based on image recognition
black-box-testing computer-vision
Last synced: 18 Feb 2025
https://github.com/whitphx/streamlit-webrtc
Real-time video and audio processing on Streamlit
computer-vision hacktoberfest pypi python streamlit streamlit-webrtc video-processing video-streaming webrtc
Last synced: 28 Apr 2025
https://github.com/future-scholars/paperlib
An open-source academic paper management tool.
academic academic-paper computer-science computer-vision management math neural-language-processing paper-management references science
Last synced: 12 Apr 2025
https://github.com/lucidrains/lambda-networks
Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute
artificial-intelligence attention attention-mechanism computer-vision deep-learning
Last synced: 14 Apr 2025
https://github.com/AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
artificial-intelligence computer-vision document document-analysis document-intelligence document-recognition document-understanding documentai end-to-end-ocr multimodal multimodal-deep-learning ocr scene-text-detection scene-text-detection-recognition scene-text-recognition text-detection text-recognition vision-language vision-language-model vision-language-transformer
Last synced: 10 Apr 2025