Computer vision
Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.
- GitHub: https://github.com/topics/computer-vision
- Wikipedia: https://en.wikipedia.org/wiki/Computer_vision
- Related Topics: vision, deep-learning, machine-learning, opencv, gan,
- Aliases: machine-vision, computervision,
- Last updated: 2026-06-29 00:06:16 UTC
- JSON Representation
https://github.com/chandrikadeb7/face-mask-detection
Face Mask Detection system based on computer vision and deep learning using OpenCV and Tensorflow/Keras
bing-search caffe computer-vision covid-19 deep-learning face-mask face-mask-detection facemask-detection facemaskdetect hacktoberfest keras keras-tensorflow machine-learning mask-detection mask-detection-system mobilenetv2 python python-3 python3 ssd-mobilenet
Last synced: 09 Jan 2026
https://github.com/Kobaayyy/Awesome-CVPR2026-CVPR2025-CVPR2024-CVPR2021-CVPR2020-Low-Level-Vision
A Collection of Papers and Codes for CVPR2026/CVPR2025/CVPR2024/CVPR2021/CVPR2020 Low Level Vision
c-v-p-r computer-vision cvpr2020 cvpr2021 cvpr2024 cvpr2025 cvpr2026 frame-interpolation image-compression image-deblurring image-dehazing image-demoireing image-denoising image-deraining image-enhancement image-processing image-quality-assessment image-restoration low-level-vision super-resolution
Last synced: 28 Feb 2026
https://github.com/snavely/bundler_sfm
Bundler Structure from Motion Toolkit
3d-reconstruction 3d-vision bundle-adjustment computer-vision computer-vision-tools structure-from-motion
Last synced: 15 May 2025
https://github.com/lufficc/ssd
High quality, fast, modular reference implementation of SSD in PyTorch
computer-vision deep-learning object-detection pytorch ssd
Last synced: 15 May 2025
https://github.com/lufficc/SSD
High quality, fast, modular reference implementation of SSD in PyTorch
computer-vision deep-learning object-detection pytorch ssd
Last synced: 20 Mar 2025
https://github.com/MaaXYZ/MaaFramework
基于图像识别的自动化黑盒测试框架 | An automation black-box testing framework based on image recognition
black-box-testing computer-vision
Last synced: 08 Nov 2025
https://github.com/yatenglg/isat_with_segment_anything
Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具
annotation-tool computer-vision labeling labeling-tool sam sam2 segment-anything segment-anything-2 video-segmentation
Last synced: 24 Dec 2025
https://github.com/lucidrains/lambda-networks
Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute
artificial-intelligence attention attention-mechanism computer-vision deep-learning
Last synced: 15 May 2025
https://github.com/alibabaresearch/advancedliteratemachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
artificial-intelligence computer-vision document document-analysis document-intelligence document-recognition document-understanding documentai end-to-end-ocr multimodal multimodal-deep-learning ocr scene-text-detection scene-text-detection-recognition scene-text-recognition text-detection text-recognition vision-language vision-language-model vision-language-transformer
Last synced: 28 Mar 2025
https://github.com/AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
artificial-intelligence computer-vision document document-analysis document-intelligence document-recognition document-understanding documentai end-to-end-ocr multimodal multimodal-deep-learning ocr scene-text-detection scene-text-detection-recognition scene-text-recognition text-detection text-recognition vision-language vision-language-model vision-language-transformer
Last synced: 10 Apr 2025
https://github.com/symforce-org/symforce
Fast symbolic computation, code generation, and nonlinear optimization for robotics
autonomous-vehicles code-generation computer-vision cpp motion-planning optimization python robotics slam structure-from-motion symbolic-computation
Last synced: 06 Sep 2025
https://github.com/spla-tam/SplaTAM
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024)
computer-vision cvpr2024 gaussian-splatting robotics slam
Last synced: 20 Mar 2025
https://github.com/invictus717/MetaTransformer
Meta-Transformer for Unified Multimodal Learning
artificial-intelligence computer-vision foundationmodel machine-learning multimedia multimodal transformers
Last synced: 10 Apr 2025
https://github.com/QIN2DIM/hcaptcha-challenger
🥂 Gracefully face hCaptcha challenge with MoE(ONNX) embedded solution.
clip computer-vision hcaptcha hcaptcha-solver image-segmentation multi-modal multi-modal-learning object-detection onnx onnx-models onnxruntime opencv-python playwright solver yolo yolov5 zero-shot-classification
Last synced: 28 Mar 2025
https://github.com/robertknight/ocrs
Rust library and CLI tool for OCR (extracting text from images)
computer-vision machine-learning ocr
Last synced: 13 May 2025
https://github.com/kritiksoman/gimp-ml
AI for GNU Image Manipulation Program
coloring-image computer-vision computervision deblurring deep-learning dehazing enlightenment gimp gimp-plugin gimp-plugins image-manipulation linux machine-learning machine-learning-algorithms macos photo-editing python3 ubuntu
Last synced: 15 May 2025
https://github.com/pixeltable/pixeltable
Pixeltable — Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.
ai artificial-intelligence chatbot computer-vision data-science database feature-engineering feature-store genai llm machine-learning ml mlops multimodal vector-database
Last synced: 01 May 2026
https://github.com/mathworks/matlab-simulink-challenge-project-hub
This MATLAB and Simulink Challenge Project Hub contains a list of research and design project ideas. These projects will help you gain practical experience and insight into technology trends and industry directions.
ai autonomous capstone capstone-project computer-vision deep-learning drones energy final-project final-year-project master-thesis matlab project-ideas robotics senior-design senior-project simulink student-project students thesis
Last synced: 11 Apr 2025
https://github.com/google-deepmind/tapnet
Tracking Any Point (TAP)
benchmark computer-vision deep-learning point-tracking robotics
Last synced: 14 May 2025
https://github.com/chainer/chainercv
ChainerCV: a Library for Deep Learning in Computer Vision
chainer chainercv computer-vision cupy deep-learning neural-network python
Last synced: 28 Sep 2025
https://github.com/kritiksoman/GIMP-ML
AI for GNU Image Manipulation Program
coloring-image computer-vision computervision deblurring deep-learning dehazing enlightenment gimp gimp-plugin gimp-plugins image-manipulation linux machine-learning machine-learning-algorithms macos photo-editing python3 ubuntu
Last synced: 05 Apr 2025
https://github.com/hkchengrex/mmaudio
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
audio audio-synthesis computer-vision deep-learning text-to-audio video-to-audio
Last synced: 08 May 2025
https://github.com/microsoft/semi-supervised-learning
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
audio-classification classification computer-vision deep-learning low-resource machine-learning natural-language-processing pytorch semi-supervised-learning semisupervised-learning transformer
Last synced: 14 May 2025
https://github.com/toandaominh1997/efficientdet.pytorch
Implementation EfficientDet: Scalable and Efficient Object Detection in PyTorch
coco computer-vision demo detection efficientdet-d0 efficientnet focalloss multibox nms object-detection pascal-voc pytorch
Last synced: 14 Aug 2025
https://github.com/toandaominh1997/EfficientDet.Pytorch
Implementation EfficientDet: Scalable and Efficient Object Detection in PyTorch
coco computer-vision demo detection efficientdet-d0 efficientnet focalloss multibox nms object-detection pascal-voc pytorch
Last synced: 19 Jul 2025
https://github.com/una-dinosauria/3d-pose-baseline
A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17.
3d-vision baseline computer-vision iccv-17 iccv-2017 tensorflow
Last synced: 16 May 2025
https://github.com/abreheret/pixelannotationtool
Annotate quickly images.
annotation annotation-tool computer-vision deep-learning ground-truth image-labeling image-seg-tool labeling-tool opencv pixel pixel-wise qt5 tool
Last synced: 15 May 2025
https://github.com/microsoft/Semi-supervised-learning
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
audio-classification classification computer-vision deep-learning low-resource machine-learning natural-language-processing pytorch semi-supervised-learning semisupervised-learning transformer
Last synced: 26 Mar 2025
https://github.com/minivision-ai/silent-face-anti-spoofing
静默活体检测(Silent-Face-Anti-Spoofing)
android-app computer-vision deep-learning face-anti-spoofing sdk
Last synced: 08 Apr 2025
https://github.com/rese1f/stablevideo
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
aigc computer-vision controlnet diffusion-model video-editing
Last synced: 16 May 2025
https://github.com/abreheret/PixelAnnotationTool
Annotate quickly images.
annotation annotation-tool computer-vision deep-learning ground-truth image-labeling image-seg-tool labeling-tool opencv pixel pixel-wise qt5 tool
Last synced: 05 Apr 2025
https://github.com/RQLuo/MixTeX-Latex-OCR
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
computer-vision deep-learning latex machine-learning ocr onnx python
Last synced: 30 Aug 2025
https://github.com/CSAILVision/LabelMeAnnotationTool
Source code for the LabelMe annotation tool.
Last synced: 27 Apr 2025
https://github.com/csailvision/labelmeannotationtool
Source code for the LabelMe annotation tool.
Last synced: 15 May 2025
https://github.com/ai-forever/ghost
A new one shot face swap approach for image and video domains
computer-vision deep-face-swap deep-learning deepfake face-swap faceswap ghost ghost-faceswap ghost-swap pytorch
Last synced: 14 May 2025
https://github.com/torchgan/torchgan
Research Framework for easy and efficient training of GANs based on Pytorch
computer-vision deep-learning gans generative-adversarial-networks generative-model machine-learning neural-networks python python3 pytorch
Last synced: 15 May 2025
https://github.com/fendouai/opencvtutorials
OpenCV-Python4.1 中文文档
computer-vision deep-learning machine-learning object-detection opencv opencv-python python
Last synced: 07 Oct 2025
https://github.com/BachiLi/redner
Differentiable rendering without approximation.
computer-graphics computer-vision differentiable-rendering monte-carlo-ray-tracing pytorch rendering tensorflow
Last synced: 26 Mar 2025
https://github.com/qingyonghu/randla-net
🔥RandLA-Net in Tensorflow (CVPR 2020, Oral & IEEE TPAMI 2021)
3d-vision computer-vision s3dis semantic-segmentation semantic3d semantickitti
Last synced: 16 May 2025
https://github.com/tannerhelland/PhotoDemon
A free portable photo editor focused on pro-grade features, high performance, and maximum usability.
computer-vision image-editor image-filters image-processing paint photo-editor vb6 win32
Last synced: 03 Apr 2025
https://github.com/skalskip/ilearndeeplearning.py
This repository contains small projects related to Neural Networks and Deep Learning in general. Subjects are closely linekd with articles I publish on Medium. I encourage you both to read as well as to check how the code works in the action.
computer-vision deep-learning deep-learning-tutorial neural-network numpy visualizations
Last synced: 08 Apr 2025
https://github.com/SkalskiP/ILearnDeepLearning.py
This repository contains small projects related to Neural Networks and Deep Learning in general. Subjects are closely linekd with articles I publish on Medium. I encourage you both to read as well as to check how the code works in the action.
computer-vision deep-learning deep-learning-tutorial neural-network numpy visualizations
Last synced: 01 May 2025
https://github.com/QingyongHu/RandLA-Net
🔥RandLA-Net in Tensorflow (CVPR 2020, Oral & IEEE TPAMI 2021)
3d-vision computer-vision s3dis semantic-segmentation semantic3d semantickitti
Last synced: 20 Mar 2025
https://github.com/allenai/ai2thor
An open-source platform for Visual AI.
artificial-intelligence computer-vision interaction physics-engine visual-ai
Last synced: 13 May 2025
https://github.com/nianticlabs/simplerecon
[ECCV 2022] SimpleRecon: 3D Reconstruction Without 3D Convolutions
computer-vision cost-volume depth depth-estimation eccv2022 multi-view-stereo mvs pytorch scannet visualization
Last synced: 16 May 2025
https://github.com/OPHoperHPO/image-background-remove-tool
✂️ Automated high-quality background removal framework for an image using neural networks. ✂️
background-removal background-removal-js basnet computer-vision deeplabv3 docker fastapi machine-learning matting python pytorch remove-background salient-object-detection segmentation torch transfer-learning trimap u2net
Last synced: 14 Mar 2025
https://github.com/ophoperhpo/image-background-remove-tool
✂️ Automated high-quality background removal framework for an image using neural networks. ✂️
background-removal background-removal-js basnet computer-vision deeplabv3 docker fastapi machine-learning matting python pytorch remove-background salient-object-detection segmentation torch transfer-learning trimap u2net
Last synced: 02 Oct 2025
https://github.com/torchssl/torchssl
A PyTorch-based library for semi-supervised learning (NeurIPS'21)
codebase computer-vision deep-learning machine-learning pytorch self-supervised-learning semi-supervised-learning toolkit
Last synced: 05 Oct 2025
https://github.com/dwctod/cvpr2024-papers-with-code-demo
收集 CVPR 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!Collect the latest CVPR (Conference on Computer Vision and Pattern Recognition) results, including papers, code, and demo videos, etc., and welcome recommendations from everyone!
computer-vision cvpr cvpr2021 cvpr2022 cvpr2023 cvpr2024 llm multimodal-deep-learning object-detection segment-anything segmentation
Last synced: 26 Jan 2026
https://github.com/DWCTOD/CVPR2024-Papers-with-Code-Demo
收集 CVPR 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!Collect the latest CVPR (Conference on Computer Vision and Pattern Recognition) results, including papers, code, and demo videos, etc., and welcome recommendations from everyone!
computer-vision cvpr cvpr2021 cvpr2022 cvpr2023 cvpr2024 llm multimodal-deep-learning object-detection segment-anything segmentation
Last synced: 29 Mar 2025
https://github.com/cdpierse/transformers-interpret
Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.
captum computer-vision deep-learning explainable-ai interpretability machine-learning model-explainability natural-language-processing neural-network nlp transformers transformers-model
Last synced: 15 May 2025
https://github.com/dirtyharrylyl/transformer-in-vision
Recent Transformer-based CV and related works.
computer-vision deep-learning multi-modal paper self-attention transformer vision-transformers visual-language
Last synced: 28 Jan 2026
https://github.com/ahmetozlu/tensorflow_object_counting_api
🚀 The TensorFlow Object Counting API is an open source framework built on top of TensorFlow and Keras that makes it easy to develop object counting systems!
computer-vision data-science deep-learning deep-neural-networks image-processing machine-learning object-counting object-counting-api object-detection object-detection-api object-detection-label object-detection-pipelines opencv pedestrian-counting shelf-management shelf-navigation tensorflow tensorflow-api tensorflow-object-detection-api vehicle-counting
Last synced: 15 May 2025
https://github.com/muskie82/MonoGS
[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM
computer-vision cvpr2024 gaussian-splatting robotics slam
Last synced: 20 Mar 2025
https://github.com/DirtyHarryLYL/Transformer-in-Vision
Recent Transformer-based CV and related works.
computer-vision deep-learning multi-modal paper self-attention transformer vision-transformers visual-language
Last synced: 20 Mar 2025
https://github.com/om-ai-lab/omdet
Real-time and accurate open-vocabulary end-to-end object detection
coco computer-vision lvis object-detection open-vocabulary real-time vision-and-language zero-shot zero-shot-object-detection
Last synced: 13 Apr 2025
https://github.com/kaiyangzhou/dassl.pytorch
A PyTorch toolbox for domain generalization, domain adaptation and semi-supervised learning.
artificial-intelligence benchmark-datasets computer-vision deep-learning deep-neural-networks domain-adaptation domain-generalization machine-learning pytorch semi-supervised-learning
Last synced: 16 May 2025
https://github.com/TorchSSL/TorchSSL
A PyTorch-based library for semi-supervised learning (NeurIPS'21)
codebase computer-vision deep-learning machine-learning pytorch self-supervised-learning semi-supervised-learning toolkit
Last synced: 01 May 2025
https://github.com/digantamisra98/mish
Official Repository for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]
activation-functions bmvc bmvc20 computer-vision deep-learning image-classification mathematics neural-networks object-detection
Last synced: 17 Oct 2025
https://github.com/piddnad/DDColor
[ICCV 2023] DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders
computer-vision image-colorization pytorch
Last synced: 24 Aug 2025
https://github.com/aitorzip/pytorch-cyclegan
A clean and readable Pytorch implementation of CycleGAN
artificial-intelligence computer-graphics computer-vision cyclegan deep-learning generative-adversarial-network image-generation image-processing pytorch
Last synced: 16 May 2025
https://github.com/piddnad/ddcolor
[ICCV 2023] DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders
computer-vision image-colorization pytorch
Last synced: 14 May 2025
https://github.com/mathworks/MATLAB-Simulink-Challenge-Project-Hub
This MATLAB and Simulink Challenge Project Hub contains a list of research and design project ideas. These projects will help you gain practical experience and insight into technology trends and industry directions.
ai autonomous capstone capstone-project computer-vision deep-learning drones energy final-project final-year-project master-thesis matlab project-ideas robotics senior-design senior-project simulink student-project students thesis
Last synced: 09 May 2025
https://github.com/digantamisra98/Mish
Official Repository for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]
activation-functions bmvc bmvc20 computer-vision deep-learning image-classification mathematics neural-networks object-detection
Last synced: 20 Mar 2025
https://github.com/swz30/mprnet
[CVPR 2021] Multi-Stage Progressive Image Restoration. SOTA results for Image deblurring, deraining, and denoising.
computer-vision cvpr-2021 cvpr2021 cvpr21 image-deblurring image-denoising image-deraining image-restoration low-level-vision multistage-network progressive-restoration pytorch
Last synced: 16 May 2025
https://github.com/aitorzip/PyTorch-CycleGAN
A clean and readable Pytorch implementation of CycleGAN
artificial-intelligence computer-graphics computer-vision cyclegan deep-learning generative-adversarial-network image-generation image-processing pytorch
Last synced: 07 Apr 2025
https://github.com/poloclub/diffusiondb
A large-scale text-to-image prompt gallery dataset based on Stable Diffusion
ai-art computer-vision image-generation prompt-engineering stable-diffusion
Last synced: 16 May 2025
https://github.com/yatengLG/ISAT_with_segment_anything
Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具
annotation-tool computer-vision labeling labeling-tool sam sam2 segment-anything segment-anything-2 video-segmentation
Last synced: 09 Mar 2025
https://github.com/gnes-ai/gnes
GNES is Generic Neural Elastic Search, a cloud-native semantic search system based on deep neural network.
cloud-native computer-vision database deep-learning distributed-systems dnn docker-swarm elasticsearch gnes grpc machine-learning microservices neural-network nlp python pytorch search-engine semantic-search tensorflow video-processing
Last synced: 05 Oct 2025
https://poloclub.github.io/diffusiondb/
A large-scale text-to-image prompt gallery dataset based on Stable Diffusion
ai-art computer-vision image-generation prompt-engineering stable-diffusion
Last synced: 07 May 2025
https://github.com/open-mmlab/mmengine
OpenMMLab Foundational Library for Training Deep Learning Models
ai computer-vision deep-learning machine-learning python pytorch
Last synced: 13 May 2025
https://github.com/tensorlayer/hyperpose
Library for Fast and Flexible Human Pose Estimation
computer-vision distributed-training mobilenet neural-networks openpose pose-estimation tensorflow tensorlayer tensorrt
Last synced: 12 Jan 2026
https://github.com/tensorlayer/HyperPose
Library for Fast and Flexible Human Pose Estimation
computer-vision distributed-training mobilenet neural-networks openpose pose-estimation tensorflow tensorlayer tensorrt
Last synced: 13 Apr 2025
https://github.com/atulapra/emotion-detection
Real-time Facial Emotion Detection using deep learning
computer-vision deep-learning emotion-detection emotion-recognition haar-cascade opencv opencv-python tflearn
Last synced: 15 May 2025
https://github.com/Yangzhangcst/Transformer-in-Computer-Vision
A paper list of some recent Transformer-based CV works.
awesome computer-vision deep-learning detr papers transformer transformer-awesome transformer-cv vit
Last synced: 06 May 2025
https://github.com/amaiya/ktrain
ktrain is a Python library that makes deep learning and AI more accessible and easier to apply
computer-vision deep-learning graph-neural-networks keras machine-learning nlp python tabular-data tensorflow
Last synced: 29 Apr 2025
https://github.com/lingdong-/qiji-font
齊伋體 - typeface from Ming Dynasty woodblock printed books
chinese classical-chinese computer-vision font typeface typography woodcutting
Last synced: 12 Apr 2025
https://github.com/swz30/MPRNet
[CVPR 2021] Multi-Stage Progressive Image Restoration. SOTA results for Image deblurring, deraining, and denoising.
computer-vision cvpr-2021 cvpr2021 cvpr21 image-deblurring image-denoising image-deraining image-restoration low-level-vision multistage-network progressive-restoration pytorch
Last synced: 07 Apr 2025
https://github.com/LingDong-/qiji-font
齊伋體 - typeface from Ming Dynasty woodblock printed books
chinese classical-chinese computer-vision font typeface typography woodcutting
Last synced: 07 May 2025
https://github.com/open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
chatgpt claude clip computer-vision evaluation gemini gpt gpt-4v gpt4 large-language-models llava llm multi-modal openai openai-api pytorch qwen vit vqa
Last synced: 20 Jul 2025
https://github.com/imagej/imagej2
Open scientific N-dimensional image processing :microscope: :sparkler:
computer-vision image-processing
Last synced: 14 May 2025
https://github.com/captain1986/captainblackboard
船长关于机器学习、计算机视觉和工程技术的总结和分享
computer-vision convolutional-neural-networks deep-learning optimization-algorithms summary
Last synced: 16 May 2025
https://github.com/pprp/simplecvreproduction
Replication of simple CV Projects including attention, classification, detection, keypoint detection, etc.
attention classification computer-vision cv demo face-detection landmark object-detection paper-reproduction pytorch
Last synced: 16 May 2025
https://github.com/br-idl/paddlevit
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+
classification computer-vision cv deep-learning detection encoder-decoder gan mlp object-detection paddlepaddle segmentation semantic-segmentation transformer vit
Last synced: 14 Apr 2025
https://github.com/huoyijie/AdvancedEAST
AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.https://github.com/huoyijie/raspberrypi-car
advancedeast advancedeast-network-arch algorithm bellow computer-vision deep-learning east icpr keras machine-learning python scene tensorflow text-detect text-predictions tian-chi tianchi
Last synced: 02 Apr 2025
https://github.com/huoyijie/advancedeast
AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.https://github.com/huoyijie/raspberrypi-car
advancedeast advancedeast-network-arch algorithm bellow computer-vision deep-learning east icpr keras machine-learning python scene tensorflow text-detect text-predictions tian-chi tianchi
Last synced: 16 May 2025
https://github.com/nachifur/mulimgviewer
MulimgViewer is a multi-image viewer that can open multiple images in one interface, which is convenient for image comparison and image stitching.
computer-vision deep-learning image-comparison image-stitching image-viewer multiple-image-comparison multiple-images multiple-imageview opencas parallel picture-viewer python3 ubuntu viewer windows10
Last synced: 14 May 2025
https://github.com/rqluo/mixtex-latex-ocr
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
computer-vision deep-learning latex machine-learning ocr onnx python
Last synced: 14 May 2025
https://github.com/KaiyangZhou/Dassl.pytorch
A PyTorch toolbox for domain generalization, domain adaptation and semi-supervised learning.
artificial-intelligence benchmark-datasets computer-vision deep-learning deep-neural-networks domain-adaptation domain-generalization machine-learning pytorch semi-supervised-learning
Last synced: 08 May 2025
https://github.com/xvjiarui/gcnet
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond
computer-vision deep-learning instance-segmentation object-detection
Last synced: 16 May 2025
https://github.com/xvjiarui/GCNet
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond
computer-vision deep-learning instance-segmentation object-detection
Last synced: 19 Jul 2025
https://github.com/utiasstars/pykitti
Python tools for working with KITTI data.
computer-vision kitti-dataset python robotics
Last synced: 15 May 2025
https://github.com/openpifpaf/openpifpaf
Official implementation of "OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association" in PyTorch.
composite-fields computer-vision deep-learning human-pose-estimation keypoint-estimation pose-estimation
Last synced: 14 May 2025
https://github.com/HyperGAN/HyperGAN
Composable GAN framework with api and user interface
artificial-intelligence computer-vision gan generative-adversarial-network hypergan machine-learning machine-learning-api online-learning python pytorch sponsors unsupervised-learning
Last synced: 12 Apr 2025
https://github.com/utiasSTARS/pykitti
Python tools for working with KITTI data.
computer-vision kitti-dataset python robotics
Last synced: 20 Mar 2025
https://github.com/hypergan/hypergan
Composable GAN framework with api and user interface
artificial-intelligence computer-vision gan generative-adversarial-network hypergan machine-learning machine-learning-api online-learning python pytorch sponsors unsupervised-learning
Last synced: 16 May 2025
https://github.com/atulapra/Emotion-detection
Real-time Facial Emotion Detection using deep learning
computer-vision deep-learning emotion-detection emotion-recognition haar-cascade opencv opencv-python tflearn
Last synced: 22 Jul 2025
https://github.com/fantasy-studio/paint-by-example
Paint by Example: Exemplar-based Image Editing with Diffusion Models
computer-vision deep-learning diffusion-models image-editing image-generation image-manipulation paint-by-example pytorch stable-diffusion
Last synced: 16 May 2025
https://github.com/patrick-llgc/learning-deep-learning
Paper reading notes on Deep Learning and Machine Learning
3d-object-detection 3d-object-recognition cnn computer-vision deep-learning literature-review machine-learning medical medical-imaging paper paper-reading paper-review point-cloud reinforcement-learning
Last synced: 14 May 2025
https://github.com/geekalexis/fastmot
High-performance multiple object tracking based on YOLO, Deep SORT, and KLT 🚀
computer-vision deep-learning deep-sort edge-computing jetson lucas-kanade multi-object-tracking object-detection people-counter real-time reid scaledyolov4 ssd tensorrt video-analysis yolov3 yolov4
Last synced: 08 Apr 2025