An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/spla-tam/SplaTAM

SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024)

computer-vision cvpr2024 gaussian-splatting robotics slam

Last synced: 20 Mar 2025

https://github.com/mathworks/matlab-simulink-challenge-project-hub

This MATLAB and Simulink Challenge Project Hub contains a list of research and design project ideas. These projects will help you gain practical experience and insight into technology trends and industry directions.

ai autonomous capstone capstone-project computer-vision deep-learning drones energy final-project final-year-project master-thesis matlab project-ideas robotics senior-design senior-project simulink student-project students thesis

Last synced: 11 Apr 2025

https://github.com/chainer/chainercv

ChainerCV: a Library for Deep Learning in Computer Vision

chainer chainercv computer-vision cupy deep-learning neural-network python

Last synced: 18 Jan 2025

https://github.com/robertknight/ocrs

Rust library and CLI tool for OCR (extracting text from images)

computer-vision machine-learning ocr

Last synced: 28 Apr 2025

https://github.com/toandaominh1997/efficientdet.pytorch

Implementation EfficientDet: Scalable and Efficient Object Detection in PyTorch

coco computer-vision demo detection efficientdet-d0 efficientnet focalloss multibox nms object-detection pascal-voc pytorch

Last synced: 15 Dec 2024

https://github.com/toandaominh1997/EfficientDet.Pytorch

Implementation EfficientDet: Scalable and Efficient Object Detection in PyTorch

coco computer-vision demo detection efficientdet-d0 efficientnet focalloss multibox nms object-detection pascal-voc pytorch

Last synced: 27 Nov 2024

https://github.com/minivision-ai/silent-face-anti-spoofing

静默活体检测(Silent-Face-Anti-Spoofing)

android-app computer-vision deep-learning face-anti-spoofing sdk

Last synced: 08 Apr 2025

https://github.com/una-dinosauria/3d-pose-baseline

A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17.

3d-vision baseline computer-vision iccv-17 iccv-2017 tensorflow

Last synced: 08 Apr 2025

https://github.com/symforce-org/symforce

Fast symbolic computation, code generation, and nonlinear optimization for robotics

autonomous-vehicles code-generation computer-vision cpp motion-planning optimization python robotics slam structure-from-motion symbolic-computation

Last synced: 01 May 2025

https://github.com/CSAILVision/LabelMeAnnotationTool

Source code for the LabelMe annotation tool.

annotation computer-vision

Last synced: 27 Apr 2025

https://github.com/csailvision/labelmeannotationtool

Source code for the LabelMe annotation tool.

annotation computer-vision

Last synced: 07 Apr 2025

https://github.com/rese1f/stablevideo

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

aigc computer-vision controlnet diffusion-model video-editing

Last synced: 08 Apr 2025

https://github.com/torchgan/torchgan

Research Framework for easy and efficient training of GANs based on Pytorch

computer-vision deep-learning gans generative-adversarial-networks generative-model machine-learning neural-networks python python3 pytorch

Last synced: 14 Apr 2025

https://github.com/tannerhelland/PhotoDemon

A free portable photo editor focused on pro-grade features, high performance, and maximum usability.

computer-vision image-editor image-filters image-processing paint photo-editor vb6 win32

Last synced: 03 Apr 2025

https://github.com/ai-forever/ghost

A new one shot face swap approach for image and video domains

computer-vision deep-face-swap deep-learning deepfake face-swap faceswap ghost ghost-faceswap ghost-swap pytorch

Last synced: 15 Apr 2025

https://github.com/qingyonghu/randla-net

🔥RandLA-Net in Tensorflow (CVPR 2020, Oral & IEEE TPAMI 2021)

3d-vision computer-vision s3dis semantic-segmentation semantic3d semantickitti

Last synced: 08 Apr 2025

https://github.com/skalskip/ilearndeeplearning.py

This repository contains small projects related to Neural Networks and Deep Learning in general. Subjects are closely linekd with articles I publish on Medium. I encourage you both to read as well as to check how the code works in the action.

computer-vision deep-learning deep-learning-tutorial neural-network numpy visualizations

Last synced: 08 Apr 2025

https://github.com/SkalskiP/ILearnDeepLearning.py

This repository contains small projects related to Neural Networks and Deep Learning in general. Subjects are closely linekd with articles I publish on Medium. I encourage you both to read as well as to check how the code works in the action.

computer-vision deep-learning deep-learning-tutorial neural-network numpy visualizations

Last synced: 01 May 2025

https://github.com/QingyongHu/RandLA-Net

🔥RandLA-Net in Tensorflow (CVPR 2020, Oral & IEEE TPAMI 2021)

3d-vision computer-vision s3dis semantic-segmentation semantic3d semantickitti

Last synced: 20 Mar 2025

https://github.com/nianticlabs/simplerecon

[ECCV 2022] SimpleRecon: 3D Reconstruction Without 3D Convolutions

computer-vision cost-volume depth depth-estimation eccv2022 multi-view-stereo mvs pytorch scannet visualization

Last synced: 09 Apr 2025

https://github.com/dwctod/cvpr2024-papers-with-code-demo

收集 CVPR 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!Collect the latest CVPR (Conference on Computer Vision and Pattern Recognition) results, including papers, code, and demo videos, etc., and welcome recommendations from everyone!

computer-vision cvpr cvpr2021 cvpr2022 cvpr2023 cvpr2024 llm multimodal-deep-learning object-detection segment-anything segmentation

Last synced: 23 Mar 2025

https://github.com/DWCTOD/CVPR2024-Papers-with-Code-Demo

收集 CVPR 最新的成果,包括论文、代码和demo视频等,欢迎大家推荐!Collect the latest CVPR (Conference on Computer Vision and Pattern Recognition) results, including papers, code, and demo videos, etc., and welcome recommendations from everyone!

computer-vision cvpr cvpr2021 cvpr2022 cvpr2023 cvpr2024 llm multimodal-deep-learning object-detection segment-anything segmentation

Last synced: 29 Mar 2025

https://github.com/cdpierse/transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

captum computer-vision deep-learning explainable-ai interpretability machine-learning model-explainability natural-language-processing neural-network nlp transformers transformers-model

Last synced: 14 Apr 2025

https://github.com/roboflow/rf-detr

RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.

computer-vision detr machine-learning object-detection rf-detr

Last synced: 01 Apr 2025

https://github.com/muskie82/MonoGS

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

computer-vision cvpr2024 gaussian-splatting robotics slam

Last synced: 20 Mar 2025

https://github.com/om-ai-lab/omdet

Real-time and accurate open-vocabulary end-to-end object detection

coco computer-vision lvis object-detection open-vocabulary real-time vision-and-language zero-shot zero-shot-object-detection

Last synced: 13 Apr 2025

https://github.com/digantamisra98/mish

Official Repository for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]

activation-functions bmvc bmvc20 computer-vision deep-learning image-classification mathematics neural-networks object-detection

Last synced: 13 Apr 2025

https://github.com/mathworks/MATLAB-Simulink-Challenge-Project-Hub

This MATLAB and Simulink Challenge Project Hub contains a list of research and design project ideas. These projects will help you gain practical experience and insight into technology trends and industry directions.

ai autonomous capstone capstone-project computer-vision deep-learning drones energy final-project final-year-project master-thesis matlab project-ideas robotics senior-design senior-project simulink student-project students thesis

Last synced: 15 Nov 2024

https://github.com/digantamisra98/Mish

Official Repository for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]

activation-functions bmvc bmvc20 computer-vision deep-learning image-classification mathematics neural-networks object-detection

Last synced: 20 Mar 2025

https://github.com/yatengLG/ISAT_with_segment_anything

Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具

annotation-tool computer-vision labeling labeling-tool sam sam2 segment-anything segment-anything-2 video-segmentation

Last synced: 09 Mar 2025

https://github.com/yatenglg/isat_with_segment_anything

Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具

annotation-tool computer-vision labeling labeling-tool sam sam2 segment-anything segment-anything-2 video-segmentation

Last synced: 10 Apr 2025

https://github.com/open-mmlab/mmengine

OpenMMLab Foundational Library for Training Deep Learning Models

ai computer-vision deep-learning machine-learning python pytorch

Last synced: 23 Apr 2025

https://github.com/poloclub/diffusiondb

A large-scale text-to-image prompt gallery dataset based on Stable Diffusion

ai-art computer-vision image-generation prompt-engineering stable-diffusion

Last synced: 08 Apr 2025

https://github.com/swz30/mprnet

[CVPR 2021] Multi-Stage Progressive Image Restoration. SOTA results for Image deblurring, deraining, and denoising.

computer-vision cvpr-2021 cvpr2021 cvpr21 image-deblurring image-denoising image-deraining image-restoration low-level-vision multistage-network progressive-restoration pytorch

Last synced: 08 Apr 2025

https://github.com/amaiya/ktrain

ktrain is a Python library that makes deep learning and AI more accessible and easier to apply

computer-vision deep-learning graph-neural-networks keras machine-learning nlp python tabular-data tensorflow

Last synced: 29 Apr 2025

https://github.com/lingdong-/qiji-font

齊伋體 - typeface from Ming Dynasty woodblock printed books

chinese classical-chinese computer-vision font typeface typography woodcutting

Last synced: 12 Apr 2025

https://github.com/swz30/MPRNet

[CVPR 2021] Multi-Stage Progressive Image Restoration. SOTA results for Image deblurring, deraining, and denoising.

computer-vision cvpr-2021 cvpr2021 cvpr21 image-deblurring image-denoising image-deraining image-restoration low-level-vision multistage-network progressive-restoration pytorch

Last synced: 07 Apr 2025

https://github.com/LingDong-/qiji-font

齊伋體 - typeface from Ming Dynasty woodblock printed books

chinese classical-chinese computer-vision font typeface typography woodcutting

Last synced: 14 Nov 2024

https://github.com/open-compass/VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks

chatgpt claude clip computer-vision evaluation gemini gpt gpt-4v gpt4 large-language-models llava llm multi-modal openai openai-api pytorch qwen vit vqa

Last synced: 28 Nov 2024

https://github.com/imagej/imagej2

Open scientific N-dimensional image processing :microscope: :sparkler:

computer-vision image-processing

Last synced: 11 Apr 2025

https://github.com/captain1986/captainblackboard

船长关于机器学习、计算机视觉和工程技术的总结和分享

computer-vision convolutional-neural-networks deep-learning optimization-algorithms summary

Last synced: 09 Apr 2025

https://github.com/pprp/simplecvreproduction

Replication of simple CV Projects including attention, classification, detection, keypoint detection, etc.

attention classification computer-vision cv demo face-detection landmark object-detection paper-reproduction pytorch

Last synced: 08 Apr 2025

https://github.com/br-idl/paddlevit

:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

classification computer-vision cv deep-learning detection encoder-decoder gan mlp object-detection paddlepaddle segmentation semantic-segmentation transformer vit

Last synced: 14 Apr 2025

https://github.com/huoyijie/advancedeast

AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.https://github.com/huoyijie/raspberrypi-car

advancedeast advancedeast-network-arch algorithm bellow computer-vision deep-learning east icpr keras machine-learning python scene tensorflow text-detect text-predictions tian-chi tianchi

Last synced: 08 Apr 2025

https://github.com/huoyijie/AdvancedEAST

AdvancedEAST is an algorithm used for Scene image text detect, which is primarily based on EAST, and the significant improvement was also made, which make long text predictions more accurate.https://github.com/huoyijie/raspberrypi-car

advancedeast advancedeast-network-arch algorithm bellow computer-vision deep-learning east icpr keras machine-learning python scene tensorflow text-detect text-predictions tian-chi tianchi

Last synced: 02 Apr 2025

https://poloclub.github.io/diffusiondb/

A large-scale text-to-image prompt gallery dataset based on Stable Diffusion

ai-art computer-vision image-generation prompt-engineering stable-diffusion

Last synced: 14 Nov 2024

https://github.com/xvjiarui/gcnet

GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond

computer-vision deep-learning instance-segmentation object-detection

Last synced: 08 Apr 2025

https://github.com/nachifur/mulimgviewer

MulimgViewer is a multi-image viewer that can open multiple images in one interface, which is convenient for image comparison and image stitching.

computer-vision deep-learning image-comparison image-stitching image-viewer multiple-image-comparison multiple-images multiple-imageview opencas parallel picture-viewer python3 ubuntu viewer windows10

Last synced: 11 Apr 2025

https://github.com/rqluo/mixtex-latex-ocr

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.

computer-vision deep-learning latex machine-learning ocr onnx python

Last synced: 13 Apr 2025

https://github.com/utiasstars/pykitti

Python tools for working with KITTI data.

computer-vision kitti-dataset python robotics

Last synced: 14 Apr 2025

https://github.com/xvjiarui/GCNet

GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond

computer-vision deep-learning instance-segmentation object-detection

Last synced: 27 Nov 2024

https://github.com/openpifpaf/openpifpaf

Official implementation of "OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association" in PyTorch.

composite-fields computer-vision deep-learning human-pose-estimation keypoint-estimation pose-estimation

Last synced: 10 Apr 2025

https://github.com/utiasSTARS/pykitti

Python tools for working with KITTI data.

computer-vision kitti-dataset python robotics

Last synced: 20 Mar 2025

https://github.com/lightaime/deep_gcns_torch

Pytorch Repo for DeepGCNs (ICCV'2019 Oral, TPAMI'2021), DeeperGCN (arXiv'2020) and GNN1000(ICML'2021): https://www.deepgcns.org

3d-point-clouds bioinformatics cheminformatics computer-vision data-mining deep-gcns deep-learning geometric-deep-learning graph-convolutional-networks graph-neural-networks pytorch science-research social-network

Last synced: 09 Apr 2025

https://github.com/tatsuyah/vehicle-detection

Vehicle detection using machine learning and computer vision techniques for Udacity's Self-Driving Car Engineer Nanodegree.

computer-vision hog-features machine-learning self-driving-car sliding-windows svm svm-classifier udacity

Last synced: 15 Mar 2025

https://github.com/yuliangxiu/econ

[CVPR'23, Highlight] ECON: Explicit Clothed humans Optimized via Normal integration

3d-reconstruction avatar-generator computer-graphics computer-vision digital-twins metaverse normal-maps pifu pifuhd smpl-body smplx virtual-humans

Last synced: 06 Apr 2025

https://github.com/nachifur/MulimgViewer

MulimgViewer is a multi-image viewer that can open multiple images in one interface, which is convenient for image comparison and image stitching.

computer-vision deep-learning image-comparison image-stitching image-viewer multiple-image-comparison multiple-images multiple-imageview opencas parallel picture-viewer python3 ubuntu viewer windows10

Last synced: 25 Nov 2024

https://github.com/bendangnuksung/Image-OutPainting

🏖 Keras Implementation of Painting outside the box

computer-vision deep-learning gans image-outpainting keras python tensorflow

Last synced: 27 Nov 2024

https://github.com/bendangnuksung/image-outpainting

🏖 Keras Implementation of Painting outside the box

computer-vision deep-learning gans image-outpainting keras python tensorflow

Last synced: 09 Apr 2025

https://github.com/pkhungurn/talking-head-anime-2-demo

Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.

ai anime computer-graphics computer-vision deep-learning machine-learning python pytorch vtuber

Last synced: 12 Apr 2025

https://github.com/TimoBolkart/voca

This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.

3d-face 3d-models animation-sequence computer-graphics computer-vision face-animation machine-learning morphable-model python python3 tensorflow voca

Last synced: 15 Nov 2024

https://github.com/open-mmlab/openmmlabcourse

OpenMMLab course index and stuff

computer-vision tutorials

Last synced: 08 Apr 2025

Computer vision Awesome Lists
Awesome-pytorch-list 704 awesome-multimodal-ml 480 awesome-self-supervised-learning 453 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 460 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 awesome-industrial-anomaly-detection 769 Awesome-Federated-Learning 558 Awesome-pytorch-list-CNVersion 692 Awesome-FL 2,522 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 190 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 awesome-grounding 154 Awesome-Image-Colorization 126 awesome-capsule-networks 67 Awesome-World-Model 265 Awesome-Open-Vocabulary 161 awesome-ai-awesomeness 236 openstl 43 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-scene-understanding 199 awesome-multi-task-learning 229 awesome-open-data-centric-ai 56 awesome-robotics-3d 106 Awesome-Skeleton-based-Action-Recognition 104 awesome-photogrammetry 68 awesome-holistic-3d 129 Awesome-3D-Object-Detection 169 awesome-data-annotation 93 awesome-panoptic-segmentation 45 awesome-optical-flow 110 Awesome-Parameter-Efficient-Transfer-Learning 121 awesome-computer-vision-models 189 awesome-ai-data-guided-projects 56 awesome-nerf-editing 419 awesome-state-of-depth-completion 71 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 awesome-robotics-datasets 79 awesome_computer_science 118 Awesome-Monocular-3D-detection 97