An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/open-mmlab/OpenMMLabCourse

OpenMMLab course index and stuff

computer-vision tutorials

Last synced: 20 Mar 2025

https://github.com/lzhbrian/image-to-image-papers

🦓<->🦒 🌃<->🌆 A collection of image to image papers with code (constantly updating)

code computer-vision curated-list gan generative-adversarial-network image-manipulation image-to-image image-to-image-translation image-translation papers

Last synced: 26 Mar 2025

https://github.com/YuliangXiu/ECON

[CVPR'23, Highlight] ECON: Explicit Clothed humans Optimized via Normal integration

3d-reconstruction avatar-generator computer-graphics computer-vision digital-twins metaverse normal-maps pifu pifuhd smpl-body smplx virtual-humans

Last synced: 18 Apr 2025

https://github.com/rykov8/ssd_keras

Port of Single Shot MultiBox Detector to Keras

computer-vision deep-learning keras-models object-detection

Last synced: 17 Jan 2025

https://github.com/youyuge34/anime-inpainting

An application tool of edge-connect, which can do anime inpainting and drawing. 动漫人物图片自动修复,去马赛克,填补,去瑕疵

anime computer-vision cv deep-learning gan generative-adversarial-network inpainting mosaic neural-network opencv python pytorch

Last synced: 04 Apr 2025

https://github.com/youyuge34/Anime-InPainting

An application tool of edge-connect, which can do anime inpainting and drawing. 动漫人物图片自动修复,去马赛克,填补,去瑕疵

anime computer-vision cv deep-learning gan generative-adversarial-network inpainting mosaic neural-network opencv python pytorch

Last synced: 26 Mar 2025

https://github.com/piddnad/ddcolor

[ICCV 2023] DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders

computer-vision image-colorization pytorch

Last synced: 13 Apr 2025

https://github.com/piddnad/DDColor

[ICCV 2023] DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders

computer-vision image-colorization pytorch

Last synced: 22 Dec 2024

https://github.com/3dtopia/openlrm

An open-source impl. of Large Reconstruction Models

3d aigc computer-vision generation

Last synced: 12 Apr 2025

https://github.com/jeeliz/jeelizWeboji

JavaScript/WebGL real-time face tracking and expression detection library. Build your own emoticons animated in real time in the browser! SVG and THREE.js integration demos are provided.

animated augmented-reality camera computer-vision deep-learning emoticon face face-detection face-expression face-tracking javascript morph real-time threejs tracking webar webcam webgl weboji

Last synced: 11 Apr 2025

https://github.com/jeeliz/jeelizweboji

JavaScript/WebGL real-time face tracking and expression detection library. Build your own emoticons animated in real time in the browser! SVG and THREE.js integration demos are provided.

animated augmented-reality camera computer-vision deep-learning emoticon face face-detection face-expression face-tracking javascript morph real-time threejs tracking webar webcam webgl weboji

Last synced: 08 Apr 2025

https://github.com/skalskip/vlms-zero-to-hero

This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.

bert-model clip computer-vision embeddings gpt gpt-2 lora natural-language-processing seq2seq vision-language-model word2vec

Last synced: 22 Apr 2025

https://github.com/huggingface/pytorch-pretrained-biggan

🦋A PyTorch implementation of BigGAN with pretrained weights and conversion scripts.

artificial-intelligence biggan computer-vision gan generative-adversarial-network neural-network pytorch

Last synced: 08 Apr 2025

https://github.com/nvidia-ai-iot/redtail

Perception and AI components for autonomous mobile robotics.

ai artificial-intelligence computer-vision deep-learning drones jetson robotics

Last synced: 12 Apr 2025

https://github.com/NVIDIA-AI-IOT/redtail

Perception and AI components for autonomous mobile robotics.

ai artificial-intelligence computer-vision deep-learning drones jetson robotics

Last synced: 05 Apr 2025

https://github.com/huggingface/pytorch-pretrained-BigGAN

🦋A PyTorch implementation of BigGAN with pretrained weights and conversion scripts.

artificial-intelligence biggan computer-vision gan generative-adversarial-network neural-network pytorch

Last synced: 27 Nov 2024

https://github.com/unrealcv/synthetic-computer-vision

A list of synthetic dataset and tools for computer vision

computer-vision dataset synthetic-images virtual-worlds

Last synced: 09 Apr 2025

https://github.com/google-research/maxim

[CVPR 2022 Oral] Official repository for "MAXIM: Multi-Axis MLP for Image Processing". SOTA for denoising, deblurring, deraining, dehazing, and enhancement.

architecture computer-vision deblurring dehazing denoising deraining enhancement image image-enhancement image-processing image-restoration low-level-vision mlp restoration retouching transformer

Last synced: 24 Nov 2024

https://github.com/wpeebles/gangealing

Official PyTorch Implementation of "GAN-Supervised Dense Visual Alignment" (CVPR 2022 Oral, Best Paper Finalist)

alignment computer-vision correspondence cvpr cvpr2022 deep-learning gan generative image-manipulation pytorch stylegan stylegan2 tracking

Last synced: 12 Apr 2025

https://github.com/IBM/MAX-Image-Resolution-Enhancer

Upscale an image by a factor of 4, while generating photo-realistic details.

ai codait computer-vision docker-image ibm machine-learning machine-learning-models neural-network tensorflow

Last synced: 14 Mar 2025

https://github.com/ibm/max-image-resolution-enhancer

Upscale an image by a factor of 4, while generating photo-realistic details.

ai codait computer-vision docker-image ibm machine-learning machine-learning-models neural-network tensorflow

Last synced: 09 Apr 2025

https://github.com/kylemcdonald/FaceTracker

Real time deformable face tracking in C++ with OpenCV 3.

computer-vision facetracker opencv realtime

Last synced: 15 Mar 2025

https://github.com/kylemcdonald/facetracker

Real time deformable face tracking in C++ with OpenCV 3.

computer-vision facetracker opencv realtime

Last synced: 18 Jan 2025

https://github.com/calciferzh/minimal-hand

A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run.

3d-hand-pose-estimation computer-vision deep-learning hand-motion-capture hand-tracking

Last synced: 12 Apr 2025

https://github.com/Sharpiless/Yolov5-Deepsort

最新版本yolov5+deepsort目标检测和追踪,能够显示目标类别,支持5.0版本可训练自己数据集

computer-vision deepsort object-detection object-tracking pytorch yolov5

Last synced: 21 Apr 2025

https://github.com/mrnerf/gaussian-splatting-cuda

3D Gaussian Splatting, reimagined: Unleashing unmatched speed with C++ and CUDA from the ground up!

computer-graphics computer-vision cuda gaussian-splatting nerf optimization

Last synced: 06 Apr 2025

https://github.com/pkhungurn/talking-head-anime-3-demo

Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project

ai anime computer-graphics computer-vision machine-learning python pytorch vtuber

Last synced: 12 Apr 2025

https://github.com/haltakov/natural-language-image-search

Search photos on Unsplash using natural language

clip computer-vision image-search machine-learning photos unsplash

Last synced: 01 Apr 2025

https://github.com/andyzeng/visual-pushing-grasping

Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.

3d artificial-intelligence computer-vision deep-learning deep-reinforcement-learning grasping manipulation pushing robotics vision

Last synced: 12 Apr 2025

https://github.com/CalciferZh/minimal-hand

A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run.

3d-hand-pose-estimation computer-vision deep-learning hand-motion-capture hand-tracking

Last synced: 01 May 2025

https://github.com/rmislam/PythonSIFT

A clean and concise Python implementation of SIFT (Scale-Invariant Feature Transform)

computer-vision feature-matching image-processing opencv python sift template-matching

Last synced: 23 Apr 2025

https://github.com/rmislam/pythonsift

A clean and concise Python implementation of SIFT (Scale-Invariant Feature Transform)

computer-vision feature-matching image-processing opencv python sift template-matching

Last synced: 12 Apr 2025

https://github.com/RQLuo/MixTeX-Latex-OCR

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.

computer-vision deep-learning latex machine-learning ocr onnx python

Last synced: 27 Dec 2024

https://github.com/1adrianb/2d-and-3d-face-alignment

This repository implements a demo of the networks described in "How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks)" paper.

3d-face-alignment computer-vision deeplearning face-alignment torch7

Last synced: 04 Apr 2025

https://github.com/BMW-InnovationLab/BMW-TensorFlow-Training-GUI

This repository allows you to get started with a gui based training a State-of-the-art Deep Learning model with little to no configuration needed! NoCode training with TensorFlow has never been so easy.

computer-vision computervision deep-learning deep-neural-networks deeplearning detection-api docker gui inference-api machine-learning neural-network no-code object-detection objectdetection resnet rest-api tensorboard tensorflow tensorflow-gui tensorflow2

Last synced: 07 Apr 2025

https://github.com/dmitryryumin/iccv-2023-papers

ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!

3d-graphics 3d-reconstruction biometrics computer-vision datasets deep-learning explainable-ai face-recognition gesture-recognition iccv iccv2023 image-processing image-synthesis multimodal-learning pattern-recognition photogrammetry pose-estimation robotics transfer-learning video-synthesis

Last synced: 08 Apr 2025

https://github.com/GoGoDuck912/Self-Correction-Human-Parsing

An out-of-box human parsing representation extractor.

computer-vision deep-learning human-parsing semantic-segmentation

Last synced: 16 Mar 2025

https://github.com/nv-tlabs/GSCNN

Gated-Shape CNN for Semantic Segmentation (ICCV 2019)

computer-vision deep-learning iccv2019 nv-tlabs pytorch semantic-boundaries semantic-segmentation

Last synced: 28 Mar 2025

https://github.com/nv-tlabs/gscnn

Gated-Shape CNN for Semantic Segmentation (ICCV 2019)

computer-vision deep-learning iccv2019 nv-tlabs pytorch semantic-boundaries semantic-segmentation

Last synced: 09 Apr 2025

https://github.com/haltakov/natural-language-youtube-search

Search inside YouTube videos using natural language

clip computer-vision machine-learning search youtube

Last synced: 15 Mar 2025

https://github.com/xiaoyufenfei/Efficient-Segmentation-Networks

Lightweight models for real-time semantic segmentationon PyTorch (include SQNet, LinkNet, SegNet, UNet, ENet, ERFNet, EDANet, ESPNet, ESPNetv2, LEDNet, ESNet, FSSNet, CGNet, DABNet, Fast-SCNN, ContextNet, FPENet, etc.)

camvid cityscapes computer-vision driving-scene-understanding efficient-segmentation-networks image-segmentation lightweight-semantic-segmentation neural-networks pytorch real-time-semantic-segmentation scene-understanding segmentation semantic-segmentation semantic-segmentation-models

Last synced: 28 Mar 2025

https://github.com/MrNeRF/gaussian-splatting-cuda

3D Gaussian Splatting, reimagined: Unleashing unmatched speed with C++ and CUDA from the ground up!

computer-graphics computer-vision cuda gaussian-splatting nerf optimization

Last synced: 11 Apr 2025

https://github.com/google-research/pix2seq

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

computer-vision deep-learning object-detection pix2seq tensorflow2 vision-language

Last synced: 22 Jan 2025

https://github.com/anupamchugh/iowncode

A curated collection of iOS, ML, AR resources sprinkled with some UI additions

alamofire arkit computer-vision coreml coremltools ios keras ml-kit natural-language-processing nlp realitykit swift swiftui vision vision-framework

Last synced: 29 Nov 2024

https://github.com/allenai/objaverse-xl

🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!

3d blender computer-vision image-to-3d objaverse python text-to-3d

Last synced: 07 Apr 2025

https://github.com/hustvl/yolos

[NeurIPS 2021] You Only Look at One Sequence

computer-vision object-detection transformer vision-transformer

Last synced: 13 Apr 2025

https://github.com/hustvl/YOLOS

[NeurIPS 2021] You Only Look at One Sequence

computer-vision object-detection transformer vision-transformer

Last synced: 20 Apr 2025

https://github.com/andyzeng/3dmatch-toolbox

3DMatch - a 3D ConvNet-based local geometric descriptor for aligning 3D meshes and point clouds.

3d 3d-deep-learning 3dmatch artificial-intelligence computer-vision deep-learning geometry-processing point-cloud rgbd vision

Last synced: 12 Apr 2025

https://github.com/cvg/glue-factory

Training library for local feature detection and matching

computer-vision deep-learning iccv2023 image-matching

Last synced: 12 Apr 2025

https://github.com/FORTH-ModelBasedTracker/MocapNET

We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance

2d-to-3d 3d-animation 3d-pose-estimation bvh bvh-format computer-vision demo ensemble gesture-recognition mocap neural-network pose-estimation real-time rgb-images tensorflow webcam

Last synced: 05 Apr 2025

https://github.com/microsoft/cameratraps

PyTorch Wildlife: a Collaborative Deep Learning Framework for Conservation.

camera-traps computer-vision conservation machine-learning megadetector pytorch pytorch-wildlife wildlife

Last synced: 10 Apr 2025

https://github.com/zhmiao/OpenLongTailRecognition-OLTR

Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)

computer-vision cvpr2019 deep-learning long-tail oltr open-long-tail-recognition open-set pytorch-implementation

Last synced: 26 Mar 2025

https://github.com/airctic/icevision

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come

ai annotation-parsers coco-dataset coco-parser computer-vision deep-learning effecientdet fastai faster-rcnn mask-rcnn object-detection pycocotools python pytorch pytorch-lightning tutorials voc-dataset voc-parser

Last synced: 11 Apr 2025

https://github.com/zhmiao/openlongtailrecognition-oltr

Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)

computer-vision cvpr2019 deep-learning long-tail oltr open-long-tail-recognition open-set pytorch-implementation

Last synced: 13 Apr 2025

https://github.com/hkchengrex/cascadepsp

[CVPR 2020] CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement

computer-vision cvpr2020 deep-learning high-resolution pytorch refinement-network segmentation segmentation-refinement

Last synced: 07 Apr 2025

https://github.com/wasidennis/AdaptSegNet

Learning to Adapt Structured Output Space for Semantic Segmentation, CVPR 2018 (spotlight)

adversarial-learning computer-vision deep-learning domain-adaptation generative-adversarial-network pytorch semantic-segmentation

Last synced: 22 Nov 2024

https://github.com/rust-cv/cv

Rust CV mono-repo. Contains pure-Rust dependencies which attempt to encapsulate the capability of OpenCV, OpenMVG, and vSLAM frameworks in a cohesive set of APIs.

algorithms computer-vision crates rust-cv

Last synced: 19 Apr 2025

https://github.com/Dovyski/cvui

A (very) simple UI lib built on top of OpenCV drawing primitives

computer-vision cpp gui imgui opencv opencv-drawing-primitives python ui

Last synced: 15 Mar 2025

https://github.com/dovyski/cvui

A (very) simple UI lib built on top of OpenCV drawing primitives

computer-vision cpp gui imgui opencv opencv-drawing-primitives python ui

Last synced: 08 Apr 2025

https://github.com/VladimirYugay/Gaussian-SLAM

Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting

3d-reconstruction computer-vision gaussian-splatting robotics slam

Last synced: 20 Mar 2025

https://github.com/haosulab/ManiSkill

SAPIEN Manipulation Skill Framework, a GPU parallelized robotics simulator and benchmark

3d-computer-vision computer-vision embodied-ai reinforcement-learning robot-learning robot-manipulation robotics robotics-simulation simulation-environment

Last synced: 13 Nov 2024

https://github.com/nettlep/magic

Scanner for decks of cards with bar codes printed on card edges

computer-vision magic swift

Last synced: 04 Apr 2025

https://github.com/hkchengrex/cutie

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

computer-vision cvpr2024 deep-learning pytorch segmentation video-editing video-object-segmentation video-segmentation

Last synced: 14 Apr 2025

https://github.com/Machine-Learning-Tokyo/papers-with-annotations

Research papers with annotations, illustrations and explanations

computer-vision deep-learning machine-learning

Last synced: 23 Apr 2025

https://github.com/machine-learning-tokyo/papers-with-annotations

Research papers with annotations, illustrations and explanations

computer-vision deep-learning machine-learning

Last synced: 22 Feb 2025

Computer vision Awesome Lists
Awesome-pytorch-list 704 awesome-multimodal-ml 480 awesome-self-supervised-learning 453 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 460 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 awesome-industrial-anomaly-detection 770 Awesome-Federated-Learning 558 Awesome-pytorch-list-CNVersion 692 Awesome-FL 2,522 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 190 iOS_ML 39 awesome-tensorflow-lite 110 CV-pretrained-model 103 awesome-attention-mechanism-in-cv 195 awesome-grounding 154 Awesome-Image-Colorization 126 awesome-capsule-networks 67 Awesome-World-Model 268 Awesome-Open-Vocabulary 161 awesome-ai-awesomeness 236 openstl 43 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-scene-understanding 199 awesome-multi-task-learning 229 awesome-open-data-centric-ai 56 awesome-robotics-3d 106 Awesome-Skeleton-based-Action-Recognition 104 awesome-photogrammetry 68 awesome-holistic-3d 129 Awesome-3D-Object-Detection 169 awesome-data-annotation 93 awesome-panoptic-segmentation 45 awesome-optical-flow 110 Awesome-Parameter-Efficient-Transfer-Learning 121 awesome-computer-vision-models 189 awesome-ai-data-guided-projects 56 awesome-nerf-editing 419 awesome-state-of-depth-completion 71 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 awesome-robotics-datasets 79 awesome_computer_science 118 Awesome-Monocular-3D-detection 97