Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/rykov8/ssd_keras

Port of Single Shot MultiBox Detector to Keras

computer-vision deep-learning keras-models object-detection

Last synced: 07 Aug 2024

https://github.com/swz30/MPRNet

[CVPR 2021] Multi-Stage Progressive Image Restoration. SOTA results for Image deblurring, deraining, and denoising.

computer-vision cvpr-2021 cvpr2021 cvpr21 image-deblurring image-denoising image-deraining image-restoration low-level-vision multistage-network progressive-restoration pytorch

Last synced: 01 Aug 2024

https://github.com/mediar-ai/screenpipe

Library to build personalized AI powered by what you've seen, said, or heard. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.

ai computer-vision llm machine-learning ml multimodal vision

Last synced: 27 Aug 2024

https://github.com/youyuge34/Anime-InPainting

An application tool of edge-connect, which can do anime inpainting and drawing. 动漫人物图片自动修复,去马赛克,填补,去瑕疵

anime computer-vision cv deep-learning gan generative-adversarial-network inpainting mosaic neural-network opencv python pytorch

Last synced: 31 Jul 2024

https://github.com/jeeliz/jeelizWeboji

JavaScript/WebGL real-time face tracking and expression detection library. Build your own emoticons animated in real time in the browser! SVG and THREE.js integration demos are provided.

animated augmented-reality camera computer-vision deep-learning emoticon face face-detection face-expression face-tracking javascript morph real-time threejs tracking webar webcam webgl weboji

Last synced: 01 Aug 2024

https://github.com/YuliangXiu/ECON

[CVPR'23, Highlight] ECON: Explicit Clothed humans Optimized via Normal integration

3d-reconstruction avatar-generator computer-graphics computer-vision digital-twins metaverse normal-maps pifu pifuhd smpl-body smplx virtual-humans

Last synced: 01 Aug 2024

https://github.com/huggingface/pytorch-pretrained-BigGAN

🦋A PyTorch implementation of BigGAN with pretrained weights and conversion scripts.

artificial-intelligence biggan computer-vision gan generative-adversarial-network neural-network pytorch

Last synced: 07 Aug 2024

https://github.com/wpeebles/gangealing

Official PyTorch Implementation of "GAN-Supervised Dense Visual Alignment" (CVPR 2022 Oral, Best Paper Finalist)

alignment computer-vision correspondence cvpr cvpr2022 deep-learning gan generative image-manipulation pytorch stylegan stylegan2 tracking

Last synced: 01 Aug 2024

https://github.com/NVIDIA-AI-IOT/redtail

Perception and AI components for autonomous mobile robotics.

ai artificial-intelligence computer-vision deep-learning drones jetson robotics

Last synced: 01 Aug 2024

https://github.com/piddnad/DDColor

[ICCV 2023] DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders

computer-vision image-colorization pytorch

Last synced: 29 Aug 2024

https://github.com/kylemcdonald/FaceTracker

Real time deformable face tracking in C++ with OpenCV 3.

computer-vision facetracker opencv realtime

Last synced: 30 Jul 2024

https://github.com/unrealcv/synthetic-computer-vision

A list of synthetic dataset and tools for computer vision

computer-vision dataset synthetic-images virtual-worlds

Last synced: 09 Aug 2024

https://github.com/IBM/MAX-Image-Resolution-Enhancer

Upscale an image by a factor of 4, while generating photo-realistic details.

ai codait computer-vision docker-image ibm machine-learning machine-learning-models neural-network tensorflow

Last synced: 30 Jul 2024

https://github.com/open-mmlab/OpenMMLabCourse

OpenMMLab course index and stuff

computer-vision tutorials

Last synced: 31 Jul 2024

https://github.com/CalciferZh/minimal-hand

A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run.

3d-hand-pose-estimation computer-vision deep-learning hand-motion-capture hand-tracking

Last synced: 02 Aug 2024

https://github.com/BMW-InnovationLab/BMW-TensorFlow-Training-GUI

This repository allows you to get started with a gui based training a State-of-the-art Deep Learning model with little to no configuration needed! NoCode training with TensorFlow has never been so easy.

computer-vision computervision deep-learning deep-neural-networks deeplearning detection-api docker gui inference-api machine-learning neural-network no-code object-detection objectdetection resnet rest-api tensorboard tensorflow tensorflow-gui tensorflow2

Last synced: 01 Aug 2024

https://github.com/google-research/maxim

[CVPR 2022 Oral] Official repository for "MAXIM: Multi-Axis MLP for Image Processing". SOTA for denoising, deblurring, deraining, dehazing, and enhancement.

architecture computer-vision deblurring dehazing denoising deraining enhancement image image-enhancement image-processing image-restoration low-level-vision mlp restoration retouching transformer

Last synced: 02 Aug 2024

https://github.com/haltakov/natural-language-image-search

Search photos on Unsplash using natural language

clip computer-vision image-search machine-learning photos unsplash

Last synced: 01 Aug 2024

https://github.com/PeikeLi/Self-Correction-Human-Parsing

An out-of-box human parsing representation extractor.

computer-vision deep-learning human-parsing semantic-segmentation

Last synced: 31 Jul 2024

https://github.com/nv-tlabs/GSCNN

Gated-Shape CNN for Semantic Segmentation (ICCV 2019)

computer-vision deep-learning iccv2019 nv-tlabs pytorch semantic-boundaries semantic-segmentation

Last synced: 31 Jul 2024

https://github.com/Sharpiless/Yolov5-Deepsort

最新版本yolov5+deepsort目标检测和追踪,能够显示目标类别,支持5.0版本可训练自己数据集

computer-vision deepsort object-detection object-tracking pytorch yolov5

Last synced: 02 Aug 2024

https://github.com/muskie82/MonoGS

[CVPR'24 Highlight] Gaussian Splatting SLAM

computer-vision cvpr2024 gaussian-splatting robotics slam

Last synced: 31 Jul 2024

https://github.com/haltakov/natural-language-youtube-search

Search inside YouTube videos using natural language

clip computer-vision machine-learning search youtube

Last synced: 31 Jul 2024

https://github.com/rmislam/PythonSIFT

A clean and concise Python implementation of SIFT (Scale-Invariant Feature Transform)

computer-vision feature-matching image-processing opencv python sift template-matching

Last synced: 02 Aug 2024

https://github.com/xiaoyufenfei/Efficient-Segmentation-Networks

Lightweight models for real-time semantic segmentationon PyTorch (include SQNet, LinkNet, SegNet, UNet, ENet, ERFNet, EDANet, ESPNet, ESPNetv2, LEDNet, ESNet, FSSNet, CGNet, DABNet, Fast-SCNN, ContextNet, FPENet, etc.)

camvid cityscapes computer-vision driving-scene-understanding efficient-segmentation-networks image-segmentation lightweight-semantic-segmentation neural-networks pytorch real-time-semantic-segmentation scene-understanding segmentation semantic-segmentation semantic-segmentation-models

Last synced: 31 Jul 2024

https://github.com/anupamchugh/iowncode

A curated collection of iOS, ML, AR resources sprinkled with some UI additions

alamofire arkit computer-vision coreml coremltools ios keras ml-kit natural-language-processing nlp realitykit swift swiftui vision vision-framework

Last synced: 09 Aug 2024

https://github.com/andyzeng/visual-pushing-grasping

Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.

3d artificial-intelligence computer-vision deep-learning deep-reinforcement-learning grasping manipulation pushing robotics vision

Last synced: 02 Aug 2024

https://github.com/robertknight/ocrs

Rust library and CLI tool for OCR (extracting text from images)

computer-vision machine-learning ocr

Last synced: 01 Aug 2024

https://github.com/airctic/icevision

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come

ai annotation-parsers coco-dataset coco-parser computer-vision deep-learning effecientdet fastai faster-rcnn mask-rcnn object-detection pycocotools python pytorch pytorch-lightning tutorials voc-dataset voc-parser

Last synced: 03 Aug 2024

https://github.com/wasidennis/AdaptSegNet

Learning to Adapt Structured Output Space for Semantic Segmentation, CVPR 2018 (spotlight)

adversarial-learning computer-vision deep-learning domain-adaptation generative-adversarial-network pytorch semantic-segmentation

Last synced: 05 Aug 2024

https://github.com/MrNeRF/gaussian-splatting-cuda

3D Gaussian Splatting, reimagined: Unleashing unmatched speed with C++ and CUDA from the ground up!

computer-graphics computer-vision cuda gaussian-splatting nerf optimization

Last synced: 01 Aug 2024

https://github.com/google-research/pix2seq

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

computer-vision deep-learning object-detection pix2seq tensorflow2 vision-language

Last synced: 02 Aug 2024

https://github.com/zhmiao/OpenLongTailRecognition-OLTR

Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)

computer-vision cvpr2019 deep-learning long-tail oltr open-long-tail-recognition open-set pytorch-implementation

Last synced: 31 Jul 2024

https://github.com/Machine-Learning-Tokyo/papers-with-annotations

Research papers with annotations, illustrations and explanations

computer-vision deep-learning machine-learning

Last synced: 02 Aug 2024

https://github.com/nettlep/magic

Scanner for decks of cards with bar codes printed on card edges

computer-vision magic swift

Last synced: 01 Aug 2024

https://github.com/FORTH-ModelBasedTracker/MocapNET

We present MocapNET, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance

2d-to-3d 3d-animation 3d-pose-estimation bvh bvh-format computer-vision demo ensemble gesture-recognition mocap neural-network pose-estimation real-time rgb-images tensorflow webcam

Last synced: 01 Aug 2024

https://github.com/andyzeng/3dmatch-toolbox

3DMatch - a 3D ConvNet-based local geometric descriptor for aligning 3D meshes and point clouds.

3d 3d-deep-learning 3dmatch artificial-intelligence computer-vision deep-learning geometry-processing point-cloud rgbd vision

Last synced: 31 Jul 2024

https://github.com/Dovyski/cvui

A (very) simple UI lib built on top of OpenCV drawing primitives

computer-vision cpp gui imgui opencv opencv-drawing-primitives python ui

Last synced: 30 Jul 2024

https://github.com/hustvl/YOLOS

[NeurIPS 2021] You Only Look at One Sequence

computer-vision object-detection transformer vision-transformer

Last synced: 02 Aug 2024

https://github.com/megvii-research/IJCAI2023-CoNR

IJCAI2023 - Collaborative Neural Rendering using Anime Character Sheets

anime computer-graphics computer-vision deep-learning pytorch

Last synced: 01 Aug 2024

https://github.com/rust-cv/cv

Rust CV mono-repo. Contains pure-Rust dependencies which attempt to encapsulate the capability of OpenCV, OpenMVG, and vSLAM frameworks in a cohesive set of APIs.

algorithms computer-vision crates rust-cv

Last synced: 02 Aug 2024

https://github.com/patrikhuber/4dface

Real-time 3D face tracking and reconstruction from 2D video

computer-vision face-models modern-cpp video-processing

Last synced: 01 Aug 2024

https://github.com/vandit15/Class-balanced-loss-pytorch

Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"

computer-vision cvpr2019 deep-learning loss-functions pytorch

Last synced: 03 Aug 2024

https://github.com/vacancy/PreciseRoIPooling

Precise RoI Pooling with coordinate gradient support, proposed in the paper "Acquisition of Localization Confidence for Accurate Object Detection" (https://arxiv.org/abs/1807.11590).

computer-vision object-detection

Last synced: 02 Aug 2024

https://github.com/VladimirYugay/Gaussian-SLAM

Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting

3d-reconstruction computer-vision gaussian-splatting robotics slam

Last synced: 31 Jul 2024

https://github.com/Jumpat/SegmentAnythingin3D

Segment Anything in 3D with NeRFs (NeurIPS 2023)

3d 3d-segmentation computer-vision deep-learning nerf segment-anything segmentation

Last synced: 31 Jul 2024

https://github.com/SimonVandenhende/Multi-Task-Learning-PyTorch

PyTorch implementation of multi-task learning architectures, incl. MTI-Net (ECCV2020).

computer-vision eccv2020 multi-task-learning nyud pascal pytorch scene-understanding segmentation

Last synced: 02 Aug 2024

https://github.com/fregu856/deeplabv3

PyTorch implementation of DeepLabV3, trained on the Cityscapes dataset.

autonomous-driving computer-vision deep-learning machine-learning pytorch semantic-segmentation

Last synced: 07 Aug 2024

https://github.com/xtreme1-io/xtreme1

Xtreme1 is an all-in-one data labeling and annotation platform for multimodal data training and supports 3D LiDAR point cloud, image, and LLM.

3d-annotation annotation annotation-tool computer-vision image-annotation image-classification image-labelling-tool labeling-tool multimodal point-cloud rlhf

Last synced: 31 Jul 2024

https://github.com/louis030195/screen-pipe

Library to build personalized AI powered by what you've seen, said, or heard. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.

ai computer-vision llm machine-learning ml multimodal vision

Last synced: 01 Aug 2024

https://github.com/dog-qiuqiu/FastestDet

:zap: A newly designed ultra lightweight anchor free target detection algorithm, weight only 250K parameters, reduces the time consumption by 10% compared with yolo-fastest, and the post-processing is simpler

computer-vision deep-learning object-detection

Last synced: 30 Jul 2024

https://github.com/quic/sense

Enhance your application with the ability to see and interact with humans using any RGB camera.

activity-recognition calorie-estimation computer-vision deep-learning fitness-app gesture-recognition neural-networks pytorch video

Last synced: 05 Aug 2024

https://github.com/bgshih/aster

Recognizing cropped text in natural images.

computer-vision ocr recognition scene-text

Last synced: 01 Aug 2024

https://github.com/Guanghan/lighttrack

LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking

artificial-intelligence computer-vision pose-estimation pose-tracking visual-object-tracking

Last synced: 07 Aug 2024

https://github.com/MarekKowalski/FaceSwap

3D face swapping implemented in Python

3d-models computer-vision face-alignment face-swap optimization

Last synced: 03 Aug 2024

https://github.com/PeterWang512/GANSketching

Sketch Your Own GAN: Customizing a GAN model with hand-drawn sketches.

computer-graphics computer-vision deep-learning gans hci

Last synced: 01 Aug 2024

https://github.com/microsoft/CameraTraps

PyTorch Wildlife: a Collaborative Deep Learning Framework for Conservation.

camera-traps computer-vision conservation machine-learning megadetector pytorch pytorch-wildlife wildlife

Last synced: 08 Aug 2024

https://github.com/cvondrick/videogan

Generating Videos with Scene Dynamics. NIPS 2016.

computer-vision deep-learning generative-adversarial-network video

Last synced: 31 Jul 2024

https://github.com/onepanelio/onepanel

The open source, end-to-end computer vision platform. Label, build, train, tune, deploy and automate in a unified platform that runs on any cloud and on-premises.

ai aiops annotation computer-vision deeplearning etl hyperparameter-tuning inference jupyterlab labeling machinelearning mlops pipelines pytorch tensorboard tensorflow training workflows

Last synced: 01 Aug 2024

https://github.com/stevenygd/PointFlow

PointFlow : 3D Point Cloud Generation with Continuous Normalizing Flows

3d-point-clouds computer-vision continuous-normalizing-flows machine-learning pytorch shapes

Last synced: 05 Aug 2024

https://github.com/gjy3035/C-3-Framework

An open-source PyTorch code for crowd counting

computer-vision crowd-analysis crowd-counting deep-learning

Last synced: 03 Aug 2024

https://github.com/mhamilton723/STEGO

Unsupervised Semantic Segmentation by Distilling Feature Correspondences

computer-vision deep-learning iclr2022 pytorch semantic-segmentation unsupervised-learning

Last synced: 03 Aug 2024

https://github.com/gigwegbe/tinyml-papers-and-projects

This is a list of interesting papers and projects about TinyML.

computer-vision embedded-systems machine-learning neural-architecture-search tinyml wake-word

Last synced: 03 Aug 2024

https://github.com/visipedia/inat_comp

iNaturalist competition details

competition computer-vision dataset inaturalist

Last synced: 01 Aug 2024

https://github.com/johnolafenwa/DeepStack

The World's Leading Cross Platform AI Engine for Edge Devices

ai-engine computer-vision deepstack face-detection face-recognition object-detection scene-recognition

Last synced: 01 Aug 2024

https://github.com/ika-rwth-aachen/Cam2BEV

TensorFlow Implementation for Computing a Semantically Segmented Bird's Eye View (BEV) Image Given the Images of Multiple Vehicle-Mounted Cameras.

autonomous-vehicles birds-eye-view computer-vision deep-learning ipm machine-learning segmentation sim2real simulation

Last synced: 31 Jul 2024

Computer vision Awesome Lists
Awesome-pytorch-list 704 awesome-self-supervised-learning 453 awesome-multimodal-ml 480 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-hand-pose-estimation 477 awesome-image-classification 220 awesome-human-pose-estimation 89 Awesome-Crowd-Counting 450 awesome-autonomous-vehicles 296 Awesome-Federated-Learning 558 Awesome-pytorch-list-CNVersion 692 iOS_ML 39 Awesome-Interaction-aware-Trajectory-Prediction 560 Awesome-FL 1,916 awesome-low-light-image-enhancement 195 CV-pretrained-model 103 awesome-tensorflow-lite 110 Awesome-Implicit-NeRF-Robotics 176 awesome-capsule-networks 67 awesome-grounding 154 awesome-attention-mechanism-in-cv 195 awesome-industrial-anomaly-detection 620 Awesome-Image-Colorization 113 awesome-ai-awesomeness 236 awesome-autonomous-vehicle 181 Awesome-Open-Vocabulary 156 awesome-open-data-centric-ai 56 Awesome-Skeleton-based-Action-Recognition 104 awesome-6d-object 600 awesome-scene-understanding 195 awesome-multi-task-learning 225 awesome-holistic-3d 129 awesome-data-annotation 93 awesome-panoptic-segmentation 45 awesome-photogrammetry 68 awesome-llm-and-aigc 1,134 Awesome-3D-Object-Detection 169 awesome-computer-vision-models 189 Awesome-Parameter-Efficient-Transfer-Learning 121 awesome-state-of-depth-completion 59 Awesome-Distributed-Deep-Learning 44 awesome-image-alignment-and-stitching 108 Awesome-Multi-Task-Learning 80 Computer-Vision-Video-Lectures 67 awesome-segment-anything-extensions 120 Awesome-Monocular-3D-detection 93 awesome-robotics-datasets 79 awesome-nerf-editing 370 awesome_computer_science 115