Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Computer vision
Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.
- GitHub: https://github.com/topics/computer-vision
- Wikipedia: https://en.wikipedia.org/wiki/Computer_vision
- Related Topics: vision, deep-learning, machine-learning, opencv, gan,
- Aliases: machine-vision, computervision,
- Last updated: 2025-01-10 00:05:25 UTC
- JSON Representation
https://github.com/ashishpatel26/365-Days-Computer-Vision-Learning-Linkedin-Post
365 Days Computer Vision Learning Linkedin Post
computer-vision cvpr cvpr2018 cvpr2019 cvpr2020 deep-learning eccv eccv-2018 eccv2019 eccv2020 iclr iclr2018 iclr2019 iclr2020 iclr2021 jmlr linkedin
Last synced: 27 Nov 2024
https://github.com/ahmedfgad/numpycnn
Building Convolutional Neural Networks From Scratch using NumPy
cnn computer-vision conv-layer convnet convolution convolutional-neural-networks data-science filter numpy pygad python relu relu-layer
Last synced: 06 Jan 2025
https://github.com/mit-han-lab/spvnas
[ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution
3d-deep-learning architecture-search computer-vision deep-learning efficiency lidar point-cloud pytorch semantickitti
Last synced: 28 Oct 2024
https://github.com/cvg/gluestick
Joint Deep Matcher for Points and Lines 🖼️💥🖼️ (ICCV 2023)
computer-vision deep-learning graph-neural-networks image-matching local-features machine-learning torch
Last synced: 06 Jan 2025
https://github.com/BYU-PCCL/holodeck
High Fidelity Simulator for Reinforcement Learning and Robotics Research.
ai computer-vision drones reinforcement-learning reinforcement-learning-environments research robotics simulator unreal-engine
Last synced: 02 Nov 2024
https://github.com/Confusezius/Deep-Metric-Learning-Baselines
PyTorch Implementation for Deep Metric Learning Pipelines
cars196 computer-vision cub200 deep-learning deep-metric-learning distance-sampling metric-learning neural-networks pku-vehicle pytorch shop-clothes
Last synced: 13 Nov 2024
https://github.com/nvidia/dataset_synthesizer
NVIDIA Deep learning Dataset Synthesizer (NDDS)
computer-vision deep-learning domain-randomization object-detection pose-estimation synthetic-dataset-generation
Last synced: 05 Jan 2025
https://github.com/Bartzi/see
Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"
chainer cnn computer-vision deep-learning scene-text-recognition semi-supervised-learning
Last synced: 03 Nov 2024
https://github.com/hihozhou/php-opencv
PHP extensions for OpenCV
computer-vision face-detection face-recognition opencv php-extension php-facedetect php-opencv php7 phpopencv
Last synced: 05 Jan 2025
https://github.com/abhshkdz/papers
:paperclip: Summaries of papers on deep learning
artificial-intelligence computer-vision deep-learning deep-neural-networks machine-learning
Last synced: 31 Dec 2024
https://github.com/torrvision/siamfc-tf
SiamFC tracking in TensorFlow.
computer-vision deep-learning object-tracking siamese-network tensorflow tracking
Last synced: 06 Jan 2025
https://github.com/sergeytulyakov/mocogan
MoCoGAN: Decomposing Motion and Content for Video Generation
computer-vision machine-learning video-generator
Last synced: 29 Nov 2024
https://github.com/underneathall/pinferencia
Python + Inference - Model Deployment library in Python. Simplest model inference server ever.
ai artificial-intelligence computer-vision data-science deep-learning huggingface inference inference-server machine-learning model-deployment model-serving modelserver nlp paddlepaddle predict python pytorch serving tensorflow transformers
Last synced: 05 Jan 2025
https://github.com/mv-lab/swin2sr
[ECCV] Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration. Advances in Image Manipulation (AIM) workshop ECCV 2022. Try it out! over 3.3M runs https://replicate.com/mv-lab/swin2sr
compression compression-artifact-reduction computer-vision deblocking deep-learning denoising eccv2022 image-denoising image-processing image-restoration image-sr image-super-resolution jpeg low-level-vision ntire super-resolution swin2sr swinir transformer vision-transformer
Last synced: 06 Nov 2024
https://github.com/yuxumin/PoinTr
[ICCV 2021 Oral] PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers
3dvision computer-vision deep-learning iccv2021 pointcloud-completion vision-transformers
Last synced: 28 Oct 2024
https://github.com/imagej/ImageJ
Public domain software for processing and analyzing scientific images
computer-vision image-processing
Last synced: 14 Nov 2024
https://github.com/robotlocomotion/pytorch-dense-correspondence
Code for "Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation"
3d artificial-intelligence computer-vision deep-learning manipulation pytorch robotics self-supervised-learning vision
Last synced: 05 Jan 2025
https://github.com/NVIDIA/Dataset_Synthesizer
NVIDIA Deep learning Dataset Synthesizer (NDDS)
computer-vision deep-learning domain-randomization object-detection pose-estimation synthetic-dataset-generation
Last synced: 26 Oct 2024
https://github.com/rnchg/apt
AI Productivity Tool - Free and open-source, enhancing user productivity while ensuring privacy and data security. It provides efficient and convenient AI solutions, including but not limited to: built-in exclusive ChatGPT, one-click batch intelligent processing of images and videos, and more.
ai ai-framework aigc computer-vision deep-learning image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing
Last synced: 03 Dec 2024
https://github.com/yassouali/ml-paper-notes
:notebook: Notes and summaries of various ML, Computer Vision & NLP papers.
computer-vision deep-learning machine-learning natural-language-processing nlp summary
Last synced: 24 Nov 2024
https://github.com/insight-platform/Savant
Python Computer Vision & Video Analytics Framework With Batteries Included
computer-vision cuda deep-learning deepstream edge-computing inference-engine instance-segmentation machine-learning nvidia nvidia-deepstream-sdk object-detection opencv peoplenet tensorrt video yolo yolov5-face yolov8 yolov8-face
Last synced: 09 Nov 2024
https://github.com/xverse-engine/XV3DGS-UEPlugin
A Unreal Engine 5 (UE5) based plugin aiming to provide real-time visulization, management, editing, and scalable hybrid rendering of Guassian Splatting model.
computer-graphics computer-vision radiance-field unreal-engine-5
Last synced: 19 Nov 2024
https://github.com/yassouali/ML-paper-notes
:notebook: Notes and summaries of various ML, Computer Vision & NLP papers.
computer-vision deep-learning machine-learning natural-language-processing nlp summary
Last synced: 04 Nov 2024
https://github.com/RobotLocomotion/pytorch-dense-correspondence
Code for "Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation"
3d artificial-intelligence computer-vision deep-learning manipulation pytorch robotics self-supervised-learning vision
Last synced: 14 Nov 2024
https://github.com/andreibarsan/dynslam
Master's Thesis on Simultaneous Localization and Mapping in dynamic environments. Separately reconstructs both the static environment and the dynamic objects from it, such as cars.
autonomous-vehicles computer-vision deep-learning dense eth-zurich master-thesis slam
Last synced: 04 Jan 2025
https://github.com/fatescript/centernet-better
An easy to understand and better performance version of CenterNet
computer-vision deep-learning object-detection
Last synced: 05 Jan 2025
https://github.com/iitzco/faced
🚀 😏 Near Real Time CPU Face detection using deep learning
computer-vision convolutional-neural-networks deep-learning face-detection fully-convolutional-networks python python-library tensorflow
Last synced: 05 Jan 2025
https://github.com/FateScript/CenterNet-better
An easy to understand and better performance version of CenterNet
computer-vision deep-learning object-detection
Last synced: 27 Nov 2024
https://github.com/hkchengrex/stcn
[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation
computer-vision deep-learning neurips-2021 pytorch segmentation video-object-segmentation video-segmentation
Last synced: 04 Jan 2025
https://github.com/ternaus/TernausNetV2
TernausNetV2: Fully Convolutional Network for Instance Segmentation
computer-vision deep-learning image-segmentation python pytorch satellite-imagery
Last synced: 18 Nov 2024
https://github.com/raoyongming/DynamicViT
[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
computer-vision deep-learning image-classification vision-transformers
Last synced: 15 Nov 2024
https://github.com/bertjiazheng/structured3d
[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling
3d-reconstruction annotations computer-graphics computer-vision dataset deep-learning eccv house-designs room-layout scene-understanding structure-annotations
Last synced: 05 Jan 2025
https://github.com/MadryLab/photoguard
Raising the Cost of Malicious AI-Powered Image Editing
adversarial-attacks adversarial-examples computer-vision deep-learning deepfakes robustness stable-diffusion
Last synced: 04 Nov 2024
https://github.com/ternaus/ternausnetv2
TernausNetV2: Fully Convolutional Network for Instance Segmentation
computer-vision deep-learning image-segmentation python pytorch satellite-imagery
Last synced: 07 Jan 2025
https://github.com/omriav/blended-latent-diffusion
Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
computer-vision deep-learning diffusion diffusion-models generative-model image-generation multimodal multimodal-deep-learning pytorch text-driven-editing text-guided-manipulation text-to-image text-to-image-synthesis
Last synced: 31 Oct 2024
https://github.com/semperai/amica
Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.
ai assistant-chat-bots computer-vision llm speech-recognition tts
Last synced: 28 Oct 2024
https://github.com/tobybreckon/fire-detection-cnn
real-time fire detection in video imagery using a convolutional neural network (deep learning) - from our ICIP 2018 paper (Dunnings / Breckon) + ICMLA 2019 paper (Samarth / Bhowmik / Breckon)
cnn-architecture cnn-for-visual-recognition computer-vision convolutional-networks convolutional-neural-networks deep-learning deep-learning-algorithms deep-neural-networks fire-detection fire-detector firenet inceptionv1-onfire machine-learning network-architecture object-detection robot-vision superpixel-segment superpixels tensorflow tensorflow-examples
Last synced: 04 Jan 2025
https://github.com/10up/classifai
Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence.
ai artificial-intelligence azure classification computer-vision content-tagging hacktoberfest ibm-watson language-processing machine-learning microsoft-azure ml wordpress wordpress-plugin
Last synced: 16 Nov 2024
https://github.com/farukalamai/advanced-machine-learning-engineer-roadmap-2024
A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing, model development, deployment, and maintenance.
aws computer-vision data-analysis data-science data-visualization deep-learning git-github machine-learning machine-learning-roadmap mlops natural-language-processing neural-network nlp opencv pandas python pytorch statistics tensorflow yolo
Last synced: 05 Jan 2025
https://github.com/l3p-cv/lost
Label Objects and Save Time (LOST) - Design your own smart Image Annotation process in a web-based environment.
annotation-framework annotation-process annotation-tool bounding-boxes computer-vision image-annotation machine-learning machine-vision polygon-annotations
Last synced: 17 Nov 2024
https://github.com/kcg2015/Vehicle-Detection-and-Tracking
Computer vision based vehicle detection and tracking using Tensorflow Object Detection API and Kalman-filtering
bayesian-filter bounding-boxes computer-vision detection hungarian-algorithm kalman-filtering keras linear-assignment-problem mobilenet-ssd object-detection occlusion single-shot-multibox-detector tensorflow-object-detection-api tracking
Last synced: 31 Oct 2024
https://github.com/zubair-irshad/awesome-robotics-3d
A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites
3d benchmarks computer-vision diffusion-models foundation-models gaussian-splatting grasping llm manipulation navigation nerf pointclouds policy-learning pretraining robotics scene-graph simulations vision-language-model vlm
Last synced: 16 Nov 2024
https://github.com/ashishpatel26/365-days-computer-vision-learning-linkedin-post
365 Days Computer Vision Learning Linkedin Post
computer-vision cvpr cvpr2018 cvpr2019 cvpr2020 deep-learning eccv eccv-2018 eccv2019 eccv2020 iclr iclr2018 iclr2019 iclr2020 iclr2021 jmlr linkedin
Last synced: 19 Nov 2024
https://github.com/ankitaggarwal011/PyCNN
Image Processing with Cellular Neural Networks in Python
cellular cnn cnn-processors computer-science computer-vision control corner-detection cross-platform edge-detection feedback image-processing library neural-network paper python
Last synced: 27 Nov 2024
https://github.com/ModelDepot/tfjs-yolo-tiny
In-Browser Object Detection using Tiny YOLO on Tensorflow.js
browser computer-vision deep-learning detection machine-learning npm-package object-detection tensorflow tensorflow-js tfjs yolo yolo-models
Last synced: 09 Nov 2024
https://github.com/modeldepot/tfjs-yolo-tiny
In-Browser Object Detection using Tiny YOLO on Tensorflow.js
browser computer-vision deep-learning detection machine-learning npm-package object-detection tensorflow tensorflow-js tfjs yolo yolo-models
Last synced: 04 Jan 2025
https://github.com/philferriere/tfoptflow
Optical Flow Prediction with TensorFlow. Implements "PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume," by Deqing Sun et al. (CVPR 2018)
computer-vision cvpr2018 deep-learning flying-chairs kitti-dataset motion-estimation mpi-sintel optical-flow pwc-net tensorflow
Last synced: 05 Jan 2025
https://github.com/pterhoer/FaceImageQuality
Code and information for face image quality assessment with SER-FIQ
biometrics computer-vision face face-recognition machine-learning quality quality-estimation quality-measures quality-metrics robustness
Last synced: 06 Nov 2024
https://github.com/nianticlabs/mickey
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
computer-vision cvpr2024 dsac metric-correspondences mickey pose-estimation ransac ransac-algorithm
Last synced: 05 Jan 2025
https://github.com/microsoft/CvT
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
classification computer-vision cvt deep-learning imagenet
Last synced: 14 Nov 2024
https://github.com/lingdong-/skeleton-tracing
A new algorithm for retrieving topological skeleton as a set of polylines from binary images
algorithm computational-geometry computer-vision polylines skeletonization
Last synced: 05 Jan 2025
https://rese1f.github.io/MovieChat/
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
computer-vision dataset large-language-models llama long-video-understanding multimodal-large-language-models
Last synced: 13 Nov 2024
https://github.com/hassony2/useful-computer-vision-phd-resources
Lists of resources useful for my PhD in computer vision
computer-vision list phd resources
Last synced: 13 Nov 2024
https://github.com/rese1f/MovieChat
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
computer-vision dataset large-language-models llama long-video-understanding multimodal-large-language-models
Last synced: 09 Nov 2024
https://github.com/rese1f/moviechat
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
computer-vision dataset large-language-models llama long-video-understanding multimodal-large-language-models
Last synced: 10 Jan 2025
https://github.com/microsoft/styleswin
[CVPR 2022] StyleSwin: Transformer-based GAN for High-resolution Image Generation
computer-vision deep-learning deep-neural-networks gans generative-adversarial-network image-generation image-synthesis pytorch styleswin transformer
Last synced: 10 Jan 2025
https://github.com/swz30/cycleisp
[CVPR 2020--Oral] CycleISP: Real Image Restoration via Improved Data Synthesis
camera-imaging-pipeline computer-vision cvpr2020 cycleisp data-synthesis image-denoising image-restoration low-level-vision pytorch raw2rgb rgb2raw
Last synced: 04 Jan 2025
https://github.com/aimagelab/dress-code
Dress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022
artificial-intelligence computer-vision deep-learning dress-code eccv2022 virtual-try-on
Last synced: 03 Jan 2025
https://github.com/jsn5/dancenet
DanceNet -💃💃Dance generator using Autoencoder, LSTM and Mixture Density Network. (Keras)
autoencoder computer-vision generative-model keras lstm mixture-density-networks
Last synced: 27 Nov 2024
https://github.com/Justin-Tan/generative-compression
TensorFlow Implementation of Generative Adversarial Networks for Extreme Learned Image Compression
computer-vision gan generative-adversarial-network image-compression tensorflow
Last synced: 27 Nov 2024
https://github.com/xiaoyufenfei/LEDNet
LEDNet: A Lightweight Encoder-Decoder Network for Real-time Semantic Segmentation
cityscape-dataset computer-vision lednet pytorch real-time semantic-segmentation
Last synced: 14 Nov 2024
https://github.com/DagnyT/hardnet
Hardnet descriptor model - "Working hard to know your neighbor's margins: Local descriptor learning loss"
computer-vision convolutional-neural-networks deep-learning image-matching image-retrieval metric-learning nips-2017 pytorch
Last synced: 04 Nov 2024
https://github.com/LingDong-/skeleton-tracing
A new algorithm for retrieving topological skeleton as a set of polylines from binary images
algorithm computational-geometry computer-vision polylines skeletonization
Last synced: 06 Nov 2024
https://github.com/NVlabs/pacnet
Pixel-Adaptive Convolutional Neural Networks (CVPR '19)
computer-vision deep-learning machine-learning
Last synced: 27 Nov 2024
https://github.com/rgeirhos/stylized-imagenet
Code to create Stylized-ImageNet, a stylized version of standard ImageNet (ICLR 2019 Oral)
computer-vision deep-learning human-vision shape-bias style-transfer texture-bias
Last synced: 04 Jan 2025
https://github.com/baidut/OpenCE
Contrast Enhancement Techniques for low-light images
computer-vision contrast contrast-enhancement image-enhancment image-manipulation image-processing image-restoration low-light
Last synced: 06 Nov 2024
https://github.com/nerox8664/awesome-computer-vision-models
A list of popular deep learning models related to classification, segmentation and detection problems
awesome awesome-list awesome-lists computer-vision computer-vision-algorithms deep-learning densenet diracnet efficientnet image-classification machine-learning machine-learning-algorithms machine-learning-models mixnet nasnet object-detection proxylessnas resnet semantic-segmentation sknet
Last synced: 21 Dec 2024
https://github.com/hfslyc/AdvSemiSeg
Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018
adversarial-learning computer-vision deep-learning pytorch semantic-segmentation semi-supervised-learning
Last synced: 28 Oct 2024
https://github.com/wellflat/imageprocessing-labs
computer vision, image processing and machine learning on the web browser or node.
computer-vision image-processing javascript machine-learning
Last synced: 02 Dec 2024
https://github.com/limuloo/MIGC
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
aigc computer-vision cvpr cvpr2024 stable-diffusion text-to-image
Last synced: 31 Oct 2024
https://github.com/AkshitIreddy/Interactive-LLM-Powered-NPCs
Interactive LLM Powered NPCs, is an open-source project that completely transforms your interaction with non-player characters (NPCs) in any game! 🎮🤖🚀
ai artificial-intelligence autonomous-agents computer-vision langchain language-model llm-agent python video-game
Last synced: 14 Nov 2024
https://github.com/swz30/CycleISP
[CVPR 2020--Oral] CycleISP: Real Image Restoration via Improved Data Synthesis
camera-imaging-pipeline computer-vision cvpr2020 cycleisp data-synthesis image-denoising image-restoration low-level-vision pytorch raw2rgb rgb2raw
Last synced: 03 Nov 2024
https://github.com/khanhnamle1994/computer-vision
Programming Assignments and Lectures for Stanford's CS 231: Convolutional Neural Networks for Visual Recognition
computer-vision convolutional-neural-networks deep-learning image-classification imagenet visual-recognition
Last synced: 05 Jan 2025
https://github.com/DIYer22/boxx
Tool-box for efficient build and debug in Python. Especially for Scientific Computing and Computer Vision.
awesome-python computer-vision debug debugging deep-learning hack pytorch pytorch-debug scientific-computing toolbox
Last synced: 27 Nov 2024
https://github.com/Megvii-BaseDetection/DeFCN
End-to-End Object Detection with Fully Convolutional Network
computer-vision object-detection
Last synced: 26 Oct 2024
https://github.com/pliang279/MultiBench
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
computer-vision deep-learning healthcare machine-learning multimodal-learning natural-language-processing representation-learning robotics speech-processing
Last synced: 15 Nov 2024
https://github.com/abhshkdz/neural-vqa
:grey_question: Visual Question Answering in Torch
computer-vision deep-learning natural-language-processing torch
Last synced: 06 Jan 2025
https://github.com/deepVector/geospatial-machine-learning
A curated list of resources focused on Machine Learning in Geospatial Data Science.
classification computer-vision convolutional-neural-networks deep-learning geoscience geospatial geospatial-machine-learning gis image-segmentation keras landsat machine-learning remote-sensing satellite-imagery satellite-images semantic-segmentation tensorflow
Last synced: 04 Nov 2024
https://github.com/lxe/llavavision
A simple "Be My Eyes" web app with a llama.cpp/llava backend
ai artificial-intelligence computer-vision llama llamacpp llm local-llm machine-learning multimodal webapp
Last synced: 08 Jan 2025
https://github.com/skalskip/sports
Cool experiments at the intersection of Computer Vision and Sports ⚽🏃
computer-vision deep-learning deep-neural-networks gpt-4 gpt-4-vision object-detection prompt-engineering pytorch sports-analytics tutorial yolov5 yolov7
Last synced: 05 Jan 2025
https://github.com/wvangansbeke/sparse-depth-completion
Predict dense depth maps from sparse and noisy LiDAR frames guided by RGB images. (Ranked 1st place on KITTI) [MVA 2019]
computer-vision deep-learning depth-completion depth-prediction kitti lidar noisy-data pytorch sensor-fusion
Last synced: 05 Jan 2025
https://github.com/amirassov/kaggle-imaterialist
The First Place Solution of Kaggle iMaterialist (Fashion) 2019 at FGVC6
computer-vision deep-learning hybrid-task-cascade imaterialist instance-segmentation kaggle kaggle-imaterialist mmdetection object-detection pytorch
Last synced: 31 Oct 2024
https://github.com/liuziwei7/fashion-detection
Fashion Detection in the Wild (Deep Clothes Detector)
clothes-detection clothes-detector computer-vision deep-learning vision-for-fashion
Last synced: 18 Nov 2024
https://github.com/zju3dv/manhattan_sdf
Code for "Neural 3D Scene Reconstruction with the Manhattan-world Assumption" CVPR 2022 Oral
3d-reconstruction 3d-vision computer-vision cvpr2022
Last synced: 05 Jan 2025
https://zju3dv.github.io/manhattan_sdf/
Code for "Neural 3D Scene Reconstruction with the Manhattan-world Assumption" CVPR 2022 Oral
3d-reconstruction 3d-vision computer-vision cvpr2022
Last synced: 17 Nov 2024
https://github.com/SkalskiP/sports
Cool experiments at the intersection of Computer Vision and Sports ⚽🏃
computer-vision deep-learning deep-neural-networks gpt-4 gpt-4-vision object-detection prompt-engineering pytorch sports-analytics tutorial yolov5 yolov7
Last synced: 06 Nov 2024
https://github.com/isarandi/metrabs
Estimate absolute 3D human poses from RGB images.
3d-human-pose computer-vision deep-learning human-pose-estimation machine-learning motion-capture tensorflow tensorflow2
Last synced: 05 Jan 2025
https://github.com/openvinotoolkit/datumaro
Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
coco computer-vision dataset datasets deep-learning format-converter imagenet neural-networks openvino-toolkit pascal-voc yolo
Last synced: 13 Nov 2024
https://github.com/explosion/prodigy-recipes
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
active-learning annotation annotation-tool artificial-intelligence computer-vision data-annotation data-science labeling-tool machine-learning machine-teaching natural-language-processing nlp prodigy spacy
Last synced: 04 Jan 2025
https://github.com/microsoft/xpretrain
Multi-modality pre-training
computer-vision multimedia multimodal-learning nlp pre-training
Last synced: 04 Jan 2025
https://github.com/mtli/photosketch
Code for Photo-Sketching: Inferring Contour Drawings from Images :dog:
ai computer-vision graphics sketch
Last synced: 07 Jan 2025
https://github.com/kaiyangzhou/pytorch-vsumm-reinforce
Unsupervised video summarization with deep reinforcement learning (AAAI'18)
computer-vision deep-learning machine-learning policy-network reinforcement-learning unsupervised-learning video-summarization
Last synced: 04 Jan 2025
https://github.com/xavierpuigf/virtualhome
API to run VirtualHome, a Multi-Agent Household Simulator
computer-vision deep-learning graph multi-agent reinforcement-learning simulator unity
Last synced: 03 Nov 2024
https://github.com/Alpha-VL/ConvMAE
ConvMAE: Masked Convolution Meets Masked Autoencoders
backbone computer-vision mae masked-image-modeling object-detection semantic-segmentation
Last synced: 15 Nov 2024
https://github.com/dmitryryumin/aaai-2024-papers
AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for better understanding. ⭐ experience the forefront of progress in artificial intelligence with this repository!
aaai aaai2024 application-domains artificial-intelligence cognitive-systems computer-vision computer-vison deep-learning expert-systems human-computer-interaction knowledge-representation machine-learning multi-agent-systems neural-networks reinforcement-learning sentiment-analysis
Last synced: 05 Jan 2025
https://github.com/tri-ml/dd3d
Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.
computer-vision deep-learning pytorch
Last synced: 05 Jan 2025
https://github.com/amazon-research/siam-mot
SiamMOT: Siamese Multi-Object Tracking
computer-vision multi-object-tracking video-analysis
Last synced: 20 Dec 2024
https://github.com/thp/psmoveapi
Cross-platform library for 6DoF tracking of the PS Move Motion Controller. Sensor fusion, computer vision, ambient display (LED orb).
6dof computer-vision controller hid sensors tracking
Last synced: 03 Jan 2025