Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Computer vision
Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.
- GitHub: https://github.com/topics/computer-vision
- Wikipedia: https://en.wikipedia.org/wiki/Computer_vision
- Related Topics: vision, deep-learning, machine-learning, opencv, gan,
- Aliases: machine-vision, computervision,
- Last updated: 2024-11-15 00:05:48 UTC
- JSON Representation
https://github.com/ankitaggarwal011/PyCNN
Image Processing with Cellular Neural Networks in Python
cellular cnn cnn-processors computer-science computer-vision control corner-detection cross-platform edge-detection feedback image-processing library neural-network paper python
Last synced: 07 Aug 2024
https://github.com/modeldepot/tfjs-yolo-tiny
In-Browser Object Detection using Tiny YOLO on Tensorflow.js
browser computer-vision deep-learning detection machine-learning npm-package object-detection tensorflow tensorflow-js tfjs yolo yolo-models
Last synced: 13 Nov 2024
https://github.com/ModelDepot/tfjs-yolo-tiny
In-Browser Object Detection using Tiny YOLO on Tensorflow.js
browser computer-vision deep-learning detection machine-learning npm-package object-detection tensorflow tensorflow-js tfjs yolo yolo-models
Last synced: 09 Nov 2024
https://github.com/philferriere/tfoptflow
Optical Flow Prediction with TensorFlow. Implements "PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume," by Deqing Sun et al. (CVPR 2018)
computer-vision cvpr2018 deep-learning flying-chairs kitti-dataset motion-estimation mpi-sintel optical-flow pwc-net tensorflow
Last synced: 13 Nov 2024
https://github.com/pterhoer/FaceImageQuality
Code and information for face image quality assessment with SER-FIQ
biometrics computer-vision face face-recognition machine-learning quality quality-estimation quality-measures quality-metrics robustness
Last synced: 06 Nov 2024
https://github.com/microsoft/CvT
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
classification computer-vision cvt deep-learning imagenet
Last synced: 14 Nov 2024
https://github.com/hkchengrex/stcn
[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation
computer-vision deep-learning neurips-2021 pytorch segmentation video-object-segmentation video-segmentation
Last synced: 10 Nov 2024
https://rese1f.github.io/MovieChat/
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
computer-vision dataset large-language-models llama long-video-understanding multimodal-large-language-models
Last synced: 13 Nov 2024
https://github.com/rese1f/MovieChat
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
computer-vision dataset large-language-models llama long-video-understanding multimodal-large-language-models
Last synced: 09 Nov 2024
https://github.com/hassony2/useful-computer-vision-phd-resources
Lists of resources useful for my PhD in computer vision
computer-vision list phd resources
Last synced: 13 Nov 2024
https://github.com/jsn5/dancenet
DanceNet -💃💃Dance generator using Autoencoder, LSTM and Mixture Density Network. (Keras)
autoencoder computer-vision generative-model keras lstm mixture-density-networks
Last synced: 07 Aug 2024
https://github.com/xiaoyufenfei/LEDNet
LEDNet: A Lightweight Encoder-Decoder Network for Real-time Semantic Segmentation
cityscape-dataset computer-vision lednet pytorch real-time semantic-segmentation
Last synced: 14 Nov 2024
https://github.com/DagnyT/hardnet
Hardnet descriptor model - "Working hard to know your neighbor's margins: Local descriptor learning loss"
computer-vision convolutional-neural-networks deep-learning image-matching image-retrieval metric-learning nips-2017 pytorch
Last synced: 04 Nov 2024
https://github.com/Justin-Tan/generative-compression
TensorFlow Implementation of Generative Adversarial Networks for Extreme Learned Image Compression
computer-vision gan generative-adversarial-network image-compression tensorflow
Last synced: 07 Aug 2024
https://github.com/LingDong-/skeleton-tracing
A new algorithm for retrieving topological skeleton as a set of polylines from binary images
algorithm computational-geometry computer-vision polylines skeletonization
Last synced: 06 Nov 2024
https://github.com/aimagelab/dress-code
Dress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022
artificial-intelligence computer-vision deep-learning dress-code eccv2022 virtual-try-on
Last synced: 14 Nov 2024
https://github.com/NVlabs/pacnet
Pixel-Adaptive Convolutional Neural Networks (CVPR '19)
computer-vision deep-learning machine-learning
Last synced: 07 Aug 2024
https://github.com/rese1f/moviechat
[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
computer-vision dataset large-language-models llama long-video-understanding multimodal-large-language-models
Last synced: 13 Nov 2024
https://github.com/baidut/OpenCE
Contrast Enhancement Techniques for low-light images
computer-vision contrast contrast-enhancement image-enhancment image-manipulation image-processing image-restoration low-light
Last synced: 06 Nov 2024
https://github.com/rgeirhos/stylized-imagenet
Code to create Stylized-ImageNet, a stylized version of standard ImageNet (ICLR 2019 Oral)
computer-vision deep-learning human-vision shape-bias style-transfer texture-bias
Last synced: 22 Oct 2024
https://github.com/hfslyc/AdvSemiSeg
Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018
adversarial-learning computer-vision deep-learning pytorch semantic-segmentation semi-supervised-learning
Last synced: 28 Oct 2024
https://github.com/limuloo/MIGC
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
aigc computer-vision cvpr cvpr2024 stable-diffusion text-to-image
Last synced: 31 Oct 2024
https://github.com/AkshitIreddy/Interactive-LLM-Powered-NPCs
Interactive LLM Powered NPCs, is an open-source project that completely transforms your interaction with non-player characters (NPCs) in any game! 🎮🤖🚀
ai artificial-intelligence autonomous-agents computer-vision langchain language-model llm-agent python video-game
Last synced: 14 Nov 2024
https://github.com/wellflat/imageprocessing-labs
computer vision, image processing and machine learning on the web browser or node.
computer-vision image-processing javascript machine-learning
Last synced: 12 Aug 2024
https://github.com/swz30/CycleISP
[CVPR 2020--Oral] CycleISP: Real Image Restoration via Improved Data Synthesis
camera-imaging-pipeline computer-vision cvpr2020 cycleisp data-synthesis image-denoising image-restoration low-level-vision pytorch raw2rgb rgb2raw
Last synced: 03 Nov 2024
https://github.com/ashishpatel26/365-Days-Computer-Vision-Learning-Linkedin-Post
365 Days Computer Vision Learning Linkedin Post
computer-vision cvpr cvpr2018 cvpr2019 cvpr2020 deep-learning eccv eccv-2018 eccv2019 eccv2020 iclr iclr2018 iclr2019 iclr2020 iclr2021 jmlr linkedin
Last synced: 07 Aug 2024
https://github.com/RQLuo/MixTeX-Latex-OCR
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
computer-vision deep-learning latex machine-learning ocr onnx python
Last synced: 03 Sep 2024
https://github.com/Megvii-BaseDetection/DeFCN
End-to-End Object Detection with Fully Convolutional Network
computer-vision object-detection
Last synced: 26 Oct 2024
https://github.com/pliang279/MultiBench
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
computer-vision deep-learning healthcare machine-learning multimodal-learning natural-language-processing representation-learning robotics speech-processing
Last synced: 15 Nov 2024
https://github.com/deepVector/geospatial-machine-learning
A curated list of resources focused on Machine Learning in Geospatial Data Science.
classification computer-vision convolutional-neural-networks deep-learning geoscience geospatial geospatial-machine-learning gis image-segmentation keras landsat machine-learning remote-sensing satellite-imagery satellite-images semantic-segmentation tensorflow
Last synced: 04 Nov 2024
https://github.com/abhshkdz/neural-vqa
:grey_question: Visual Question Answering in Torch
computer-vision deep-learning natural-language-processing torch
Last synced: 15 Nov 2024
https://github.com/amirassov/kaggle-imaterialist
The First Place Solution of Kaggle iMaterialist (Fashion) 2019 at FGVC6
computer-vision deep-learning hybrid-task-cascade imaterialist instance-segmentation kaggle kaggle-imaterialist mmdetection object-detection pytorch
Last synced: 31 Oct 2024
https://zju3dv.github.io/manhattan_sdf/
Code for "Neural 3D Scene Reconstruction with the Manhattan-world Assumption" CVPR 2022 Oral
3d-reconstruction 3d-vision computer-vision cvpr2022
Last synced: 03 Aug 2024
https://github.com/skalskip/sports
Cool experiments at the intersection of Computer Vision and Sports ⚽🏃
computer-vision deep-learning deep-neural-networks gpt-4 gpt-4-vision object-detection prompt-engineering pytorch sports-analytics tutorial yolov5 yolov7
Last synced: 13 Nov 2024
https://github.com/SkalskiP/sports
Cool experiments at the intersection of Computer Vision and Sports ⚽🏃
computer-vision deep-learning deep-neural-networks gpt-4 gpt-4-vision object-detection prompt-engineering pytorch sports-analytics tutorial yolov5 yolov7
Last synced: 06 Nov 2024
https://github.com/lxe/llavavision
A simple "Be My Eyes" web app with a llama.cpp/llava backend
ai artificial-intelligence computer-vision llama llamacpp llm local-llm machine-learning multimodal webapp
Last synced: 10 Oct 2024
https://github.com/wvangansbeke/sparse-depth-completion
Predict dense depth maps from sparse and noisy LiDAR frames guided by RGB images. (Ranked 1st place on KITTI) [MVA 2019]
computer-vision deep-learning depth-completion depth-prediction kitti lidar noisy-data pytorch sensor-fusion
Last synced: 14 Nov 2024
https://github.com/openvinotoolkit/datumaro
Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
coco computer-vision dataset datasets deep-learning format-converter imagenet neural-networks openvino-toolkit pascal-voc yolo
Last synced: 13 Nov 2024
https://github.com/liuziwei7/fashion-detection
Fashion Detection in the Wild (Deep Clothes Detector)
clothes-detection clothes-detector computer-vision deep-learning vision-for-fashion
Last synced: 03 Aug 2024
https://github.com/explosion/prodigy-recipes
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
active-learning annotation annotation-tool artificial-intelligence computer-vision data-annotation data-science labeling-tool machine-learning machine-teaching natural-language-processing nlp prodigy spacy
Last synced: 07 Oct 2024
https://github.com/khanhnamle1994/computer-vision
Programming Assignments and Lectures for Stanford's CS 231: Convolutional Neural Networks for Visual Recognition
computer-vision convolutional-neural-networks deep-learning image-classification imagenet visual-recognition
Last synced: 10 Nov 2024
https://github.com/xavierpuigf/virtualhome
API to run VirtualHome, a Multi-Agent Household Simulator
computer-vision deep-learning graph multi-agent reinforcement-learning simulator unity
Last synced: 03 Nov 2024
https://github.com/Alpha-VL/ConvMAE
ConvMAE: Masked Convolution Meets Masked Autoencoders
backbone computer-vision mae masked-image-modeling object-detection semantic-segmentation
Last synced: 15 Nov 2024
https://github.com/amazon-science/siam-mot
SiamMOT: Siamese Multi-Object Tracking
computer-vision multi-object-tracking video-analysis
Last synced: 12 Nov 2024
https://github.com/kohjingyu/fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
computer-vision large-language-models machine-learning natural-language-processing
Last synced: 05 Nov 2024
https://github.com/microsoft/xpretrain
Multi-modality pre-training
computer-vision multimedia multimodal-learning nlp pre-training
Last synced: 05 Nov 2024
https://github.com/hzwer/awesome-optical-flow
This is a list of awesome paper about optical flow and related work.
computer-vision deep-learning optical-flow
Last synced: 15 Nov 2024
https://github.com/luyanger1799/Amazing-Semantic-Segmentation
Amazing Semantic Segmentation on Tensorflow && Keras (include FCN, UNet, SegNet, PSPNet, PAN, RefineNet, DeepLabV3, DeepLabV3+, DenseASPP, BiSegNet)
computer-vision deep-learning keras-tensorflow semantic-segmentation tensorflow
Last synced: 02 Nov 2024
https://github.com/tri-ml/dd3d
Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.
computer-vision deep-learning pytorch
Last synced: 13 Nov 2024
https://github.com/postech-cvlab/scnerf
[ICCV21] Self-Calibrating Neural Radiance Fields
calibration computer-vision deep-learning implicit-representions nerf pytorch self-calibration
Last synced: 14 Nov 2024
https://github.com/thp/psmoveapi
Cross-platform library for 6DoF tracking of the PS Move Motion Controller. Sensor fusion, computer vision, ambient display (LED orb).
6dof computer-vision controller hid sensors tracking
Last synced: 13 Nov 2024
https://github.com/POSTECH-CVLab/SCNeRF
[ICCV21] Self-Calibrating Neural Radiance Fields
calibration computer-vision deep-learning implicit-representions nerf pytorch self-calibration
Last synced: 08 Aug 2024
https://github.com/jo-m/trainbot
Watches a piece of train track, detects trains, and stitches together images of them.
bot computer-vision go golang stitching trains
Last synced: 03 Aug 2024
https://github.com/pliang279/multibench
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
computer-vision deep-learning healthcare machine-learning multimodal-learning natural-language-processing representation-learning robotics speech-processing
Last synced: 13 Nov 2024
https://github.com/ahangchen/torch_base
Quickly bring up your PyTorch project(a skeleton)
computer-vision deep-learning machine-learning pytorch
Last synced: 11 Nov 2024
https://github.com/developmentseed/label-maker
Data Preparation for Satellite Machine Learning
computer-vision data-preparation deep-learning keras remote-sensing satellite-imagery
Last synced: 13 Nov 2024
https://github.com/cvondrick/soundnet
SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016
computer-vision deep-learning sound
Last synced: 27 Oct 2024
https://github.com/TRI-ML/dd3d
Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.
computer-vision deep-learning pytorch
Last synced: 28 Oct 2024
https://github.com/microsoft/XPretrain
Multi-modality pre-training
computer-vision multimedia multimodal-learning nlp pre-training
Last synced: 04 Nov 2024
https://github.com/Ha0Tang/SelectionGAN
[CVPR 2019 Oral] Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation
adversarial-learning computer-graphics computer-vision cross-view cvpr-2019 cvpr19 cvpr2019 cvusa-dataset dayton deep-learning gans generative-adversarial-network image-generation image-manipulation image-to-image-translation image-translation pytorch semantic-maps
Last synced: 06 Aug 2024
https://github.com/farukalamai/advanced-machine-learning-engineer-roadmap-2024
A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing, model development, deployment, and maintenance.
aws computer-vision data-analysis data-science data-visualization deep-learning git-github machine-learning machine-learning-roadmap mlops natural-language-processing neural-network nlp opencv pandas python pytorch statistics tensorflow yolo
Last synced: 14 Nov 2024
https://github.com/kscottz/PythonFromSpace
Python Examples for Remote Sensing
computer-vision gis image-processing imaging jupyter-notebook python satellite vision
Last synced: 15 Nov 2024
https://github.com/synthesiaresearch/humanrf
Official code for "HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion"
3d-reconstruction computer-graphics computer-vision machine-learning nerf siggraph2023
Last synced: 06 Nov 2024
https://github.com/NVlabs/intrinsic3d
Intrinsic3D - High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting (ICCV 2017)
3d 3d-reconstruction computer-vision geometry iccv iccv-2017 iccv2017 intrinsic3d keyframes lighting nvidia rgbd sdf shape-from-shading spherical-harmonics surface-reconstruction tum
Last synced: 13 Nov 2024
https://github.com/JunweiLiang/Object_Detection_Tracking
Out-of-the-box code and models for CMU's object detection and tracking system for multi-camera surveillance videos. Speed optimized Faster-RCNN model. Tensorflow based. Also supports EfficientDet. WACVW'20
activity-detection computer-vision deep-learning detection-tracking efficientdet maskrcnn multi-camera multi-camera-tracking multi-camera-vehicle-reid object-detection reid surveillance-videos tracking tracking-detection video-object-detection video-object-tracking
Last synced: 06 Nov 2024
https://github.com/mv-lab/InstructIR
[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR
computer-vision deblurring deep-learning dehazing denoising image-enhancement image-restoration inverse-problems language-model low-light-image-enhancement multi-task multimodal neural-network photography prompt pytorch super-resolution
Last synced: 03 Nov 2024
https://github.com/junweiliang/object_detection_tracking
Out-of-the-box code and models for CMU's object detection and tracking system for multi-camera surveillance videos. Speed optimized Faster-RCNN model. Tensorflow based. Also supports EfficientDet. WACVW'20
activity-detection computer-vision deep-learning detection-tracking efficientdet maskrcnn multi-camera multi-camera-tracking multi-camera-vehicle-reid object-detection reid surveillance-videos tracking tracking-detection video-object-detection video-object-tracking
Last synced: 08 Nov 2024
https://github.com/hkchengrex/mivos
[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion. Semi-supervised VOS as well!
computer-vision cvpr2021 deep-learning interactive-segmentation pytorch segmentation video-object-segmentation video-segmentation
Last synced: 10 Nov 2024
https://github.com/vlgiitr/DL_Topics
List of DL topics and resources essential for cracking interviews
computer-vision deep-learning generative-models linear-algebra natural-language-processing probability-statistics
Last synced: 12 Nov 2024
https://github.com/anmspro/traffic-signal-violation-detection-system
A Computer Vision based Traffic Signal Violation Detection System from video footage using YOLOv3 & Tkinter. (GUI Included)
computer-vision hacktoberfest object-detection opencv python tensorflow tkinter traffic-signal traffic-violation-detection yolov3
Last synced: 13 Nov 2024
https://github.com/pykale/pykale
Knowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the 🔥PyTorch ecosystem. ⭐ Star to support our work!
computer-vision data-science deep-learning domain-adaptation graph-analysis knowledge-aware-learning machine-learning medical-image-analysis meta-learning multimodal multimodal-learning python pytorch transfer-learning
Last synced: 11 Oct 2024
https://github.com/ruiminshen/yolo2-pytorch
PyTorch implementation of the YOLO (You Only Look Once) v2
caffe2 computer-vision deep-learning deep-neural-networks object-detection onnx python pytorch yolo2
Last synced: 04 Nov 2024
https://github.com/francescosaveriozuppichini/glasses
High-quality Neural Networks for Computer Vision 😎
classification computer-vision convolutional-networks deep-learning deep-neural-networks machine-learning neural-network pretrained-models pytorch
Last synced: 13 Nov 2024
https://github.com/kaiyangzhou/pytorch-vsumm-reinforce
Unsupervised video summarization with deep reinforcement learning (AAAI'18)
computer-vision deep-learning machine-learning policy-network reinforcement-learning unsupervised-learning video-summarization
Last synced: 29 Oct 2024
https://github.com/FrancescoSaverioZuppichini/glasses
High-quality Neural Networks for Computer Vision 😎
classification computer-vision convolutional-networks deep-learning deep-neural-networks machine-learning neural-network pretrained-models pytorch
Last synced: 30 Oct 2024
https://github.com/weiyithu/NerfingMVS
[ICCV 2021 Oral] NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo
3d-reconstruction 3dvision computer-vision deep-learning multi-view-stereo neural-radiance-fields
Last synced: 07 Nov 2024
https://github.com/mtli/PhotoSketch
Code for Photo-Sketching: Inferring Contour Drawings from Images :dog:
ai computer-vision graphics sketch
Last synced: 14 Oct 2024
https://github.com/skhadem/3D-BoundingBox
PyTorch implementation for 3D Bounding Box Estimation Using Deep Learning and Geometry
3d computer-vision localization pytorch
Last synced: 28 Oct 2024
https://github.com/dmitryryumin/aaai-2024-papers
AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for better understanding. ⭐ experience the forefront of progress in artificial intelligence with this repository!
aaai aaai2024 application-domains artificial-intelligence cognitive-systems computer-vision computer-vison deep-learning expert-systems human-computer-interaction knowledge-representation machine-learning multi-agent-systems neural-networks reinforcement-learning sentiment-analysis
Last synced: 15 Nov 2024
https://github.com/tanjeffreyz/auto-maple
Artificial intelligence software for MapleStory that uses various machine learning and computer vision techniques to navigate challenging in-game environments
ai bot computer-vision deep-learning maplestory python
Last synced: 07 Nov 2024
https://github.com/BrokenSource/DepthFlow
🌊 Images to → 2.5D Parallax Effect Video. A Free and Open Source ImmersityAI alternative
computer-vision depth-map depth-maps depth-prediction depthflow depthy gradio image-parallax image-to-video immersity immersityai leiapix monocular-depth monocular-depth-estimation parallax parallax-effect shaderflow shadertoy
Last synced: 08 Nov 2024
https://github.com/vita-epfl/monoloco
A 3D vision library from 2D keypoints: monocular and stereo 3D detection for humans, social distancing, and body orientation.
3d-deep-learning 3d-detection 3d-object-detection 3d-vision computer-vision covid-19 deep-learning human-pose-estimation iccv2019 icra2021 kitti-dataset machine-learning object-detection openpifpaf pifpaf pose-estimation pytorch uncertainty
Last synced: 14 Nov 2024
https://github.com/KevinMusgrave/powerful-benchmarker
A library for ML benchmarking. It's powerful.
benchmarking computer-vision deep-learning domain-adaptation machine-learning metric-learning pytorch transfer-learning
Last synced: 15 Nov 2024
https://github.com/kevinmusgrave/powerful-benchmarker
A library for ML benchmarking. It's powerful.
benchmarking computer-vision deep-learning domain-adaptation machine-learning metric-learning pytorch transfer-learning
Last synced: 01 Nov 2024
https://github.com/nicholaskajoh/ivy
Video-based object counting software.
computer-vision object-counter object-counting video-analysis video-processing
Last synced: 05 Aug 2024
https://github.com/alturosdestinations/alturos.yolo
C# Yolo Darknet Wrapper (real-time object detection)
computer-vision csharp dotnet image-classification neural-network object-detection objectdetection visual-studio yolo yolo2 yolo3 yolov2 yolov3
Last synced: 10 Nov 2024
https://github.com/AlturosDestinations/Alturos.Yolo
C# Yolo Darknet Wrapper (real-time object detection)
computer-vision csharp dotnet image-classification neural-network object-detection objectdetection visual-studio yolo yolo2 yolo3 yolov2 yolov3
Last synced: 09 Nov 2024
https://github.com/open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 40+ HF models, 20+ benchmarks
chatgpt claude clip computer-vision evaluation gemini gpt gpt-4v gpt4 large-language-models llava llm multi-modal openai openai-api pytorch qwen vit vqa
Last synced: 08 Aug 2024
https://github.com/Haochen-Wang409/U2PL
[CVPR'22] Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels
cityscapes computer-vision contrastive-learning cvpr2022 deep-learning mechine-learning pascal-voc pytorch semantic-segmentation semi-supervised-learning
Last synced: 03 Nov 2024
https://actors-hq.com
Official code for "HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion"
3d-reconstruction computer-graphics computer-vision machine-learning nerf siggraph2023
Last synced: 30 Oct 2024
https://github.com/LSH9832/edgeyolo
an edge-real-time anchor-free object detector with decent performance
anchor-free ascend computer-vision deep-learning edge-computing mnn object-detection object-detector onnx pytorch rk3588 rk3588s rknn tensorrt yolo
Last synced: 28 Oct 2024
https://github.com/eliahuhorwitz/deepsim
Official PyTorch implementation of the paper: "DeepSIM: Image Shape Manipulation from a Single Augmented Training Sample" (ICCV 2021 Oral)
computer-graphics computer-vision deep-learning deep-neural-networks edge-to-image generative-adversarial-network iccv2021 image-editing image-to-image-translation pytorch segmantation-to-image single-image
Last synced: 10 Nov 2024
https://github.com/google-research/maxvit
[ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...
architecture classification cnn computer-vision image image-processing mlp object-detection resnet segmentation transformer transformer-architecture vision-transformer
Last synced: 10 Nov 2024
https://github.com/eliahuhorwitz/DeepSIM
Official PyTorch implementation of the paper: "DeepSIM: Image Shape Manipulation from a Single Augmented Training Sample" (ICCV 2021 Oral)
computer-graphics computer-vision deep-learning deep-neural-networks edge-to-image generative-adversarial-network iccv2021 image-editing image-to-image-translation pytorch segmantation-to-image single-image
Last synced: 07 Aug 2024
https://github.com/davidcaron/pclpy
Python bindings for the Point Cloud Library (PCL)
computer-vision lidar pcl point-cloud python
Last synced: 13 Nov 2024
https://github.com/OpenDriveLab/PersFormer_3DLane
[ECCV2022 Oral] Perspective Transformer on 3D Lane Detection
3d-lane-detection autonomous-driving computer-vision deep-learning lane-detection
Last synced: 28 Oct 2024
https://github.com/omerbsezer/fast-pytorch
Pytorch Tutorial, Pytorch with Google Colab, Pytorch Implementations: CNN, RNN, DCGAN, Transfer Learning, Chatbot, Pytorch Sample Codes
cnn cnn-pytorch colab-notebook colab-tutorial colaboratory computer-vision deep-learning machine-learning mlp python pytorch pytorch-implementation pytorch-tutorial rnn rnn-pytorch transfer-learning
Last synced: 11 Nov 2024
https://github.com/danini/graph-cut-ransac
The Graph-Cut RANSAC algorithm proposed in paper: Daniel Barath and Jiri Matas; Graph-Cut RANSAC, Conference on Computer Vision and Pattern Recognition, 2018. It is available at http://openaccess.thecvf.com/content_cvpr_2018/papers/Barath_Graph-Cut_RANSAC_CVPR_2018_paper.pdf
computer-vision essential-matrix fundamental-matrix graph-cut-ransac homography pattern-recognition ransac robust robust-estimators
Last synced: 27 Oct 2024