Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/philferriere/tfoptflow

Optical Flow Prediction with TensorFlow. Implements "PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume," by Deqing Sun et al. (CVPR 2018)

computer-vision cvpr2018 deep-learning flying-chairs kitti-dataset motion-estimation mpi-sintel optical-flow pwc-net tensorflow

Last synced: 13 Nov 2024

https://github.com/accumulatemore/opencv

✔(已完结)最全面的 OpenCV 笔记【咕泡唐宇迪】

computer-vision opencv python

Last synced: 10 Nov 2024

https://github.com/microsoft/CvT

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

classification computer-vision cvt deep-learning imagenet

Last synced: 14 Nov 2024

https://github.com/hkchengrex/stcn

[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

computer-vision deep-learning neurips-2021 pytorch segmentation video-object-segmentation video-segmentation

Last synced: 10 Nov 2024

https://rese1f.github.io/MovieChat/

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

computer-vision dataset large-language-models llama long-video-understanding multimodal-large-language-models

Last synced: 13 Nov 2024

https://github.com/hassony2/useful-computer-vision-phd-resources

Lists of resources useful for my PhD in computer vision

computer-vision list phd resources

Last synced: 13 Nov 2024

https://github.com/rese1f/MovieChat

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

computer-vision dataset large-language-models llama long-video-understanding multimodal-large-language-models

Last synced: 09 Nov 2024

https://github.com/jsn5/dancenet

DanceNet -💃💃Dance generator using Autoencoder, LSTM and Mixture Density Network. (Keras)

autoencoder computer-vision generative-model keras lstm mixture-density-networks

Last synced: 07 Aug 2024

https://github.com/DagnyT/hardnet

Hardnet descriptor model - "Working hard to know your neighbor's margins: Local descriptor learning loss"

computer-vision convolutional-neural-networks deep-learning image-matching image-retrieval metric-learning nips-2017 pytorch

Last synced: 04 Nov 2024

https://github.com/xiaoyufenfei/LEDNet

LEDNet: A Lightweight Encoder-Decoder Network for Real-time Semantic Segmentation

cityscape-dataset computer-vision lednet pytorch real-time semantic-segmentation

Last synced: 14 Nov 2024

https://github.com/LingDong-/skeleton-tracing

A new algorithm for retrieving topological skeleton as a set of polylines from binary images

algorithm computational-geometry computer-vision polylines skeletonization

Last synced: 06 Nov 2024

https://github.com/Justin-Tan/generative-compression

TensorFlow Implementation of Generative Adversarial Networks for Extreme Learned Image Compression

computer-vision gan generative-adversarial-network image-compression tensorflow

Last synced: 07 Aug 2024

https://github.com/aimagelab/dress-code

Dress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022

artificial-intelligence computer-vision deep-learning dress-code eccv2022 virtual-try-on

Last synced: 14 Nov 2024

https://github.com/NVlabs/pacnet

Pixel-Adaptive Convolutional Neural Networks (CVPR '19)

computer-vision deep-learning machine-learning

Last synced: 07 Aug 2024

https://github.com/rese1f/moviechat

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

computer-vision dataset large-language-models llama long-video-understanding multimodal-large-language-models

Last synced: 13 Nov 2024

https://github.com/rgeirhos/stylized-imagenet

Code to create Stylized-ImageNet, a stylized version of standard ImageNet (ICLR 2019 Oral)

computer-vision deep-learning human-vision shape-bias style-transfer texture-bias

Last synced: 22 Oct 2024

https://github.com/hfslyc/AdvSemiSeg

Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

adversarial-learning computer-vision deep-learning pytorch semantic-segmentation semi-supervised-learning

Last synced: 28 Oct 2024

https://github.com/limuloo/MIGC

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

aigc computer-vision cvpr cvpr2024 stable-diffusion text-to-image

Last synced: 31 Oct 2024

https://github.com/wellflat/imageprocessing-labs

computer vision, image processing and machine learning on the web browser or node.

computer-vision image-processing javascript machine-learning

Last synced: 12 Aug 2024

https://github.com/AkshitIreddy/Interactive-LLM-Powered-NPCs

Interactive LLM Powered NPCs, is an open-source project that completely transforms your interaction with non-player characters (NPCs) in any game! 🎮🤖🚀

ai artificial-intelligence autonomous-agents computer-vision langchain language-model llm-agent python video-game

Last synced: 14 Nov 2024

https://github.com/swz30/CycleISP

[CVPR 2020--Oral] CycleISP: Real Image Restoration via Improved Data Synthesis

camera-imaging-pipeline computer-vision cvpr2020 cycleisp data-synthesis image-denoising image-restoration low-level-vision pytorch raw2rgb rgb2raw

Last synced: 03 Nov 2024

https://github.com/willard-yuan/pcv-book-code

:book:Python计算机视觉中译本实例代码

computer-vision python

Last synced: 12 Nov 2024

https://github.com/RQLuo/MixTeX-Latex-OCR

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.

computer-vision deep-learning latex machine-learning ocr onnx python

Last synced: 03 Sep 2024

https://github.com/Megvii-BaseDetection/DeFCN

End-to-End Object Detection with Fully Convolutional Network

computer-vision object-detection

Last synced: 26 Oct 2024

https://github.com/abhshkdz/neural-vqa

:grey_question: Visual Question Answering in Torch

computer-vision deep-learning natural-language-processing torch

Last synced: 08 Nov 2024

https://zju3dv.github.io/manhattan_sdf/

Code for "Neural 3D Scene Reconstruction with the Manhattan-world Assumption" CVPR 2022 Oral

3d-reconstruction 3d-vision computer-vision cvpr2022

Last synced: 03 Aug 2024

https://github.com/lxe/llavavision

A simple "Be My Eyes" web app with a llama.cpp/llava backend

ai artificial-intelligence computer-vision llama llamacpp llm local-llm machine-learning multimodal webapp

Last synced: 10 Oct 2024

https://github.com/openvinotoolkit/datumaro

Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.

coco computer-vision dataset datasets deep-learning format-converter imagenet neural-networks openvino-toolkit pascal-voc yolo

Last synced: 13 Nov 2024

https://github.com/wvangansbeke/sparse-depth-completion

Predict dense depth maps from sparse and noisy LiDAR frames guided by RGB images. (Ranked 1st place on KITTI) [MVA 2019]

computer-vision deep-learning depth-completion depth-prediction kitti lidar noisy-data pytorch sensor-fusion

Last synced: 14 Nov 2024

https://github.com/liuziwei7/fashion-detection

Fashion Detection in the Wild (Deep Clothes Detector)

clothes-detection clothes-detector computer-vision deep-learning vision-for-fashion

Last synced: 03 Aug 2024

https://github.com/khanhnamle1994/computer-vision

Programming Assignments and Lectures for Stanford's CS 231: Convolutional Neural Networks for Visual Recognition

computer-vision convolutional-neural-networks deep-learning image-classification imagenet visual-recognition

Last synced: 10 Nov 2024

https://github.com/xavierpuigf/virtualhome

API to run VirtualHome, a Multi-Agent Household Simulator

computer-vision deep-learning graph multi-agent reinforcement-learning simulator unity

Last synced: 03 Nov 2024

https://github.com/amazon-science/siam-mot

SiamMOT: Siamese Multi-Object Tracking

computer-vision multi-object-tracking video-analysis

Last synced: 12 Nov 2024

https://github.com/kohjingyu/fromage

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

computer-vision large-language-models machine-learning natural-language-processing

Last synced: 05 Nov 2024

https://github.com/luyanger1799/Amazing-Semantic-Segmentation

Amazing Semantic Segmentation on Tensorflow && Keras (include FCN, UNet, SegNet, PSPNet, PAN, RefineNet, DeepLabV3, DeepLabV3+, DenseASPP, BiSegNet)

computer-vision deep-learning keras-tensorflow semantic-segmentation tensorflow

Last synced: 02 Nov 2024

https://github.com/tri-ml/dd3d

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

computer-vision deep-learning pytorch

Last synced: 13 Nov 2024

https://github.com/thp/psmoveapi

Cross-platform library for 6DoF tracking of the PS Move Motion Controller. Sensor fusion, computer vision, ambient display (LED orb).

6dof computer-vision controller hid sensors tracking

Last synced: 13 Nov 2024

https://github.com/jo-m/trainbot

Watches a piece of train track, detects trains, and stitches together images of them.

bot computer-vision go golang stitching trains

Last synced: 03 Aug 2024

https://github.com/ahangchen/torch_base

Quickly bring up your PyTorch project(a skeleton)

computer-vision deep-learning machine-learning pytorch

Last synced: 11 Nov 2024

https://github.com/cvondrick/soundnet

SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016

computer-vision deep-learning sound

Last synced: 27 Oct 2024

https://github.com/TRI-ML/dd3d

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

computer-vision deep-learning pytorch

Last synced: 28 Oct 2024

https://github.com/farukalamai/advanced-machine-learning-engineer-roadmap-2024

A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing, model development, deployment, and maintenance.

aws computer-vision data-analysis data-science data-visualization deep-learning git-github machine-learning machine-learning-roadmap mlops natural-language-processing neural-network nlp opencv pandas python pytorch statistics tensorflow yolo

Last synced: 14 Nov 2024

https://github.com/Alpha-VL/ConvMAE

ConvMAE: Masked Convolution Meets Masked Autoencoders

backbone computer-vision mae masked-image-modeling object-detection semantic-segmentation

Last synced: 03 Aug 2024

https://github.com/synthesiaresearch/humanrf

Official code for "HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion"

3d-reconstruction computer-graphics computer-vision machine-learning nerf siggraph2023

Last synced: 06 Nov 2024

https://github.com/NVlabs/intrinsic3d

Intrinsic3D - High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting (ICCV 2017)

3d 3d-reconstruction computer-vision geometry iccv iccv-2017 iccv2017 intrinsic3d keyframes lighting nvidia rgbd sdf shape-from-shading spherical-harmonics surface-reconstruction tum

Last synced: 13 Nov 2024

https://github.com/JunweiLiang/Object_Detection_Tracking

Out-of-the-box code and models for CMU's object detection and tracking system for multi-camera surveillance videos. Speed optimized Faster-RCNN model. Tensorflow based. Also supports EfficientDet. WACVW'20

activity-detection computer-vision deep-learning detection-tracking efficientdet maskrcnn multi-camera multi-camera-tracking multi-camera-vehicle-reid object-detection reid surveillance-videos tracking tracking-detection video-object-detection video-object-tracking

Last synced: 06 Nov 2024

https://github.com/mv-lab/InstructIR

[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR

computer-vision deblurring deep-learning dehazing denoising image-enhancement image-restoration inverse-problems language-model low-light-image-enhancement multi-task multimodal neural-network photography prompt pytorch super-resolution

Last synced: 03 Nov 2024

https://github.com/junweiliang/object_detection_tracking

Out-of-the-box code and models for CMU's object detection and tracking system for multi-camera surveillance videos. Speed optimized Faster-RCNN model. Tensorflow based. Also supports EfficientDet. WACVW'20

activity-detection computer-vision deep-learning detection-tracking efficientdet maskrcnn multi-camera multi-camera-tracking multi-camera-vehicle-reid object-detection reid surveillance-videos tracking tracking-detection video-object-detection video-object-tracking

Last synced: 08 Nov 2024

https://github.com/hkchengrex/mivos

[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion. Semi-supervised VOS as well!

computer-vision cvpr2021 deep-learning interactive-segmentation pytorch segmentation video-object-segmentation video-segmentation

Last synced: 10 Nov 2024

https://github.com/vlgiitr/DL_Topics

List of DL topics and resources essential for cracking interviews

computer-vision deep-learning generative-models linear-algebra natural-language-processing probability-statistics

Last synced: 12 Nov 2024

https://github.com/anmspro/traffic-signal-violation-detection-system

A Computer Vision based Traffic Signal Violation Detection System from video footage using YOLOv3 & Tkinter. (GUI Included)

computer-vision hacktoberfest object-detection opencv python tensorflow tkinter traffic-signal traffic-violation-detection yolov3

Last synced: 13 Nov 2024

https://github.com/pykale/pykale

Knowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the 🔥PyTorch ecosystem. ⭐ Star to support our work!

computer-vision data-science deep-learning domain-adaptation graph-analysis knowledge-aware-learning machine-learning medical-image-analysis meta-learning multimodal multimodal-learning python pytorch transfer-learning

Last synced: 11 Oct 2024

https://github.com/ruiminshen/yolo2-pytorch

PyTorch implementation of the YOLO (You Only Look Once) v2

caffe2 computer-vision deep-learning deep-neural-networks object-detection onnx python pytorch yolo2

Last synced: 04 Nov 2024

https://github.com/weiyithu/NerfingMVS

[ICCV 2021 Oral] NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

3d-reconstruction 3dvision computer-vision deep-learning multi-view-stereo neural-radiance-fields

Last synced: 07 Nov 2024

https://github.com/mtli/PhotoSketch

Code for Photo-Sketching: Inferring Contour Drawings from Images :dog:

ai computer-vision graphics sketch

Last synced: 14 Oct 2024

https://github.com/skhadem/3D-BoundingBox

PyTorch implementation for 3D Bounding Box Estimation Using Deep Learning and Geometry

3d computer-vision localization pytorch

Last synced: 28 Oct 2024

https://github.com/tanjeffreyz/auto-maple

Artificial intelligence software for MapleStory that uses various machine learning and computer vision techniques to navigate challenging in-game environments

ai bot computer-vision deep-learning maplestory python

Last synced: 07 Nov 2024

https://github.com/open-compass/VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 40+ HF models, 20+ benchmarks

chatgpt claude clip computer-vision evaluation gemini gpt gpt-4v gpt4 large-language-models llava llm multi-modal openai openai-api pytorch qwen vit vqa

Last synced: 08 Aug 2024

https://actors-hq.com

Official code for "HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion"

3d-reconstruction computer-graphics computer-vision machine-learning nerf siggraph2023

Last synced: 30 Oct 2024

https://github.com/eliahuhorwitz/deepsim

Official PyTorch implementation of the paper: "DeepSIM: Image Shape Manipulation from a Single Augmented Training Sample" (ICCV 2021 Oral)

computer-graphics computer-vision deep-learning deep-neural-networks edge-to-image generative-adversarial-network iccv2021 image-editing image-to-image-translation pytorch segmantation-to-image single-image

Last synced: 10 Nov 2024

https://github.com/eliahuhorwitz/DeepSIM

Official PyTorch implementation of the paper: "DeepSIM: Image Shape Manipulation from a Single Augmented Training Sample" (ICCV 2021 Oral)

computer-graphics computer-vision deep-learning deep-neural-networks edge-to-image generative-adversarial-network iccv2021 image-editing image-to-image-translation pytorch segmantation-to-image single-image

Last synced: 07 Aug 2024

https://github.com/google-research/maxvit

[ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...

architecture classification cnn computer-vision image image-processing mlp object-detection resnet segmentation transformer transformer-architecture vision-transformer

Last synced: 10 Nov 2024

https://github.com/davidcaron/pclpy

Python bindings for the Point Cloud Library (PCL)

computer-vision lidar pcl point-cloud python

Last synced: 13 Nov 2024

https://github.com/OpenDriveLab/PersFormer_3DLane

[ECCV2022 Oral] Perspective Transformer on 3D Lane Detection

3d-lane-detection autonomous-driving computer-vision deep-learning lane-detection

Last synced: 28 Oct 2024

https://github.com/dmitryryumin/aaai-2024-papers

AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for better understanding. ⭐ experience the forefront of progress in artificial intelligence with this repository!

aaai aaai2024 application-domains artificial-intelligence cognitive-systems computer-vision computer-vison deep-learning expert-systems human-computer-interaction knowledge-representation machine-learning multi-agent-systems neural-networks reinforcement-learning sentiment-analysis

Last synced: 08 Nov 2024

https://github.com/danini/graph-cut-ransac

The Graph-Cut RANSAC algorithm proposed in paper: Daniel Barath and Jiri Matas; Graph-Cut RANSAC, Conference on Computer Vision and Pattern Recognition, 2018. It is available at http://openaccess.thecvf.com/content_cvpr_2018/papers/Barath_Graph-Cut_RANSAC_CVPR_2018_paper.pdf

computer-vision essential-matrix fundamental-matrix graph-cut-ransac homography pattern-recognition ransac robust robust-estimators

Last synced: 27 Oct 2024

https://github.com/omerbsezer/fast-pytorch

Pytorch Tutorial, Pytorch with Google Colab, Pytorch Implementations: CNN, RNN, DCGAN, Transfer Learning, Chatbot, Pytorch Sample Codes

cnn cnn-pytorch colab-notebook colab-tutorial colaboratory computer-vision deep-learning machine-learning mlp python pytorch pytorch-implementation pytorch-tutorial rnn rnn-pytorch transfer-learning

Last synced: 11 Nov 2024

Computer vision Awesome Lists
Awesome-pytorch-list 704 awesome-self-supervised-learning 453 awesome-multimodal-ml 480 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-hand-pose-estimation 493 awesome-image-classification 220 awesome-human-pose-estimation 89 Awesome-Crowd-Counting 451 awesome-autonomous-vehicles 296 Awesome-Federated-Learning 558 Awesome-pytorch-list-CNVersion 692 awesome-industrial-anomaly-detection 657 iOS_ML 39 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-FL 2,098 awesome-low-light-image-enhancement 197 Awesome-Implicit-NeRF-Robotics 184 CV-pretrained-model 103 awesome-tensorflow-lite 110 awesome-capsule-networks 67 awesome-attention-mechanism-in-cv 195 awesome-grounding 154 Awesome-Image-Colorization 116 awesome-ai-awesomeness 236 openstl 43 awesome-autonomous-vehicle 181 Awesome-Open-Vocabulary 158 awesome-open-data-centric-ai 56 Awesome-Skeleton-based-Action-Recognition 104 awesome-6d-object 600 awesome-scene-understanding 197 awesome-multi-task-learning 226 awesome-holistic-3d 129 awesome-data-annotation 93 awesome-panoptic-segmentation 45 awesome-photogrammetry 68 awesome-llm-and-aigc 1,134 Awesome-3D-Object-Detection 169 awesome-computer-vision-models 189 Awesome-Parameter-Efficient-Transfer-Learning 121 awesome-state-of-depth-completion 59 Awesome-Distributed-Deep-Learning 44 awesome-image-alignment-and-stitching 108 Awesome-Monocular-3D-detection 93 Awesome-Multi-Task-Learning 80 Computer-Vision-Video-Lectures 67 awesome-segment-anything-extensions 120 awesome-robotics-datasets 79 awesome_computer_science 116