Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/rese1f/MovieChat

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

computer-vision dataset large-language-models llama long-video-understanding multimodal-large-language-models

Last synced: 09 Nov 2024

https://github.com/hassony2/useful-computer-vision-phd-resources

Lists of resources useful for my PhD in computer vision

computer-vision list phd resources

Last synced: 02 Aug 2024

https://github.com/jsn5/dancenet

DanceNet -๐Ÿ’ƒ๐Ÿ’ƒDance generator using Autoencoder, LSTM and Mixture Density Network. (Keras)

autoencoder computer-vision generative-model keras lstm mixture-density-networks

Last synced: 07 Aug 2024

https://github.com/baudm/parseq

Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)

computer-vision eccv eccv2022 ocr optical-character-recognition scene-text-recognition text-recognition vision-transformer

Last synced: 07 Nov 2024

https://github.com/DagnyT/hardnet

Hardnet descriptor model - "Working hard to know your neighbor's margins: Local descriptor learning loss"

computer-vision convolutional-neural-networks deep-learning image-matching image-retrieval metric-learning nips-2017 pytorch

Last synced: 04 Nov 2024

https://github.com/xiaoyufenfei/LEDNet

LEDNet: A Lightweight Encoder-Decoder Network for Real-time Semantic Segmentation

cityscape-dataset computer-vision lednet pytorch real-time semantic-segmentation

Last synced: 03 Aug 2024

https://github.com/Justin-Tan/generative-compression

TensorFlow Implementation of Generative Adversarial Networks for Extreme Learned Image Compression

computer-vision gan generative-adversarial-network image-compression tensorflow

Last synced: 07 Aug 2024

https://github.com/LingDong-/skeleton-tracing

A new algorithm for retrieving topological skeleton as a set of polylines from binary images

algorithm computational-geometry computer-vision polylines skeletonization

Last synced: 06 Nov 2024

https://github.com/NVlabs/pacnet

Pixel-Adaptive Convolutional Neural Networks (CVPR '19)

computer-vision deep-learning machine-learning

Last synced: 07 Aug 2024

https://github.com/rese1f/moviechat

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

computer-vision dataset large-language-models llama long-video-understanding multimodal-large-language-models

Last synced: 30 Oct 2024

https://github.com/rgeirhos/stylized-imagenet

Code to create Stylized-ImageNet, a stylized version of standard ImageNet (ICLR 2019 Oral)

computer-vision deep-learning human-vision shape-bias style-transfer texture-bias

Last synced: 22 Oct 2024

https://github.com/hfslyc/AdvSemiSeg

Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

adversarial-learning computer-vision deep-learning pytorch semantic-segmentation semi-supervised-learning

Last synced: 28 Oct 2024

https://github.com/limuloo/MIGC

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

aigc computer-vision cvpr cvpr2024 stable-diffusion text-to-image

Last synced: 31 Oct 2024

https://github.com/wellflat/imageprocessing-labs

computer vision, image processing and machine learning on the web browser or node.

computer-vision image-processing javascript machine-learning

Last synced: 12 Aug 2024

https://github.com/swz30/CycleISP

[CVPR 2020--Oral] CycleISP: Real Image Restoration via Improved Data Synthesis

camera-imaging-pipeline computer-vision cvpr2020 cycleisp data-synthesis image-denoising image-restoration low-level-vision pytorch raw2rgb rgb2raw

Last synced: 03 Nov 2024

https://github.com/RQLuo/MixTeX-Latex-OCR

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.

computer-vision deep-learning latex machine-learning ocr onnx python

Last synced: 03 Sep 2024

https://github.com/Megvii-BaseDetection/DeFCN

End-to-End Object Detection with Fully Convolutional Network

computer-vision object-detection

Last synced: 26 Oct 2024

https://github.com/abhshkdz/neural-vqa

:grey_question: Visual Question Answering in Torch

computer-vision deep-learning natural-language-processing torch

Last synced: 08 Nov 2024

https://zju3dv.github.io/manhattan_sdf/

Code for "Neural 3D Scene Reconstruction with the Manhattan-world Assumption" CVPR 2022 Oral

3d-reconstruction 3d-vision computer-vision cvpr2022

Last synced: 03 Aug 2024

https://github.com/lxe/llavavision

A simple "Be My Eyes" web app with a llama.cpp/llava backend

ai artificial-intelligence computer-vision llama llamacpp llm local-llm machine-learning multimodal webapp

Last synced: 10 Oct 2024

https://github.com/SkalskiP/sports

Cool experiments at the intersection of Computer Vision and Sports โšฝ๐Ÿƒ

computer-vision deep-learning deep-neural-networks gpt-4 gpt-4-vision object-detection prompt-engineering pytorch sports-analytics tutorial yolov5 yolov7

Last synced: 06 Nov 2024

https://github.com/openvinotoolkit/datumaro

Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.

coco computer-vision dataset datasets deep-learning format-converter imagenet neural-networks openvino-toolkit pascal-voc yolo

Last synced: 02 Aug 2024

https://github.com/skalskip/sports

Cool experiments at the intersection of Computer Vision and Sports โšฝ๐Ÿƒ

computer-vision deep-learning deep-neural-networks gpt-4 gpt-4-vision object-detection prompt-engineering pytorch sports-analytics tutorial yolov5 yolov7

Last synced: 30 Oct 2024

https://github.com/liuziwei7/fashion-detection

Fashion Detection in the Wild (Deep Clothes Detector)

clothes-detection clothes-detector computer-vision deep-learning vision-for-fashion

Last synced: 03 Aug 2024

https://github.com/AkshitIreddy/Interactive-LLM-Powered-NPCs

Interactive LLM Powered NPCs, is an open-source project that completely transforms your interaction with non-player characters (NPCs) in any game! ๐ŸŽฎ๐Ÿค–๐Ÿš€

ai artificial-intelligence autonomous-agents computer-vision langchain language-model llm-agent python video-game

Last synced: 03 Aug 2024

https://github.com/khanhnamle1994/computer-vision

Programming Assignments and Lectures for Stanford's CS 231: Convolutional Neural Networks for Visual Recognition

computer-vision convolutional-neural-networks deep-learning image-classification imagenet visual-recognition

Last synced: 10 Nov 2024

https://github.com/xavierpuigf/virtualhome

API to run VirtualHome, a Multi-Agent Household Simulator

computer-vision deep-learning graph multi-agent reinforcement-learning simulator unity

Last synced: 03 Nov 2024

https://github.com/amazon-research/siam-mot

SiamMOT: Siamese Multi-Object Tracking

computer-vision multi-object-tracking video-analysis

Last synced: 02 Aug 2024

https://rese1f.github.io/MovieChat/

[CVPR 2024] ๐ŸŽฌ๐Ÿ’ญ chat with over 10K frames of video!

computer-vision dataset large-language-models llama long-video-understanding multimodal-large-language-models

Last synced: 02 Aug 2024

https://github.com/aimagelab/dress-code

Dress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022

artificial-intelligence computer-vision deep-learning dress-code eccv2022 virtual-try-on

Last synced: 07 Nov 2024

https://github.com/kohjingyu/fromage

๐Ÿง€ Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

computer-vision large-language-models machine-learning natural-language-processing

Last synced: 05 Nov 2024

https://github.com/luyanger1799/Amazing-Semantic-Segmentation

Amazing Semantic Segmentation on Tensorflow && Keras (include FCN, UNet, SegNet, PSPNet, PAN, RefineNet, DeepLabV3, DeepLabV3+, DenseASPP, BiSegNet)

computer-vision deep-learning keras-tensorflow semantic-segmentation tensorflow

Last synced: 02 Nov 2024

https://github.com/thp/psmoveapi

Cross-platform library for 6DoF tracking of the PS Move Motion Controller. Sensor fusion, computer vision, ambient display (LED orb).

6dof computer-vision controller hid sensors tracking

Last synced: 30 Oct 2024

https://github.com/jo-m/trainbot

Watches a piece of train track, detects trains, and stitches together images of them.

bot computer-vision go golang stitching trains

Last synced: 03 Aug 2024

https://github.com/cvondrick/soundnet

SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016

computer-vision deep-learning sound

Last synced: 27 Oct 2024

https://github.com/TRI-ML/dd3d

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

computer-vision deep-learning pytorch

Last synced: 28 Oct 2024

https://github.com/Alpha-VL/ConvMAE

ConvMAE: Masked Convolution Meets Masked Autoencoders

backbone computer-vision mae masked-image-modeling object-detection semantic-segmentation

Last synced: 03 Aug 2024

https://github.com/wvangansbeke/sparse-depth-completion

Predict dense depth maps from sparse and noisy LiDAR frames guided by RGB images. (Ranked 1st place on KITTI) [2019]

computer-vision deep-learning depth-completion depth-prediction kitti lidar noisy-data pytorch sensor-fusion

Last synced: 07 Nov 2024

https://github.com/synthesiaresearch/humanrf

Official code for "HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion"

3d-reconstruction computer-graphics computer-vision machine-learning nerf siggraph2023

Last synced: 06 Nov 2024

https://github.com/junweiliang/object_detection_tracking

Out-of-the-box code and models for CMU's object detection and tracking system for multi-camera surveillance videos. Speed optimized Faster-RCNN model. Tensorflow based. Also supports EfficientDet. WACVW'20

activity-detection computer-vision deep-learning detection-tracking efficientdet maskrcnn multi-camera multi-camera-tracking multi-camera-vehicle-reid object-detection reid surveillance-videos tracking tracking-detection video-object-detection video-object-tracking

Last synced: 08 Nov 2024

https://github.com/mv-lab/InstructIR

[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR

computer-vision deblurring deep-learning dehazing denoising image-enhancement image-restoration inverse-problems language-model low-light-image-enhancement multi-task multimodal neural-network photography prompt pytorch super-resolution

Last synced: 03 Nov 2024

https://github.com/NVlabs/intrinsic3d

Intrinsic3D - High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting (ICCV 2017)

3d 3d-reconstruction computer-vision geometry iccv iccv-2017 iccv2017 intrinsic3d keyframes lighting nvidia rgbd sdf shape-from-shading spherical-harmonics surface-reconstruction tum

Last synced: 02 Aug 2024

https://github.com/JunweiLiang/Object_Detection_Tracking

Out-of-the-box code and models for CMU's object detection and tracking system for multi-camera surveillance videos. Speed optimized Faster-RCNN model. Tensorflow based. Also supports EfficientDet. WACVW'20

activity-detection computer-vision deep-learning detection-tracking efficientdet maskrcnn multi-camera multi-camera-tracking multi-camera-vehicle-reid object-detection reid surveillance-videos tracking tracking-detection video-object-detection video-object-tracking

Last synced: 06 Nov 2024

https://github.com/hkchengrex/mivos

[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion. Semi-supervised VOS as well!

computer-vision cvpr2021 deep-learning interactive-segmentation pytorch segmentation video-object-segmentation video-segmentation

Last synced: 10 Nov 2024

https://github.com/lpiccinelli-eth/unidepth

Universal Monocular Metric Depth Estimation

3d-reconstruction computer-vision depth-estimation

Last synced: 02 Aug 2024

https://github.com/pykale/pykale

Knowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the ๐Ÿ”ฅPyTorch ecosystem. โญ Star to support our work!

computer-vision data-science deep-learning domain-adaptation graph-analysis knowledge-aware-learning machine-learning medical-image-analysis meta-learning multimodal multimodal-learning python pytorch transfer-learning

Last synced: 11 Oct 2024

https://github.com/ruiminshen/yolo2-pytorch

PyTorch implementation of the YOLO (You Only Look Once) v2

caffe2 computer-vision deep-learning deep-neural-networks object-detection onnx python pytorch yolo2

Last synced: 04 Nov 2024

https://github.com/anmspro/traffic-signal-violation-detection-system

A Computer Vision based Traffic Signal Violation Detection System from video footage using YOLOv3 & Tkinter. (GUI Included)

computer-vision hacktoberfest object-detection opencv python tensorflow tkinter traffic-signal traffic-violation-detection yolov3

Last synced: 30 Oct 2024

https://github.com/vlgiitr/DL_Topics

List of DL topics and resources essential for cracking interviews

computer-vision deep-learning generative-models linear-algebra natural-language-processing probability-statistics

Last synced: 02 Aug 2024

https://github.com/weiyithu/NerfingMVS

[ICCV 2021 Oral] NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

3d-reconstruction 3dvision computer-vision deep-learning multi-view-stereo neural-radiance-fields

Last synced: 07 Nov 2024

https://github.com/mtli/PhotoSketch

Code for Photo-Sketching: Inferring Contour Drawings from Images :dog:

ai computer-vision graphics sketch

Last synced: 14 Oct 2024

https://github.com/skhadem/3D-BoundingBox

PyTorch implementation for 3D Bounding Box Estimation Using Deep Learning and Geometry

3d computer-vision localization pytorch

Last synced: 28 Oct 2024

https://github.com/tanjeffreyz/auto-maple

Artificial intelligence software for MapleStory that uses various machine learning and computer vision techniques to navigate challenging in-game environments

ai bot computer-vision deep-learning maplestory python

Last synced: 07 Nov 2024

https://github.com/open-compass/VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 40+ HF models, 20+ benchmarks

chatgpt claude clip computer-vision evaluation gemini gpt gpt-4v gpt4 large-language-models llava llm multi-modal openai openai-api pytorch qwen vit vqa

Last synced: 08 Aug 2024

https://actors-hq.com

Official code for "HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion"

3d-reconstruction computer-graphics computer-vision machine-learning nerf siggraph2023

Last synced: 30 Oct 2024

https://github.com/eliahuhorwitz/deepsim

Official PyTorch implementation of the paper: "DeepSIM: Image Shape Manipulation from a Single Augmented Training Sample" (ICCV 2021 Oral)

computer-graphics computer-vision deep-learning deep-neural-networks edge-to-image generative-adversarial-network iccv2021 image-editing image-to-image-translation pytorch segmantation-to-image single-image

Last synced: 10 Nov 2024

https://github.com/google-research/maxvit

[ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...

architecture classification cnn computer-vision image image-processing mlp object-detection resnet segmentation transformer transformer-architecture vision-transformer

Last synced: 10 Nov 2024

https://github.com/eliahuhorwitz/DeepSIM

Official PyTorch implementation of the paper: "DeepSIM: Image Shape Manipulation from a Single Augmented Training Sample" (ICCV 2021 Oral)

computer-graphics computer-vision deep-learning deep-neural-networks edge-to-image generative-adversarial-network iccv2021 image-editing image-to-image-translation pytorch segmantation-to-image single-image

Last synced: 07 Aug 2024

https://github.com/davidcaron/pclpy

Python bindings for the Point Cloud Library (PCL)

computer-vision lidar pcl point-cloud python

Last synced: 06 Nov 2024

https://github.com/OpenDriveLab/PersFormer_3DLane

[ECCV2022 Oral] Perspective Transformer on 3D Lane Detection

3d-lane-detection autonomous-driving computer-vision deep-learning lane-detection

Last synced: 28 Oct 2024

https://github.com/dmitryryumin/aaai-2024-papers

AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for better understanding. โญ experience the forefront of progress in artificial intelligence with this repository!

aaai aaai2024 application-domains artificial-intelligence cognitive-systems computer-vision computer-vison deep-learning expert-systems human-computer-interaction knowledge-representation machine-learning multi-agent-systems neural-networks reinforcement-learning sentiment-analysis

Last synced: 08 Nov 2024

https://github.com/danini/graph-cut-ransac

The Graph-Cut RANSAC algorithm proposed in paper: Daniel Barath and Jiri Matas; Graph-Cut RANSAC, Conference on Computer Vision and Pattern Recognition, 2018. It is available at http://openaccess.thecvf.com/content_cvpr_2018/papers/Barath_Graph-Cut_RANSAC_CVPR_2018_paper.pdf

computer-vision essential-matrix fundamental-matrix graph-cut-ransac homography pattern-recognition ransac robust robust-estimators

Last synced: 27 Oct 2024

https://github.com/xverse-engine/XV3DGS-UEPlugin

A Unreal Engine 5 (UE5) based plugin aiming to provide real-time visulization, management, editing, and scalable hybrid rendering of Guassian Splatting model.

computer-graphics computer-vision radiance-field unreal-engine-5

Last synced: 04 Aug 2024

https://github.com/hysts/anime-face-detector

Anime Face Detector using mmdet and mmpose

anime computer-vision face-detection face-landmark-detection pytorch

Last synced: 30 Oct 2024

https://github.com/alicevision/popsift

PopSift is an implementation of the SIFT algorithm in CUDA.

computer-vision cuda feature-extraction gpu image-processing sift

Last synced: 05 Nov 2024

https://github.com/bchao1/anime-face-dataset

๐Ÿ–ผ A collection of high-quality anime faces.

anime computer-vision dataset gan image-generation machine-learning

Last synced: 31 Oct 2024

https://github.com/Ank-Cha/Social-Distancing-Analyser-COVID-19

A social distancing analyzer AI tool to regulate social distancing protocol using video surveillance of CCTV cameras and drones. Social Distancing Analyser to prevent COVID19

artificial-intelligence computer-vision covid-19 covid19 python social-distancing stay-home yolo

Last synced: 09 Nov 2024

https://github.com/bchao1/Anime-Face-Dataset

๐Ÿ–ผ A collection of high-quality anime faces.

anime computer-vision dataset gan image-generation machine-learning

Last synced: 02 Aug 2024

https://github.com/Sha-Lab/FEAT

The code repository for "Few-Shot Learning via Embedding Adaptation with Set-to-Set Functions"

computer-vision cvpr2020 few-shot few-shot-learning machine-learning meta-learning

Last synced: 26 Oct 2024

https://github.com/vincentfung13/MINE

Code and models for our ICCV 2021 paper "MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis"

3d-reconstruction 3d-vision computer-vision deep-learning depth-estimation nerf novel-view-synthesis

Last synced: 28 Oct 2024

https://github.com/patrikhuber/superviseddescent

C++11 implementation of the supervised descent optimisation method

computer-vision landmark-detection machine-learning modern-cpp

Last synced: 31 Oct 2024

https://github.com/lvis-dataset/lvis-api

Python API for LVIS Dataset

computer-vision object-detection

Last synced: 04 Nov 2024

Computer vision Awesome Lists
Awesome-pytorch-list 704 awesome-self-supervised-learning 453 awesome-multimodal-ml 480 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-hand-pose-estimation 493 awesome-image-classification 220 awesome-human-pose-estimation 89 Awesome-Crowd-Counting 451 awesome-autonomous-vehicles 296 Awesome-Federated-Learning 558 Awesome-pytorch-list-CNVersion 692 awesome-industrial-anomaly-detection 656 iOS_ML 39 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-FL 2,098 awesome-low-light-image-enhancement 197 CV-pretrained-model 103 awesome-tensorflow-lite 110 Awesome-Implicit-NeRF-Robotics 183 awesome-capsule-networks 67 awesome-attention-mechanism-in-cv 195 awesome-grounding 154 Awesome-Image-Colorization 115 awesome-ai-awesomeness 236 openstl 43 awesome-autonomous-vehicle 181 Awesome-Open-Vocabulary 158 awesome-open-data-centric-ai 56 Awesome-Skeleton-based-Action-Recognition 104 awesome-6d-object 600 awesome-scene-understanding 196 awesome-multi-task-learning 226 awesome-holistic-3d 129 awesome-data-annotation 93 awesome-panoptic-segmentation 45 awesome-photogrammetry 68 awesome-llm-and-aigc 1,134 Awesome-3D-Object-Detection 169 awesome-computer-vision-models 189 Awesome-Parameter-Efficient-Transfer-Learning 121 awesome-state-of-depth-completion 59 Awesome-Distributed-Deep-Learning 44 awesome-image-alignment-and-stitching 108 Awesome-Monocular-3D-detection 93 Awesome-Multi-Task-Learning 80 Computer-Vision-Video-Lectures 67 awesome-segment-anything-extensions 120 awesome-robotics-datasets 79 awesome-nerf-editing 380