Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/yuweihao/kern

Code for Knowledge-Embedded Routing Network for Scene Graph Generation (CVPR 2019)

computer-vision graph-neural-network pytorch scene-graph scene-graph-generation

Last synced: 06 Nov 2024

https://github.com/dahyun-kang/ifsl

[CVPR'22] Official PyTorch implementation of Integrative Few-Shot Learning for Classification and Segmentation

computer-vision cvpr2022 deep-learning few-shot-classification few-shot-learning few-shot-segmentation pytorch pytorch-lightning

Last synced: 03 Aug 2024

https://github.com/seva100/optic-nerve-cnn

Code repository for a paper "Optic Disc and Cup Segmentation Methods for Glaucoma Detection with Modification of U-Net Convolutional Neural Network"

computer-vision cup-segmentation-methods glaucoma-detection ipynb medical-imaging optic-disc paper

Last synced: 11 Oct 2024

https://github.com/capjamesg/cv-book-svg

Turn an image of a bookshelf into an interactive SVG.

books computer-vision image-analysis library personal-library

Last synced: 22 Oct 2024

https://github.com/hugozanini/realtime-semantic-segmentation

Implementation of RefineNet to perform real time instance segmentation in the browser using TensorFlow.js

computer-vision real-time-machine-learning semantic-segmentation tensorflow tensorflowjs

Last synced: 08 Aug 2024

https://github.com/dnth/x.infer

Framework agnostic computer vision inference.

computer-vision inference-api ollama pytorch-image-models transformers ultralytics vllm

Last synced: 08 Nov 2024

https://github.com/olafenwamoses/deepstack_exdark

A DeepStack custom model for detecting common objects in dark/night images and videos.

ai cctv computer-vision deep-learning deepstack machine-learning night-vision night-vision-camera object-detection yolo yolov3 yolov5

Last synced: 27 Oct 2024

https://github.com/ihhub/penguinv

Computer vision library with focus on heterogeneous systems

avx computer-vision cpp cuda gpu hacktoberfest heterogeneous-systems image-processing opencl python simd sse thread-pool

Last synced: 31 Oct 2024

https://github.com/viplix3/BoTSORT-cpp

C++ implementation of BoT-SORT MOT algorithm with Re-ID and Camera Motion Compensation

computer-vision cpp kalman-filter multi-object-tracking reid

Last synced: 27 Oct 2024

https://github.com/ibaiGorordo/ONNX-CREStereo-Depth-Estimation

Python scripts performing stereo depth estimation using the CREStereo model in ONNX.

computer-vision crestereo depth-estimation onnx onnxruntime opencv python stereo-depth-estimation stereo-matching stereo-vision

Last synced: 28 Oct 2024

https://github.com/margaretmz/cartoonizer-with-tflite

How to create a Cartoonizer Android app with TensorFlow Lite models.

android computer-vision gans tensorflow-lite tflite

Last synced: 06 Nov 2024

https://github.com/see2sound/see2sound

Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

audio-processing computer-vision see-2-sound

Last synced: 14 Nov 2024

https://github.com/raoyongming/PointGLR

[CVPR 2020] Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds

3d-point-clouds computer-vision deep-learning metric-learning representation-learning unsupervised-learning

Last synced: 28 Oct 2024

https://github.com/lancedb/yoloexplorer

YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds

computer-vision object-detection yolov5 yolov8

Last synced: 06 Nov 2024

https://github.com/juniorxsound/threadeddepthcleaner

📷 Threaded depth-map cleaning and inpainting using OpenCV

computer-vision depth-camera depth-map image-filters image-processing librealsense2 opencv

Last synced: 11 Nov 2024

https://github.com/deep-diver/mlops-hf-tf-vision-models

MLOps for Vision Models (TensorFlow) from 🤗 Transformers with TensorFlow Extended (TFX)

computer-vision huggingface-transformers mlops tensorflow tensorflow-extended

Last synced: 01 Nov 2024

https://github.com/hellonico/origami

Lowest barrier of entry to Image Processing, Computer Vision and Neural Networks on the JavaVM

clojure computer-vision deep-learning dnn java kotlin opencv yolov8

Last synced: 10 Oct 2024

https://github.com/mybridge/learn-machine-learning

Learn to Build a Machine Learning Application from Top Articles

computer-vision data-science deep-learning machine-learning neural-networks

Last synced: 07 Nov 2024

https://github.com/z-x-yang/aot

Associating Objects with Transformers for Video Object Segmentation

computer-vision video-object-segmentation

Last synced: 07 Nov 2024

https://github.com/McGill-NLP/weblinx

WebLINX is a benchmark for building web navigation agents with conversational capabilities

agent agents computer-vision llm multimodal navigation nlp web

Last synced: 20 Oct 2024

https://github.com/ismail-mebsout/Parsing-PDFs-using-YOLOV3

Parsing pdf tables using YOLOV3

computer-vision pdf python

Last synced: 09 Nov 2024

https://github.com/101arrowz/scanner

Document scanning from scratch

computer-vision rust typescript wasm

Last synced: 27 Oct 2024

https://github.com/abd-shoumik/Social-distance-detection

Social distance detection , a deep learning computer vision project with yolo object detection

computer-vision covid-19 dataset deeplearning deeplearning-ai socialdistancing yolo

Last synced: 09 Nov 2024

https://github.com/ravisvi/super-resolution-videos

Applying SRGAN technique implemented in https://github.com/zsdonghao/SRGAN on videos to super resolve them.

computer-vision srgan super-resolution tensorlayer

Last synced: 07 Nov 2024

https://github.com/pooya-mohammadi/deep_utils

An open-source toolkit which is full of handy functions, including the most used models and utilities for deep-learning practitioners!

augmentation coco computer-vision cutmix deep-learning face-detection face-recognition machine-learning modelcheckpoint nlp object-detection python pytorch senet tensorflow utils vggface2 yolov5

Last synced: 09 Nov 2024

https://github.com/agentmorris/megadetector

MegaDetector is an AI model that helps conservation folks spend less time doing boring things with camera trap images.

camera-traps cameratraps computer-vision conservation ecology machine-learning megadetector wildlife

Last synced: 14 Nov 2024

https://github.com/RizwanMunawar/yolov5-object-tracking

YOLOv5 Object Tracking + Detection + Object Blurring + Streamlit Dashboard Using OpenCV, PyTorch and Streamlit

computer-vision object-detection object-tracking streamlit-dashboard yolov5

Last synced: 09 Nov 2024

https://github.com/rizwanmunawar/yolov5-object-tracking

YOLOv5 Object Tracking + Detection + Object Blurring + Streamlit Dashboard Using OpenCV, PyTorch and Streamlit

computer-vision object-detection object-tracking streamlit-dashboard yolov5

Last synced: 14 Nov 2024

https://github.com/henzler/neuraltexture

Learning a Neural 3D Texture Space from 2D Exemplars [CVPR 2020]

computer-graphics computer-vision deep-learning machine-learning texture-synthesis

Last synced: 08 Nov 2024

https://github.com/yihongXU/TransCenter

This is the official implementation of TransCenter (TPAMI). The code and pretrained models are now available here: https://gitlab.inria.fr/yixu/TransCenter_official.

computer-vision deep-learning multiple-object-tracking pytorch transformers

Last synced: 13 Nov 2024

https://github.com/mindspore-courses/d2l-mindspore

《动手学深度学习》的MindSpore实现。供MindSpore学习者配合李沐老师课程使用。

computer-vision deep-learning machine-learning mindspore natural-language-processing notebook

Last synced: 09 Nov 2024

https://github.com/xiaosu-zhu/mcquic

Repository of CVPR'22 paper "Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression"

computer-vision cvpr2022 image-compression image-processing pytorch

Last synced: 02 Oct 2024

https://github.com/mars885/doc-skanner

An Android application that makes it possible to automatically scan and digitize documents from photos.

android android-application computer-vision doc-scanner document-scanner opencv

Last synced: 08 Nov 2024

https://github.com/karayaman/lichess-with-a-real-board

Lichess.org client for real life chess boards.

computer-vision lichess

Last synced: 08 Nov 2024

https://github.com/dahyun-kang/renet

[ICCV'21] Official PyTorch implementation of Relational Embedding for Few-Shot Classification

computer-vision deep-learning few-shot-classifcation few-shot-learning iccv2021 neural-networks pytorch

Last synced: 03 Aug 2024

https://github.com/twke18/SPML

Universal Weakly Supervised Segmentation by Pixel-to-Segment Contrastive Learning

computer-vision deep-learning metric-learning semantic-segmentation weakly-supervised-learning

Last synced: 29 Oct 2024

https://github.com/jhu-lcsr/good_robot

"Good Robot! Now Watch This!": Repurposing Reinforcement Learning for Task-to-Task Transfer; and “Good Robot!”: Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer

computer-vision deep-learning deep-q-network deep-reinforcement-learning grasping learning-from-demonstration manipulation multi-step-dqn multi-step-learning reinforcement-learning robotic-manipulation robotics simulation task-to-task-transfer vrep-simulator zero-shot-learning

Last synced: 02 Nov 2024

https://github.com/UT-Austin-RPL/Ditto

Code for Ditto: Building Digital Twins of Articulated Objects from Interaction

3d-reconstruction computer-vision deep-learning digital-twin robotics

Last synced: 07 Nov 2024

https://github.com/videokit-ai/videokit

Low-code, cross-platform media SDK for Unity Engine. Register at https://videokit.ai

computer-vision natml unity3d user-generated-content video-editing video-effects video-filter video-recording

Last synced: 12 Nov 2024

https://github.com/dthuerck/mapmap_cpu

A high-performance general-purpose MRF MAP solver, heavily exploiting SIMD instructions.

algorithm computer-vision dynamic-programming feedback-vertex-set markov-random-field massively-parallel mrf simd simd-programming vectorization

Last synced: 09 Nov 2024

https://github.com/jmisilo/clip-gpt-captioning

CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.

computer-vision cv deep-learning image-caption image-caption-generator image-captioning machine-learning nlp python pytorch

Last synced: 04 Nov 2024

https://mcgill-nlp.github.io/weblinx/

WebLINX is a benchmark for building web navigation agents with conversational capabilities

agent agents computer-vision llm multimodal navigation nlp web

Last synced: 03 Aug 2024

https://github.com/kyegomez/rt-x

Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"

artificial-intelligence attention-is-all-you-need attention-model computer-vision gpt4 gpt4all multimodal vision vision-transformer

Last synced: 09 Nov 2024

https://github.com/n2cholas/jax-resnet

Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).

computer-vision deep-learning flax jax machine-learning neural-networks resnet

Last synced: 04 Aug 2024

https://github.com/Z-Zheng/FreeNet

FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image Classification (TGRS 2020) https://ieeexplore.ieee.org/document/9007624

computer-vision convolutional-neural-network deep-learning geospatial hyperspectral-image-classification remote-sensing

Last synced: 31 Oct 2024

https://github.com/tatsuyah/Lane-Lines-Detection-Python-OpenCV

Lane Lines Detection using Python and OpenCV for self-driving car

computer-vision self-driving-car

Last synced: 03 Nov 2024

https://github.com/SHI-Labs/Semi-Supervised-Transfer-Learning

[CVPR 2021] Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

computer-vision semi-supervised-learning transfer-learning

Last synced: 03 Aug 2024

https://github.com/shi-labs/semi-supervised-transfer-learning

[CVPR 2021] Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

computer-vision semi-supervised-learning transfer-learning

Last synced: 08 Nov 2024

https://github.com/sayakpaul/convnext-tf

Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.

computer-vision convolutional-neural-networks efficient-models imagenet-1k imagenet-21k keras tensorflow

Last synced: 01 Nov 2024

https://github.com/alkasm/colorfilters

Image thresholding in multiple colorspaces.

computer-vision image-processing opencv

Last synced: 27 Oct 2024

https://github.com/abhshkdz/neural-vqa-attention

:question: Attention-based Visual Question Answering in Torch

computer-vision deep-learning natural-language-processing torch

Last synced: 08 Nov 2024

https://github.com/skalskip/fashion-assistant

Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from images. We pass the prompt, along with the extracted features, to LLM, allowing for advanced image dataset queries.

computer-vision llm natural-language-processing openai openai-api yolo

Last synced: 01 Nov 2024

https://github.com/anupsarkar-dev/exovisix

Auto Attendance System Using Real Time Face Recognition With Various Computer Vision & Machine Learning Tools

bio-metric-attendance-system computer-vision eye-detection face-recognition gesture-detection haar-training javacv javafx mechine-learing motion-detection mysql-database ocr-recognition opencv

Last synced: 11 Oct 2024

https://github.com/etherealengine/digital-beings

A platform for letting researchers connect an intelligent AI directly to real time communication networks and 3D worlds. Your AI, Anywhere.

ai artificial-intelligence bot computer-vision cv digital-beings digital-humans machine-learning ml nlp telegram

Last synced: 12 Nov 2024

https://github.com/young-geng/m3ae_public

Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation

computer-vision flax jax natural-language-processing transformers

Last synced: 06 Nov 2024

https://github.com/eliphatfs/calibur

Conversion between different conventions of camera matrices and transform matrices.

computer-graphics computer-vision developer-productivity lightweight

Last synced: 13 Nov 2024

https://github.com/salvatorera/ml-news-of-the-week

A collection of the the best ML and AI news every week (research, news, resources)

agents ai artificial-intelligence computer-vision llms machine-learning nlp python rag retrieval-augmented-generation transformer

Last synced: 26 Oct 2024

https://github.com/varunagrawal/bbox

Python library for 2D/3D bounding boxes

3d-bounding-boxes bounding-boxes computer-vision computer-vision-tools

Last synced: 14 Nov 2024

https://github.com/adityap27/face-mask-detector

𝐑𝐞𝐚𝐥-𝐓𝐢𝐦𝐞 𝐅𝐚𝐜𝐞 𝐦𝐚𝐬𝐤 𝐝𝐞𝐭𝐞𝐜𝐭𝐢𝐨𝐧 𝐮𝐬𝐢𝐧𝐠 𝐝𝐞𝐞𝐩𝐥𝐞𝐚𝐫𝐧𝐢𝐧𝐠 𝐰𝐢𝐭𝐡 𝐀𝐥𝐞𝐫𝐭 𝐬𝐲𝐬𝐭𝐞𝐦 💻🔔

artificial-intelligence computer-vision deep-learning machine-learning neural-network yolo

Last synced: 09 Nov 2024

https://github.com/li-plus/seam-carving

A super-fast Python implementation of seam carving algorithm for intelligent image resizing.

computer-graphics computer-vision image-processing image-resizing python seam-carving

Last synced: 06 Nov 2024

https://github.com/imatge-upc/sentiment-2017-imavis

From Pixels to Sentiment: Fine-tuning CNNs for Visual Sentiment Prediction

affective-computing cnn computer-vision deep-learning fine-tuning-cnns sentiment sentiment-maps

Last synced: 03 Aug 2024

https://github.com/sibozhang/Speech2Video

Code for ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"

3d accv aigc body computer-vision face gan generative-ai gesture hand pose rnn simulation speech speech2video-synthesis talking-head talking-heads video

Last synced: 03 Aug 2024

https://github.com/nalepae/bounding-box

Bounding Box is a library to plot pretty bounding boxes with a simple Python API.

bounding-boxes computer-vision python visualization

Last synced: 14 Nov 2024

https://github.com/microsoft/orbit-dataset

The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mobile phone. The dataset is presented with a teachable object recognition benchmark task which aims to drive few-shot learning on challenging real-world data.

benchmark classification computer-vision dataset few-shot-learning machine-learning meta-learning microsoft object-recognition video

Last synced: 26 Oct 2024

https://github.com/utkarsh-deshmukh/single-image-dehazing-python

python implementation of the paper: "Efficient Image Dehazing with Boundary Constraint and Contextual Regularization"

airlight-estimation computer-vision defogging fog-removal haze-removal haze-removal-algorithm ieee image-dehazing opencv python single-image-defogging single-image-dehazing

Last synced: 10 Oct 2024

https://github.com/fasttrackorg/fasttrack

FastTrack is a cross-platform application designed to track multiple objects in video recording.

computer-vision image-processing linux macos opencv opencv4 qt qt6 research science tracking windows

Last synced: 12 Oct 2024

Computer vision Awesome Lists
Awesome-pytorch-list 704 awesome-self-supervised-learning 453 awesome-multimodal-ml 480 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-hand-pose-estimation 493 awesome-image-classification 220 awesome-human-pose-estimation 89 Awesome-Crowd-Counting 451 awesome-autonomous-vehicles 296 Awesome-Federated-Learning 558 Awesome-pytorch-list-CNVersion 692 awesome-industrial-anomaly-detection 657 iOS_ML 39 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-FL 2,098 awesome-low-light-image-enhancement 197 Awesome-Implicit-NeRF-Robotics 184 CV-pretrained-model 103 awesome-tensorflow-lite 110 awesome-capsule-networks 67 awesome-attention-mechanism-in-cv 195 awesome-grounding 154 Awesome-Image-Colorization 115 awesome-ai-awesomeness 236 openstl 43 awesome-autonomous-vehicle 181 Awesome-Open-Vocabulary 158 awesome-open-data-centric-ai 56 Awesome-Skeleton-based-Action-Recognition 104 awesome-6d-object 600 awesome-scene-understanding 197 awesome-multi-task-learning 226 awesome-holistic-3d 129 awesome-data-annotation 93 awesome-panoptic-segmentation 45 awesome-photogrammetry 68 awesome-llm-and-aigc 1,134 Awesome-3D-Object-Detection 169 awesome-computer-vision-models 189 Awesome-Parameter-Efficient-Transfer-Learning 121 awesome-state-of-depth-completion 59 Awesome-Distributed-Deep-Learning 44 awesome-image-alignment-and-stitching 108 Awesome-Monocular-3D-detection 93 Awesome-Multi-Task-Learning 80 Computer-Vision-Video-Lectures 67 awesome-segment-anything-extensions 120 awesome-robotics-datasets 79 awesome_computer_science 116