Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/mybridge/learn-machine-learning

Learn to Build a Machine Learning Application from Top Articles

computer-vision data-science deep-learning machine-learning neural-networks

Last synced: 19 Feb 2025

https://mcgill-nlp.github.io/weblinx/

WebLINX is a benchmark for building web navigation agents with conversational capabilities

agent agents computer-vision llm multimodal navigation nlp web

Last synced: 17 Nov 2024

https://github.com/McGill-NLP/weblinx

WebLINX is a benchmark for building web navigation agents with conversational capabilities

agent agents computer-vision llm multimodal navigation nlp web

Last synced: 20 Oct 2024

https://github.com/VedankPurohit/LiveRecall

Welcome to **LiveRecall**, the open-source alternative to Microsoft's Recall. LiveRecall captures snapshots of your screen and allows you to recall them using natural language queries, leveraging semantic search technology. For added security, all images are encrypted.

computer-vision memory productivity python recall sementic

Last synced: 06 Jan 2025

https://github.com/rizwanmunawar/yolov5-object-tracking

YOLOv5 Object Tracking + Detection + Object Blurring + Streamlit Dashboard Using OpenCV, PyTorch and Streamlit

computer-vision object-detection object-tracking streamlit-dashboard yolov5

Last synced: 17 Feb 2025

https://github.com/101arrowz/scanner

Document scanning from scratch

computer-vision rust typescript wasm

Last synced: 27 Oct 2024

https://github.com/abd-shoumik/Social-distance-detection

Social distance detection , a deep learning computer vision project with yolo object detection

computer-vision covid-19 dataset deeplearning deeplearning-ai socialdistancing yolo

Last synced: 09 Nov 2024

https://github.com/ismail-mebsout/Parsing-PDFs-using-YOLOV3

Parsing pdf tables using YOLOV3

computer-vision pdf python

Last synced: 09 Nov 2024

https://github.com/filipbasara0/simple-diffusion

A minimal implementation of a denoising diffusion model in PyTorch.

attention-mechanism computer-vision deep-learning diffusion image-generation pytorch stable-diffusion unet

Last synced: 12 Feb 2025

https://github.com/into-ai/deeplearning2020

course materials for introduction to deep learning 2020

computer-vision course-materials deep-learning lab learning mooc openhpi tensorflow

Last synced: 13 Feb 2025

https://github.com/longxiang-ai/human101

The official implementation of "Human101: Training 100+FPS Human Gaussians in 100s from 1 View".

3d 3d-reconstruction 3d-vision computer-vision digital-human gaussian gaussian-splatting human-reconstruction monocular real-time real-time-rendering view-synthesis

Last synced: 26 Jan 2025

https://github.com/ravisvi/super-resolution-videos

Applying SRGAN technique implemented in https://github.com/zsdonghao/SRGAN on videos to super resolve them.

computer-vision srgan super-resolution tensorlayer

Last synced: 07 Nov 2024

https://github.com/dahyun-kang/renet

[ICCV'21] Official PyTorch implementation of Relational Embedding for Few-Shot Classification

computer-vision deep-learning few-shot-classifcation few-shot-learning iccv2021 neural-networks pytorch

Last synced: 15 Nov 2024

https://github.com/jmisilo/clip-gpt-captioning

CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.

computer-vision cv deep-learning image-caption image-caption-generator image-captioning machine-learning nlp python pytorch

Last synced: 17 Dec 2024

https://github.com/utkarsh-deshmukh/single-image-dehazing-python

python implementation of the paper: "Efficient Image Dehazing with Boundary Constraint and Contextual Regularization"

airlight-estimation computer-vision defogging fog-removal haze-removal haze-removal-algorithm ieee image-dehazing opencv python single-image-defogging single-image-dehazing

Last synced: 17 Feb 2025

https://github.com/rpng/android-camera-calibration

Updated (opencv3 and camera2 API) android camera calibration application

android camera-calibration camera2-api computer-vision opencv

Last synced: 18 Nov 2024

https://github.com/pooya-mohammadi/deep_utils

An open-source toolkit which is full of handy functions, including the most used models and utilities for deep-learning practitioners!

augmentation coco computer-vision cutmix deep-learning face-detection face-recognition machine-learning modelcheckpoint nlp object-detection python pytorch senet tensorflow utils vggface2 yolov5

Last synced: 09 Nov 2024

https://github.com/bytedance/sptsv2

The official implementation of SPTS v2: Single-Point Text Spotting

artificial-intelligence computer-vision deep-learning ocr

Last synced: 15 Nov 2024

https://github.com/agentmorris/megadetector

MegaDetector is an AI model that helps conservation folks spend less time doing boring things with camera trap images.

camera-traps cameratraps computer-vision conservation ecology machine-learning megadetector wildlife

Last synced: 16 Feb 2025

https://github.com/RizwanMunawar/yolov5-object-tracking

YOLOv5 Object Tracking + Detection + Object Blurring + Streamlit Dashboard Using OpenCV, PyTorch and Streamlit

computer-vision object-detection object-tracking streamlit-dashboard yolov5

Last synced: 09 Nov 2024

https://github.com/henzler/neuraltexture

Learning a Neural 3D Texture Space from 2D Exemplars [CVPR 2020]

computer-graphics computer-vision deep-learning machine-learning texture-synthesis

Last synced: 27 Jan 2025

https://github.com/mindspore-courses/d2l-mindspore

ใ€ŠๅŠจๆ‰‹ๅญฆๆทฑๅบฆๅญฆไน ใ€‹็š„MindSporeๅฎž็Žฐใ€‚ไพ›MindSporeๅญฆไน ่€…้…ๅˆๆŽๆฒ่€ๅธˆ่ฏพ็จ‹ไฝฟ็”จใ€‚

computer-vision deep-learning machine-learning mindspore natural-language-processing notebook

Last synced: 16 Nov 2024

https://github.com/yihongXU/TransCenter

This is the official implementation of TransCenter (TPAMI). The code and pretrained models are now available here: https://gitlab.inria.fr/yixu/TransCenter_official.

computer-vision deep-learning multiple-object-tracking pytorch transformers

Last synced: 13 Nov 2024

https://github.com/furkan-gulsen/sport-with-ai

The human body is detected with the help of the Mediapipe library. Then, using the mathematical methods applied, it is determined how much the exercise count is done.

ai artificial-intelligence computer-vision deep-learning image-processing keras machine-learning mediapipe python python3 sport tensorflow

Last synced: 02 Feb 2025

https://github.com/pollen-robotics/pollen-vision

Simple and unified interface to zero-shot computer vision models curated for robotics use cases.

computer-vision grasping object-detection object-segmentation robotics

Last synced: 13 Feb 2025

https://github.com/by-sabbir/headposeestimation

Head Pose Estimation using OpenCV solving PNP.

computer-vision headpose-estimation opencv

Last synced: 19 Feb 2025

https://github.com/xiaosu-zhu/mcquic

Repository of CVPR'22 paper "Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression"

computer-vision cvpr2022 image-compression image-processing pytorch

Last synced: 29 Jan 2025

https://github.com/mars885/doc-skanner

An Android application that makes it possible to automatically scan and digitize documents from photos.

android android-application computer-vision doc-scanner document-scanner opencv

Last synced: 08 Nov 2024

https://github.com/karayaman/lichess-with-a-real-board

Lichess.org client for real life chess boards.

computer-vision lichess

Last synced: 08 Nov 2024

https://github.com/twke18/SPML

Universal Weakly Supervised Segmentation by Pixel-to-Segment Contrastive Learning

computer-vision deep-learning metric-learning semantic-segmentation weakly-supervised-learning

Last synced: 29 Oct 2024

https://github.com/dthuerck/mapmap_cpu

A high-performance general-purpose MRF MAP solver, heavily exploiting SIMD instructions.

algorithm computer-vision dynamic-programming feedback-vertex-set markov-random-field massively-parallel mrf simd simd-programming vectorization

Last synced: 09 Nov 2024

https://github.com/jhu-lcsr/good_robot

"Good Robot! Now Watch This!": Repurposing Reinforcement Learning for Task-to-Task Transfer; and โ€œGood Robot!โ€: Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer

computer-vision deep-learning deep-q-network deep-reinforcement-learning grasping learning-from-demonstration manipulation multi-step-dqn multi-step-learning reinforcement-learning robotic-manipulation robotics simulation task-to-task-transfer vrep-simulator zero-shot-learning

Last synced: 02 Nov 2024

https://github.com/UT-Austin-RPL/Ditto

Code for Ditto: Building Digital Twins of Articulated Objects from Interaction

3d-reconstruction computer-vision deep-learning digital-twin robotics

Last synced: 07 Nov 2024

https://github.com/scrubbbbs/cbird

Command-line program for managing a media collection, with focus on Content-Based Image Retrieval (Computer Vision) methods for finding duplicates.

command-line-interface computer-vision content-based-image-retrieval duplicate-detection duplicate-files duplicates ffmpeg opencv qt6 similarity-search

Last synced: 13 Feb 2025

https://github.com/jeeliz/jeelizpupillometry

Real-time pupillometry in the web browser using a 4K webcam video feed processed by this WebGL/Javascript library. 2 demo experiments are included.

camera cctv computer-vision detection eye eyelink face iris javascript measurement pupil pupillometry radius real-time size tracking video webcam webgl

Last synced: 16 Nov 2024

https://github.com/kyegomez/rt-x

Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"

artificial-intelligence attention-is-all-you-need attention-model computer-vision gpt4 gpt4all multimodal vision vision-transformer

Last synced: 27 Dec 2024

https://github.com/ccextractor/rekognition

Free and Open Source alternative to Amazon's Rekognition service. CCExtractor Development | Poor Man's Rekognition

computer-vision deep-learning django django-rest-framework docker face-detection gsoc image-processing machine-learning machinelearning opencv python rest rest-api tensorflow tensorflow-serving video-processing

Last synced: 20 Dec 2024

https://github.com/fasttrackorg/fasttrack

FastTrack is a cross-platform application designed to track multiple objects in video recording.

computer-vision image-processing linux macos opencv opencv4 qt qt6 research science tracking windows

Last synced: 18 Feb 2025

https://github.com/ai4ce/msg

[NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)

computer-vision deep-learning multiview neurips-2024 scene-graph scene-representations visual-navigation

Last synced: 14 Feb 2025

https://github.com/n2cholas/jax-resnet

Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).

computer-vision deep-learning flax jax machine-learning neural-networks resnet

Last synced: 18 Nov 2024

https://github.com/anupsarkar-dev/exovisix

Auto Attendance System Using Real Time Face Recognition With Various Computer Vision & Machine Learning Tools

bio-metric-attendance-system computer-vision eye-detection face-recognition gesture-detection haar-training javacv javafx mechine-learing motion-detection mysql-database ocr-recognition opencv

Last synced: 11 Feb 2025

https://github.com/mdietrichstein/yolo-tiny-v1-mobile

Yolo for Android and iOS - Mobile Deep Learning Object Detection in Realtime written in Kotlin and Swift

android computer-vision deep-learning deep-neural-networks ios keras keras-tensorflow mobile tensorflow

Last synced: 07 Dec 2024

https://github.com/Z-Zheng/FreeNet

FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image Classification (TGRS 2020) https://ieeexplore.ieee.org/document/9007624

computer-vision convolutional-neural-network deep-learning geospatial hyperspectral-image-classification remote-sensing

Last synced: 31 Oct 2024

https://github.com/tatsuyah/Lane-Lines-Detection-Python-OpenCV

Lane Lines Detection using Python and OpenCV for self-driving car

computer-vision self-driving-car

Last synced: 03 Nov 2024

https://github.com/shi-labs/semi-supervised-transfer-learning

[CVPR 2021] Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

computer-vision semi-supervised-learning transfer-learning

Last synced: 08 Nov 2024

https://github.com/sayakpaul/convnext-tf

Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.

computer-vision convolutional-neural-networks efficient-models imagenet-1k imagenet-21k keras tensorflow

Last synced: 13 Jan 2025

https://github.com/alkasm/colorfilters

Image thresholding in multiple colorspaces.

computer-vision image-processing opencv

Last synced: 27 Oct 2024

https://github.com/microsoft/orbit-dataset

The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mobile phone. The dataset is presented with a teachable object recognition benchmark task which aims to drive few-shot learning on challenging real-world data.

benchmark classification computer-vision dataset few-shot-learning machine-learning meta-learning microsoft object-recognition video

Last synced: 11 Feb 2025

https://github.com/SHI-Labs/Semi-Supervised-Transfer-Learning

[CVPR 2021] Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

computer-vision semi-supervised-learning transfer-learning

Last synced: 15 Nov 2024

https://github.com/li-plus/seam-carving

A super-fast Python implementation of seam carving algorithm for intelligent image resizing.

computer-graphics computer-vision image-processing image-resizing python seam-carving

Last synced: 18 Feb 2025

https://github.com/yuanze-lin/revive

[NeurIPS 2022] Official Code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering

computer-vision deep-learning gpt-3 knowledge-based multimodal-deep-learning neurips2022 ok-vqa pytorch question-answering vision-and-languge vqa

Last synced: 13 Feb 2025

https://github.com/abhshkdz/neural-vqa-attention

:question: Attention-based Visual Question Answering in Torch

computer-vision deep-learning natural-language-processing torch

Last synced: 08 Nov 2024

https://github.com/sibozhang/Speech2Video

Code for ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"

3d accv aigc body computer-vision face gan generative-ai gesture hand pose rnn simulation speech speech2video-synthesis talking-head talking-heads video

Last synced: 17 Nov 2024

https://github.com/etherealengine/digital-beings

A platform for letting researchers connect an intelligent AI directly to real time communication networks and 3D worlds. Your AI, Anywhere.

ai artificial-intelligence bot computer-vision cv digital-beings digital-humans machine-learning ml nlp telegram

Last synced: 12 Nov 2024

https://github.com/varunagrawal/bbox

Python library for 2D/3D bounding boxes

3d-bounding-boxes bounding-boxes computer-vision computer-vision-tools

Last synced: 19 Nov 2024

https://github.com/young-geng/m3ae_public

Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation

computer-vision flax jax natural-language-processing transformers

Last synced: 06 Nov 2024

https://github.com/skalskip/fashion-assistant

Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from images. We pass the prompt, along with the extracted features, to LLM, allowing for advanced image dataset queries.

computer-vision llm natural-language-processing openai openai-api yolo

Last synced: 01 Nov 2024

https://github.com/hv0905/nekoimagegallery

An AI-powered natural language & reverse Image Search Engine powered by CLIP & qdrant.

clip computer-vision image-search image-search-engine search-engine transformers

Last synced: 13 Feb 2025

https://github.com/tensorush/awesome-graphics-programming

๐Ÿ˜Ž ๐ŸงŠ Collection of the most awesome learning resources on graphics programming in the form of videos, tutorials and books.

3d 3d-graphics augmented-reality awesome-list cg computer-graphics computer-vision game-development gamedev geometry-processing graphics graphics-programming learning opengl raytracing rendering shaders tutorials virtual-reality vulkan

Last synced: 06 Nov 2024

https://github.com/lingdong-/visionosc

PoseOSC + FaceOSC + HandOSC + OcrOSC + CatOSC + DogOSC

computer-vision facial-landmarks-detection hand-tracking ocr osc pose-estimation vision-framework

Last synced: 19 Jan 2025

https://github.com/eliphatfs/calibur

Conversion between different conventions of camera matrices and transform matrices.

computer-graphics computer-vision developer-productivity lightweight

Last synced: 19 Dec 2024

https://github.com/imatge-upc/sentiment-2017-imavis

From Pixels to Sentiment: Fine-tuning CNNs for Visual Sentiment Prediction

affective-computing cnn computer-vision deep-learning fine-tuning-cnns sentiment sentiment-maps

Last synced: 17 Nov 2024

https://github.com/adityap27/face-mask-detector

๐‘๐ž๐š๐ฅ-๐“๐ข๐ฆ๐ž ๐…๐š๐œ๐ž ๐ฆ๐š๐ฌ๐ค ๐๐ž๐ญ๐ž๐œ๐ญ๐ข๐จ๐ง ๐ฎ๐ฌ๐ข๐ง๐  ๐๐ž๐ž๐ฉ๐ฅ๐ž๐š๐ซ๐ง๐ข๐ง๐  ๐ฐ๐ข๐ญ๐ก ๐€๐ฅ๐ž๐ซ๐ญ ๐ฌ๐ฒ๐ฌ๐ญ๐ž๐ฆ ๐Ÿ’ป๐Ÿ””

artificial-intelligence computer-vision deep-learning machine-learning neural-network yolo

Last synced: 09 Nov 2024

https://github.com/Placenote/PlacenoteSDK-iOS

PlacenoteSDK Sample app in native iOS using ARKit, written primarily in Swift

arkit augmented-reality computer-vision ios ios-swift scene-recognition scenekit slam

Last synced: 04 Dec 2024

https://github.com/nalepae/bounding-box

Bounding Box is a library to plot pretty bounding boxes with a simple Python API.

bounding-boxes computer-vision python visualization

Last synced: 14 Nov 2024

https://github.com/bytedance/twist

Official codes: Self-Supervised Learning by Estimating Twin Class Distribution

computer-vision deep-learning pretraining self-supervised-learning twist

Last synced: 15 Nov 2024

https://github.com/juglab/FourierImageTransformer

Fourier Image Transformer (FIT) can solve relevant image analysis tasks in Fourier space.

computer-vision image-processing transformer

Last synced: 15 Nov 2024

https://github.com/eora-ai/inferoxy

Service for quick deploying and using dockerized Computer Vision models

computer-vision machine-learning mlops pipelines python

Last synced: 27 Nov 2024

https://github.com/yoonit-labs/nativescript-yoonit-camera

The most advanced and modern NativeScript Camera module for Android and iOS with a lot of awesome features

android camera camerax-api computer-vision hacktoberfest ios machine-learning mlkit nativescript vuejs-plugin

Last synced: 08 Feb 2025

https://github.com/pliang279/multiviz

[ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models

computer-vision machine-learning multimodal-learning natural-language-processing

Last synced: 14 Feb 2025

https://github.com/robertwgh/ezSIFT

ezSIFT: An easy-to-use standalone SIFT library written in C/C++

computer-vision cpp feature-detection feature-extraction feature-matching sift sift-algorithm sift-descriptors

Last synced: 14 Nov 2024

https://github.com/eveningglow/multitask-CycleGAN

Pytorch implementation of multitask CycleGAN with auxiliary classification loss

computer-vision cyclegan deep-learning gan generative-adversarial-network image-translation pytorch

Last synced: 15 Nov 2024

https://github.com/rssr25/computer-vision

Some computer vision projects written using openCV and python.

computer-vision opencv python python27

Last synced: 24 Jan 2025

https://github.com/baidut/PaQ-2-PiQ

Source code for "From Patches to Pictures (PaQ-2-PiQ): Mapping the Perceptual Space of Picture Quality"

computer-vision image-processing image-quality image-quality-assessment picture-quality

Last synced: 03 Nov 2024

Computer vision Awesome Lists
Awesome-pytorch-list 704 awesome-self-supervised-learning 453 awesome-multimodal-ml 480 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-hand-pose-estimation 495 awesome-image-classification 220 awesome-human-pose-estimation 89 Awesome-Crowd-Counting 454 awesome-autonomous-vehicles 296 Awesome-Federated-Learning 558 Awesome-pytorch-list-CNVersion 692 Awesome-FL 2,323 awesome-industrial-anomaly-detection 723 iOS_ML 39 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 185 CV-pretrained-model 103 awesome-tensorflow-lite 110 awesome-capsule-networks 67 awesome-attention-mechanism-in-cv 195 awesome-grounding 154 Awesome-Image-Colorization 123 openstl 43 awesome-ai-awesomeness 236 awesome-autonomous-vehicle 181 Awesome-Open-Vocabulary 159 Awesome-Skeleton-based-Action-Recognition 104 awesome-open-data-centric-ai 56 awesome-6d-object 600 awesome-scene-understanding 197 awesome-multi-task-learning 229 awesome-robotics-3d 106 awesome-holistic-3d 129 awesome-llm-and-aigc 1,350 awesome-data-annotation 93 awesome-panoptic-segmentation 45 awesome-photogrammetry 68 Awesome-3D-Object-Detection 169 awesome-optical-flow 110 awesome-computer-vision-models 189 Awesome-Parameter-Efficient-Transfer-Learning 121 awesome-ai-data-guided-projects 56 awesome-state-of-depth-completion 59 Awesome-Distributed-Deep-Learning 44 awesome-image-alignment-and-stitching 108 awesome_computer_science 116 Awesome-Monocular-3D-detection 97 Awesome-Multi-Task-Learning 80 Computer-Vision-Video-Lectures 67