Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/ali-design/gan_steerability

On the "steerability" of generative adversarial networks

computer-vision deep-learning gan generative-adversarial-network

Last synced: 15 Nov 2024

https://github.com/POSTECH-CVLab/FastPointTransformer

Official source code of Fast Point Transformer, CVPR 2022

3d-vision computer-vision cvpr2022 point-cloud transformer

Last synced: 28 Oct 2024

https://github.com/davidstutz/superpixels-revisited

Library containing 7 state-of-the-art superpixel algorithms with a total of 9 implementations used for evaluation purposes in [1] utilizing an extended version of the Berkeley Segmentation Benchmark.

computer-vision image-processing opencv superpixel-algorithms superpixels

Last synced: 18 Feb 2025

https://github.com/angeladai/ScanComplete

[CVPR'18] ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans

3d-reconstruction autoregressive-neural-networks computer-graphics computer-vision deep-learning

Last synced: 07 Nov 2024

https://github.com/roboflow/webcamgpt

webcamGPT - chat with video stream 💬 + 📸

chatgpt computer-vision gpt-4

Last synced: 18 Feb 2025

https://github.com/roboflow/roboflow-100-benchmark

Code for replicating Roboflow 100 benchmark results and programmatically downloading benchmark datasets

computer-vision dataset deep-learning machine-learning object-detection pytorch

Last synced: 17 Feb 2025

https://github.com/qdata/c-tran

General Multi-label Image Classification with Transformers

computer-vision multi-label-classification transformers

Last synced: 19 Feb 2025

https://github.com/zc-alexfan/hold

[CVPR 2024✨Highlight] Official repository for HOLD, the first method that jointly reconstructs articulated hands and objects from monocular videos without assuming a pre-scanned object template and 3D hand-object training data.

3d-reconstruction ai artificial-intelligence augmented-reality computer-vision hand-object-interaction hand-object-reconstruction hand-tracking mano mixed-reality neural-networks pose-estimation pytorch virtual-reality

Last synced: 13 Nov 2024

https://github.com/roboflow/webcamGPT

webcamGPT - chat with video stream 💬 + 📸

chatgpt computer-vision gpt-4

Last synced: 06 Nov 2024

https://github.com/junweiliang/multiverse

Dataset, code and model for the CVPR'20 paper "The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction". And for the ECCV'20 SimAug paper.

3d-simulation computer-vision trajectory-prediction trajectory-prediction-benchmark video-understanding

Last synced: 17 Feb 2025

https://github.com/HusseinYoussef/Arabic-OCR

OCR system for Arabic language that converts images of typed text to machine-encoded text.

arabic character-segmentation computer-vision dataset image-processing machine-learning neural-network ocr opencv-python scikit-learn segmentation

Last synced: 03 Nov 2024

https://github.com/lil-lab/nlvr

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.

computer-vision corpus machine-learning natural-language-processing

Last synced: 12 Nov 2024

https://github.com/QData/C-Tran

General Multi-label Image Classification with Transformers

computer-vision multi-label-classification transformers

Last synced: 05 Nov 2024

https://github.com/staghado/vit.cpp

Inference Vision Transformer (ViT) in plain C/C++ with ggml

ai c computer-vision cpp cpu edge-computing ggml image-classification llamacpp vision-transformer whisper-cpp

Last synced: 14 Feb 2025

https://github.com/twitter-research/image-crop-analysis

Code for reproducing our analysis in the paper titled: Image Cropping on Twitter: Fairness Metrics, their Limitations, and the Importance of Representation, Design, and Agency

bias computer-vision fairness fairness-ml image-processing machine-learning research

Last synced: 30 Oct 2024

https://github.com/fcakyon/craft-text-detector

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

actions anaconda computer-vision craft deep-learning document hacktoberfest linux macos neural-network ocr pypi python pytorch text text-detection vision windows workflow

Last synced: 03 Nov 2024

https://github.com/megvii-research/RevCol

Official Code of Paper "Reversible Column Networks" "RevColv2"

cnn computer-vision iclr2023 mae pytorch transformer vit

Last synced: 28 Oct 2024

https://github.com/LZBUAV/K210

Kendryte K210人工智能芯片应用程序集合,包括人脸检测、颜色检测、目标检测和分类、二维码和Apriltag代码检测以及和ArduPilot飞控软件的通信。这些应用程序已部署到无人机终端。This repository is a collection of applications for the Kendryte K210 AI chip which include face detection, color detection, object detection and classification, QR code and Apriltag code detection ,and communication with the ArduPilot flight software. Finally, we can deploy these applications to the UAV terminals and make drones more intelligent.

computer-vision face-detection k210 machine-vision yolov2

Last synced: 28 Oct 2024

https://github.com/mtli/html4vision

A simple HTML visualization tool for computer vision research :hammer_and_wrench:

computer-vision html visualization

Last synced: 19 Feb 2025

https://github.com/louisfb01/best_ai_papers_2023

A curated list of the latest breakthroughs in AI (in 2023) by release date with a clear video explanation, link to a more in-depth article, and code.

ai artificial-intelligence computer-vision machine-learning ml nlp paper papers python research state-of-the-art

Last synced: 18 Feb 2025

https://github.com/chunbolang/BAM

Official PyTorch Implementation of Learning What Not to Segment: A New Perspective on Few-Shot Segmentation (CVPR'22 Oral & TPAMI'23).

computer-vision few-shot-segmentation

Last synced: 15 Nov 2024

https://github.com/brentyi/jaxlie

Rigid transforms + Lie groups in JAX

computer-vision geometry jax lie-groups robotics

Last synced: 15 Feb 2025

https://github.com/milaan9/python_computer_vision_from_scratch

This repository explores the variety of techniques commonly used to analyze and interpret images. It also describes challenging real-world applications where vision is being successfully used, both for specialized applications such as medical imaging, and for fun, consumer-level tasks such as image editing and stitching, which students can apply to their own personal photos and videos.

canny-edge-detection computer-vision dtw-algorithm eigenfaces feature-extraction hough-lines image-analysis image-manipulation image-processing image-recognition ipython-notebook machine-learning python4datascience python4everybody sobel-filter sobel-operator tutor-milaan9

Last synced: 13 Feb 2025

https://github.com/AaltoVision/ADVIO

An Authentic Dataset for Visual-Inertial Odometry

benchmarking computer-vision navigation visual-inertial-odometry

Last synced: 02 Nov 2024

https://github.com/mtli/HTML4Vision

A simple HTML visualization tool for computer vision research :hammer_and_wrench:

computer-vision html visualization

Last synced: 15 Nov 2024

https://github.com/idea-research/deepdataspace

The Go-To Choice for CV Data Visualization, Annotation, and Model Analysis.

collaborative-annotation computer-vision dataset-visualization intelligent-annotation labeling-tool model-analysis

Last synced: 16 Feb 2025

https://github.com/eric-canas/qreader

Robust and Straight-Forward solution for reading difficult and tricky QR codes within images in Python. Powered by YOLOv8

computer-vision easy-to-use image-processing object-detection pip python pytorch pyzbar qr qrcode qrcode-reader qrcode-scanner yolov8

Last synced: 15 Feb 2025

https://github.com/lannguyen0910/food-recognition

🍔🍟🍗 Food analysis baseline with Theseus. Integrate object detection, image classification and multi-class semantic segmentation 🍞🍖🍕

computer-vision deep-learning efficientnet food-analysis food-classification food-detection food-segmentation image-classification object-detection pytorch semantic-segmentation unetplusplus yolov5 yolov8

Last synced: 09 Nov 2024

https://github.com/mindspore-lab/mindcv

A toolbox of vision models and algorithms based on MindSpore

computer-vision cv deep-learning image-classification imagenet mindspore models resnet transformer

Last synced: 28 Oct 2024

https://github.com/vitae-transformer/vitae-transformer-matting

A comprehensive list [AIM@IJCAI'21, P3M@MM'21, GFM@IJCV'22, RIM@CVPR'23, P3MNet@IJCV'23] of our research works related to image matting, including papers, codes, datasets, demos, and citations. Note: The repo for [IJCV'23] "Rethinking Portrait Matting with Privacy Preserving" has been moved to: https://github.com/ViTAE-Transformer/P3M-Net

computer-vision deep-learning image-matting privacy-preserving survey vision-transformer

Last synced: 14 Jan 2025

https://github.com/nianticlabs/wavelet-monodepth

[CVPR 2021] Monocular depth estimation using wavelets for efficiency

computer-vision cvpr2021 depth-estimation kitti-dataset nyu-depth-v2 wavelets

Last synced: 18 Feb 2025

https://github.com/kwotsin/transfer_learning_tutorial

A guide to transfer learning with inception-resnet-v2.

computer-vision tensorflow tensorflow-tutorials transfer-learning

Last synced: 10 Feb 2025

https://github.com/voxel51/voxelgpt

AI assistant that can query visual datasets, search the FiftyOne docs, and answer general computer vision questions

artificial-intelligence chatgpt computer-vision data-science deep-learning fiftyone langchain llm machine-learning openai python

Last synced: 09 Nov 2024

https://github.com/daleroberts/bv

Quickly view satellite imagery, hyperspectral imagery, and machine learning image outputs directly in your iTerm2 terminal.

command-line computer-vision image iterm2 machine-learning python satellite satellite-imagery

Last synced: 20 Dec 2024

https://github.com/ternaus/cloths_segmentation

Code for binary segmentation of cloths

computer-vision deep-learning image-segmentation

Last synced: 13 Feb 2025

https://github.com/wkentaro/morefusion

MoreFusion: Multi-object Reasoning for 6D Pose Estimation from Volumetric Fusion, CVPR 2020

artificial-intelligence computer-vision deep-learning machine-learning pose-estimation robotics ros

Last synced: 14 Nov 2024

https://github.com/zae-bayern/elpv-dataset

A dataset of functional and defective solar cells extracted from EL images of solar modules

computer-vision machine-learning photovoltaic solar-cells solar-energy

Last synced: 14 Nov 2024

https://github.com/baegwangbin/surface_normal_uncertainty

[ICCV 2021 Oral] Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation

3d-reconstruction computer-vision deep-learning iccv2021 surface-normal surface-normals surface-normals-estimation uncertainty uncertainty-estimation

Last synced: 13 Feb 2025

https://github.com/nianticlabs/footprints

[CVPR 2020] Estimation of the visible and hidden traversable space from a single color image

computer-vision deep-learning depth-estimation monodepth pytorch

Last synced: 18 Feb 2025

https://github.com/aangelopoulos/conformal_classification

Wrapper for a PyTorch classifier which allows it to output prediction sets. The sets are theoretically guaranteed to contain the true class with high probability (via conformal prediction).

artificial-intelligence classification classifier computer-vision conformal conformal-prediction deep-neural-networks distribution-free imagenet machine-learning neural-networks nonparametric nonparametric-statistics prediction-sets pytorch statistics uncertainty uncertainty-quantification vision

Last synced: 05 Nov 2024

https://github.com/wkentaro/fcn

Chainer Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)

chainer computer-vision convolutional-networks deep-learning fcn fcn8s segmentation semantic-segmentation

Last synced: 15 Feb 2025

https://github.com/baegwangbin/magnet

[CVPR 2022 Oral] Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

3d-reconstruction computer-vision cvpr2022 deep-learning depth-estimation multi-view-stereo multiview-geometry multiview-stereo uncertainty uncertainty-estimation

Last synced: 10 Feb 2025

https://github.com/vinjn/opencv-2-cookbook-src

《OpenCV 2 计算机视觉编程手册》 配套代码,支持 OpenCV 3.x / 4.x

computer-vision opencv opencv2 opencv3 opencv4

Last synced: 17 Feb 2025

https://github.com/compphoto/intrinsic

Repo for the papers "Intrinsic Image Decomposition via Ordinal Shading" (TOG 2023) and "Colorful Diffuse Intrinsic Image Decomposition in the Wild" (TOG 2024)

computer-graphics computer-vision image-processing intrinsic-decomposition inverse-rendering machine-learning multi-illumination

Last synced: 16 Feb 2025

https://github.com/haofanwang/natural-language-joint-query-search

Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.

attention clip computer-vision image-retrieval image-search multi-modal-search unsplash visualizations

Last synced: 19 Dec 2024

https://github.com/arefmalek/airdraw

A vision-based drawing application

computer-vision mediapipe opencv python python3

Last synced: 31 Oct 2024

https://github.com/1adrianb/binary-human-pose-estimation

This code implements a demo of the Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment with Limited Resources paper by Adrian Bulat and Georgios Tzimiropoulos.

binary computer-vision deep-learning human-pose-estimation network-binarization torch7

Last synced: 10 Jan 2025

https://github.com/morvanzhou/tensorflow-computer-vision-tutorial

Tutorials of deep learning for computer vision.

cnn computer-vision deepdream googlenet lenet resnet

Last synced: 10 Feb 2025

https://github.com/Villavu/Simba

Simba is a program used to repeat certain (complicated) tasks. Typically these tasks involve using the mouse and keyboard. Simba is programmable, which means you can design your own logic and steps that Simba will follow, based upon certain input such as colors on the screen.

automation computer-vision fpc lape lazarus macro pascal simba villavu

Last synced: 04 Nov 2024

https://github.com/jiachens/ModelNet40-C

Repo for "Benchmarking Robustness of 3D Point Cloud Recognition against Common Corruptions" https://arxiv.org/abs/2201.12296

benchmark computer-vision corruption-robustness data-augmentation deep-learning ml-safety point-cloud-processing pytorch regularization robustness

Last synced: 28 Oct 2024

https://github.com/li-plus/dsnet

DSNet: A Flexible Detect-to-Summarize Network for Video Summarization

computer-vision detection machine-learning pytorch video-summarization

Last synced: 27 Dec 2024

https://github.com/angeladai/3DMV

[ECCV'18] 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation

computer-graphics computer-vision deep-learning scene-understanding

Last synced: 07 Nov 2024

https://github.com/kingyiusuen/clip-image-search

Search images with a text or image query, using Open AI's pretrained CLIP model.

computer-vision deep-learning elasticsearch image-search image-search-engine reverse-image-search search-engine streamlit-webapp

Last synced: 19 Dec 2024

https://github.com/kwotsin/tensorflow-xception

TensorFlow implementation of the Xception Model by François Chollet

computer-vision deep-learning machine-learning tensorflow tensorflow-xception xception-model

Last synced: 19 Feb 2025

https://github.com/lijx10/DeepI2P

DeepI2P: Image-to-Point Cloud Registration via Deep Classification. CVPR 2021

3d-vision cnn computer-vision deep-learning image-processing localization neural-network point-cloud registration

Last synced: 27 Oct 2024

https://github.com/kwotsin/TensorFlow-Xception

TensorFlow implementation of the Xception Model by François Chollet

computer-vision deep-learning machine-learning tensorflow tensorflow-xception xception-model

Last synced: 31 Oct 2024

https://github.com/dmarnerides/pydlt

PyTorch based Deep Learning Toolbox

computer-vision deep-learning pytorch toolbox

Last synced: 14 Nov 2024

https://github.com/baegwangbin/MaGNet

[CVPR 2022 Oral] Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

3d-reconstruction computer-vision cvpr2022 deep-learning depth-estimation multi-view-stereo multiview-geometry multiview-stereo uncertainty uncertainty-estimation

Last synced: 04 Nov 2024

https://github.com/rish-16/sight

👁 Sightseer: TensorFlow library for state-of-the-art Computer Vision and Object Detection models

computer-vision object-detection state-of-the-art tensorflow

Last synced: 11 Nov 2024

https://github.com/imfing/sketch-to-art

🖼 Create artwork from your casual sketch with GAN and style transfer

computer-vision deep-learning gan pix2pix style-transfer

Last synced: 18 Feb 2025

https://github.com/junyanz/light-field-video

Light field video applications (e.g. video refocusing, focus tracking, changing aperture and view)

caffe computer-graphics computer-vision deep-learning lightfield

Last synced: 19 Dec 2024

https://github.com/maartengr/concept

Concept Modeling: Topic Modeling on Images and Text

computer-vision image-processing nlp topic-modeling

Last synced: 17 Feb 2025

https://github.com/albumentations-team/autoalbument

AutoML for image augmentation. AutoAlbument uses the Faster AutoAugment algorithm to find optimal augmentation policies. Documentation - https://albumentations.ai/docs/autoalbument/

augmentation automated-machine-learning automl computer-vision deep-learning image-augmentation machine-learning pytorch

Last synced: 14 Feb 2025

Computer vision Awesome Lists
Awesome-pytorch-list 704 awesome-self-supervised-learning 453 awesome-multimodal-ml 480 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-hand-pose-estimation 495 awesome-image-classification 220 awesome-human-pose-estimation 89 Awesome-Crowd-Counting 454 awesome-autonomous-vehicles 296 Awesome-Federated-Learning 558 Awesome-pytorch-list-CNVersion 692 Awesome-FL 2,323 awesome-industrial-anomaly-detection 723 iOS_ML 39 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 185 CV-pretrained-model 103 awesome-tensorflow-lite 110 awesome-capsule-networks 67 awesome-attention-mechanism-in-cv 195 awesome-grounding 154 Awesome-Image-Colorization 123 openstl 43 awesome-ai-awesomeness 236 awesome-autonomous-vehicle 181 Awesome-Open-Vocabulary 159 Awesome-Skeleton-based-Action-Recognition 104 awesome-open-data-centric-ai 56 awesome-6d-object 600 awesome-scene-understanding 197 awesome-multi-task-learning 229 awesome-robotics-3d 106 awesome-holistic-3d 129 awesome-llm-and-aigc 1,350 awesome-data-annotation 93 awesome-panoptic-segmentation 45 awesome-photogrammetry 68 Awesome-3D-Object-Detection 169 awesome-optical-flow 110 awesome-computer-vision-models 189 Awesome-Parameter-Efficient-Transfer-Learning 121 awesome-ai-data-guided-projects 56 awesome-state-of-depth-completion 59 Awesome-Distributed-Deep-Learning 44 awesome-image-alignment-and-stitching 108 awesome_computer_science 116 Awesome-Monocular-3D-detection 97 Awesome-Multi-Task-Learning 80 Computer-Vision-Video-Lectures 67