Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/vzhou842/cnn-from-scratch

A Convolutional Neural Network implemented from scratch (using only numpy) in Python.

computer-vision convolutional-neural-networks guide machine-learning neural-network python

Last synced: 13 Nov 2024

https://github.com/photonvision/photonvision

PhotonVision is the free, fast, and easy-to-use computer vision solution for the FIRST Robotics Competition.

computer-vision frc java opencv vision vision-processing wpilib

Last synced: 13 Oct 2024

https://github.com/POSTECH-CVLab/FastPointTransformer

Official source code of Fast Point Transformer, CVPR 2022

3d-vision computer-vision cvpr2022 point-cloud transformer

Last synced: 28 Oct 2024

https://github.com/angeladai/ScanComplete

[CVPR'18] ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans

3d-reconstruction autoregressive-neural-networks computer-graphics computer-vision deep-learning

Last synced: 07 Nov 2024

https://github.com/zc-alexfan/hold

[CVPR 2024✨Highlight] Official repository for HOLD, the first method that jointly reconstructs articulated hands and objects from monocular videos without assuming a pre-scanned object template and 3D hand-object training data.

3d-reconstruction ai artificial-intelligence augmented-reality computer-vision hand-object-interaction hand-object-reconstruction hand-tracking mano mixed-reality neural-networks pose-estimation pytorch virtual-reality

Last synced: 13 Nov 2024

https://github.com/roboflow/webcamGPT

webcamGPT - chat with video stream 💬 + 📸

chatgpt computer-vision gpt-4

Last synced: 06 Nov 2024

https://github.com/HusseinYoussef/Arabic-OCR

OCR system for Arabic language that converts images of typed text to machine-encoded text.

arabic character-segmentation computer-vision dataset image-processing machine-learning neural-network ocr opencv-python scikit-learn segmentation

Last synced: 03 Nov 2024

https://github.com/roboflow/webcamgpt

webcamGPT - chat with video stream 💬 + 📸

chatgpt computer-vision gpt-4

Last synced: 13 Nov 2024

https://github.com/lil-lab/nlvr

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.

computer-vision corpus machine-learning natural-language-processing

Last synced: 12 Nov 2024

https://github.com/QData/C-Tran

General Multi-label Image Classification with Transformers

computer-vision multi-label-classification transformers

Last synced: 05 Nov 2024

https://github.com/qdata/c-tran

General Multi-label Image Classification with Transformers

computer-vision multi-label-classification transformers

Last synced: 12 Nov 2024

https://github.com/davidstutz/superpixels-revisited

Library containing 7 state-of-the-art superpixel algorithms with a total of 9 implementations used for evaluation purposes in [1] utilizing an extended version of the Berkeley Segmentation Benchmark.

computer-vision image-processing opencv superpixel-algorithms superpixels

Last synced: 10 Nov 2024

https://github.com/twitter-research/image-crop-analysis

Code for reproducing our analysis in the paper titled: Image Cropping on Twitter: Fairness Metrics, their Limitations, and the Importance of Representation, Design, and Agency

bias computer-vision fairness fairness-ml image-processing machine-learning research

Last synced: 30 Oct 2024

https://github.com/fcakyon/craft-text-detector

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

actions anaconda computer-vision craft deep-learning document hacktoberfest linux macos neural-network ocr pypi python pytorch text text-detection vision windows workflow

Last synced: 03 Nov 2024

https://github.com/megvii-research/RevCol

Official Code of Paper "Reversible Column Networks" "RevColv2"

cnn computer-vision iclr2023 mae pytorch transformer vit

Last synced: 28 Oct 2024

https://github.com/LZBUAV/K210

Kendryte K210人工智能芯片应用程序集合,包括人脸检测、颜色检测、目标检测和分类、二维码和Apriltag代码检测以及和ArduPilot飞控软件的通信。这些应用程序已部署到无人机终端。This repository is a collection of applications for the Kendryte K210 AI chip which include face detection, color detection, object detection and classification, QR code and Apriltag code detection ,and communication with the ArduPilot flight software. Finally, we can deploy these applications to the UAV terminals and make drones more intelligent.

computer-vision face-detection k210 machine-vision yolov2

Last synced: 28 Oct 2024

https://github.com/mdyao/Awesome-3D-AIGC

A curated list of papers and open-source resources focused on 3D AIGC.

3d-aigc 3d-gaussian-splatting 3d-generation aigc computer-graphics computer-vision generation nerf neural-rendering

Last synced: 12 Sep 2024

https://github.com/AaltoVision/ADVIO

An Authentic Dataset for Visual-Inertial Odometry

benchmarking computer-vision navigation visual-inertial-odometry

Last synced: 02 Nov 2024

https://github.com/chunbolang/BAM

Official PyTorch Implementation of Learning What Not to Segment: A New Perspective on Few-Shot Segmentation (CVPR'22 Oral & TPAMI'23).

computer-vision few-shot-segmentation

Last synced: 03 Aug 2024

https://github.com/milaan9/python_computer_vision_from_scratch

This repository explores the variety of techniques commonly used to analyze and interpret images. It also describes challenging real-world applications where vision is being successfully used, both for specialized applications such as medical imaging, and for fun, consumer-level tasks such as image editing and stitching, which students can apply to their own personal photos and videos.

canny-edge-detection computer-vision dtw-algorithm eigenfaces feature-extraction hough-lines image-analysis image-manipulation image-processing image-recognition ipython-notebook machine-learning python4datascience python4everybody sobel-filter sobel-operator tutor-milaan9

Last synced: 11 Oct 2024

https://github.com/lannguyen0910/food-recognition

🍔🍟🍗 Food analysis baseline with Theseus. Integrate object detection, image classification and multi-class semantic segmentation 🍞🍖🍕

computer-vision deep-learning efficientnet food-analysis food-classification food-detection food-segmentation image-classification object-detection pytorch semantic-segmentation unetplusplus yolov5 yolov8

Last synced: 09 Nov 2024

https://github.com/junweiliang/multiverse

Dataset, code and model for the CVPR'20 paper "The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction". And for the ECCV'20 SimAug paper.

3d-simulation computer-vision trajectory-prediction trajectory-prediction-benchmark video-understanding

Last synced: 08 Nov 2024

https://github.com/mindspore-lab/mindcv

A toolbox of vision models and algorithms based on MindSpore

computer-vision cv deep-learning image-classification imagenet mindspore models resnet transformer

Last synced: 28 Oct 2024

https://github.com/kwotsin/transfer_learning_tutorial

A guide to transfer learning with inception-resnet-v2.

computer-vision tensorflow tensorflow-tutorials transfer-learning

Last synced: 12 Nov 2024

https://github.com/roboflow/roboflow-100-benchmark

Code for replicating Roboflow 100 benchmark results and programmatically downloading benchmark datasets

computer-vision dataset deep-learning machine-learning object-detection pytorch

Last synced: 13 Nov 2024

https://github.com/mtli/HTML4Vision

A simple HTML visualization tool for computer vision research :hammer_and_wrench:

computer-vision html visualization

Last synced: 03 Aug 2024

https://github.com/voxel51/voxelgpt

AI assistant that can query visual datasets, search the FiftyOne docs, and answer general computer vision questions

artificial-intelligence chatgpt computer-vision data-science deep-learning fiftyone langchain llm machine-learning openai python

Last synced: 09 Nov 2024

https://github.com/daleroberts/bv

Quickly view satellite imagery, hyperspectral imagery, and machine learning image outputs directly in your iTerm2 terminal.

command-line computer-vision image iterm2 machine-learning python satellite satellite-imagery

Last synced: 07 Nov 2024

https://github.com/brentyi/jaxlie

Rigid transforms + Lie groups in JAX

computer-vision geometry jax lie-groups robotics

Last synced: 22 Oct 2024

https://github.com/wkentaro/morefusion

MoreFusion: Multi-object Reasoning for 6D Pose Estimation from Volumetric Fusion, CVPR 2020

artificial-intelligence computer-vision deep-learning machine-learning pose-estimation robotics ros

Last synced: 14 Nov 2024

https://github.com/zae-bayern/elpv-dataset

A dataset of functional and defective solar cells extracted from EL images of solar modules

computer-vision machine-learning photovoltaic solar-cells solar-energy

Last synced: 14 Nov 2024

https://github.com/staghado/vit.cpp

Inference Vision Transformer (ViT) in plain C/C++ with ggml

ai c computer-vision cpp cpu edge-computing ggml image-classification llamacpp vision-transformer whisper-cpp

Last synced: 12 Nov 2024

https://github.com/ternaus/cloths_segmentation

Code for binary segmentation of cloths

computer-vision deep-learning image-segmentation

Last synced: 26 Oct 2024

https://github.com/aangelopoulos/conformal_classification

Wrapper for a PyTorch classifier which allows it to output prediction sets. The sets are theoretically guaranteed to contain the true class with high probability (via conformal prediction).

artificial-intelligence classification classifier computer-vision conformal conformal-prediction deep-neural-networks distribution-free imagenet machine-learning neural-networks nonparametric nonparametric-statistics prediction-sets pytorch statistics uncertainty uncertainty-quantification vision

Last synced: 05 Nov 2024

https://github.com/wkentaro/fcn

Chainer Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)

chainer computer-vision convolutional-networks deep-learning fcn fcn8s segmentation semantic-segmentation

Last synced: 13 Nov 2024

https://github.com/vitae-transformer/vitae-transformer-matting

A comprehensive list of our research works related to image matting, including papers, codes, datasets, demos, and citations. Note: The repo for [IJCV'23] "Rethinking Portrait Matting with Privacy Preserving" has been moved to: https://github.com/ViTAE-Transformer/P3M-Net

computer-vision deep-learning image-matting privacy-preserving survey vision-transformer

Last synced: 14 Nov 2024

https://github.com/vinjn/opencv-2-cookbook-src

《OpenCV 2 计算机视觉编程手册》 配套代码,支持 OpenCV 3.x / 4.x

computer-vision opencv opencv2 opencv3 opencv4

Last synced: 13 Nov 2024

https://github.com/arefmalek/airdraw

A vision-based drawing application

computer-vision mediapipe opencv python python3

Last synced: 31 Oct 2024

https://github.com/1adrianb/binary-human-pose-estimation

This code implements a demo of the Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment with Limited Resources paper by Adrian Bulat and Georgios Tzimiropoulos.

binary computer-vision deep-learning human-pose-estimation network-binarization torch7

Last synced: 14 Nov 2024

https://github.com/morvanzhou/tensorflow-computer-vision-tutorial

Tutorials of deep learning for computer vision.

cnn computer-vision deepdream googlenet lenet resnet

Last synced: 06 Nov 2024

https://github.com/Villavu/Simba

Simba is a program used to repeat certain (complicated) tasks. Typically these tasks involve using the mouse and keyboard. Simba is programmable, which means you can design your own logic and steps that Simba will follow, based upon certain input such as colors on the screen.

automation computer-vision fpc lape lazarus macro pascal simba villavu

Last synced: 04 Nov 2024

https://github.com/jiachens/ModelNet40-C

Repo for "Benchmarking Robustness of 3D Point Cloud Recognition against Common Corruptions" https://arxiv.org/abs/2201.12296

benchmark computer-vision corruption-robustness data-augmentation deep-learning ml-safety point-cloud-processing pytorch regularization robustness

Last synced: 28 Oct 2024

https://github.com/haofanwang/natural-language-joint-query-search

Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.

attention clip computer-vision image-retrieval image-search multi-modal-search unsplash visualizations

Last synced: 07 Nov 2024

https://github.com/angeladai/3DMV

[ECCV'18] 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation

computer-graphics computer-vision deep-learning scene-understanding

Last synced: 07 Nov 2024

https://github.com/lijx10/DeepI2P

DeepI2P: Image-to-Point Cloud Registration via Deep Classification. CVPR 2021

3d-vision cnn computer-vision deep-learning image-processing localization neural-network point-cloud registration

Last synced: 27 Oct 2024

https://github.com/kwotsin/TensorFlow-Xception

TensorFlow implementation of the Xception Model by François Chollet

computer-vision deep-learning machine-learning tensorflow tensorflow-xception xception-model

Last synced: 31 Oct 2024

https://github.com/rish-16/sight

👁 Sightseer: TensorFlow library for state-of-the-art Computer Vision and Object Detection models

computer-vision object-detection state-of-the-art tensorflow

Last synced: 11 Nov 2024

https://github.com/dmarnerides/pydlt

PyTorch based Deep Learning Toolbox

computer-vision deep-learning pytorch toolbox

Last synced: 14 Nov 2024

https://github.com/baegwangbin/MaGNet

[CVPR 2022 Oral] Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

3d-reconstruction computer-vision cvpr2022 deep-learning depth-estimation multi-view-stereo multiview-geometry multiview-stereo uncertainty uncertainty-estimation

Last synced: 04 Nov 2024

https://github.com/junyanz/light-field-video

Light field video applications (e.g. video refocusing, focus tracking, changing aperture and view)

caffe computer-graphics computer-vision deep-learning lightfield

Last synced: 31 Oct 2024

https://github.com/imfing/sketch-to-art

🖼 Create artwork from your casual sketch with GAN and style transfer

computer-vision deep-learning gan pix2pix style-transfer

Last synced: 09 Nov 2024

https://github.com/kaustubh-sadekar/VirtualCam

Virtual camera is created only using opencv and numpy. It simulates a camera where we can control all its parameters, intrinsic and extrinsic to get a better understanding how each component in the camera projection matrix affects the final image of the object captured by the camera.

camera-calibration camera-control camera-distortion camera-matrix camera-parameters camera-projection-matrix camera-rotation camera-translation computer-graphics computer-vision computer-vision-funamentals extrinsic-parameters fundamentals gui image-formation rendering trackbars transformation-matrix virtual-camera

Last synced: 25 Oct 2024

https://github.com/jacobgil/pytorch-zssr

PyTorch implementation of 1712.06087 "Zero-Shot" Super-Resolution using Deep Internal Learning

computer-vision deep-learning pytorch super-resolution

Last synced: 31 Oct 2024

https://astra-vision.github.io/MaterialPalette/

[CVPR 2024] Official repository of "Material Palette: Extraction of Materials from a Single Real-world Image"

albedo computer-vision cvpr cvpr2024 generative-ai material normal roughness stable-diffusion

Last synced: 30 Oct 2024

https://github.com/astra-vision/MaterialPalette

[CVPR 2024] Official repository of "Material Palette: Extraction of Materials from a Single Real-world Image"

albedo computer-vision cvpr cvpr2024 generative-ai material normal roughness stable-diffusion

Last synced: 30 Oct 2024

https://github.com/ibaiGorordo/ONNX-YOLOv7-Object-Detection

Python scripts performing object detection using the YOLOv7 model in ONNX.

computer-vision deep-learning object-detection onnx opencv python yolo yolov7

Last synced: 09 Nov 2024

https://github.com/manila95/monolayout

MonoLayout: Amodal Scene Layout from a single image

autonomous-driving computer-vision robotic-vision scene-layout

Last synced: 28 Oct 2024

https://github.com/junyanz/facedemo

A simple 3D face alignment and warping demo.

3d-face computer-vision face-swap

Last synced: 31 Oct 2024

https://github.com/MaartenGr/Concept

Concept Modeling: Topic Modeling on Images and Text

computer-vision image-processing nlp topic-modeling

Last synced: 05 Nov 2024

https://github.com/LeapLabTHU/EfficientTrain

1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.

computer-vision deep-learning efficient-training machine-learning pytorch

Last synced: 05 Nov 2024

https://github.com/maartengr/concept

Concept Modeling: Topic Modeling on Images and Text

computer-vision image-processing nlp topic-modeling

Last synced: 26 Oct 2024

https://github.com/Gradiant/pyodi

Python Object Detection Insights

computer-vision deep-learning machine-learning object-detection

Last synced: 05 Nov 2024

https://github.com/olafenwamoses/idenprof

IdenProf dataset is a collection of images of identifiable professionals. It is been collected to enable the development of AI systems that can serve by identifying people and the nature of their job by simply looking at an image, just like humans can do.

ai-systems computer-vision datasets humans idenprof-dataset identifying-people image-classification image-recognition images machine-intelligence machine-learning machine-vision people professionals

Last synced: 27 Oct 2024

Computer vision Awesome Lists
Awesome-pytorch-list 704 awesome-self-supervised-learning 453 awesome-multimodal-ml 480 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-hand-pose-estimation 493 awesome-image-classification 220 awesome-human-pose-estimation 89 Awesome-Crowd-Counting 451 awesome-autonomous-vehicles 296 Awesome-Federated-Learning 558 Awesome-pytorch-list-CNVersion 692 awesome-industrial-anomaly-detection 657 iOS_ML 39 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-FL 2,098 awesome-low-light-image-enhancement 197 Awesome-Implicit-NeRF-Robotics 184 CV-pretrained-model 103 awesome-tensorflow-lite 110 awesome-capsule-networks 67 awesome-attention-mechanism-in-cv 195 awesome-grounding 154 Awesome-Image-Colorization 115 awesome-ai-awesomeness 236 openstl 43 awesome-autonomous-vehicle 181 Awesome-Open-Vocabulary 158 awesome-open-data-centric-ai 56 Awesome-Skeleton-based-Action-Recognition 104 awesome-6d-object 600 awesome-scene-understanding 197 awesome-multi-task-learning 226 awesome-holistic-3d 129 awesome-data-annotation 93 awesome-panoptic-segmentation 45 awesome-photogrammetry 68 awesome-llm-and-aigc 1,134 Awesome-3D-Object-Detection 169 awesome-computer-vision-models 189 Awesome-Parameter-Efficient-Transfer-Learning 121 awesome-state-of-depth-completion 59 Awesome-Distributed-Deep-Learning 44 awesome-image-alignment-and-stitching 108 Awesome-Monocular-3D-detection 93 Awesome-Multi-Task-Learning 80 Computer-Vision-Video-Lectures 67 awesome-segment-anything-extensions 120 awesome-robotics-datasets 79 awesome_computer_science 116