Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/SimonVandenhende/Multi-Task-Learning-PyTorch

PyTorch implementation of multi-task learning architectures, incl. MTI-Net (ECCV2020).

computer-vision eccv2020 multi-task-learning nyud pascal pytorch scene-understanding segmentation

Last synced: 03 Jul 2024

https://github.com/azad-academy/personalized-diffusion

A Tutorial on Customized Image Generation by Fine-Tuning the Stable Diffusion Models

computer-vision deep-learning diffusion-models machine-learning

Last synced: 03 Jul 2024

https://github.com/CommissarMa/Crowd_counting_from_scratch

This is an overview and tutorial about crowd counting. In this repository, you can learn how to estimate number of pedestrians in crowd scenes through computer vision and deep learning.

computer-vision crowd-counting deep-learning

Last synced: 03 Jul 2024

https://github.com/gjy3035/C-3-Framework

An open-source PyTorch code for crowd counting

computer-vision crowd-analysis crowd-counting deep-learning

Last synced: 03 Jul 2024

https://github.com/stevenygd/PointFlow

PointFlow : 3D Point Cloud Generation with Continuous Normalizing Flows

3d-point-clouds computer-vision continuous-normalizing-flows machine-learning pytorch shapes

Last synced: 03 Jul 2024

https://github.com/hongyehu/RG-Flow

This is project page for the paper "RG-Flow: a hierarchical and explainable flow model based on renormalization group and sparse prior". Paper link: https://arxiv.org/abs/2010.00029

computer-vision generative-model hierarchical-models normalizing-flow renormalization-group representation-learning sparse-coding

Last synced: 03 Jul 2024

https://github.com/amosgropp/IGR

Implicit Geometric Regularization for Learning Shapes

3d-reconstruction computer-vision deep-learning

Last synced: 03 Jul 2024

https://github.com/haltakov/natural-language-image-search

Search photos on Unsplash using natural language

clip computer-vision image-search machine-learning photos unsplash

Last synced: 03 Jul 2024

https://github.com/youyuge34/Anime-InPainting

An application tool of edge-connect, which can do anime inpainting and drawing. 动漫人物图片自动修复,去马赛克,填补,去瑕疵

anime computer-vision cv deep-learning gan generative-adversarial-network inpainting mosaic neural-network opencv python pytorch

Last synced: 02 Jul 2024

https://github.com/roboflow/zero-shot-object-tracking

Object tracking implemented with the Roboflow Inference API, DeepSort, and OpenAI CLIP.

computer-vision deep-sort object-detection object-tracking openai-clip

Last synced: 02 Jul 2024

https://github.com/airctic/icevision

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come

ai annotation-parsers coco-dataset coco-parser computer-vision deep-learning effecientdet fastai faster-rcnn mask-rcnn object-detection pycocotools python pytorch pytorch-lightning tutorials voc-dataset voc-parser

Last synced: 02 Jul 2024

https://github.com/fisal77/ArtaEye

ArtaEye is web/mobile camera eye tracking to make beautiful drawing and artwork based on eye movements. In this project. We target people with disabilities to create an artwork. We try to translate their feelings through their eyes into digital drawings.

ai computer-vision eye-detection eye-tracking eye-tracking-mouse ml opencv webcam webcam-eyetracking

Last synced: 02 Jul 2024

https://github.com/gigwegbe/tinyml-papers-and-projects

This is a list of interesting papers and projects about TinyML.

computer-vision embedded-systems machine-learning neural-architecture-search tinyml wake-word

Last synced: 02 Jul 2024

https://github.com/mop/bier

Cleaned up reference implementation of BIER: Boosting Independent Embeddings Robustly.

cnn computer-vision embeddings

Last synced: 02 Jul 2024

https://github.com/hustvl/GaussianDreamer

GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)

aigc computer-vision cvpr2024 diffusion-models dreamfusion gaussian-splatting nerf radiance-field smpl text-to-3d

Last synced: 02 Jul 2024

https://github.com/Haiyang-W/UniTR

[ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"

3d 3d-object-detection 3d-segmentation backbone bev camera computer-vision iccv2023 lidar multi-modal multi-view point-cloud transformer unified

Last synced: 01 Jul 2024

https://github.com/sanechips-multimedia/syenet

SYENet: A Simple Yet Effective Network for Multiple Low-Level Vision Tasks with Real-Time Performance on Mobile Device, in ICCV 2023

computer-vision deep-neural-networks

Last synced: 01 Jul 2024

https://github.com/Anything-of-anything/Anything-3D

Segment-Anything + 3D. Let's lift anything to 3D.

3d computer-vision reconstruction segment segment-anything

Last synced: 01 Jul 2024

https://github.com/skhadem/3D-BoundingBox

PyTorch implementation for 3D Bounding Box Estimation Using Deep Learning and Geometry

3d computer-vision localization pytorch

Last synced: 01 Jul 2024

https://github.com/uranus4ever/Advanced-Lane-Detection

Camera Calibration; Distortion Correction; Perspective transform ("bird-eye view"); Compute curvature and vehicle position.

camera-calibration computer-vision lane-curvature lane-detection perspective-transformation

Last synced: 01 Jul 2024

https://github.com/darrenjkt/SEE-MTDA

(RA-L 2022) See Eye to Eye: A Lidar-Agnostic 3D Detection Framework for Unsupervised Multi-Target Domain Adaptation.

3d-object-detection autonomous-driving computer-vision deep-learning object-detection surface-completion unsupervised-domain-adaptation

Last synced: 01 Jul 2024

https://github.com/Jumpat/SegmentAnythingin3D

Segment Anything in 3D with NeRFs (NeurIPS 2023)

3d 3d-segmentation computer-vision deep-learning nerf segment-anything segmentation

Last synced: 01 Jul 2024

https://github.com/qa276390/SearchTrack

[BMVC 2022] SearchTrack: Multiple Object Tracking with Object-Customized Search and Motion-Aware Features

bmvc2022 computer-vision deep-learning kitti-dataset mots multiple-object-tracking segementation tracking

Last synced: 01 Jul 2024

https://github.com/SurajDonthi/Multi-Camera-Person-Re-Identification

State-of-the-art model for person re-identification in Multi-camera Multi-Target Tracking. Benchmarked on Market-1501 and DukeMTMTC-reID datasets.

computer-vision convolutional-neural-networks deep-learning dukemtmc-reid market-1501 mtmct multi-camera-tracking person-reidentification pytorch pytorch-lightning spatial-temporal

Last synced: 01 Jul 2024

https://github.com/fubel/synthehicle

[WACVW 2023] A massive synthetic dataset for 3D multi-target multi-camera tracking and segmentation.

computer-vision object-detection object-tracking

Last synced: 01 Jul 2024

https://github.com/cs230-stanford/cs230-code-examples

Code examples in pyTorch and Tensorflow for CS230

computer-vision natural-language-processing pytorch tensorflow

Last synced: 01 Jul 2024

https://github.com/autodistill/autodistill-yolov8

YOLOv8 Target Model plugin for Autodistill

autodistill computer-vision yolov8

Last synced: 01 Jul 2024

https://github.com/autodistill/autodistill-gpt-4v

GPT-4V(ision) module for use with Autodistill.

autodistill computer-vision gpt-4 gpt-4v object-detection

Last synced: 01 Jul 2024

https://github.com/extreme-assistant/Awesome-CV-Team

国内外优秀的计算机视觉团队汇总,极市团队整理

computer-vision deep-learning machine-learning team

Last synced: 01 Jul 2024

https://github.com/OpenPerceptionX/PersFormer_3DLane

[ECCV2022 oral] Perspective Transformer on 3D Lane Detection

3d-lane-detection autonomous-driving computer-vision lane-detection

Last synced: 01 Jul 2024

https://github.com/suhangpro/mvcnn

Multi-view CNN (MVCNN) for shape recognition

computer-graphics computer-vision convolutional-neural-networks deep-learning

Last synced: 01 Jul 2024

https://github.com/roboticslab-uc3m/textiles

Research on algorithms for garment perception, manipulation...

computer-vision manipulation research robotics textile

Last synced: 01 Jul 2024

https://github.com/enpeizhao/CVprojects

computer vision projects | 计算机视觉相关好玩的AI项目(Python、C++、embedded system)

computer-vision cpp cuda deep-learning embedded-systems machine-learning python tensorrt

Last synced: 01 Jul 2024

https://github.com/digantamisra98/Mish

Official Repository for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]

activation-functions bmvc bmvc20 computer-vision deep-learning image-classification mathematics neural-networks object-detection

Last synced: 01 Jul 2024

https://github.com/open-mmlab/OpenMMLabCourse

OpenMMLab course index and stuff

computer-vision tutorials

Last synced: 01 Jul 2024

https://github.com/abcSup/NotEnoughSleepAI

Implementation of the Multi-Task Multi-Sensor Fusion for 3D Object Detection paper by Uber

computer-vision deep-learning neural-networks

Last synced: 01 Jul 2024

https://github.com/mindspore-courses/d2l-mindspore

《动手学深度学习》的MindSpore实现。供MindSpore学习者配合李沐老师课程使用。

computer-vision deep-learning machine-learning mindspore natural-language-processing notebook

Last synced: 01 Jul 2024

https://github.com/lijx10/DeepI2P

DeepI2P: Image-to-Point Cloud Registration via Deep Classification. CVPR 2021

3d-vision cnn computer-vision deep-learning image-processing localization neural-network point-cloud registration

Last synced: 01 Jul 2024

https://github.com/cvondrick/soundnet

SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016

computer-vision deep-learning sound

Last synced: 01 Jul 2024

https://github.com/andyzeng/3dmatch-toolbox

3DMatch - a 3D ConvNet-based local geometric descriptor for aligning 3D meshes and point clouds.

3d 3d-deep-learning 3dmatch artificial-intelligence computer-vision deep-learning geometry-processing point-cloud rgbd vision

Last synced: 01 Jul 2024

https://github.com/CVCUDA/CV-CUDA

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

bytedance cloud computer-vision cpp cuda cv-cuda gpu image-processing machine-learning nvidia python

Last synced: 01 Jul 2024

https://github.com/joffman/ros_object_recognition

A framework for ROS-based 2D and 3D object recognition.

computer-vision cplusplus opencv pcl python robotics ros

Last synced: 01 Jul 2024

https://github.com/Eagleflag88/MultiModalEvnPercption

Experimental platform to achieve fused environment perception using different modalities of sensors

computer-vision ekf-slam factor-graph imu lidar-point-cloud ros slam

Last synced: 01 Jul 2024

https://github.com/viplix3/BoTSORT-cpp

C++ implementation of BoT-SORT MOT algorithm with Re-ID and Camera Motion Compensation

computer-vision cpp kalman-filter multi-object-tracking reid

Last synced: 01 Jul 2024

https://github.com/SHI-Labs/Self-Similarity-Grouping

Self-similarity Grouping: A Simple Unsupervised Cross Domain Adaptation Approach for Person Re-identification (ICCV 2019, Oral)

computer-vision deep-learning domain-adaptation person-reidentification

Last synced: 01 Jul 2024

https://github.com/Masao-Taketani/FOTS_OCR

TensorFlow Implementation of FOTS, Fast Oriented Text Spotting with a Unified Network.

computer-vision deep-learning image-recognition ocr scene-text-recognition tensorflow

Last synced: 30 Jun 2024

https://github.com/fcakyon/craft-text-detector

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

actions anaconda computer-vision craft deep-learning document hacktoberfest linux macos neural-network ocr pypi python pytorch text text-detection vision windows workflow

Last synced: 30 Jun 2024

https://github.com/HusseinYoussef/Arabic-OCR

OCR system for Arabic language that converts images of typed text to machine-encoded text.

arabic character-segmentation computer-vision dataset image-processing machine-learning neural-network ocr opencv-python scikit-learn segmentation

Last synced: 30 Jun 2024

https://github.com/bgshih/aster

Recognizing cropped text in natural images.

computer-vision ocr recognition scene-text

Last synced: 30 Jun 2024

https://github.com/vsymbol/CUTIE

CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)

computer-vision deep-learning text-extraction

Last synced: 30 Jun 2024

https://github.com/matlab-deep-learning/Pretrained-YOLOv8-Network-For-Object-Detection

YOLO v8 inference in MATLAB for Object Detection with yolov8n, yolov8s, yolov8m, yolov8l, yolov8x, networks

computer-vision deep-learning image-processing matlab matlab-deep-learning object-detection pretrained-models yolo yolov8

Last synced: 30 Jun 2024

https://github.com/vkt3d/natdevice

High performance, cross-platform media device streaming for Unity Engine.

android avfoundation camera camera2-api computer-vision ios machine-learning mediafoundation microphone natml unity unity3d webassembly webgl windows

Last synced: 30 Jun 2024

https://github.com/visipedia/inat_comp

iNaturalist competition details

competition computer-vision dataset inaturalist

Last synced: 30 Jun 2024

https://github.com/norlab-ulaval/PercepTreeV1

Implementation of Grondin et al. 2022 "Tree Detection and Diameter Estimation Based on Deep Learning". Also includes datasets and some of the pretrained models.

computer-vision datasets forestry

Last synced: 30 Jun 2024

https://github.com/anupamchugh/iowncode

A curated collection of iOS, ML, AR resources sprinkled with some UI additions

alamofire arkit computer-vision coreml coremltools ios keras ml-kit natural-language-processing nlp realitykit swift swiftui vision vision-framework

Last synced: 29 Jun 2024

https://github.com/hysts/anime-face-detector

Anime Face Detector using mmdet and mmpose

anime computer-vision face-detection face-landmark-detection pytorch

Last synced: 29 Jun 2024

https://github.com/pkhungurn/talking-head-anime-3-demo

Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project

ai anime computer-graphics computer-vision machine-learning python pytorch vtuber

Last synced: 29 Jun 2024

https://github.com/pkhungurn/talking-head-anime-2-demo

Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.

ai anime computer-graphics computer-vision deep-learning machine-learning python pytorch vtuber

Last synced: 29 Jun 2024

https://github.com/osu-cvl/learning-idk

Code for "Learning When to Say 'I Don't Know'".

computer-vision deep-learning reject-option-classification uncertainty

Last synced: 29 Jun 2024