An open API service indexing awesome lists of open source software.

Computer vision

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding of digital images and videos.

https://github.com/SimonVandenhende/Multi-Task-Learning-PyTorch

PyTorch implementation of multi-task learning architectures, incl. MTI-Net (ECCV2020).

computer-vision eccv2020 multi-task-learning nyud pascal pytorch scene-understanding segmentation

Last synced: 06 May 2025

https://github.com/simonvandenhende/multi-task-learning-pytorch

PyTorch implementation of multi-task learning architectures, incl. MTI-Net (ECCV2020).

computer-vision eccv2020 multi-task-learning nyud pascal pytorch scene-understanding segmentation

Last synced: 04 Apr 2025

https://github.com/patrikhuber/4dface

Real-time 3D face tracking and reconstruction from 2D video

computer-vision face-models modern-cpp video-processing

Last synced: 04 Apr 2025

https://github.com/megvii-research/IJCAI2023-CoNR

IJCAI2023 - Collaborative Neural Rendering using Anime Character Sheets

anime computer-graphics computer-vision deep-learning pytorch

Last synced: 14 Apr 2025

https://github.com/baegwangbin/dsine

[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation

3d-from-images 3d-reconstruction computer-vision cvpr2024 deep-learning surface-normal surface-normals surface-normals-estimation

Last synced: 13 Apr 2025

https://github.com/ziqi-jin/finetune-anything

Fine-tune SAM (Segment Anything Model) for computer vision tasks such as semantic segmentation, matting, detection ... in specific scenarios

computer-vision deep-learning fine-tune segment-anything

Last synced: 06 May 2025

https://github.com/vandit15/Class-balanced-loss-pytorch

Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"

computer-vision cvpr2019 deep-learning loss-functions pytorch

Last synced: 08 May 2025

https://github.com/noahcao/OC_SORT

[CVPR2023] The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.

computer-vision deep-learning object-detection object-tracking tracking

Last synced: 20 Mar 2025

https://github.com/vacancy/PreciseRoIPooling

Precise RoI Pooling with coordinate gradient support, proposed in the paper "Acquisition of Localization Confidence for Accurate Object Detection" (https://arxiv.org/abs/1807.11590).

computer-vision object-detection

Last synced: 04 May 2025

https://github.com/microsoft/CameraTraps

PyTorch Wildlife: a Collaborative Deep Learning Framework for Conservation.

camera-traps computer-vision conservation machine-learning megadetector pytorch pytorch-wildlife wildlife

Last synced: 27 Nov 2024

https://github.com/Jumpat/SegmentAnythingin3D

Segment Anything in 3D with NeRFs (NeurIPS 2023)

3d 3d-segmentation computer-vision deep-learning nerf segment-anything segmentation

Last synced: 20 Mar 2025

https://github.com/fregu856/deeplabv3

PyTorch implementation of DeepLabV3, trained on the Cityscapes dataset.

autonomous-driving computer-vision deep-learning machine-learning pytorch semantic-segmentation

Last synced: 27 Nov 2024

https://github.com/dog-qiuqiu/fastestdet

:zap: A newly designed ultra lightweight anchor free target detection algorithm, weight only 250K parameters, reduces the time consumption by 10% compared with yolo-fastest, and the post-processing is simpler

computer-vision deep-learning object-detection

Last synced: 04 Apr 2025

https://github.com/dog-qiuqiu/FastestDet

:zap: A newly designed ultra lightweight anchor free target detection algorithm, weight only 250K parameters, reduces the time consumption by 10% compared with yolo-fastest, and the post-processing is simpler

computer-vision deep-learning object-detection

Last synced: 14 Mar 2025

https://github.com/johnolafenwa/DeepStack

The World's Leading Cross Platform AI Engine for Edge Devices

ai-engine computer-vision deepstack face-detection face-recognition object-detection scene-recognition

Last synced: 06 Apr 2025

https://github.com/adamdad/kat

[ICLR2025] Kolmogorov-Arnold Transformer

computer-vision kan kolmogorov-arnold-networks transformer

Last synced: 14 Apr 2025

https://github.com/johnolafenwa/deepstack

The World's Leading Cross Platform AI Engine for Edge Devices

ai-engine computer-vision deepstack face-detection face-recognition object-detection scene-recognition

Last synced: 12 Apr 2025

https://github.com/nianticlabs/acezero

[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.

3d-reconstruction camera-relocalization computer-vision eccv eccv2024 machine-learning pose-estimation sfm structure-from-motion visual-relocalization

Last synced: 14 Apr 2025

https://github.com/amusi/AI-Job-Recommend

国内公司人工智能方向(含机器学习、深度学习、计算机视觉和自然语言处理)岗位的招聘信息(含全职、实习和校招)

artificial-intelligence computer-vision deep-learning job-search jobs machine-learning natural-language-processing

Last synced: 07 Dec 2024

https://github.com/hustvl/gaussiandreamer

[CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models

aigc computer-vision cvpr2024 diffusion-models dreamfusion gaussian-splatting nerf radiance-field smpl text-to-3d

Last synced: 14 Apr 2025

https://github.com/AntonioTepsich/Convolutional-KANs

This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learnable non linear activations in each pixel.

cnn computer-vision deep-learning

Last synced: 05 Dec 2024

https://github.com/xtreme1-io/xtreme1

Xtreme1 is an all-in-one data labeling and annotation platform for multimodal data training and supports 3D LiDAR point cloud, image, and LLM.

3d-annotation annotation annotation-tool computer-vision image-annotation image-classification image-labelling-tool labeling-tool multimodal point-cloud rlhf

Last synced: 20 Mar 2025

https://github.com/quic/sense

Enhance your application with the ability to see and interact with humans using any RGB camera.

activity-recognition calorie-estimation computer-vision deep-learning fitness-app gesture-recognition neural-networks pytorch video

Last synced: 22 Nov 2024

https://github.com/bgshih/aster

Recognizing cropped text in natural images.

computer-vision ocr recognition scene-text

Last synced: 13 Apr 2025

https://github.com/ahangchen/torch_base

Quickly bring up your PyTorch project(a skeleton)

computer-vision deep-learning machine-learning pytorch

Last synced: 04 Apr 2025

https://github.com/Guanghan/lighttrack

LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking

artificial-intelligence computer-vision pose-estimation pose-tracking visual-object-tracking

Last synced: 27 Nov 2024

https://github.com/adamspannbauer/python_video_stab

A Python package to stabilize videos using OpenCV

computer-vision opencv python video video-stabilization

Last synced: 08 Apr 2025

https://github.com/hkchengrex/Cutie

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

computer-vision cvpr2024 deep-learning pytorch segmentation video-editing video-object-segmentation video-segmentation

Last synced: 17 Apr 2025

https://github.com/mhamilton723/stego

Unsupervised Semantic Segmentation by Distilling Feature Correspondences

computer-vision deep-learning iclr2022 pytorch semantic-segmentation unsupervised-learning

Last synced: 04 Apr 2025

https://github.com/mhamilton723/STEGO

Unsupervised Semantic Segmentation by Distilling Feature Correspondences

computer-vision deep-learning iclr2022 pytorch semantic-segmentation unsupervised-learning

Last synced: 08 May 2025

https://github.com/MarekKowalski/FaceSwap

3D face swapping implemented in Python

3d-models computer-vision face-alignment face-swap optimization

Last synced: 18 Nov 2024

https://github.com/cvondrick/videogan

Generating Videos with Scene Dynamics. NIPS 2016.

computer-vision deep-learning generative-adversarial-network video

Last synced: 28 Mar 2025

https://github.com/stevenygd/PointFlow

PointFlow : 3D Point Cloud Generation with Continuous Normalizing Flows

3d-point-clouds computer-vision continuous-normalizing-flows machine-learning pytorch shapes

Last synced: 23 Nov 2024

https://github.com/gjy3035/c-3-framework

An open-source PyTorch code for crowd counting

computer-vision crowd-analysis crowd-counting deep-learning

Last synced: 12 Apr 2025

https://github.com/PeterWang512/GANSketching

Sketch Your Own GAN: Customizing a GAN model with hand-drawn sketches.

computer-graphics computer-vision deep-learning gans hci

Last synced: 04 Apr 2025

https://github.com/skalskip/top-cvpr-2024-papers

This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]

computer-vision cvpr cvpr2024 image-segmentation object-detection paper transformers vision-and-language

Last synced: 04 Apr 2025

https://github.com/onepanelio/onepanel

The open source, end-to-end computer vision platform. Label, build, train, tune, deploy and automate in a unified platform that runs on any cloud and on-premises.

ai aiops annotation computer-vision deeplearning etl hyperparameter-tuning inference jupyterlab labeling machinelearning mlops pipelines pytorch tensorboard tensorflow training workflows

Last synced: 04 Apr 2025

https://github.com/gjy3035/C-3-Framework

An open-source PyTorch code for crowd counting

computer-vision crowd-analysis crowd-counting deep-learning

Last synced: 17 Nov 2024

https://github.com/Rubikplayer/flame-fitting

Example code for the FLAME 3D head model. The code demonstrates how to sample 3D heads from the model, fit the model to 3D keypoints and 3D scans.

3d-face-alignment 3d-model 3d-reconstruction chumpy computer-graphics computer-vision face face-alignment face-model flame flame-fitting flame-model morphable-model smpl-x

Last synced: 19 Apr 2025

https://github.com/semperai/amica

Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.

ai assistant-chat-bots computer-vision llm speech-recognition tts

Last synced: 24 Mar 2025

https://github.com/ika-rwth-aachen/Cam2BEV

TensorFlow Implementation for Computing a Semantically Segmented Bird's Eye View (BEV) Image Given the Images of Multiple Vehicle-Mounted Cameras.

autonomous-vehicles birds-eye-view computer-vision deep-learning ipm machine-learning segmentation sim2real simulation

Last synced: 20 Mar 2025

https://github.com/swz30/mirnet

[ECCV 2020] Learning Enriched Features for Real Image Restoration and Enhancement. SOTA results for image denoising, super-resolution, and image enhancement.

attention-mechanism computer-vision eccv2020 image-denoising image-enhancement image-restoration low-level-vision multi-resolution-streams pytorch super-resolution

Last synced: 04 Apr 2025

https://github.com/swz30/MIRNet

[ECCV 2020] Learning Enriched Features for Real Image Restoration and Enhancement. SOTA results for image denoising, super-resolution, and image enhancement.

attention-mechanism computer-vision eccv2020 image-denoising image-enhancement image-restoration low-level-vision multi-resolution-streams pytorch super-resolution

Last synced: 02 Apr 2025

https://github.com/HKUST-Aerial-Robotics/DenseSurfelMapping

This is the open-source version of ICRA 2019 submission "Real-time Scalable Dense Surfel Mapping"

computer-vision drones quadrotor robotics

Last synced: 20 Apr 2025

https://github.com/visipedia/inat_comp

iNaturalist competition details

competition computer-vision dataset inaturalist

Last synced: 05 Apr 2025

https://github.com/zhangxiaosong18/FreeAnchor

FreeAnchor: Learning to Match Anchors for Visual Object Detection (NeurIPS 2019)

computer-vision freeanchor neurips-2019 object-detection one-stage pytorch

Last synced: 14 Mar 2025

https://github.com/jwchoi384/Gaussian_YOLOv3

Gaussian YOLOv3: An Accurate and Fast Object Detector Using Localization Uncertainty for Autonomous Driving (ICCV, 2019)

computer-vision deep-learning dnn gaussian-yolov3 gaussian-yolov3-implementation gaussianyolov3 neural-network object-detection

Last synced: 21 Apr 2025

https://github.com/lagadic/visp

Open Source Visual Servoing Platform

c-plus-plus computer-vision visp visual-servoing

Last synced: 08 Apr 2025

https://github.com/DerrickXuNu/OpenCOOD

[ICRA 2022] An opensource framework for cooperative detection. Official implementation for OPV2V.

autonomous-driving collaborative-perception computer-vision deep-learning multi-agent-perception multi-agent-systems pytorch simulations

Last synced: 20 Mar 2025

https://github.com/ThibaultGROUEIX/AtlasNet

This repository contains the source codes for the paper "AtlasNet: A Papier-Mâché Approach to Learning 3D Surface Generation ". The network is able to synthesize a mesh (point cloud + connectivity) from a low-resolution point cloud, or from an image.

3d 3d-deep-learning computer-vision cvpr2018 geometry-processing pytorch

Last synced: 20 Mar 2025

https://github.com/summergift/embeddedsystem

:books: 计算机体系架构、嵌入式系统基础与主流编程语言相关内容总结

c cnn computer-vision data-structures iot linux machine-learning network python

Last synced: 14 Apr 2025

https://github.com/kylemcdonald/ofxCv

Alternative approach to interfacing with OpenCv from openFrameworks.

addon computer-vision opencv openframeworks wrapper

Last synced: 15 Mar 2025

https://github.com/kylemcdonald/ofxcv

Alternative approach to interfacing with OpenCv from openFrameworks.

addon computer-vision opencv openframeworks wrapper

Last synced: 12 Apr 2025

https://github.com/arunponnusamy/cvlib

A simple, high level, easy to use, open source Computer Vision library for Python.

computer-vision deep-learning image-processing machine-learning python

Last synced: 09 May 2025

https://github.com/MOLAorg/mola

A Modular Optimization framework for Localization and mApping (MOLA)

computer-vision cxx cxx17 datasets graph-slam lidar lidar-point-cloud localization mobile-robots slam toolkit visual-slam

Last synced: 05 May 2025

https://github.com/pathak22/pyflow

Fast, accurate and easy to run dense optical flow with python wrapper

computer-vision dense-flow image-processing optical-flow python-optical-flow

Last synced: 05 Apr 2025

https://github.com/wvangansbeke/lanedetection_end2end

End-to-end Lane Detection for Self-Driving Cars (ICCV 2019 Workshop)

computer-vision deep-learning end-to-end lane-detection least-squares pytorch self-driving-cars

Last synced: 05 Apr 2025

https://github.com/wvangansbeke/LaneDetection_End2End

End-to-end Lane Detection for Self-Driving Cars (ICCV 2019 Workshop)

computer-vision deep-learning end-to-end lane-detection least-squares pytorch self-driving-cars

Last synced: 07 May 2025

https://github.com/ch-sa/labelCloud

A lightweight tool for labeling 3D bounding boxes in point clouds.

3d-object-detection 6d-pose-estimation annotation bounding-boxes computer-vision labeling machine-learning point-clouds tool

Last synced: 20 Mar 2025

https://github.com/skalskip/top-cvpr-2023-papers

This repository is a curated collection of the most exciting and influential CVPR 2023 papers. 🔥 [Paper + Code]

computer-vision cvpr cvpr2023 image-segmentation object-detection paper transformers vision-and-language

Last synced: 04 Apr 2025

https://github.com/leblancfg/autocrop

:relieved: Automatically detects and crops faces from batches of pictures.

autocrop computer-vision crop face face-detection facedetect opencv pictures python

Last synced: 08 Apr 2025

https://github.com/wanmeihuali/taichi_3d_gaussian_splatting

An unofficial implementation of paper 3D Gaussian Splatting for Real-Time Radiance Field Rendering by taichi lang.

3d-reconstruction 3d-rendering computer-graphics computer-vision machine-learning nerf python pytorch real-time-rendering taichi

Last synced: 11 Apr 2025

https://github.com/imagej/ImageJ

Public domain software for processing and analyzing scientific images

computer-vision image-processing

Last synced: 07 May 2025

https://github.com/SkalskiP/top-cvpr-2023-papers

This repository is a curated collection of the most exciting and influential CVPR 2023 papers. 🔥 [Paper + Code]

computer-vision cvpr cvpr2023 image-segmentation object-detection paper transformers vision-and-language

Last synced: 03 May 2025

https://github.com/rnchg/apt

AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSeek, one-click batch intelligent processing of pictures, videos, audio, etc.

ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing

Last synced: 10 Apr 2025

https://github.com/rnchg/Apt

AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSeek, one-click batch intelligent processing of pictures, videos, audio, etc.

ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing

Last synced: 24 Mar 2025

https://github.com/imagej/imagej

Public domain software for processing and analyzing scientific images

computer-vision image-processing

Last synced: 13 Apr 2025

https://github.com/baudm/parseq

Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)

computer-vision eccv eccv2022 ocr optical-character-recognition scene-text-recognition text-recognition vision-transformer

Last synced: 04 Apr 2025

Computer vision Awesome Lists
Awesome-pytorch-list 704 awesome-multimodal-ml 480 awesome-self-supervised-learning 453 Awesome-Transformer-Attention 2,160 awesome_3DReconstruction_list 169 awesome-hand-pose-estimation 497 awesome-image-classification 220 Awesome-Crowd-Counting 460 awesome-human-pose-estimation 89 awesome-autonomous-vehicles 296 awesome-industrial-anomaly-detection 781 Awesome-Federated-Learning 558 Awesome-pytorch-list-CNVersion 692 Awesome-FL 2,545 Awesome-Interaction-aware-Trajectory-Prediction 564 Awesome-Implicit-NeRF-Robotics 190 iOS_ML 39 CV-pretrained-model 103 awesome-tensorflow-lite 110 awesome-attention-mechanism-in-cv 195 Awesome-Image-Colorization 126 awesome-grounding 154 awesome-capsule-networks 67 Awesome-World-Model 269 Awesome-Open-Vocabulary 161 awesome-ai-awesomeness 236 openstl 43 awesome-autonomous-vehicle 181 awesome-6d-object 600 awesome-scene-understanding 199 awesome-multi-task-learning 229 awesome-open-data-centric-ai 56 awesome-robotics-3d 106 Awesome-Skeleton-based-Action-Recognition 104 awesome-photogrammetry 68 Awesome-3D-Object-Detection 169 awesome-holistic-3d 129 awesome-data-annotation 93 awesome-panoptic-segmentation 45 awesome-optical-flow 110 Awesome-Parameter-Efficient-Transfer-Learning 121 awesome-computer-vision-models 189 awesome-ai-data-guided-projects 56 awesome-nerf-editing 419 awesome-state-of-depth-completion 71 awesome-image-alignment-and-stitching 108 Awesome-Distributed-Deep-Learning 44 awesome_computer_science 118 awesome-robotics-datasets 79 Awesome-Monocular-3D-detection 97