Projects in Awesome Lists by open-mmlab

https://github.com/open-mmlab/mmdetection

OpenMMLab Detection Toolbox and Benchmark

cascade-rcnn convnext detr fast-rcnn faster-rcnn glip grounding-dino instance-segmentation mask-rcnn object-detection panoptic-segmentation pytorch retinanet rtmdet semisupervised-learning ssd swin-transformer transformer vision-transformer yolo

Last synced: 12 May 2025

https://github.com/open-mmlab/amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

audio-generation audio-synthesis audioldm audit emilia fastspeech2 maskgct music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e vits vocoder voice-conversion

Last synced: 12 May 2025

https://github.com/open-mmlab/mmsegmentation

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

deeplabv3 image-segmentation medical-image-segmentation pspnet pytorch realtime-segmentation retinal-vessel-segmentation semantic-segmentation swin-transformer transformer vessel-segmentation

Last synced: 15 May 2025

https://github.com/open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

audio-generation audio-synthesis audioldm audit emilia fastspeech2 maskgct music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e vits vocoder voice-conversion

Last synced: 28 Mar 2025

https://github.com/open-mmlab/mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

aigc computer-vision deep-learning diffusion diffusion-models generative-adversarial-network generative-ai image-editing image-generation image-processing image-synthesis inpainting matting pytorch super-resolution text2image video-frame-interpolation video-interpolation video-super-resolution

Last synced: 13 May 2025

https://github.com/open-mmlab/mmediting

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

aigc computer-vision deep-learning diffusion diffusion-models generative-adversarial-network generative-ai image-editing image-generation image-processing image-synthesis inpainting matting pytorch super-resolution text2image video-frame-interpolation video-interpolation video-super-resolution

Last synced: 05 Apr 2025

https://github.com/open-mmlab/mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

animal-pose-estimation benchmark cpm crowdpose face-keypoint freihand hand-pose-estimation higher-hrnet hourglass hrnet human-pose mmpose mpii mspn ochuman pose-estimation pytorch rsn rtmpose udp

Last synced: 12 May 2025

https://github.com/open-mmlab/mmcv

OpenMMLab Computer Vision Foundation

Last synced: 12 May 2025

https://github.com/open-mmlab/mmdetection3d

OpenMMLab's next-generation platform for general 3D object detection.

3d-object-detection object-detection point-cloud pytorch

Last synced: 23 Apr 2025

https://github.com/open-mmlab/openpcdet

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

3d-detection autonomous-driving object-detection point-cloud pv-rcnn pytorch

Last synced: 14 May 2025

https://github.com/open-mmlab/OpenPCDet

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

3d-detection autonomous-driving object-detection point-cloud pv-rcnn pytorch

Last synced: 20 Mar 2025

https://github.com/open-mmlab/mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

action-recognition ava benchmark deep-learning i3d non-local openmmlab posec3d pytorch slowfast spatial-temporal-action-detection temporal-action-localization tsm tsn uniformerv2 video-classification video-understanding x3d

Last synced: 13 May 2025

https://github.com/open-mmlab/mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

abcnet abinet crnn dbnet deep-learning fcenet key-information-extraction maskrcnn ocr pan panet psenet pytorch sar sdmg-r segmentation-based-text-recognition spts svtr text-detection text-recognition

Last synced: 14 May 2025

https://github.com/open-mmlab/mmtracking

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

multi-object-tracking single-object-tracking tracking video-instance-segmentation video-object-detection

Last synced: 14 May 2025

https://github.com/open-mmlab/mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

beit clip constrastive-learning convnext deep-learning image-classification mae masked-image-modeling mobilenet moco multimodal pretrained-models pytorch resnet self-supervised-learning swin-transformer vision-transformer

Last synced: 07 May 2025

https://github.com/open-mmlab/mmselfsup

OpenMMLab Self-Supervised Learning Toolbox and Benchmark

beit mae masked-image-modeling moco pytorch self-supervised-learning simclr simsiam unsupervised-learning

Last synced: 14 May 2025

https://github.com/open-mmlab/mmyolo

OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.

deep-learning object-detection ppyoloe pytorch rotated-object-detection rtmdet yolo yolov5 yolov6 yolov7 yolov8 yolox

Last synced: 14 May 2025

https://github.com/open-mmlab/mmskeleton

A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.

action-recognition deep-learning graph-convolutional-network pytorch skeleton-based-action-recognition

Last synced: 13 Apr 2025

https://github.com/open-mmlab/mmdeploy

OpenMMLab Model Deployment Framework

computer-vision deep-learning deployment mmdetection mmsegmentation model-converter ncnn onnx onnxruntime openvino pplnn pytorch sdk tensorrt

Last synced: 13 May 2025

https://github.com/open-mmlab/mmrotate

OpenMMLab Rotated Object Detection Toolbox and Benchmark

detection openmmlab pytorch rotated-object

Last synced: 14 May 2025

https://github.com/open-mmlab/mmgeneration

MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.

diffusion-models gan generative generative-adversarial-network mmcv openmmlab pytorch

Last synced: 15 May 2025

https://github.com/open-mmlab/mmaction

An open-source toolbox for action understanding based on PyTorch

action-detection action-recognition pytorch spatial-temporal-action-detection temporal-action-detection temporal-action-localization video-understanding

Last synced: 15 May 2025

https://github.com/open-mmlab/mmrazor

OpenMMLab Model Compression Toolbox and Benchmark.

autoslim classification darts detection knowledge-distillation nas pruning pytorch quantization segmentation spos

Last synced: 14 May 2025

https://github.com/open-mmlab/multimodal-gpt

Multimodal-GPT

flamingo gpt gpt-4 llama multimodal transformer vision-and-language

Last synced: 15 May 2025

https://github.com/open-mmlab/mmhuman3d

OpenMMLab 3D Human Parametric Model Toolbox and Benchmark

Last synced: 14 May 2025

https://github.com/open-mmlab/mmfashion

Open-source toolbox for visual fashion analysis based on PyTorch

attribute-prediction clothes-retrieval deep-learning fashion-ai landmark-detection visual-fashion-analysis

Last synced: 15 May 2025

https://github.com/open-mmlab/mmengine

OpenMMLab Foundational Library for Training Deep Learning Models

ai computer-vision deep-learning machine-learning python pytorch

Last synced: 13 May 2025

https://github.com/open-mmlab/playground

A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.

mmdetection mmocr mmpose mmrotate openmmlab sam

Last synced: 15 May 2025

https://github.com/open-mmlab/openmmlabcourse

OpenMMLab course index and stuff

computer-vision tutorials

Last synced: 16 May 2025

https://github.com/open-mmlab/OpenMMLabCourse

OpenMMLab course index and stuff

computer-vision tutorials

Last synced: 20 Mar 2025

https://github.com/open-mmlab/mmflow

OpenMMLab optical flow toolbox and benchmark

openmmlab optical-flow pytorch

Last synced: 15 May 2025

https://github.com/open-mmlab/pia

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA，你的个性化图像动画生成器，利用文本提示将图像变为奇妙的动画

aigc animation diffusion-models image-to-video image-to-video-generation personalized-generation stable-diffusion

Last synced: 16 May 2025

https://github.com/open-mmlab/PIA

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA，你的个性化图像动画生成器，利用文本提示将图像变为奇妙的动画

aigc animation diffusion-models image-to-video image-to-video-generation personalized-generation stable-diffusion

Last synced: 28 Mar 2025

https://github.com/open-mmlab/powerpaint

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一个高质量多功能的图像修补模型，可以同时支持插入物体、移除物体、图像扩展、形状可控的物体生成，只需要一个模型

aigc generation image-editing image-inpainting stable-diffusion

Last synced: 07 Apr 2025