Projects in Awesome Lists by open-mmlab
A curated list of projects in awesome lists by open-mmlab .
https://github.com/open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
cascade-rcnn convnext detr fast-rcnn faster-rcnn glip grounding-dino instance-segmentation mask-rcnn object-detection panoptic-segmentation pytorch retinanet rtmdet semisupervised-learning ssd swin-transformer transformer vision-transformer yolo
Last synced: 12 May 2025
https://github.com/open-mmlab/amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
audio-generation audio-synthesis audioldm audit emilia fastspeech2 maskgct music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e vits vocoder voice-conversion
Last synced: 12 May 2025
https://github.com/open-mmlab/mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
deeplabv3 image-segmentation medical-image-segmentation pspnet pytorch realtime-segmentation retinal-vessel-segmentation semantic-segmentation swin-transformer transformer vessel-segmentation
Last synced: 15 May 2025
https://github.com/open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
audio-generation audio-synthesis audioldm audit emilia fastspeech2 maskgct music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e vits vocoder voice-conversion
Last synced: 28 Mar 2025
https://github.com/open-mmlab/mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
aigc computer-vision deep-learning diffusion diffusion-models generative-adversarial-network generative-ai image-editing image-generation image-processing image-synthesis inpainting matting pytorch super-resolution text2image video-frame-interpolation video-interpolation video-super-resolution
Last synced: 13 May 2025
https://github.com/open-mmlab/mmediting
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
aigc computer-vision deep-learning diffusion diffusion-models generative-adversarial-network generative-ai image-editing image-generation image-processing image-synthesis inpainting matting pytorch super-resolution text2image video-frame-interpolation video-interpolation video-super-resolution
Last synced: 05 Apr 2025
https://github.com/open-mmlab/mmpose
OpenMMLab Pose Estimation Toolbox and Benchmark.
animal-pose-estimation benchmark cpm crowdpose face-keypoint freihand hand-pose-estimation higher-hrnet hourglass hrnet human-pose mmpose mpii mspn ochuman pose-estimation pytorch rsn rtmpose udp
Last synced: 12 May 2025
https://github.com/open-mmlab/mmdetection3d
OpenMMLab's next-generation platform for general 3D object detection.
3d-object-detection object-detection point-cloud pytorch
Last synced: 23 Apr 2025
https://github.com/open-mmlab/openpcdet
OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
3d-detection autonomous-driving object-detection point-cloud pv-rcnn pytorch
Last synced: 14 May 2025
https://github.com/open-mmlab/OpenPCDet
OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
3d-detection autonomous-driving object-detection point-cloud pv-rcnn pytorch
Last synced: 20 Mar 2025
https://github.com/open-mmlab/mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
action-recognition ava benchmark deep-learning i3d non-local openmmlab posec3d pytorch slowfast spatial-temporal-action-detection temporal-action-localization tsm tsn uniformerv2 video-classification video-understanding x3d
Last synced: 13 May 2025
https://github.com/open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
abcnet abinet crnn dbnet deep-learning fcenet key-information-extraction maskrcnn ocr pan panet psenet pytorch sar sdmg-r segmentation-based-text-recognition spts svtr text-detection text-recognition
Last synced: 14 May 2025
https://github.com/open-mmlab/mmtracking
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
multi-object-tracking single-object-tracking tracking video-instance-segmentation video-object-detection
Last synced: 14 May 2025
https://github.com/open-mmlab/mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
beit clip constrastive-learning convnext deep-learning image-classification mae masked-image-modeling mobilenet moco multimodal pretrained-models pytorch resnet self-supervised-learning swin-transformer vision-transformer
Last synced: 07 May 2025
https://github.com/open-mmlab/mmselfsup
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
beit mae masked-image-modeling moco pytorch self-supervised-learning simclr simsiam unsupervised-learning
Last synced: 14 May 2025
https://github.com/open-mmlab/mmyolo
OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.
deep-learning object-detection ppyoloe pytorch rotated-object-detection rtmdet yolo yolov5 yolov6 yolov7 yolov8 yolox
Last synced: 14 May 2025
https://github.com/open-mmlab/mmskeleton
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
action-recognition deep-learning graph-convolutional-network pytorch skeleton-based-action-recognition
Last synced: 13 Apr 2025
https://github.com/open-mmlab/mmdeploy
OpenMMLab Model Deployment Framework
computer-vision deep-learning deployment mmdetection mmsegmentation model-converter ncnn onnx onnxruntime openvino pplnn pytorch sdk tensorrt
Last synced: 13 May 2025
https://github.com/open-mmlab/mmrotate
OpenMMLab Rotated Object Detection Toolbox and Benchmark
detection openmmlab pytorch rotated-object
Last synced: 14 May 2025
https://github.com/open-mmlab/mmgeneration
MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.
diffusion-models gan generative generative-adversarial-network mmcv openmmlab pytorch
Last synced: 15 May 2025
https://github.com/open-mmlab/mmaction
An open-source toolbox for action understanding based on PyTorch
action-detection action-recognition pytorch spatial-temporal-action-detection temporal-action-detection temporal-action-localization video-understanding
Last synced: 15 May 2025
https://github.com/open-mmlab/mmrazor
OpenMMLab Model Compression Toolbox and Benchmark.
autoslim classification darts detection knowledge-distillation nas pruning pytorch quantization segmentation spos
Last synced: 14 May 2025
https://github.com/open-mmlab/multimodal-gpt
Multimodal-GPT
flamingo gpt gpt-4 llama multimodal transformer vision-and-language
Last synced: 15 May 2025
https://github.com/open-mmlab/mmhuman3d
OpenMMLab 3D Human Parametric Model Toolbox and Benchmark
Last synced: 14 May 2025
https://github.com/open-mmlab/mmfashion
Open-source toolbox for visual fashion analysis based on PyTorch
attribute-prediction clothes-retrieval deep-learning fashion-ai landmark-detection visual-fashion-analysis
Last synced: 15 May 2025
https://github.com/open-mmlab/mmengine
OpenMMLab Foundational Library for Training Deep Learning Models
ai computer-vision deep-learning machine-learning python pytorch
Last synced: 13 May 2025
https://github.com/open-mmlab/playground
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
mmdetection mmocr mmpose mmrotate openmmlab sam
Last synced: 15 May 2025
https://github.com/open-mmlab/openmmlabcourse
OpenMMLab course index and stuff
Last synced: 16 May 2025
https://github.com/open-mmlab/OpenMMLabCourse
OpenMMLab course index and stuff
Last synced: 20 Mar 2025
https://github.com/open-mmlab/mmflow
OpenMMLab optical flow toolbox and benchmark
openmmlab optical-flow pytorch
Last synced: 15 May 2025
https://github.com/open-mmlab/pia
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
aigc animation diffusion-models image-to-video image-to-video-generation personalized-generation stable-diffusion
Last synced: 16 May 2025
https://github.com/open-mmlab/PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
aigc animation diffusion-models image-to-video image-to-video-generation personalized-generation stable-diffusion
Last synced: 28 Mar 2025
https://github.com/open-mmlab/powerpaint
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一个高质量多功能的图像修补模型,可以同时支持插入物体、移除物体、图像扩展、形状可控的物体生成,只需要一个模型
aigc generation image-editing image-inpainting stable-diffusion
Last synced: 07 Apr 2025
https://github.com/open-mmlab/mmfewshot
OpenMMLab FewShot Learning Toolbox and Benchmark
few-shot-learning openmmlab pytorch
Last synced: 16 May 2025
https://github.com/open-mmlab/foleycrafter
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
aigc audio-processing diffusion-models foley-sound-synthesis video-to-audio
Last synced: 04 Apr 2025
https://github.com/open-mmlab/FoleyCrafter
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
aigc audio-processing diffusion-models foley-sound-synthesis video-to-audio
Last synced: 15 Feb 2025
https://github.com/open-mmlab/openunreid
PyTorch open-source toolbox for unsupervised or domain adaptive object re-ID.
domain-translation image-retrieval open-set-domain-adaptation pseudo-labeling re-identification unsupervised-domain-adaptation unsupervised-learning
Last synced: 06 Apr 2025
https://github.com/open-mmlab/labelbee-client
Out-of-the-box Annotation Toolbox
Last synced: 05 Apr 2025
https://github.com/open-mmlab/styleshot
StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!
controllable-generation style-transfer text-to-image
Last synced: 16 May 2025
https://github.com/open-mmlab/labelbee
LabelBee is an annotation Library
annotation frontend javascript
Last synced: 15 May 2025
https://github.com/open-mmlab/mmeval
A unified evaluation library for multiple machine learning libraries
machine-learning metrics python pytorch tensorflow
Last synced: 04 Apr 2025
https://github.com/open-mmlab/live2diff
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
aigc diffusion-models personalized-generation stable-diffusion video-to-video video-to-video-translation
Last synced: 08 May 2025
https://github.com/open-mmlab/Live2Diff
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
aigc diffusion-models personalized-generation stable-diffusion video-to-video video-to-video-translation
Last synced: 28 Mar 2025
https://github.com/open-mmlab/mmstyles
Latex style file to facilitate writing of technical papers
Last synced: 27 Feb 2025