https://github.com/open-mmlab/mmengine

OpenMMLab Foundational Library for Training Deep Learning Models
https://github.com/open-mmlab/mmengine
ai computer-vision deep-learning machine-learning python pytorch
Last synced: 2 months ago
JSON representation
OpenMMLab Foundational Library for Training Deep Learning Models
Host: GitHub
URL: https://github.com/open-mmlab/mmengine
Owner: open-mmlab
License: apache-2.0
Created: 2022-02-08T09:05:09.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2025-03-04T12:22:42.000Z (4 months ago)
Last Synced: 2025-04-12T07:16:40.376Z (3 months ago)
Topics: ai, computer-vision, deep-learning, machine-learning, python, pytorch
Language: Python
Homepage: https://mmengine.readthedocs.io/
Size: 3.99 MB
Stars: 1,264
Watchers: 24
Forks: 386
Open Issues: 243
Metadata Files:
- Readme: README.md
- Contributing: .github/CONTRIBUTING.md
- License: LICENSE
- Code of conduct: .github/CODE_OF_CONDUCT.md
- Citation: CITATION.cff
Awesome Lists containing this project

README

        


  

   

  

    OpenMMLab website

    ^HOT

        

    OpenMMLab platform

    ^{TRY IT OUT}

  

   


[![PyPI - Python Version](https://img.shields.io/pypi/pyversions/mmengine)](https://pypi.org/project/mmengine/)

[![pytorch](https://img.shields.io/badge/pytorch-1.6~2.1-yellow)](#installation)

[![PyPI](https://img.shields.io/pypi/v/mmengine)](https://pypi.org/project/mmengine)

[![license](https://img.shields.io/github/license/open-mmlab/mmengine.svg)](https://github.com/open-mmlab/mmengine/blob/main/LICENSE)

[Introduction](#introduction) |

[Installation](#installation) |

[Get Started](#get-started) |

[📘Documentation](https://mmengine.readthedocs.io/en/latest/) |

[🤔Reporting Issues](https://github.com/open-mmlab/mmengine/issues/new/choose)





English | [简体中文](README_zh-CN.md)





  

    

  

  

    

  

  

    

  

  

    

  

  

    

  

  

    



## What's New

v0.10.6 was released on 2025-01-13.

Highlights:

- Support custom `artifact_location` in MLflowVisBackend [#1505](#1505)

- Enable `exclude_frozen_parameters` for `DeepSpeedEngine._zero3_consolidated_16bit_state_dict` [#1517](#1517)

Read [Changelog](./docs/en/notes/changelog.md#v0104-2342024) for more details.

## Introduction

MMEngine is a foundational library for training deep learning models based on PyTorch. It serves as the training engine of all OpenMMLab codebases, which support hundreds of algorithms in various research areas. Moreover, MMEngine is also generic to be applied to non-OpenMMLab projects. Its highlights are as follows:

**Integrate mainstream large-scale model training frameworks**

- [ColossalAI](https://mmengine.readthedocs.io/en/latest/common_usage/large_model_training.html#colossalai)

- [DeepSpeed](https://mmengine.readthedocs.io/en/latest/common_usage/large_model_training.html#deepspeed)

- [FSDP](https://mmengine.readthedocs.io/en/latest/common_usage/large_model_training.html#fullyshardeddataparallel-fsdp)

**Supports a variety of training strategies**

- [Mixed Precision Training](https://mmengine.readthedocs.io/en/latest/common_usage/speed_up_training.html#mixed-precision-training)

- [Gradient Accumulation](https://mmengine.readthedocs.io/en/latest/common_usage/save_gpu_memory.html#gradient-accumulation)

- [Gradient Checkpointing](https://mmengine.readthedocs.io/en/latest/common_usage/save_gpu_memory.html#gradient-checkpointing)

**Provides a user-friendly configuration system**

- [Pure Python-style configuration files, easy to navigate](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/config.html#a-pure-python-style-configuration-file-beta)

- [Plain-text-style configuration files, supporting JSON and YAML](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/config.html)

**Covers mainstream training monitoring platforms**

- [TensorBoard](https://mmengine.readthedocs.io/en/latest/common_usage/visualize_training_log.html#tensorboard) | [WandB](https://mmengine.readthedocs.io/en/latest/common_usage/visualize_training_log.html#wandb) | [MLflow](https://mmengine.readthedocs.io/en/latest/common_usage/visualize_training_log.html#mlflow-wip)

- [ClearML](https://mmengine.readthedocs.io/en/latest/common_usage/visualize_training_log.html#clearml) | [Neptune](https://mmengine.readthedocs.io/en/latest/common_usage/visualize_training_log.html#neptune) | [DVCLive](https://mmengine.readthedocs.io/en/latest/common_usage/visualize_training_log.html#dvclive) | [Aim](https://mmengine.readthedocs.io/en/latest/common_usage/visualize_training_log.html#aim)

## Installation

Supported PyTorch Versions

| MMEngine           | PyTorch      | Python         |

| ------------------ | ------------ | -------------- |

| main               | >=1.6 \<=2.1 | >=3.8, \<=3.11 |

| >=0.9.0, \<=0.10.4 | >=1.6 \<=2.1 | >=3.8, \<=3.11 |

Before installing MMEngine, please ensure that PyTorch has been successfully installed following the [official guide](https://pytorch.org/get-started/locally/).

Install MMEngine

```bash

pip install -U openmim

mim install mmengine

```

Verify the installation

```bash

python -c 'from mmengine.utils.dl_utils import collect_env;print(collect_env())'

```

## Get Started

Taking the training of a ResNet-50 model on the CIFAR-10 dataset as an example, we will use MMEngine to build a complete, configurable training and validation process in less than 80 lines of code.

Build Models

First, we need to define a **model** which 1) inherits from `BaseModel` and 2) accepts an additional argument `mode` in the `forward` method, in addition to those arguments related to the dataset.

- During training, the value of `mode` is "loss", and the `forward` method should return a `dict` containing the key "loss".

- During validation, the value of `mode` is "predict", and the forward method should return results containing both predictions and labels.

```python

import torch.nn.functional as F

import torchvision

from mmengine.model import BaseModel

class MMResNet50(BaseModel):

    def __init__(self):

        super().__init__()

        self.resnet = torchvision.models.resnet50()

    def forward(self, imgs, labels, mode):

        x = self.resnet(imgs)

        if mode == 'loss':

            return {'loss': F.cross_entropy(x, labels)}

        elif mode == 'predict':

            return x, labels

```

Build Datasets

Next, we need to create **Dataset**s and **DataLoader**s for training and validation.

In this case, we simply use built-in datasets supported in TorchVision.

```python

import torchvision.transforms as transforms

from torch.utils.data import DataLoader

norm_cfg = dict(mean=[0.491, 0.482, 0.447], std=[0.202, 0.199, 0.201])

train_dataloader = DataLoader(batch_size=32,

                              shuffle=True,

                              dataset=torchvision.datasets.CIFAR10(

                                  'data/cifar10',

                                  train=True,

                                  download=True,

                                  transform=transforms.Compose([

                                      transforms.RandomCrop(32, padding=4),

                                      transforms.RandomHorizontalFlip(),

                                      transforms.ToTensor(),

                                      transforms.Normalize(**norm_cfg)

                                  ])))

val_dataloader = DataLoader(batch_size=32,

                            shuffle=False,

                            dataset=torchvision.datasets.CIFAR10(

                                'data/cifar10',

                                train=False,

                                download=True,

                                transform=transforms.Compose([

                                    transforms.ToTensor(),

                                    transforms.Normalize(**norm_cfg)

                                ])))

```

Build Metrics

To validate and test the model, we need to define a **Metric** called accuracy to evaluate the model. This metric needs to inherit from `BaseMetric` and implements the `process` and `compute_metrics` methods.

```python

from mmengine.evaluator import BaseMetric

class Accuracy(BaseMetric):

    def process(self, data_batch, data_samples):

        score, gt = data_samples

        # Save the results of a batch to `self.results`

        self.results.append({

            'batch_size': len(gt),

            'correct': (score.argmax(dim=1) == gt).sum().cpu(),

        })

    def compute_metrics(self, results):

        total_correct = sum(item['correct'] for item in results)

        total_size = sum(item['batch_size'] for item in results)

        # Returns a dictionary with the results of the evaluated metrics,

        # where the key is the name of the metric

        return dict(accuracy=100 * total_correct / total_size)

```

Build a Runner

Finally, we can construct a **Runner** with previously defined `Model`, `DataLoader`, and `Metrics`, with some other configs, as shown below.

```python

from torch.optim import SGD

from mmengine.runner import Runner

runner = Runner(

    model=MMResNet50(),

    work_dir='./work_dir',

    train_dataloader=train_dataloader,

    # a wrapper to execute back propagation and gradient update, etc.

    optim_wrapper=dict(optimizer=dict(type=SGD, lr=0.001, momentum=0.9)),

    # set some training configs like epochs

    train_cfg=dict(by_epoch=True, max_epochs=5, val_interval=1),

    val_dataloader=val_dataloader,

    val_cfg=dict(),

    val_evaluator=dict(type=Accuracy),

)

```

Launch Training

```python

runner.train()

```

## Learn More

Tutorials

- [Runner](https://mmengine.readthedocs.io/en/latest/tutorials/runner.html)

- [Dataset and DataLoader](https://mmengine.readthedocs.io/en/latest/tutorials/dataset.html)

- [Model](https://mmengine.readthedocs.io/en/latest/tutorials/model.html)

- [Evaluation](https://mmengine.readthedocs.io/en/latest/tutorials/evaluation.html)

- [OptimWrapper](https://mmengine.readthedocs.io/en/latest/tutorials/optim_wrapper.html)

- [Parameter Scheduler](https://mmengine.readthedocs.io/en/latest/tutorials/param_scheduler.html)

- [Hook](https://mmengine.readthedocs.io/en/latest/tutorials/hook.html)

Advanced tutorials

- [Registry](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/registry.html)

- [Config](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/config.html)

- [BaseDataset](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/basedataset.html)

- [Data Transform](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/data_transform.html)

- [Weight Initialization](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/initialize.html)

- [Visualization](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/visualization.html)

- [Abstract Data Element](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/data_element.html)

- [Distribution Communication](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/distributed.html)

- [Logging](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/logging.html)

- [File IO](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/fileio.html)

- [Global manager (ManagerMixin)](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/manager_mixin.html)

- [Use modules from other libraries](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/cross_library.html)

- [Test Time Agumentation](https://mmengine.readthedocs.io/en/latest/advanced_tutorials/test_time_augmentation.html)

Examples

- [Train a GAN](https://mmengine.readthedocs.io/en/latest/examples/train_a_gan.html)

Common Usage

- [Resume Training](https://mmengine.readthedocs.io/en/latest/common_usage/resume_training.html)

- [Speed up Training](https://mmengine.readthedocs.io/en/latest/common_usage/speed_up_training.html)

- [Save Memory on GPU](https://mmengine.readthedocs.io/en/latest/common_usage/save_gpu_memory.html)

Design

- [Hook](https://mmengine.readthedocs.io/en/latest/design/hook.html)

- [Runner](https://mmengine.readthedocs.io/en/latest/design/runner.html)

- [Evaluation](https://mmengine.readthedocs.io/en/latest/design/evaluation.html)

- [Visualization](https://mmengine.readthedocs.io/en/latest/design/visualization.html)

- [Logging](https://mmengine.readthedocs.io/en/latest/design/logging.html)

- [Infer](https://mmengine.readthedocs.io/en/latest/design/infer.html)

Migration guide

- [Migrate Runner from MMCV to MMEngine](https://mmengine.readthedocs.io/en/latest/migration/runner.html)

- [Migrate Hook from MMCV to MMEngine](https://mmengine.readthedocs.io/en/latest/migration/hook.html)

- [Migrate Model from MMCV to MMEngine](https://mmengine.readthedocs.io/en/latest/migration/model.html)

- [Migrate Parameter Scheduler from MMCV to MMEngine](https://mmengine.readthedocs.io/en/latest/migration/param_scheduler.html)

- [Migrate Data Transform to OpenMMLab 2.0](https://mmengine.readthedocs.io/en/latest/migration/transform.html)

## Contributing

We appreciate all contributions to improve MMEngine. Please refer to [CONTRIBUTING.md](CONTRIBUTING.md) for the contributing guideline.

## Citation

If you find this project useful in your research, please consider cite:

```

@article{mmengine2022,

  title   = {{MMEngine}: OpenMMLab Foundational Library for Training Deep Learning Models},

  author  = {MMEngine Contributors},

  howpublished = {\url{https://github.com/open-mmlab/mmengine}},

  year={2022}

}

```

## License

This project is released under the [Apache 2.0 license](LICENSE).

## Ecosystem

- [APES: Attention-based Point Cloud Edge Sampling](https://github.com/JunweiZheng93/APES)

- [DiffEngine: diffusers training toolbox with mmengine](https://github.com/okotaku/diffengine)

## Projects in OpenMMLab

- [MIM](https://github.com/open-mmlab/mim): MIM installs OpenMMLab packages.

- [MMCV](https://github.com/open-mmlab/mmcv): OpenMMLab foundational library for computer vision.

- [MMEval](https://github.com/open-mmlab/mmeval): A unified evaluation library for multiple machine learning libraries.

- [MMPreTrain](https://github.com/open-mmlab/mmpretrain): OpenMMLab pre-training toolbox and benchmark.

- [MMagic](https://github.com/open-mmlab/mmagic): Open**MM**Lab **A**dvanced, **G**enerative and **I**ntelligent **C**reation toolbox.

- [MMDetection](https://github.com/open-mmlab/mmdetection): OpenMMLab detection toolbox and benchmark.

- [MMYOLO](https://github.com/open-mmlab/mmyolo): OpenMMLab YOLO series toolbox and benchmark.

- [MMDetection3D](https://github.com/open-mmlab/mmdetection3d): OpenMMLab's next-generation platform for general 3D object detection.

- [MMRotate](https://github.com/open-mmlab/mmrotate): OpenMMLab rotated object detection toolbox and benchmark.

- [MMTracking](https://github.com/open-mmlab/mmtracking): OpenMMLab video perception toolbox and benchmark.

- [MMPose](https://github.com/open-mmlab/mmpose): OpenMMLab pose estimation toolbox and benchmark.

- [MMSegmentation](https://github.com/open-mmlab/mmsegmentation): OpenMMLab semantic segmentation toolbox and benchmark.

- [MMOCR](https://github.com/open-mmlab/mmocr): OpenMMLab text detection, recognition, and understanding toolbox.

- [MMHuman3D](https://github.com/open-mmlab/mmhuman3d): OpenMMLab 3D human parametric model toolbox and benchmark.

- [MMSelfSup](https://github.com/open-mmlab/mmselfsup): OpenMMLab self-supervised learning toolbox and benchmark.

- [MMFewShot](https://github.com/open-mmlab/mmfewshot): OpenMMLab fewshot learning toolbox and benchmark.

- [MMAction2](https://github.com/open-mmlab/mmaction2): OpenMMLab's next-generation action understanding toolbox and benchmark.

- [MMFlow](https://github.com/open-mmlab/mmflow): OpenMMLab optical flow toolbox and benchmark.

- [MMDeploy](https://github.com/open-mmlab/mmdeploy): OpenMMLab model deployment framework.

- [MMRazor](https://github.com/open-mmlab/mmrazor): OpenMMLab model compression toolbox and benchmark.

- [Playground](https://github.com/open-mmlab/playground): A central hub for gathering and showcasing amazing projects built upon OpenMMLab.
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/open-mmlab/mmengine

Awesome Lists containing this project

README