Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/aws/sagemaker-mxnet-training-toolkit
Toolkit for running MXNet training scripts on SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https://github.com/aws/deep-learning-containers.
https://github.com/aws/sagemaker-mxnet-training-toolkit
aws docker mxnet sagemaker
Last synced: about 18 hours ago
JSON representation
Toolkit for running MXNet training scripts on SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https://github.com/aws/deep-learning-containers.
- Host: GitHub
- URL: https://github.com/aws/sagemaker-mxnet-training-toolkit
- Owner: aws
- License: apache-2.0
- Created: 2018-01-25T22:55:24.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2023-04-26T15:07:53.000Z (almost 2 years ago)
- Last Synced: 2025-01-28T19:48:32.120Z (8 days ago)
- Topics: aws, docker, mxnet, sagemaker
- Language: Python
- Homepage:
- Size: 12.9 MB
- Stars: 60
- Watchers: 56
- Forks: 49
- Open Issues: 5
-
Metadata Files:
- Readme: README.rst
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
================================
SageMaker MXNet Training Toolkit
================================SageMaker MXNet Training Toolkit is an open-source library for using MXNet to train models on Amazon SageMaker.
For inference, see `SageMaker MXNet Inference Toolkit `__.
For the Dockerfiles used for building SageMaker MXNet Containers, see `AWS Deep Learning Containers `__.
For information on running MXNet jobs on Amazon SageMaker, please refer to the `SageMaker Python SDK documentation `__.Contributing
------------Please read `CONTRIBUTING.md `__
for details on our code of conduct, and the process for submitting pull requests to us.Testing
-------Set up a virtual environment for testing.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~One of the multiple ways to setup a virtual environment
::
# use a package virtualenv
# create a virtualenv
virtualenv -p python3
# activate the virtualenv
source /bin/activateInstall requirements
~~~~~~~~~~~~~~~~~~~~::
pip install --upgrade .[test]
Local Test
~~~~~~~~~~To run specific test
::
tox -- -k test/unit/test_training.py::test_train_for_distributed_scheduler
To run an entire file
::
tox -- test/unit/test_training.py
To run all tests within a folder [e.g. integration/local/]
Note: To run integration tests locally, one needs to build an image. To trigger image build, use `-B` flag.
::
tox -- test/integration/local
You can also run them in parallel:::
tox -- -n auto test/integration/local
To run for specific interpreter [Python environment], use the ``-e`` flag
::
tox -e py37 -- test/unit/test_training.py
Remote Test
~~~~~~~~~~~Make sure to provide AWS account ID, Region, Docker base name & Tag.
Docker Registry is composed of (aws_id, region)
Image URI is composed of (docker_registry, docker_base_name, tag)Resulting Image URI is composed as: ``{aws_id}.dkr.ecr.{region}.amazonaws.com/{docker_base_name}:{tag}``
::
tox -- --aws-id --region --docker-base-name --tag test/integration/sagemaker
For more details, refer `conftest.py `_
License
-------SageMaker MXNet Training Toolkit is licensed under the Apache 2.0 License.
It is copyright 2018 Amazon.com, Inc. or its affiliates. All Rights Reserved.
The license is available at: http://aws.amazon.com/apache2.0/