https://github.com/outerbounds/pydata-global-2023-hpc
https://github.com/outerbounds/pydata-global-2023-hpc
Last synced: 12 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/outerbounds/pydata-global-2023-hpc
- Owner: outerbounds
- Created: 2023-12-07T13:34:27.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2023-12-07T15:25:01.000Z (over 2 years ago)
- Last Synced: 2025-06-16T13:12:42.484Z (12 months ago)
- Language: Python
- Size: 3.09 MB
- Stars: 3
- Watchers: 5
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Instructions
```
git clone https://github.com/outerbounds/pydata-global-2023-hpc.git
```
## Part 1: [Get started - torchx for scheduling local jobs](./torchx-local/)
This section will help you familiarize with the Torchx tool, which helps you schedule batch jobs on a variety of solutions.
## Part 2: [Build systems - torchrun with Metaflow](./torchrun-cluster/)
This section will teach you to learn Metaflow by training a GPT model from scratch, all from Python code.
## Slides
You can find a PDF of the presentation slides [here](./PyData%20-%20HPC%20in%20the%20cloud.pdf).