Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/bsc-wdc/tutorial_sc24

SuperComputing 2024 Tutorial
https://github.com/bsc-wdc/tutorial_sc24

Last synced: about 1 month ago
JSON representation

SuperComputing 2024 Tutorial

Awesome Lists containing this project

README

        

# Optimising workflow lifecycle management: development, HPC-ready containers deployment and reproducibility

## SC24 tutorial, Atlanta, November 18th, 2024

## Abstract

With Exaflop systems already here, the application communities are eager to leverage these large and complex systems. The complexity is further increased by the applications' need to combine different aspects beyond traditional HPC solvers and simulators, with artificial intelligence (AI) and data analytics (DA). The eFlows4HPC project proposed a software stack and the HPC Workflows-as-a-Service (HPCWaaS) methodology to provide tools to simplify the development, deployment, execution, and reuse of workflows. These results are leveraged in the DT-GEO and CAELESTIS projects. These tools also aim to support the reproducibility, portability and ease of use of complex workflows.

The tutorial will focus on a set of tools and methodologies for managing the whole application workflow lifecycle. In particular, the tutorial will cover aspects of developing computational workflows with PyCOMPSs and new extensions to better integrate with AI and DA with examples from DT-GEO and CAELESTIS projects. The tutorial will also describe how to automatically record workflow provenance with PyCOMPSs to share FAIR workflows in public repositories, enabling their reproducibility. Finally, we will explain how to generate specific containers that leverage HPC systems features and use them in the workflow deployment phase. The tutorial will include hands-on sessions on different aspects.

## Agenda

| Time | Session | Presenter |
| --- | --- | --- |
| 08:30 – 08:35 | Overview of tutorial agenda | Rosa M Badia |
| 8:35 - 9:05 | [Part 1.1: Hybrid HPC+AI+DA workflow development with PyCOMPSs](slides/Agenda_and_Part_1.1.pdf) | Rosa M Badia |
| 9:05 - 9:35 | [Part 1.2: Workflows’ reproducibility through provenance](slides/Part_1.2.pdf) | Raül Sirvent |
| 9:35 - 9:50 | [Part 1.3: HPC ready container images Motivation for architecture specific containers](slides/Part_1.3.pdf) | Rosa M Badia |
| 9:50 - 10:00 | Hands-on preparation | All presenters |
| 10:00 - 10:30 | Break (30 min) | |
| 10:30 - 11:00 | [Part 2.1: Hands-on session: Sample workflows with PyCOMPSs and execution with containers](handson/Hands-on%20guide%20and%20questions.pdf) | Rosa M Badia |
| 11:00 - 11:45 | [Part 2.2: Hands-on session: How to automatically record workflow provenance and use it to share experiments in WorkflowHub](handson/Workflow%20Provenance%20Hands-on%20guide%20and%20questions.pdf) | Raül Sirvent |
| 11:45 - 12:00 | Tutorial conclusions | All presenters |