Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/willguimont/sjm

Manage SLURM jobs on multiple clusters easily.
https://github.com/willguimont/sjm

cli machine-learning ocaml slurm tools

Last synced: about 11 hours ago
JSON representation

Manage SLURM jobs on multiple clusters easily.

Awesome Lists containing this project

README

        

# sjm: **S**LURM **J**ob **M**anager

Manage SLURM jobs on multiple clusters easily.

## Installation

```bash
git clone https://github.com/willGuimont/sjm
opam install .
```

## Usage

```bash
# Add remote (ssh-copy-id is required)
sjm add
# List remotes
sjm ls
# Remove remote
sjm rm
# List jobs on remote
sjm ps
# Submit job on remote
sjm run [list of replace patterns of the form "pattern=value" that will be place each $pattern in the script by value]
# Clear tmp jobs cache
sjm clr
# Clear tmp jobs cache on host
sjm clr-remote
# git pull in a directory on a remote
sjm pull
```

## Example

```bash
sjm add mycluster [email protected]
# See test_job.sh for an example of a job script
sjm pull mycluster my_project
sjm run mycluster test_job.sh NUMBER_GPU=2 CONFIG=my_config.yml
sjm ps mycluster
```