Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/vwxyzjn/benchmark-ci
https://github.com/vwxyzjn/benchmark-ci
Last synced: 4 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/vwxyzjn/benchmark-ci
- Owner: vwxyzjn
- Created: 2023-09-07T14:23:03.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-10-25T20:00:35.000Z (over 1 year ago)
- Last Synced: 2024-10-11T05:50:58.185Z (4 months ago)
- Language: Python
- Size: 1.06 MB
- Stars: 7
- Watchers: 2
- Forks: 1
- Open Issues: 11
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Benchmark-CI
This is a project setting up automatic benchmark experiments with CI triggered by github comments.
Basically, we want to input a command in the issue like `/deploy` and our CI will run trigger the benchmark experiments and report the results back to the issue, like this:
data:image/s3,"s3://crabby-images/43981/4398143c38e9590af8aca9b0473bfb13ca36f48a" alt=""
## Basic design
The process is as follows:
1. In `.github/workflows/blank.yml` we trigger the CI when there is a `/deploy` comment in the issue.
1. The CI runs `benchmark/benchmark_and_report.sh`
1. The script with run `benchmark/benchmark.sh` to run the benchmark experiments in a SLURM cluster.
1. The script then parse the job ids and run a dependency job `sbatch --dependency=afterany:$job_ids benchmark/post_github_comment.sbatch`, basically invoking the script `benchmark/post_github_comment.py` to post the benchmark results to the issue once all the experiments finished.## Requirement
* You need a slurm cluster and use its head node to set up a Github self-hosted runner that has access to the `sbatch` command.
data:image/s3,"s3://crabby-images/5a050/5a050b559c4321ebf2c2b73765264e668161af2e" alt=""
* You need to set up the required secrets, which are:
* `HUGGING_FACE_HUB_TOKEN`: a token to access the HuggingFace hub.
* `PERSONAL_ACCESS_TOKEN_GITHUB`: a token to access the Github API. This is required because the Github action token only lasts for the duration of the workflow, which is not enough for the benchmark experiments to finish.
* `WANDB_API_KEY`: a token to access the Weights and Biases API.
data:image/s3,"s3://crabby-images/885ee/885ee2d44edf1eed485aea2b1785bb32112a06a9" alt=""## Reference
* https://dev.to/zirkelc/trigger-github-workflow-for-comment-on-pull-request-45l2