https://github.com/glavin001/llm-trainer

Training/Fine-Tuning Large Language Models
https://github.com/glavin001/llm-trainer

Last synced: 4 months ago
JSON representation

Training/Fine-Tuning Large Language Models

README

# LLM Trainer

> Training/Fine-Tuning Large Language Models

```bash
sudo docker login
```

Pull existing Docker image to warm build cache:

```bash
sudo docker pull glavin001/llm-trainer:may-28-433
```

Build & start container with Jupyter labs/notebook:

```bash
sudo docker compose up --build
```

If you see error:
> tornado.web.HTTPError: HTTP 403: Forbidden

Then stop the docker container and retry.

```bash
wandb login -u USERNAME
```

```bash
./scripts/train.sh
```