https://github.com/anto18671/pretraining-custom-timm

A flexible and extensible PyTorch pretraining script built atop the timm library.
https://github.com/anto18671/pretraining-custom-timm

computer-vision pretraining pytorch timm

Last synced: about 2 months ago
JSON representation

A flexible and extensible PyTorch pretraining script built atop the timm library.

Host: GitHub
URL: https://github.com/anto18671/pretraining-custom-timm
Owner: anto18671
License: mit
Created: 2025-08-31T01:49:53.000Z (10 months ago)
Default Branch: main
Last Pushed: 2025-09-03T19:42:09.000Z (10 months ago)
Last Synced: 2025-10-04T05:52:56.424Z (9 months ago)
Topics: computer-vision, pretraining, pytorch, timm
Language: Python
Homepage:
Size: 20.5 KB
Stars: 1
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# pretraining-custom-timm

A flexible and extensible **PyTorch pretraining script** built atop the `timm` library. Designed for pretraining custom vision models with modern techniques like mixed precision, advanced augmentations, learning rate scheduling, EMA, and checkpointing. Updated **August 31, 2025**.

---

## Features

- **Model Configuration**: Leverages `timm` for easily switching architectures.
- **Data Loading & Augmentation**: Includes standard and advanced augment techniques (e.g., RandAugment, RandomErasing).
- **Mixed Precision Training**: Built-in support for AMP using PyTorch's `torch.amp`.
- **Learning Rate Scheduling**: Configurable warmup + cosine decay.
- **EMA (Exponential Moving Average)**: Maintains smoothed model weights for robust validation performance.
- **Checkpoint Strategy**: Saves robust checkpoints: `last.pth`, `best.pth`, `last_ema.pth`, and `best_ema.pth`.
- **Training Logging**: Progress tracking with `tqdm`; summary of model via `torchsummary`.

---

## Installation

```bash
git clone https://github.com/anto18671/pretraining-custom-timm.git
cd pretraining-custom-timm
pip install -r requirements.txt
```

**Requirements** (aligning with this repo’s `requirements.txt`):

- `torch`
- `torchvision`
- `timm`
- `tqdm`
- `torchsummary`
- `datasets` _(if Hugging Face datasets are used)_
_(Adjust based on what's listed in the actual `requirements.txt`.)_

---

## Usage

Run the training script:

```bash
python train.py
```

### Default Hyperparameters

```text
Image size: 256
Batch size: 128
Epochs: 300
Warmup epochs: 20
Base LR: scaled as 5e-4 × (batch_size / 512)
Weight decay: 0.05
Mixup alpha: 0.8
CutMix alpha: 1.0
EMA decay: 0.9999
```

_(Update these defaults if your script uses different values.)_

---

## Checkpoints & Logging

- **Checkpoints**:

- `last.pth`: Last epoch weights.
- `best.pth`: Best validation performance (raw model).
- `last_ema.pth`: EMA model at last epoch.
- `best_ema.pth`: EMA model at best validation accuracy.

- **Training Logs**: Displayed via `tqdm` (loss, accuracy, learning rate).

---

## Example Training Console Output

---

## Additional Usage Examples

### Resuming from a Checkpoint

To resume training from the best checkpoint:

```python
checkpoint = torch.load("best.pth")
model.load_state_dict(checkpoint['state_dict'])
start_epoch = checkpoint['epoch'] + 1
```

### Evaluating EMA Weights

```python
ema_checkpoint = torch.load("best_ema.pth")
ema_model.load_state_dict(ema_checkpoint['state_dict'])
# Proceed with validation using ema_model...
```

---

## Future Extensions

- Distributed training with PyTorch (`torch.distributed`).
- Logging with TensorBoard or Weights & Biases.
- Fine-tuning scripts with custom head layers.
- Additional augmentations and data sampling strategies.

---

## License

This project is licensed under the **MIT License** — see the `LICENSE` file for details.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/anto18671/pretraining-custom-timm

Awesome Lists containing this project

README