Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/stazizov/vit_cifar100
A PyTorch implementation of Vision Transformers as described in: An Image Is Worth 16 x 16 Words: Transformers for Image Recognition at Scale trained on CIFAR-100 dataset
https://github.com/stazizov/vit_cifar100
Last synced: 27 days ago
JSON representation
A PyTorch implementation of Vision Transformers as described in: An Image Is Worth 16 x 16 Words: Transformers for Image Recognition at Scale trained on CIFAR-100 dataset
- Host: GitHub
- URL: https://github.com/stazizov/vit_cifar100
- Owner: stazizov
- Created: 2023-06-24T21:44:23.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-10-12T12:07:08.000Z (about 1 year ago)
- Last Synced: 2024-05-15T09:46:54.333Z (6 months ago)
- Language: Python
- Homepage:
- Size: 6.84 KB
- Stars: 6
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Description
A PyTorch implementation of Vision Transformers as described in: `An Image Is Worth 16 x 16 Words: Transformers for Image Recognition at Scale` trained on CIFAR-100 dataset## Requirements
- torch==2.0.1
- pytorch_lightning==1.9.0
- torchmetrics==0.11.4
- torchvision==0.15.2## How to run
```bash
pip install -r requirements.txt
python main.py
```## Monitor metrics
```bash
tensorboard --logdir logs
```## Usage
```python
model = LightningVisionTransformer(
image_size = 384,
patch_size = 16,
in_channels = 3,
n_classes = 1000,
embedding_dimension=768,
depth=12,
n_heads=12,
mlp_ratio=4.,
qkv_bias=True,
proj_p=0.,
attn_p=0
)
```## References
- https://arxiv.org/abs/2010.11929
- https://github.com/huggingface/pytorch-image-models/blob/main/timm/models/vision_transformer.py