https://github.com/liyuanlucasliu/transformer-clinic
Understanding the Difficulty of Training Transformers
https://github.com/liyuanlucasliu/transformer-clinic
initialization nmt transformer wmt
Last synced: about 2 months ago
JSON representation
Understanding the Difficulty of Training Transformers
- Host: GitHub
- URL: https://github.com/liyuanlucasliu/transformer-clinic
- Owner: LiyuanLucasLiu
- License: apache-2.0
- Created: 2020-04-01T16:24:54.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2022-05-31T18:27:35.000Z (about 3 years ago)
- Last Synced: 2025-04-09T15:08:39.893Z (about 2 months ago)
- Topics: initialization, nmt, transformer, wmt
- Language: Python
- Homepage: https://arxiv.org/abs/2004.08249
- Size: 5.15 MB
- Stars: 328
- Watchers: 11
- Forks: 19
- Open Issues: 1