An open API service indexing awesome lists of open source software.

https://github.com/collabora/sigmareparam-pytorch

An unofficial implementation of the σReparam from the "Stabilizing Transformer Training by Preventing Attention Entropy Collapse" paper
https://github.com/collabora/sigmareparam-pytorch

Last synced: 3 months ago
JSON representation

An unofficial implementation of the σReparam from the "Stabilizing Transformer Training by Preventing Attention Entropy Collapse" paper

Awesome Lists containing this project