awesome-transformer

This repo is not maintained. For latest version, please visit https://github.com/ictnlp. A collection of transformer's guides, implementations and variants.
https://github.com/SkyAndCloud/awesome-transformer

Last synced: 15 days ago
JSON representation

Papers
- NMT Basic
- Transformer original paper
  - Attention is All You Need
Implementations & How to reproduce paper's result?
- Minimal, paper-equavalent but not certainly performance-reproducable implementations(both *PyTorch* implementations)
  - code
  - code
- Complex, performance-reproducable implementations
  - tensor2tensor
  - OpenNMT-py
  - sacrebleu - v13a.pl` but more convenient, to calculate bleu score and report the signature as `BLEU+case.mixed+lang.de-en+test.wmt17 = 32.97 66.1/40.2/26.6/18.1 (BP = 0.980 ratio = 0.980 hyp_len = 63134 ref_len = 64399)` for easy reproduction.
  - tensor2tensor transformer.py
  - OpenNMT-tf
  - get_ende_bleu.sh
  - transformer_base_multistep8
  - t2t issue 539
  - t2t issue 444
  - t2t issue 317
  - Tensor2Tensor for Neural Machine Translation
  - corpus preprocessed by OpenNMT - trainingdata/wmt_ende_sp_model.tar.gz). Note that the preprocess procedure includes tokenization, bpe/word-piece operation(here using [sentencepiece](https://github.com/google/sentencepiece) powered by Google which implements word-piece algorithm), see [OpenNMT-tf script](https://github.com/OpenNMT/OpenNMT-tf/blob/master/scripts/wmt/prepare_data.sh) for more details.
  - OpenNMT-py issue
  - OpenNMT: Open-Source Toolkit for Neural Machine Translation
  - doc - -update-freq` when training to accumulate every `N` batches loss to backward, so it's `8` for 1 GPU, `2` for 4 GPUs and so on.
  - “变形金刚”为何强大：从模型到代码全面解析Google Tensor2Tensor系统
  - the preprocessed WMT'16 EN-DE data provided by Google
- Transformer original paper
  - transformer result
- Complex, not certainly performance-reproducable implementations
  - Marian
Training tips
- Complex, not certainly performance-reproducable implementations
  - Training Tips for the Transformer Model
Further
- Complex, not certainly performance-reproducable implementations
Uncategorized
- Uncategorized
  - Yong Shan
  - Jinchao Zhang

Programming Languages

Python 4

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

awesome-transformer

Papers

NMT Basic

Transformer original paper

Implementations & How to reproduce paper's result?

Minimal, paper-equavalent but not certainly performance-reproducable implementations(both PyTorch implementations)

Complex, performance-reproducable implementations

Transformer original paper

Complex, not certainly performance-reproducable implementations

Training tips

Complex, not certainly performance-reproducable implementations

Further

Complex, not certainly performance-reproducable implementations

Uncategorized

Uncategorized

awesome-transformer

Papers

NMT Basic

Transformer original paper

Implementations & How to reproduce paper's result?

Minimal, paper-equavalent but not certainly performance-reproducable implementations(both *PyTorch* implementations)

Complex, performance-reproducable implementations

Transformer original paper

Complex, not certainly performance-reproducable implementations

Training tips

Complex, not certainly performance-reproducable implementations

Further

Complex, not certainly performance-reproducable implementations

Uncategorized

Uncategorized

Minimal, paper-equavalent but not certainly performance-reproducable implementations(both PyTorch implementations)