An open API service indexing awesome lists of open source software.

https://github.com/naetherm/gptllm

Just a small learning project for implementing GPT2 LLM
https://github.com/naetherm/gptllm

Last synced: about 2 months ago
JSON representation

Just a small learning project for implementing GPT2 LLM

Awesome Lists containing this project

README

        

# gptLLM

Reimplementation of the papers "Language models are unsupervised multitask learners" and by that a reimplementation of GPT2.

## References

```
@article{radford2019language,
title={Language models are unsupervised multitask learners},
author={Radford, Alec and Wu, Jeffrey and Child, Rewon and Luan, David and Amodei, Dario and Sutskever, Ilya and others},
journal={OpenAI blog},
volume={1},
number={8},
pages={9},
year={2019}
}

@article{radford2018improving,
title={Improving language understanding by generative pre-training},
author={Radford, Alec and Narasimhan, Karthik and Salimans, Tim and Sutskever, Ilya and others},
year={2018},
publisher={OpenAI}
}

@article{hendrycks2016gaussian,
title={Gaussian error linear units (gelus)},
author={Hendrycks, Dan and Gimpel, Kevin},
journal={arXiv preprint arXiv:1606.08415},
year={2016}
}
```