https://github.com/losttech/gpt
GPT-2 port to C#
https://github.com/losttech/gpt
Last synced: 9 months ago
JSON representation
GPT-2 port to C#
- Host: GitHub
- URL: https://github.com/losttech/gpt
- Owner: losttech
- License: mit
- Created: 2021-03-28T20:39:59.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2022-09-27T20:01:51.000Z (over 3 years ago)
- Last Synced: 2025-04-15T10:56:48.518Z (10 months ago)
- Language: C#
- Size: 591 KB
- Stars: 10
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
C# implementation of [GPT-2](https://en.wikipedia.org/wiki/GPT-2).
[](https://www.nuget.org/packages/LostTech.TensorFlow.GPT/)
## Known issues
### CUDA out of host memory
There seems to be an issue with TensorFlow's default GPU memory allocator, that consumes more than needed.
In case you know you have enough RAM/GPU RAM, setting `TF_GPU_ALLOCATOR` environment variable to `cuda_malloc` might help.