https://github.com/es7/gpt-from-scratch
In this repository, I have created the GPT architecture, provided the code for building it from scratch, and demonstrated how to train it.
https://github.com/es7/gpt-from-scratch
deep-learning gpt2 large-language-models llm machine-learning natural-language-processing
Last synced: over 1 year ago
JSON representation
In this repository, I have created the GPT architecture, provided the code for building it from scratch, and demonstrated how to train it.
- Host: GitHub
- URL: https://github.com/es7/gpt-from-scratch
- Owner: ES7
- License: mit
- Created: 2024-06-22T08:56:06.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-06-22T09:17:55.000Z (almost 2 years ago)
- Last Synced: 2025-01-11T08:17:18.025Z (over 1 year ago)
- Topics: deep-learning, gpt2, large-language-models, llm, machine-learning, natural-language-processing
- Language: Jupyter Notebook
- Homepage:
- Size: 493 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# GPT-from-Scratch