https://github.com/mrshibly/minigpt-from-scratch
A complete implementation of a GPT-style decoder-only Transformer language model built from scratch in PyTorch, including BPE tokenization, dataset preprocessing, training pipeline, mixed-precision training, checkpointing, and text generation.
https://github.com/mrshibly/minigpt-from-scratch
deep-learning gpt language-model llm machine-learning minigpt nlp pyt self-attention transformer
Last synced: about 1 month ago
JSON representation
A complete implementation of a GPT-style decoder-only Transformer language model built from scratch in PyTorch, including BPE tokenization, dataset preprocessing, training pipeline, mixed-precision training, checkpointing, and text generation.
- Host: GitHub
- URL: https://github.com/mrshibly/minigpt-from-scratch
- Owner: mrshibly
- Created: 2026-03-13T20:53:17.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2026-03-13T21:41:36.000Z (3 months ago)
- Last Synced: 2026-03-14T09:21:22.351Z (3 months ago)
- Topics: deep-learning, gpt, language-model, llm, machine-learning, minigpt, nlp, pyt, self-attention, transformer
- Language: Python
- Homepage:
- Size: 3.22 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files: