An open API service indexing awesome lists of open source software.

https://github.com/jancarauma/nanogpt

nanoGPT - A simple GPT-Style Transformer from Scratch in PyTorch
https://github.com/jancarauma/nanogpt

artificial-intelligence attention-mechanism deep-learning gpt language-model language-modeling machine-learning nano-gpt natural-language-processing neural-networks nlp pytorch self-attention text-generation transformer

Last synced: 27 days ago
JSON representation

nanoGPT - A simple GPT-Style Transformer from Scratch in PyTorch

Awesome Lists containing this project

README

          

# nanoGPT

A minimalist implementation of a GPT-style transformer model built from scratch using PyTorch.

This project demonstrates the core components of transformer architecture, including multi-head self-attention, positional embeddings, and causal masking, trained on a small custom dataset for simple conversational tasks.
Designed for educational purposes and quick experimentation with language modeling fundamentals.

![image](https://github.com/user-attachments/assets/701bed97-82a3-4314-a2ba-157397ecfe9e)