https://github.com/hesamsheikh/llm-mechanics
Coding an LLM and its building blocks from scratch.
https://github.com/hesamsheikh/llm-mechanics
gpt large-language-model llm natural-language-processing python pytorch
Last synced: 8 months ago
JSON representation
Coding an LLM and its building blocks from scratch.
- Host: GitHub
- URL: https://github.com/hesamsheikh/llm-mechanics
- Owner: hesamsheikh
- Created: 2024-12-19T11:20:08.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2025-01-27T17:24:59.000Z (9 months ago)
- Last Synced: 2025-01-27T18:34:42.181Z (9 months ago)
- Topics: gpt, large-language-model, llm, natural-language-processing, python, pytorch
- Language: Jupyter Notebook
- Homepage: https://x.com/Hesamation
- Size: 2.37 MB
- Stars: 15
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# LLM Mechanics (implementing from scratch)
This repository is my personal playground to implement large language models (LLM) and their building blocks from scratch using PyTorch. If you like to know the theory behind how LLMs operate, check out my [Advanced ML Learning Journal](https://github.com/hesamsheikh/ml-retreat).
![]()