An open API service indexing awesome lists of open source software.

https://github.com/mukeshmithrakumar/llm-poc-2024

Popular Large Language Models from scratch - 2024
https://github.com/mukeshmithrakumar/llm-poc-2024

gpt llama llm llm-inference llm-training transformer

Last synced: 25 days ago
JSON representation

Popular Large Language Models from scratch - 2024

Awesome Lists containing this project

README

          


llm_poc_2024_banner

Large Language Models POC 2024



Python 3.10.14
PyTorch
PyTorch Lightning
Huggingface Transformers
Github Issues
Github Stars


🚧 🚀 WIP... 🚧



About   |  
Models   |  
Helpful Notebooks   |  
Requirements   |  
License   |  
Author


## :dart: About

My goal is to work my way through certain LLM models starting from Transformers to help me understand how each model works and builds from the previous models. Once I have a set of models I am interested in, the next focus will be on fine-tuning and optimizing the models to run on the cheapest hardware possible.

## :sparkles: Models


✅ Transformer

✅ GPT

✅ LLaMA

◻️ LLM Inference Optimization

◻️ In-flight Batching

◻️ Speculative inference

◻️ Key-Value Caching

◻️ PagedAttention

◻️ Pipeline Parallelism

◻️ Tensor Parallelism

◻️ Sequence Parallelism

◻️ Flash Attention

◻️ Quantization

◻️ Sparsity

◻️ Distillation

◻️

◻️

## :fire: Helpful Notebooks


Transformer Arithmetic

[WIP] Transformer Scaling

◻️

## :test_tube: Requirements

Requirements for all the models are stored under a single `requirements.txt` file.

## :memo: License

This project is under license from MIT. For more details, see the [LICENSE](LICENSE) file.

---

Made with :heart: by Mukesh Mithrakumar

 

Back to top