awesome-llm
  
  
    Awesome series for Large Language Model(LLM)s 
    https://github.com/KennethanCeyer/awesome-llm
  
        Last synced: 2 days ago 
        JSON representation
    
- 
            
Materials
- 
                    
Posts
- BloombergGPT (50B) - Announced by Bloomberg / 2023
 - Gemma - Gemma: Introducing new state-of-the-art open models / 2024
 - Grok-1 - Open Release of Grok-1 / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - Luminous (13B) - Announced by Aleph Alpha / 2021
 - Turing NLG (17B) - Announced by Microsoft / 2020
 - Claude (52B) - Announced by Anthropic / 2021
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - AlexaTM (20B - Announced by Amazon / 2023
 - Dolly (6B) - Announced by Databricks / 2023
 - Jurassic-1 - Announced by AI21 / 2022
 - Jurassic-2 - Announced by AI21 / 2023
 - Koala - Announced by Berkeley Artificial Intelligence Research(BAIR) / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - Minerva (Parameter size unannounced) - Announced by Google / 2022
 - Grok-1.5 - Announced by XAI / 2024
 - DBRX - Announced by Databricks / 2024
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - Llama 2 (70B) - Announced by Meta / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - Luminous (13B) - Announced by Aleph Alpha / 2021
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - Claude (52B) - Announced by Anthropic / 2021
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - BloombergGPT (50B) - Announced by Bloomberg / 2023
 - Grok-2 - Announced by XAI / 2025
 
 - 
                    
Papers
- Megatron-Turing NLG (530B) - Announced by NVIDIA and Microsoft / 2021
 - LaMDA (137B) - Announced by Google / 2021
 - PaLM (540B) - Announced by Google / 2022
 - AlphaCode (41.4B) - Announced by DeepMind / 2022
 - Chinchilla (70B) - Announced by DeepMind / 2022
 - Sparrow (70B) - Announced by DeepMind / 2022
 - NLLB (54.5B) - Announced by Meta / 2022
 - LLaMA (65B) - Announced by Meta / 2023
 - AlexaTM (20B) - Announced by Amazon / 2022
 - Galactica (120B) - Announced by Meta / 2022
 - LIMA - Announced by Meta / 2023
 - DeekSeek-R1 (631B) - Announced by DeepSeek-AI / 2025
 - Gopher (280B) - Announced by DeepMind / 2021
 - GLaM (1.2T) - Announced by Google / 2021
 
 - 
                    
Projects
- BigScience - Maintained by HuggingFace ([Twitter](https://twitter.com/BigScienceLLM)) ([Notion](https://bigscience.notion.site/BLOOM-BigScience-176B-Model-ad073ca07cdf479398d5f95d88e218c4))
 - HuggingChat - Maintained by HuggingFace / 2023
 - OpenAssistant - Maintained by Open Assistant / 2023
 - Eleuther AI Language Model - Maintained by Eleuther AI / 2023
 - Falcon LLM - Maintained by Technology Innovation Institute / 2023
 - StableLM - Maintained by Stability AI / 2023
 - HuggingChat - Maintained by HuggingFace / 2023
 - Gemma - Maintained by Google / 2024
 
 - 
                    
HuggingFace repositories
- Vicuna Delta v0 - An open-source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT.
 - MPT 7B - A decoder-style transformer pre-trained from scratch on 1T tokens of English text and code. This model was trained by MosaicML.
 - OpenAssistant SFT 6 - 30 billion LLaMa-based model made by HuggingFace for the chatting conversation.
 - Falcon 7B - A 7B parameters causal decoder-only model built by TII and trained on 1,500B tokens of RefinedWeb enhanced with curated corpora.
 
 - 
                    
Reading materials
 - 
                    
GitHub repositories
- Stanford Alpaca -  - A repository of Stanford Alpaca project, a model fine-tuned from the LLaMA 7B model on 52K instruction-following demonstrations.
 - Dolly -  - A large language model trained on the Databricks Machine Learning Platform.
 - dalai -  - The cli tool to run LLaMA on the local machine.
 - LLaMA-Adapter -  - Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters.
 - alpaca-lora -  - Instruct-tune LLaMA on consumer hardware.
 - openai/evals -  - A curated list of reinforcement learning with human feedback resources.
 - trlx -  - A repo for distributed training of language models with Reinforcement Learning via Human Feedback. (RLHF)
 - pythia -  - A suite of 16 LLMs all trained on public data seen in the exact same order and ranging in size from 70M to 12B parameters.
 - Embedchain -  - Framework to create ChatGPT like bots over your dataset.
 - google-deepmind/gemma -  - Open weights LLM from Google DeepMind.
 - llama_index -  - A project that provides a central interface to connect your LLM's with external data.
 - DeepSeek-R1 -  - A first-generation reasoning model from DeepSeek-AI.
 - AutoGPT -  - An experimental open-source attempt to make GPT-4 fully autonomous.
 
 
 - 
                    
 - 
            
Models
- 
                    
Open models
- T5 (11B) - Announced by Google / 2020
 - T5 FLAN (540B) - Announced by Google / 2022
 - T0 (11B) - Announced by BigScience (HuggingFace) / 2021
 - OPT-175B (175B) - Announced by Meta / 2022
 - Bloom (176B) - Announced by BigScience (HuggingFace) / 2022
 - BERT-Large (336M) - Announced by Google / 2018
 - Macaw (11B) - Announced by AI2 / 2021
 - Stanford Alpaca (7B) - Announced by Stanford University / 2023
 - GPT-NeoX 2.0 (20B) - Announced by EleutherAI / 2023
 - UL2 (20B) - Announced by Google / 2022
 - Gemma 3 (1B, 4B, 12B, 27B) - Announced by DeepSeek / 2025
 - LLaMA 3 (8B, 70B) - Announced by Meta / 2024
 - Mistral 7B - Announced by Mistral AI / 2023
 - Solar (10.7B) - Announced by Upstage / 2023
 - DBRX (132B) - Announced by Databricks / 2024
 - Dolly (6B, 12B) - Announced by Databricks / 2023
 - GPT-NEOX 20B - Announced by EleutherAI / 2023
 
 - 
                    
Commercial models
- ChatGPT Plus (175B) - Announced by OpenAI / 2023
 - GPT 3.5 (175B, text-davinci-003) - Announced by OpenAI / 2022
 - Gemini - Announced by Google Deepmind / 2023
 - ChatGPT (175B) - Announced by OpenAI / 2022
 - ChatGPT Plus (175B) - Announced by OpenAI / 2023
 - Codex (11B) - Announced by OpenAI / 2021
 - Bard - Announced by Google / 2023
 - GPT 4 (Parameter size unannounced, gpt-4-32k) - Announced by OpenAI / 2023
 
 - 
                    
Projects
- LMOps - Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities.
 - Visual ChatGPT - Announced by Microsoft / 2023
 
 
 - 
                    
 - 
            
Datasets
 - 
            
Benchmarks
 
            Programming Languages
          
          
        
            Categories
          
          
        
            Sub Categories
          
          
        
            Keywords
          
          
              
                language-model
                3
              
              
                llm
                3
              
              
                gpt
                2
              
              
                deep-learning
                1
              
              
                visual-analysis
                1
              
              
                leaderboard
                1
              
              
                dataset
                1
              
              
                x-prompt
                1
              
              
                promptist
                1
              
              
                prompt
                1
              
              
                pretraining
                1
              
              
                nlp
                1
              
              
                lmops
                1
              
              
                lm
                1
              
              
                agi
                1
              
              
                transformers
                1
              
              
                gpt-3
                1
              
              
                deepspeed-library
                1
              
              
                vector-database
                1
              
              
                rag
                1
              
              
                multi-agents
                1
              
              
                llamaindex
                1
              
              
                framework
                1
              
              
                fine-tuning
                1
              
              
                data
                1
              
              
                application
                1
              
              
                agents
                1
              
              
                reinforcement-learning
                1
              
              
                pytorch
                1
              
              
                machine-learning
                1
              
              
                llama
                1
              
              
                ai
                1
              
              
                dolly
                1
              
              
                databricks
                1
              
              
                chatbot
                1
              
              
                instruction-following
                1