Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-llm-list

An overview of Large Language Model (LLM) options
https://github.com/Barnacle-ai/awesome-llm-list

Last synced: 5 days ago
JSON representation

GEN-AI FOR DEVELOPERS
GPT4V ALTERNATIVES
- Fuyu-8B
- BakLLaVA
- CogVLM
- Qwen-VL
CUSTOM GPTs
EDUCATION
LEADERBOARDS
INFERENCING FRAMEWORKS
OPEN SOURCE MODELS
- φ [Phi-2](https://www.microsoft.com/en-us/research/blog/phi-2-the-surprising-power-of-small-language-models/)
  - Microsoft
- 🌬️ [Mistral 8x7B](https://mistral.ai)
  - Mistral
- 🐦‍⬛ [Starling](https://starling.cs.berkeley.edu)
- 1️⃣ [Yi](https://01.ai)
  - 01.AI
  - request
- 🐳 [Orca 2](https://www.microsoft.com/en-us/research/blog/orca-2-teaching-small-language-models-how-to-reason/)
  - MS Research License
- 🏯 [Qwen](https://huggingface.co/Qwen)
  - Tongyi Qianwen
  - Tongyi Qianwen
- 🐋 [Stable Beluga](https://stability.ai/blog/stable-beluga-large-instruction-fine-tuned-models)
  - CC BY-NC-4.0
- 🦙 [Stanford Alpaca](https://crfm.stanford.edu/2023/03/13/alpaca.html)
  - Stanford
  - blog post - tuning process. They used 4x A100 80GB GPUs for 1.5 hours. For total training cost, the cost of training the underlying LLaMA model also needs to be taken into account.
- 🌸 [Bloom](https://bigscience.huggingface.co/blog/bloom)
  - BigScience Rail License
- 🦙 [IBM Dromedary](https://github.com/IBM/Dromedary)
  - IBM
  - IBM
- 🌬️ [Notus](https://argilla.io/blog/notus7b/)
  - Argilla - tuneed from Mistral
- G [Gemma](https://blog.google/technology/developers/gemma-open-models/)
  - Google
  - Gemma
- 🦙 [LLaMA2](https://ai.meta.com/llama/)
  - Llama 2 Community License Agreement
- 🧱 [Databricks Dolly 2](https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm)
  - CC BY-SA-4.0
- 🧠 [Cerebras-GPT](https://www.cerebras.net/blog/cerebras-gpt-a-family-of-open-compute-efficient-large-language-models/)
  - https://arxiv.org/abs/2304.03208
  - Cerebras
- 🍮 [Google FLAN-T5](https://ai.googleblog.com/2020/02/exploring-transfer-learning-with-t5.html)
  - Google
  - Apache 2.0
- 🧩 [MosaicML MPT-7B](https://www.mosaicml.com/blog/mpt-7b)
  - CC-By-NC-SA-4.0
  - CC-By-SA-3.0
- 🍃 [Zephyr](https://huggingface.co/collections/HuggingFaceH4/zephyr-7b-6538c6d6d5ddd1cbb1744a66)
  - MIT
COMMERCIAL MODELS
- [IBM](https://www.ibm.com/products/watsonx-ai)
LLM CHAT
RESEARCH PAPERS
BENCHMARKS
- GAIA
- ARC
- HellaSwag
- MMLU
- TruthfulQA
- Winogrande
- GSM8K
- DROP
- IDEFICS
- Winogrande

Programming Languages

Python 8 Jupyter Notebook 3 C++ 1 MDX 1

Categories

RESEARCH PAPERS 28 OPEN SOURCE MODELS 28 EDUCATION 16 BENCHMARKS 10 LLM CHAT 9 CUSTOM GPTs 7 LEADERBOARDS 7 GEN-AI FOR DEVELOPERS 6 INFERENCING FRAMEWORKS 5 GPT4V ALTERNATIVES 4 COMMERCIAL MODELS 3

Sub Categories

🐦‍⬛ [Starling](https://starling.cs.berkeley.edu) 3 [IBM](https://www.ibm.com/products/watsonx-ai) 3 🦙 [Stanford Alpaca](https://crfm.stanford.edu/2023/03/13/alpaca.html) 2 🦙 [IBM Dromedary](https://github.com/IBM/Dromedary) 2 🍮 [Google FLAN-T5](https://ai.googleblog.com/2020/02/exploring-transfer-learning-with-t5.html) 2 1️⃣ [Yi](https://01.ai) 2 🧠 [Cerebras-GPT](https://www.cerebras.net/blog/cerebras-gpt-a-family-of-open-compute-efficient-large-language-models/) 2 🧩 [MosaicML MPT-7B](https://www.mosaicml.com/blog/mpt-7b) 2 🏯 [Qwen](https://huggingface.co/Qwen) 2 G [Gemma](https://blog.google/technology/developers/gemma-open-models/) 2 🐳 [Orca 2](https://www.microsoft.com/en-us/research/blog/orca-2-teaching-small-language-models-how-to-reason/) 1 🐋 [Stable Beluga](https://stability.ai/blog/stable-beluga-large-instruction-fine-tuned-models) 1 φ [Phi-2](https://www.microsoft.com/en-us/research/blog/phi-2-the-surprising-power-of-small-language-models/) 1 🌸 [Bloom](https://bigscience.huggingface.co/blog/bloom) 1 🧱 [Databricks Dolly 2](https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm) 1 🍃 [Zephyr](https://huggingface.co/collections/HuggingFaceH4/zephyr-7b-6538c6d6d5ddd1cbb1744a66) 1 🌬️ [Notus](https://argilla.io/blog/notus7b/) 1 🌬️ [Mistral 8x7B](https://mistral.ai) 1 🦙 [LLaMA2](https://ai.meta.com/llama/) 1

Keywords

deep-learning 5 inference 4 llm 4 large-language-models 3 language-model 3 gpt 3 chatgpt 3 pytorch 3 generative-ai 2 falcon 2 openai 2 ai 2 prompt-engineering 2 transformer 2 nlp 2 mlops 2 llmops 2 llm-serving 2 llama 2 cuda 2 machine-learning 1 roadmap 1 hallucinations 1 evaluation 1 foundation-models 1 instruction-following 1 leaderboard 1 rlhf 1 amd 1 hpu 1 agentgpt 1 ai-agents 1 awesome-gpt-store 1 awesome-gpts 1 awesome-list 1 chatgpt-api 1 chatgpt-plugins 1 customgpt 1 gpt-4 1 gpt-store 1 gpts 1 gptshowcas 1 gptslist 1 gptstore 1 azure 1 dall-e 1 generativeai 1 llms 1 microsoft-for-beginners 1 semantic-search 1