Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-llm-list
An overview of Large Language Model (LLM) options
https://github.com/Barnacle-ai/awesome-llm-list
Last synced: 3 days ago
JSON representation
-
GEN-AI FOR DEVELOPERS
-
GPT4V ALTERNATIVES
-
CUSTOM GPTs
-
EDUCATION
- Generative AI for Beginners - Microsoft
- Prompt Engineering Guide
- Stanford: CS324 - Large Language Models
- Large Language Model Course
- Brex's Prompt Engineering Guide
- OpenAI Prompt Engineering Guide
- LLM Bootcamp
- Best practices for prompt engineering with OpenAI API
- Lil'Log: Prompt Engineering
- Prompt Engineering Guide
- Cohere LLM University
- Deep Learning: ChatGPT Prompt Engineering for Developers
- Deep Learning: Learn the fundamentals of generative AI for real-world applications
- Deep Learning: LangChain for LLM Application Development
- Princeton: COS 597G - Understanding Large Language Models
- Machine Learning Engineering Online Book
-
LEADERBOARDS
-
INFERENCING FRAMEWORKS
-
OPEN SOURCE MODELS
-
φ [Phi-2](https://www.microsoft.com/en-us/research/blog/phi-2-the-surprising-power-of-small-language-models/)
-
🌬️ [Mistral 8x7B](https://mistral.ai)
-
🐦⬛ [Starling](https://starling.cs.berkeley.edu)
-
1️⃣ [Yi](https://01.ai)
-
🐳 [Orca 2](https://www.microsoft.com/en-us/research/blog/orca-2-teaching-small-language-models-how-to-reason/)
-
🏯 [Qwen](https://huggingface.co/Qwen)
-
🐋 [Stable Beluga](https://stability.ai/blog/stable-beluga-large-instruction-fine-tuned-models)
-
🦙 [Stanford Alpaca](https://crfm.stanford.edu/2023/03/13/alpaca.html)
-
🌸 [Bloom](https://bigscience.huggingface.co/blog/bloom)
-
🦙 [IBM Dromedary](https://github.com/IBM/Dromedary)
-
🌬️ [Notus](https://argilla.io/blog/notus7b/)
- Argilla - tuneed from Mistral
-
G [Gemma](https://blog.google/technology/developers/gemma-open-models/)
-
🧱 [Databricks Dolly 2](https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm)
-
🧠 [Cerebras-GPT](https://www.cerebras.net/blog/cerebras-gpt-a-family-of-open-compute-efficient-large-language-models/)
-
🧩 [MosaicML MPT-7B](https://www.mosaicml.com/blog/mpt-7b)
-
🍮 [Google FLAN-T5](https://ai.googleblog.com/2020/02/exploring-transfer-learning-with-t5.html)
-
🍃 [Zephyr](https://huggingface.co/collections/HuggingFaceH4/zephyr-7b-6538c6d6d5ddd1cbb1744a66)
-
-
COMMERCIAL MODELS
-
[IBM](https://www.ibm.com/products/watsonx-ai)
-
-
LLM CHAT
-
RESEARCH PAPERS
- Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4
- Direct Preference Optimization: Your Language Model is Secretly a Reward Model
- AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
- Memory Augmented Large Language Models are Computationally Universal
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters
- A Survey of Large Language Models
- A Comprehensive Overview of Large Language Models
- Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
- Aligning Large Language Models with Human: A Survey
- ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
- Generative Agents: Interactive Simulacra of Human Behavior
- QLoRA: Efficient Finetuning of Quantized LLMs
- Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned
- Scaling Instruction-Finetuned Language Models
- Constitutional AI: Harmlessness from AI Feedback
- What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization?
- Training language models to follow instructions with human feedback
- Emergent Abilities of Large Language Models
- Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
- LoRA: Low-Rank Adaptation of Large Language Models
- The Power of Scale for Parameter-Efficient Prompt Tuning
- On the Opportunities and Risks of Foundation Models
- Language Models are Few-Shot Learners
- Scaling Laws for Neural Language Models
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
- Attention is all you need
- Language Models are Few-Shot Learners
- Generative Agents: Interactive Simulacra of Human Behavior
-
BENCHMARKS
Programming Languages
Categories
Sub Categories
🐦⬛ [Starling](https://starling.cs.berkeley.edu)
3
🏯 [Qwen](https://huggingface.co/Qwen)
2
G [Gemma](https://blog.google/technology/developers/gemma-open-models/)
2
🧩 [MosaicML MPT-7B](https://www.mosaicml.com/blog/mpt-7b)
2
[IBM](https://www.ibm.com/products/watsonx-ai)
2
1️⃣ [Yi](https://01.ai)
2
🍮 [Google FLAN-T5](https://ai.googleblog.com/2020/02/exploring-transfer-learning-with-t5.html)
2
🦙 [Stanford Alpaca](https://crfm.stanford.edu/2023/03/13/alpaca.html)
2
🧠 [Cerebras-GPT](https://www.cerebras.net/blog/cerebras-gpt-a-family-of-open-compute-efficient-large-language-models/)
1
🌬️ [Notus](https://argilla.io/blog/notus7b/)
1
🍃 [Zephyr](https://huggingface.co/collections/HuggingFaceH4/zephyr-7b-6538c6d6d5ddd1cbb1744a66)
1
🌬️ [Mistral 8x7B](https://mistral.ai)
1
🧱 [Databricks Dolly 2](https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm)
1
🌸 [Bloom](https://bigscience.huggingface.co/blog/bloom)
1
φ [Phi-2](https://www.microsoft.com/en-us/research/blog/phi-2-the-surprising-power-of-small-language-models/)
1
🦙 [IBM Dromedary](https://github.com/IBM/Dromedary)
1
🐋 [Stable Beluga](https://stability.ai/blog/stable-beluga-large-instruction-fine-tuned-models)
1
🐳 [Orca 2](https://www.microsoft.com/en-us/research/blog/orca-2-teaching-small-language-models-how-to-reason/)
1
Keywords
deep-learning
5
inference
4
llm
4
gpt
3
language-model
3
chatgpt
3
pytorch
3
large-language-models
3
generative-ai
2
openai
2
prompt-engineering
2
falcon
2
transformer
2
nlp
2
cuda
2
mlops
2
llmops
2
llm-serving
2
llama
2
ai
2
foundation-models
1
gpt-store
1
instruction-following
1
leaderboard
1
gpt-4
1
customgpt
1
rlhf
1
amd
1
chatgpt-plugins
1
chatgpt-api
1
awesome-list
1
awesome-gpts
1
awesome-gpt-store
1
ai-agents
1
agentgpt
1
evaluation
1
hallucinations
1
roadmap
1
machine-learning
1
gpts
1
gptshowcas
1
course
1
gptslist
1
transformers
1
semantic-search
1
gptstore
1
llms
1
azure
1
dall-e
1
generativeai
1