Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-LLM-resourses
🧑🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.
https://github.com/WangRongsheng/awesome-LLM-resourses
Last synced: about 13 hours ago
JSON representation
-
评估 Evaluation
- aisuite
- DeepSeek-v3
- lm-evaluation-harness - shot evaluation of language models.
- opencompass - 4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
- llm-comparator - by-side, developed.
- Ollama Benchmark
- 火山引擎
- 文心千帆
- DashScope
- Groq
- 硅基流动
- VLMEvalKit - source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks.
- EvalScope
- Weave
- Evaluation guidebook
- MixEval
- AGI-Eval
- DeerAPI
- Qwen-Chat
-
知识库 RAG
- LightRAG - Agent-Generator pipelines.
- RAGLite - Augmented Generation (RAG) with PostgreSQL or SQLite.
- GraphRAG-Ollama-UI
- MiniRAG - augmented generation framework that enables small models to achieve good RAG performance through heterogeneous graph indexing and lightweight topology-enhanced retrieval.
- FlashRAG
- AnythingLLM - in-one AI app for any LLM with full RAG and AI Agent capabilites.
- MaxKB
- RAGFlow - source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
- Dify - source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
- FastGPT - based platform built on the LLM, offers out-of-the-box data processing and model invocation capabilities, allows for workflow orchestration through Flow visualization.
- Langchain-Chatchat
- QAnything
- Quivr - augmented generation.
- Verba
- GraphRAG - based Retrieval-Augmented Generation (RAG) system.
- TEN - Gen AI-Agent Framework, the world's first truly real-time multimodal AI agent framework.
- nano-GraphRAG - to-hack GraphRAG implementation.
- RAG Techniques - Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
- ragas
- kotaemon - source clean & customizable RAG UI for chatting with your documents. Built with both end users and developers in mind.
- Fast-GraphRAG
- AutoRAG
- RAGapp
- KAG - guided reasoning and retrieval framework based on OpenSPG engine and LLMs.
- Tiny-GraphRAG
- XRAG - Augmented Generation (RAG) systems.
- TurboRAG - Augmented Generation with Precomputed KV Caches for Chunked Text.
- LightRAG - Augmented Generation.
- DB-GPT GraphRAG - GPT GraphRAG integrates both triplet-based knowledge graphs and document structure graphs while leveraging community and document retrieval mechanisms to enhance RAG capabilities, achieving comparable performance while consuming only 50% of the tokens required by Microsoft's GraphRAG. Refer to the DB-GPT [Graph RAG User Manual](http://docs.dbgpt.cn/docs/cookbook/rag/graph_rag_app_develop/) for details.
- Chonkie - nonsense RAG chunking library that's lightweight, lightning-fast, and ready to CHONK your texts.
- RAG-GPT - GPT, leveraging LLM and RAG technology, learns from user-customized knowledge bases to provide contextually relevant answers for a wide range of queries, ensuring rapid and accurate information retrieval.
- CAG
-
论文 Paper
- Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
- TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
- Yi-Lightning Technical Report
- Qwen2.5 Technical Report
- YuLan-Mini: An Open Data-efficient Language Model
- An Introduction to Vision-Language Modeling
- Huggingface Daily Papers - ai/ML-Papers-Explained)
- OLMoE: Open Mixture-of-Experts Language Models
- The Llama 3 Herd of Models
- Qwen Technical Report
- Hermes-3-Technical-Report
- Qwen2 Technical Report
- DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
- Baichuan 2: Open Large-scale Language Models
- DataComp-LM: In search of the next generation of training sets for language models
- OLMo: Accelerating the Science of Language Models
- MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series
- Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
- Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
- Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
- Jamba: A Hybrid Transformer-Mamba Language Model
- Textbooks Are All You Need
- Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
- Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
- Baichuan Alignment Technical Report
- Qwen2-vl Technical Report
- DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
- Baichuan-Omni Technical Report
- Model Merging Paper
- 1.5-Pints Technical Report: Pretraining in Days, Not Months – Your Language Model Thrives on Quality Data
- Phi-4 Technical Report
- Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
- 2 OLMo 2 Furious
-
Tips
- LLM-Dojo 开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架
- o1 isn’t a chat model (and that’s the point)
- Beam Search快速理解及代码解析
- 基于 transformers 的 generate() 方法实现多样化文本生成:参数含义和算法原理解读
- 轻松入门大语言模型(LLM)
- LLM训练-pretrain
- Zero-Qwen-VL
- MiniMind
- Knowledge distillation: Teaching LLM's with synthetic data
- Part 1: Methods for adapting large language models
- Part 2: To fine-tune or not to fine-tune
- Part 3: How to fine-tune: Focus on effective datasets
- Reader-LM: Small Language Models for Cleaning and Converting HTML to Markdown
- Distributed Training Guide
- Chat Templates
- LLMs for Text Classification: A Guide to Supervised Learning
- Unsupervised Text Classification: Categorize Natural Language With LLMs
- Text Classification With LLMs: A Roundup of the Best Methods
- Tiny LLM Universe
- Zero-Chatgpt
- ![Forkers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/network/members)
- ![Stargazers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/stargazers)
- ![Stargazers over time - LLM-resourses)
- LLM Pricing
- Uncensor any LLM with abliteration
- finetune-Qwen2-VL
- build_MiniLLM_from_scratch
- Tiny LLM zh
- Top 20+ RAG Interview Questions
- ![Forkers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/network/members)
- ![Stargazers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/stargazers)
- LLMs应用构建一年之心得
- What We Learned from a Year of Building with LLMs (Part I)
- What We Learned from a Year of Building with LLMs (Part II)
- What We Learned from a Year of Building with LLMs (Part III): Strategy
- pytorch-llama
- Preference Optimization for Vision Language Models with TRL
- Fine-tuning visual language models using SFTTrainer - sfttrainer-for-vision-language-models)】
- A Visual Guide to Mixture of Experts (MoE)
- MPP-LLaVA
- LLM-Travel
- Role-Playing in Large Language Models like ChatGPT
-
书籍 Book
- Foundations of Large Language Models
- 《大规模语言模型:从理论到实践》
- 《大语言模型》
- 《动手做AI Agent》
- 《动手学大模型Dive into LLMs》
- 《Build a Large Language Model (From Scratch)》
- 《多模态大模型》
- 《Understanding Deep Learning》
- 《Illustrated book to learn about Transformers & LLMs》
- 《Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG》
- 《大模型基础》
- Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software
- 《Hands-On Large Language Models》
- 《自然语言处理:大模型理论与实践》
- 《动手学强化学习》
- 《面向开发者的LLM入门教程》
- 《Generative AI Handbook: A Roadmap for Learning Resources》
-
课程 Course
- HuggingFace Learn
- 李宏毅 GenAI课程
- HuggingFace NLP Course
- 清华 NLP 刘知远团队大模型公开课
- Mistral: Getting Started with Mistral
- Knowledge Graphs for RAG
- OpenRAG
- 通往AGI之路
- 斯坦福 CS224N: Natural Language Processing with Deep Learning
- 吴恩达: Generative AI for Everyone
- 吴恩达: LLM series of courses
- ACL 2023 Tutorial: Retrieval-based Language Models and Applications
- 微软: Generative AI for Beginners
- 斯坦福 CS324: Large Language Models
- openai-cookbook
- mistralai-cookbook
- build nanoGPT
- LLMs From Scratch (Datawhale Version)
- Multimodal RAG: Chat with Videos
- Large Language Model Agents
- Cohere LLM University
- LLMs and Transformers
- Smol Vision
- Learn RAG From Scratch – Python AI Tutorial from a LangChain Engineer
- 微软: State of GPT
- 斯坦福 CS25: Transformers United V4
- 普林斯顿 COS 597G (Fall 2022): Understanding Large Language Models
- 约翰霍普金斯 CS 601.471/671 NLP: Self-supervised Models
- 滑铁卢大学 CS 886: Recent Advances on Foundation Models
- LLM101n
- Interactive visualization of Transformer
- andysingal/llm-course
- LM-class
- Google Advanced: Generative AI for Developers Learning Path
- Anthropics:Prompt Engineering Interactive Tutorial
- Hands on llms - time financial advisor LLM system.
- LangGPT
- LLMs Interview Note
- LLMsBook
- Coursera: Chatgpt 应用提示工程
- LLM Evaluation: A Complete Course
- Introduction to Generative AI 2024 Spring
- llm-course: Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
- RAG++ : From POC to production
- Weights & Biases AI Academy
- Prompt Engineering & AI tutorials & Resources
- LLM Resources Hub
-
教程 Tutorial
- AI-Guide-and-Demos
- AI开发者频道
- B站:五里墩茶社
- B站:木羽Cheney
- B站:漆妮妮
- Prompt Engineering Guide
- 动手学大模型应用开发
- 知乎: ybq
- YTB: Hyung Won Chung
- Blog: Tejaswi kashyap
- Blog: 小昇的博客
- Blog: GbyAI
- B站:TechBeat人工智能社区
- B站:黄益贺
- LLM Visualization
- B站:AI老兵文哲
- YTB:IBM Technology
- YTB: Unify Reading Paper Group
- Chip Huyen
- How Much VRAM
- Blog: 科学空间(苏剑林)
- 知乎: 原石人类
- B站:小黑黑讲AI
- B站:面壁的车辆工程师
- Blog: mlabonne
- Blog: Lil’Log (OponAI)
- YTB: Unify Reading Paper Group
- LLM-Action
- B站:深度学习自然语言处理
- W&B articles
- Huggingface Blog
- B站: 毛玉仁
-
数据 Data
- datasketch
- semhash
- ReaderLM-v2
- Bespoke Curator - Training & Structured Data Extraction.
- AotoLabel
- LabelLLM - Source Data Annotation Platform.
- OmniParser
- MinerU - stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.
- PDF-Extract-Kit - Quality PDF Content Extraction.
- Docling
- GOT-OCR2.0
- Zerox - 4o-mini.
- data-juicer - stop data processing system to make data higher-quality, juicier, and more digestible for LLMs!
- Parsera - sites with LLMs.
- Sparrow - source solution for efficient data extraction and processing from various documents and images.
- pdf-extract-api
- MegaParse
- DocLayout-YOLO - to-Local Adaptive Perception.
- TensorZero
- Promptwright
- LLM Decontaminator
- DataTrove
- llm-swarm
- LangKit - source toolkit for monitoring Large Language Models (LLMs). Extracts signals from prompts & responses, ensuring safety & security.
- Tabled
- Distilabel
- Common-Crawl-Pipeline-Creator
- pdf2htmlEX
- Extractous
- MarkItDown
-
微调 Fine-Tuning
- Online-RLHF
- InternEvo - sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
- 360-LLaMA-Factory - Tuning of 100+ LLMs. (add Sequence Parallelism for supporting long context training)
- AutoTrain - of-the-art Machine Learning models.
- workbench-llamafactory - to-end model development workflow using Llamafactory.
- TRL
- LLaMA-Factory - Tuning of 100+ LLMs.
- unsloth - 5X faster 80% less memory LLM finetuning.
- Firefly
- Xtuner - featured toolkit for fine-tuning large models.
- torchtune - PyTorch Library for LLM Fine-tuning.
- Ludwig - code framework for building custom LLMs, neural networks, and other AI models.
- aikit - tune, build, and deploy open-source LLMs easily!
- H2O-LLMStudio - a framework and no-code GUI for fine-tuning LLMs.
- LitGPT - of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
- LLMBox
- PaddleNLP - to-use and powerful NLP and LLM library.
- TinyLLaVA Factory - scale Large Multimodal Models.
- LLM-Foundry
- ChatLearn - scale alignment.
- Meta Lingua - to-hack codebase to research LLMs.
- mistral-finetune - weight codebase that enables memory-efficient and performant finetuning of Mistral's models.
- lmms-finetune - 1.5, qwen-vl, llava-interleave, llava-next-video, phi3-v etc.
- Simplifine
- Transformer Lab - tune, and evaluate large language models on your own computer.
- Liger-Kernel
- Vision-LLM Alignemnt - based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
- OpenRLHF - to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral).
- nanotron - parallelism training.
- Proxy Tuning
- Effective LLM Alignment
- Autotrain-advanced
-
智能体 Agents
- LinkAI
- Baidu APPBuilder
- AutoGen - studio.com/)
- AgentGPT
- XAgent
- MobileAgent
- Lagent - based agents.
- Qwen-Agent
- LazyLLM
- AgentScope - empowered multi-agent applications in an easier way.
- MoA - of-the-art results.
- Agently
- Tribe - agent teams.
- CAMEL - agent framework and an open-source community dedicated to finding the scaling law of agents.
- IoA - source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.
- Agent Zero
- Agents - source Framework for Data-centric, Self-evolving Autonomous Language Agents.
- FastAgency - agent workflows to production.
- agentUniverse - agent framework that allows developers to easily build multi-agent applications. Furthermore, through the community, they can exchange and share practices of patterns across different domains.
- OmAgent
- Agent-S
- PydanticAI
- CrewAI - playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
- Coze
- Swarm - agent systems. Managed by OpenAI Solutions team. Experimental framework.
- llama-agentic-system
- PraisonAI - code solution for building and managing multi-agent LLM systems, focusing on simplicity, customisation, and efficient human-agent collaboration.
- Agentarium - source framework for creating and managing simulations populated with AI-powered agents.
- smolagents
-
体验 Usage
-
课程
-
教程
-
软件
- 沉浸式翻译
- ![Forkers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/network/members)
- ![Stargazers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/stargazers)
- POT
- Bob
- OpenAI Translator Bob Plugin
-
推理 Inference
- ollama
- Open WebUI - friendly WebUI for LLMs (Formerly Ollama WebUI).
- Text Generation WebUI
- Xinference
- LangChain - aware reasoning applications.
- LlamaIndex
- TensorRT-LLM - LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.
- LlamaChat
- RouteLLM - save LLM costs without compromising quality!
- MInference
- Mem0
- SGLang
- GuideLLM
- LLM-Engines - source models (VLLM, SGLang, Together) and commercial models (OpenAI, Mistral, Claude).
- lobe-chat - source, modern-design LLMs/AI chat framework. Supports Multi AI Providers, Multi-Modals (Vision/TTS) and plugin system.
- vllm - throughput and memory-efficient inference and serving engine for LLMs.
- NVIDIA ChatRTX
- LM Studio
- chat-with-mlx
- LLM Pricing
- Open Interpreter
- Chat-ollama
- chat-ui
- MemGPT - term memory and custom tools.
- koboldcpp - file way to run various GGML and GGUF models with KoboldAI's UI.
- LLMFarm
- enchanted
- Flowise
- Jan - LLM).
- LMDeploy
- AirLLM
- LLMHub
- YuanChat
- LiteLLM
- OARC - TTS speech models, Keras classifiers, Llava vision, Whisper recognition, and more to create a unified chatbot agent for local, custom automation.
- g1 - 3.1 70b on Groq to create o1-like reasoning chains.
- MemoryScope - term memory capabilities, offering a framework for building such abilities.
- OpenLLM - source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.
- Infinity - native database built for LLM applications, providing incredibly fast hybrid search of dense embedding, sparse embedding, tensor and full-text.
- optillm - of-the-art techniques that can improve the accuracy and performance of LLMs.
- LocalAI - hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required.
- ZhiLight
- LLaMA Box
- DashInfer - leading performance atop various hardware architectures.
-
RAG
- LightRAG - Agent-Generator pipelines.
-
搜索 Search
- OpenSearch GPT
- nanoPerplexityAI - source implementation of perplexity.ai.
- curiosity - like user experience.
- MindSearch - based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT).
-
社区 Community
Programming Languages
Categories
Sub Categories
Keywords
llm
73
gpt
31
llama
30
ai
29
rag
26
llms
23
openai
21
large-language-models
21
python
20
chatgpt
18
agent
17
llama3
16
chatbot
14
gpt-4
13
llama2
13
pytorch
12
mistral
12
machine-learning
10
llmops
10
fine-tuning
10
langchain
9
ollama
9
qwen
9
deep-learning
9
llava
8
generative-ai
8
transformers
8
nlp
8
gemma
7
llm-training
7
framework
7
llm-inference
7
pdf
7
artificial-intelligence
7
finetuning
6
chatglm
6
lora
6
retrieval-augmented-generation
6
language-model
5
agents
5
cuda
5
llamacpp
5
qlora
5
llm-serving
5
transformer
5
evaluation
5
genai
5
huggingface
5
llm-agent
5
inference
5