awesome-LLM-resourses

🧑‍🚀 全世界最好的LLM资料总结（数据处理、模型训练、模型部署、o1 模型、MCP、小语言模型、视觉语言模型） | Summary of the world's best LLM resources.
https://github.com/WangRongsheng/awesome-LLM-resourses

Last synced: 4 days ago
JSON representation

评估 Evaluation
- aisuite
- DeepSeek-v3
- DeepEval - to-use, open-source LLM evaluation framework, for evaluating and testing large-language model systems.
- Lighteval - in-one toolkit for evaluating LLMs across multiple backends.
- QwQ/eval
- 火山引擎
- 文心千帆
- DashScope
- Groq
- 硅基流动
- VLMEvalKit - source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks.
- Weave
- lm-evaluation-harness - shot evaluation of language models.
- opencompass - 4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
- llm-comparator - by-side, developed.
- EvalScope
- Evaluation guidebook
- Ollama Benchmark
- 硅基流动
- MixEval
- AGI-Eval
- DeerAPI
- Qwen-Chat
- Evalchemy - to-use toolkit for evaluating post-trained language models.
- MathArena
- YourBench
知识库 RAG
- LightRAG - Agent-Generator pipelines.
- MiniRAG - augmented generation framework that enables small models to achieve good RAG performance through heterogeneous graph indexing and lightweight topology-enhanced retrieval.
- FlashRAG
- Quivr - augmented generation.
- Verba
- TEN - Gen AI-Agent Framework, the world's first truly real-time multimodal AI agent framework.
- nano-GraphRAG - to-hack GraphRAG implementation.
- RAG Techniques - Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
- ragas
- kotaemon - source clean & customizable RAG UI for chatting with your documents. Built with both end users and developers in mind.
- Fast-GraphRAG
- AutoRAG
- RAGapp
- Tiny-GraphRAG
- AnythingLLM - in-one AI app for any LLM with full RAG and AI Agent capabilites.
- MaxKB
- RAGFlow - source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
- Dify - source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
- FastGPT - based platform built on the LLM, offers out-of-the-box data processing and model invocation capabilities, allows for workflow orchestration through Flow visualization.
- Langchain-Chatchat
- QAnything
- GraphRAG - based Retrieval-Augmented Generation (RAG) system.
- RAGLite - Augmented Generation (RAG) with PostgreSQL or SQLite.
- XRAG - Augmented Generation (RAG) systems.
- TurboRAG - Augmented Generation with Precomputed KV Caches for Chunked Text.
- LightRAG - Augmented Generation.
- DB-GPT GraphRAG - GPT GraphRAG integrates both triplet-based knowledge graphs and document structure graphs while leveraging community and document retrieval mechanisms to enhance RAG capabilities, achieving comparable performance while consuming only 50% of the tokens required by Microsoft's GraphRAG. Refer to the DB-GPT [Graph RAG User Manual](http://docs.dbgpt.cn/docs/cookbook/rag/graph_rag_app_develop/) for details.
- Chonkie - nonsense RAG chunking library that's lightweight, lightning-fast, and ready to CHONK your texts.
- RAG-GPT - GPT, leveraging LLM and RAG technology, learns from user-customized knowledge bases to provide contextually relevant answers for a wide range of queries, ensuring rapid and accurate information retrieval.
- CAG
- GraphRAG-Ollama-UI
- KAG - guided reasoning and retrieval framework based on OpenSPG engine and LLMs.
- Rankify - Ranking, and Retrieval-Augmented Generation.
论文 Paper
- TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
- Yi-Lightning Technical Report
- An Introduction to Vision-Language Modeling
- Gemma 3 Technical Report
- Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining
- Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
- Baichuan Alignment Technical Report
- Huggingface Daily Papers - ai/ML-Papers-Explained)
- Hermes-3-Technical-Report
- The Llama 3 Herd of Models
- Qwen Technical Report
- Qwen2 Technical Report
- Qwen2-vl Technical Report
- DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
- Baichuan 2: Open Large-scale Language Models
- DataComp-LM: In search of the next generation of training sets for language models
- OLMo: Accelerating the Science of Language Models
- MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series
- Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
- Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
- Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
- DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
- Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs
- Baichuan-Omni Technical Report
- Model Merging Paper
- 1.5-Pints Technical Report: Pretraining in Days, Not Months – Your Language Model Thrives on Quality Data
- SkyLadder: Better and Faster Pretraining via Context Window Scheduling
- Phi-4 Technical Report
- Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
- Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models
- 2 OLMo 2 Furious
- Qwen2.5-VL Technical Report
- Baichuan-M1: Pushing the Medical Capability of Large Language Models
- Jamba: A Hybrid Transformer-Mamba Language Model
- Textbooks Are All You Need
- Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
- OLMoE: Open Mixture-of-Experts Language Models
- Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
- Qwen2.5 Technical Report
- YuLan-Mini: An Open Data-efficient Language Model
- Qwen2.5-Omni technical report
- Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
- Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources
Tips
- LLM-Dojo 开源大模型学习场所，使用简洁且易阅读的代码构建模型训练框架
- 基于 transformers 的 generate() 方法实现多样化文本生成：参数含义和算法原理解读
- 轻松入门大语言模型（LLM）
- Zero-Qwen-VL
- MiniMind
- Knowledge distillation: Teaching LLM's with synthetic data
- Part 1: Methods for adapting large language models
- Part 2: To fine-tune or not to fine-tune
- Part 3: How to fine-tune: Focus on effective datasets
- Reader-LM: Small Language Models for Cleaning and Converting HTML to Markdown
- Distributed Training Guide
- Chat Templates
- LLMs for Text Classification: A Guide to Supervised Learning
- Unsupervised Text Classification: Categorize Natural Language With LLMs
- Text Classification With LLMs: A Roundup of the Best Methods
- Tiny LLM Universe
- Zero-Chatgpt
- ![Forkers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/network/members)
- ![Stargazers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/stargazers)
- ![Stargazers over time - LLM-resourses)
- LLM Pricing
- Uncensor any LLM with abliteration
- finetune-Qwen2-VL
- build_MiniLLM_from_scratch
- Tiny LLM zh
- Top 20+ RAG Interview Questions
- ![Forkers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/network/members)
- ![Stargazers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/stargazers)
- LLMs应用构建一年之心得
- What We Learned from a Year of Building with LLMs (Part I)
- What We Learned from a Year of Building with LLMs (Part II)
- What We Learned from a Year of Building with LLMs (Part III): Strategy
- pytorch-llama
- Preference Optimization for Vision Language Models with TRL
- A Visual Guide to Mixture of Experts (MoE)
- The Ultra-Scale Playbook: Training LLMs on GPU Clusters
- MPP-LLaVA
- LLM-Travel
- Role-Playing in Large Language Models like ChatGPT
- LLM训练-pretrain
- Fine-tuning visual language models using SFTTrainer - sfttrainer-for-vision-language-models)】
- o1 isn’t a chat model (and that’s the point)
- Beam Search快速理解及代码解析
书籍 Book
- Foundations of Large Language Models
- 《动手做AI Agent》
- 《Illustrated book to learn about Transformers & LLMs》
- 《大模型基础》
- Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software
- 《Hands-On Large Language Models》
- 《自然语言处理：大模型理论与实践》
- 《大规模语言模型：从理论到实践》
- 《大语言模型》
- 《动手学大模型Dive into LLMs》
- 《Build a Large Language Model (From Scratch)》
- 《多模态大模型》
- 《Understanding Deep Learning》
- 《Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG》
- 《动手学强化学习》
- 《面向开发者的LLM入门教程》
- 《Generative AI Handbook: A Roadmap for Learning Resources》
- Textbook on reinforcement learning from human feedback
课程 Course
- HuggingFace Learn
- Mistral: Getting Started with Mistral
- Knowledge Graphs for RAG
- OpenRAG
- 通往AGI之路
- Large Language Model Agents
- Interactive visualization of Transformer
- andysingal/llm-course
- LM-class
- Google Advanced: Generative AI for Developers Learning Path
- Anthropics：Prompt Engineering Interactive Tutorial
- Hands on llms - time financial advisor LLM system.
- LangGPT
- LLMs Interview Note
- LLMsBook
- Coursera: Chatgpt 应用提示工程
- LLM Evaluation: A Complete Course
- Introduction to Generative AI 2024 Spring
- 斯坦福 CS224N: Natural Language Processing with Deep Learning
- 吴恩达: Generative AI for Everyone
- 吴恩达: LLM series of courses
- ACL 2023 Tutorial: Retrieval-based Language Models and Applications
- llm-course: Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
- 微软: Generative AI for Beginners
- 微软: State of GPT
- 斯坦福 CS25: Transformers United V4
- 斯坦福 CS324: Large Language Models
- 普林斯顿 COS 597G (Fall 2022): Understanding Large Language Models
- 约翰霍普金斯 CS 601.471/671 NLP: Self-supervised Models
- openai-cookbook
- 滑铁卢大学 CS 886: Recent Advances on Foundation Models
- mistralai-cookbook
- build nanoGPT
- LLM101n
- LLMs From Scratch (Datawhale Version)
- Cohere LLM University
- LLMs and Transformers
- Smol Vision
- Multimodal RAG: Chat with Videos
- RAG++ : From POC to production
- Weights & Biases AI Academy
- Learn RAG From Scratch – Python AI Tutorial from a LangChain Engineer
- Prompt Engineering & AI tutorials & Resources
- LLM Resources Hub
- LLM技术科普
- HuggingFace NLP Course
- 清华 NLP 刘知远团队大模型公开课
- 李宏毅 GenAI课程
- Andrej Karpathy: Deep Dive into LLMs like ChatGPT
教程 Tutorial
- AI-Guide-and-Demos
- AI开发者频道
- B站：漆妮妮
- Prompt Engineering Guide
- B站：AI老兵文哲
- YTB：IBM Technology
- YTB: Unify Reading Paper Group
- Chip Huyen
- How Much VRAM
- 知乎: 原石人类
- B站：小黑黑讲AI
- B站：面壁的车辆工程师
- Blog: mlabonne
- Blog: Lil’Log (OponAI)
- YTB: Unify Reading Paper Group
- LLM-Action
- B站：TechBeat人工智能社区
- B站：黄益贺
- B站：深度学习自然语言处理
- LLM Visualization
- Blog: 科学空间（苏剑林）
- YTB: Hyung Won Chung
- Blog: Tejaswi kashyap
- Blog: 小昇的博客
- 知乎: ybq
- W&B articles
- Huggingface Blog
- Blog: GbyAI
- 动手学大模型应用开发
- B站：五里墩茶社
- B站：木羽Cheney
- Theoretical Machine Learning: A Handbook for Everyone
- Implementation of all RAG techniques in a simpler way.
- B站: 毛玉仁
- cnblog: 第七子
数据 Data
- datasketch
- semhash
- ReaderLM-v2
- Curator - training and structured data extraction.
- AotoLabel
- LabelLLM - Source Data Annotation Platform.
- OmniParser
- MinerU - stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.
- Docling
- GOT-OCR2.0
- Zerox - 4o-mini.
- data-juicer - stop data processing system to make data higher-quality, juicier, and more digestible for LLMs!
- Parsera - sites with LLMs.
- Sparrow - source solution for efficient data extraction and processing from various documents and images.
- pdf-extract-api
- MegaParse
- DocLayout-YOLO - to-Local Adaptive Perception.
- TensorZero
- PDF-Extract-Kit - Quality PDF Content Extraction.
- LangKit - source toolkit for monitoring Large Language Models (LLMs). Extracts signals from prompts & responses, ensuring safety & security.
- LLM Decontaminator
- DataTrove
- llm-swarm
- BabelDOC
- Tabled
- Distilabel
- Common-Crawl-Pipeline-Creator
- pdf2htmlEX
- Extractous
- Easy Dataset - tuning datasets for LLM.
- MarkItDown
- olmOCR
- Promptwright
微调 Fine-Tuning
- Online-RLHF
- 360-LLaMA-Factory - Tuning of 100+ LLMs. (add Sequence Parallelism for supporting long context training)
- TRL
- ChatLearn - scale alignment.
- Meta Lingua - to-hack codebase to research LLMs.
- MLX-VLM - VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
- Simplifine
- Transformer Lab - tune, and evaluate large language models on your own computer.
- Vision-LLM Alignemnt - based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
- OpenRLHF - to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral).
- nanotron - parallelism training.
- Proxy Tuning
- Effective LLM Alignment
- LLaMA-Factory - Tuning of 100+ LLMs.
- unsloth - 5X faster 80% less memory LLM finetuning.
- Firefly
- Xtuner - featured toolkit for fine-tuning large models.
- torchtune - PyTorch Library for LLM Fine-tuning.
- AutoTrain - of-the-art Machine Learning models.
- Ludwig - code framework for building custom LLMs, neural networks, and other AI models.
- mistral-finetune - weight codebase that enables memory-efficient and performant finetuning of Mistral's models.
- aikit - tune, build, and deploy open-source LLMs easily!
- H2O-LLMStudio - a framework and no-code GUI for fine-tuning LLMs.
- LitGPT - of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
- LLMBox
- PaddleNLP - to-use and powerful NLP and LLM library.
- workbench-llamafactory - to-end model development workflow using Llamafactory.
- TinyLLaVA Factory - scale Large Multimodal Models.
- lmms-finetune - 1.5, qwen-vl, llava-interleave, llava-next-video, phi3-v etc.
- Liger-Kernel
- Autotrain-advanced
- InternEvo - sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
- veRL
- LLM-Foundry
- Axolotl - tune a model, run model inference or evaluation, and much more.
- Oumi - of-the-art foundation models, end-to-end.
- Kiln - tuning LLM models, synthetic data generation, and collaborating on datasets.
- DeepSeek-671B-SFT-Guide - source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions.
智能体 Agents
- LinkAI
- Baidu APPBuilder
- AutoGen - studio.com/)
- Agent Zero
- Agents - source Framework for Data-centric, Self-evolving Autonomous Language Agents.
- OmAgent
- Agent-S
- PydanticAI
- Coze
- CrewAI - playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
- AgentGPT
- XAgent
- MobileAgent
- Lagent - based agents.
- Qwen-Agent
- agentUniverse - agent framework that allows developers to easily build multi-agent applications. Furthermore, through the community, they can exchange and share practices of patterns across different domains.
- LazyLLM
- MoA - of-the-art results.
- Agently
- Tribe - agent teams.
- CAMEL - agent framework and an open-source community dedicated to finding the scaling law of agents.
- IoA - source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.
- AgentScope - empowered multi-agent applications in an easier way.
- FastAgency - agent workflows to production.
- Swarm - agent systems. Managed by OpenAI Solutions team. Experimental framework.
- PraisonAI - code solution for building and managing multi-agent LLM systems, focusing on simplicity, customisation, and efficient human-agent collaboration.
- Agentarium - source framework for creating and managing simulations populated with AI-powered agents.
- smolagents
- Cooragent
体验 Usage
- 林哥的大模型野榜
- AnyChat
- LMSYS Chatbot Arena: Benchmarking LLMs in the Wild
- CompassArena 司南大模型竞技场
- 琅琊榜
- Huggingface Spaces
- WiseModel Spaces
- OpenRouter
- 智谱Z.AI
课程
- 李宏毅 GenAI课程
教程
- YTB：AI Anytime
- YTB: AI超元域
软件
- 沉浸式翻译
- ![Forkers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/network/members)
- ![Stargazers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/stargazers)
- POT
- Bob
- OpenAI Translator Bob Plugin
RAG
- LightRAG - Agent-Generator pipelines.
MCP
- mcp.ad
- FastAPI-MCP
- modelscope/mcp
- mcpm.sh
- MCP是啥？技术原理是什么？一个视频搞懂MCP的一切。Windows系统配置MCP，Cursor,Cline 使用MCP
- MCP是什么？为啥是下一代AI标准？MCP原理+开发实战！在Cursor、Claude、Cline中使用MCP，让AI真正自动化！
- smithery.ai
- mcp.so
- modelcontextprotocol/servers
- mcp.ad
- pulsemcp.com
- awesome-mcp-servers
- glama.ai
- mcp.composio.dev
- awesome-mcp-list
- mcpo
- FastMCP
- sharemcp.cn
- mcpstore.co
搜索 Search
- OpenSearch GPT
- MiniPerplx - powered search engine that helps you find information on the internet.
- MindSearch - based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT).
- nanoPerplexityAI - source implementation of perplexity.ai.
- curiosity - like user experience.
推理 Inference
- Chitu - performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
- OARC - TTS speech models, Keras classifiers, Llava vision, Whisper recognition, and more to create a unified chatbot agent for local, custom automation.
- g1 - 3.1 70b on Groq to create o1-like reasoning chains.
- MemoryScope - term memory capabilities, offering a framework for building such abilities.
- OpenLLM - source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.
- Infinity - native database built for LLM applications, providing incredibly fast hybrid search of dense embedding, sparse embedding, tensor and full-text.
- optillm - of-the-art techniques that can improve the accuracy and performance of LLMs.
- Text Generation WebUI
- Xinference
- LangChain - aware reasoning applications.
- LlamaIndex
- lobe-chat - source, modern-design LLMs/AI chat framework. Supports Multi AI Providers, Multi-Modals (Vision/TTS) and plugin system.
- TensorRT-LLM - LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.
- Chat-ollama
- chat-ui
- koboldcpp - file way to run various GGML and GGUF models with KoboldAI's UI.
- LLMFarm
- Flowise
- Jan - LLM).
- LMDeploy
- RouteLLM - save LLM costs without compromising quality!
- MInference
- Mem0
- SGLang
- AirLLM
- LLMHub
- YuanChat
- LiteLLM
- LocalAI - hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required.
- ZhiLight
- LLaMA Box
- DashInfer - leading performance atop various hardware architectures.
- SkyPilot
- ollama
- Open WebUI - friendly WebUI for LLMs (Formerly Ollama WebUI).
- vllm - throughput and memory-efficient inference and serving engine for LLMs.
- LlamaChat
- NVIDIA ChatRTX
- LM Studio
- chat-with-mlx
- LLM Pricing
- Open Interpreter
- MemGPT - term memory and custom tools.
- GuideLLM
- LLM-Engines - source models (VLLM, SGLang, Together) and commercial models (OpenAI, Mistral, Claude).
- ktransformers - edge LLM Inference Optimizations.
- TokenSwift
社区 Community
- 魔乐社区
- HuggingFace
- ModelScope
- WiseModel
- OpenCSG
Small Language Model
- 使用Cosmopedia训练cosmo-1b

Programming Languages

Python 139 Jupyter Notebook 25 TypeScript 16 JavaScript 7 C++ 6 HTML 3 Go 3 Swift 2 Rust 2 C 2

Categories

课程 Course 49 推理 Inference 47 论文 Paper 44 Tips 44 微调 Fine-Tuning 38 教程 Tutorial 35 数据 Data 33 知识库 RAG 33 智能体 Agents 29 评估 Evaluation 26 MCP 19 书籍 Book 18 体验 Usage 9 软件 6 社区 Community 5 搜索 Search 5 教程 2 RAG 1 课程 1 Small Language Model 1

Sub Categories

Keywords

llm 82 ai 38 rag 31 gpt 29 llama 29 llms 24 openai 23 python 23 large-language-models 22 agent 21 chatgpt 20 llama3 17 pytorch 13 qwen 13 chatbot 13 machine-learning 12 mistral 12 deep-learning 11 gpt-4 11 llama2 11 fine-tuning 11 langchain 10 ollama 10 deepseek 10 llava 9 llmops 9 nlp 9 transformers 8 llm-training 8 gemma 8 generative-ai 8 framework 8 llm-serving 7 artificial-intelligence 7 agents 7 pdf 7 language-model 6 lora 6 javascript 6 retrieval-augmented-generation 6 huggingface 6 genai 6 llm-inference 6 finetuning 6 evaluation 6 rlhf 6 transformer 6 ai-agents 5 mcp 5 chatglm 5