Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-LLM-resourses
🧑🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.
https://github.com/WangRongsheng/awesome-LLM-resourses
Last synced: 1 day ago
JSON representation
-
微调 Fine-Tuning
- AutoTrain - of-the-art Machine Learning models.
- workbench-llamafactory - to-end model development workflow using Llamafactory.
- TRL
- LLaMA-Factory - Tuning of 100+ LLMs.
- unsloth - 5X faster 80% less memory LLM finetuning.
- Firefly
- Xtuner - featured toolkit for fine-tuning large models.
- torchtune - PyTorch Library for LLM Fine-tuning.
- Ludwig - code framework for building custom LLMs, neural networks, and other AI models.
- aikit - tune, build, and deploy open-source LLMs easily!
- H2O-LLMStudio - a framework and no-code GUI for fine-tuning LLMs.
- LitGPT - of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
- LLMBox
- PaddleNLP - to-use and powerful NLP and LLM library.
- TinyLLaVA Factory - scale Large Multimodal Models.
- LLM-Foundry
- ChatLearn - scale alignment.
- Meta Lingua - to-hack codebase to research LLMs.
- mistral-finetune - weight codebase that enables memory-efficient and performant finetuning of Mistral's models.
- lmms-finetune - 1.5, qwen-vl, llava-interleave, llava-next-video, phi3-v etc.
- Simplifine
- Transformer Lab - tune, and evaluate large language models on your own computer.
- Liger-Kernel
- nanotron - parallelism training.
- Proxy Tuning
- Effective LLM Alignment
- Autotrain-advanced
-
智能体 Agents
- LinkAI
- Baidu APPBuilder
- AutoGen - studio.com/)
- AgentGPT
- XAgent
- MobileAgent
- Lagent - based agents.
- Qwen-Agent
- LazyLLM
- AgentScope - empowered multi-agent applications in an easier way.
- MoA - of-the-art results.
- Agently
- Tribe - agent teams.
- CAMEL - agent framework.
- IoA - source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.
- Agent Zero
- Agents - source Framework for Data-centric, Self-evolving Autonomous Language Agents.
- FastAgency - agent workflows to production.
- agentUniverse - agent framework that allows developers to easily build multi-agent applications. Furthermore, through the community, they can exchange and share practices of patterns across different domains.
- OmAgent
- CrewAI - playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
- Coze
- Swarm - agent systems. Managed by OpenAI Solutions team. Experimental framework.
-
课程 Course
- HuggingFace NLP Course
- 清华 NLP 刘知远团队大模型公开课
- Mistral: Getting Started with Mistral
- Knowledge Graphs for RAG
- OpenRAG
- 通往AGI之路
- 斯坦福 CS224N: Natural Language Processing with Deep Learning
- 吴恩达: Generative AI for Everyone
- 吴恩达: LLM series of courses
- ACL 2023 Tutorial: Retrieval-based Language Models and Applications
- 微软: Generative AI for Beginners
- 斯坦福 CS324: Large Language Models
- openai-cookbook
- mistralai-cookbook
- build nanoGPT
- LLMs From Scratch (Datawhale Version)
- Multimodal RAG: Chat with Videos
- Large Language Model Agents
- Cohere LLM University
- LLMs and Transformers
- Smol Vision
- LLMsBook
- Learn RAG From Scratch – Python AI Tutorial from a LangChain Engineer
- 微软: State of GPT
- 斯坦福 CS25: Transformers United V4
- 普林斯顿 COS 597G (Fall 2022): Understanding Large Language Models
- 约翰霍普金斯 CS 601.471/671 NLP: Self-supervised Models
- 滑铁卢大学 CS 886: Recent Advances on Foundation Models
- LLM101n
- Andrej Karpathy - Neural Networks: Zero to Hero
- Interactive visualization of Transformer
- andysingal/llm-course
- LM-class
- Google Advanced: Generative AI for Developers Learning Path
- Anthropics:Prompt Engineering Interactive Tutorial
- Hands on llms - time financial advisor LLM system.
- LangGPT
- LLMs Interview Note
- Coursera: Chatgpt 应用提示工程
- LLM Evaluation: A Complete Course
- Introduction to Generative AI 2024 Spring
- llm-course: Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
- RAG++ : From POC to production
- Weights & Biases AI Academy
- Prompt Engineering & AI tutorials & Resources
-
教程 Tutorial
- AI开发者频道
- B站:五里墩茶社
- B站:木羽Cheney
- B站:漆妮妮
- Prompt Engineering Guide
- 动手学大模型应用开发
- 知乎: ybq
- YTB: Hyung Won Chung
- Blog: Tejaswi kashyap
- Blog: 小昇的博客
- Blog: GbyAI
- B站:TechBeat人工智能社区
- B站:黄益贺
- LLM Visualization
- B站:AI老兵文哲
- Blog: mlabonne
- YTB:IBM Technology
- YTB: Unify Reading Paper Group
- Chip Huyen
- How Much VRAM
- Blog: 科学空间(苏剑林)
- 知乎: 原石人类
- B站:小黑黑讲AI
- B站:面壁的车辆工程师
- LLM-Action
- B站:深度学习自然语言处理
- W&B articles
- Huggingface Blog
-
体验 Usage
-
Tips
- 轻松入门大语言模型(LLM)
- LLM训练-pretrain
- Zero-Qwen-VL
- finetune-Qwen2-VL
- MiniMind
- Knowledge distillation: Teaching LLM's with synthetic data
- Part 1: Methods for adapting large language models
- Part 2: To fine-tune or not to fine-tune
- Part 3: How to fine-tune: Focus on effective datasets
- Reader-LM: Small Language Models for Cleaning and Converting HTML to Markdown
- Distributed Training Guide
- Chat Templates
- LLMs for Text Classification: A Guide to Supervised Learning
- Unsupervised Text Classification: Categorize Natural Language With LLMs
- Text Classification With LLMs: A Roundup of the Best Methods
- Tiny LLM Universe
- Zero-Chatgpt
- ![Forkers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/network/members)
- ![Stargazers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/stargazers)
- ![Stargazers over time - LLM-resourses)
- LLM Pricing
- Uncensor any LLM with abliteration
- build_MiniLLM_from_scratch
- Tiny LLM zh
- Top 20+ RAG Interview Questions
- LLMs应用构建一年之心得
- What We Learned from a Year of Building with LLMs (Part I)
- What We Learned from a Year of Building with LLMs (Part II)
- What We Learned from a Year of Building with LLMs (Part III): Strategy
- pytorch-llama
- Preference Optimization for Vision Language Models with TRL
- Fine-tuning visual language models using SFTTrainer - sfttrainer-for-vision-language-models)】
- A Visual Guide to Mixture of Experts (MoE)
- MPP-LLaVA
- LLM-Travel
- Role-Playing in Large Language Models like ChatGPT
-
书籍 Book
- 《大规模语言模型:从理论到实践》
- 《大语言模型》
- 《动手做AI Agent》
- 《动手学大模型Dive into LLMs》
- 《Build a Large Language Model (From Scratch)》
- 《多模态大模型》
- 《Understanding Deep Learning》
- 《Illustrated book to learn about Transformers & LLMs》
- 《Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG》
- 《大模型基础》
- 《Hands-On Large Language Models》
- 《自然语言处理:大模型理论与实践》
- 《动手学强化学习》
- 《面向开发者的LLM入门教程》
- 《Generative AI Handbook: A Roadmap for Learning Resources》
-
课程
-
教程
-
软件
- 沉浸式翻译
- ![Forkers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/network/members)
- ![Stargazers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/stargazers)
- POT
- Bob
- OpenAI Translator Bob Plugin
- ![Forkers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/network/members)
- ![Stargazers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/stargazers)
-
知识库 RAG
- FlashRAG
- AnythingLLM - in-one AI app for any LLM with full RAG and AI Agent capabilites.
- MaxKB
- RAGFlow - source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
- Dify - source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
- FastGPT - based platform built on the LLM, offers out-of-the-box data processing and model invocation capabilities, allows for workflow orchestration through Flow visualization.
- Langchain-Chatchat
- QAnything
- Quivr - augmented generation.
- Verba
- GraphRAG - based Retrieval-Augmented Generation (RAG) system.
- TEN - Gen AI-Agent Framework, the world's first truly real-time multimodal AI agent framework.
- nano-GraphRAG - to-hack GraphRAG implementation.
- RAG Techniques - Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
- ragas
- kotaemon - source clean & customizable RAG UI for chatting with your documents. Built with both end users and developers in mind.
- AutoRAG
- RAGapp
- KAG - enhanced generation framework based on OpenSPG engine, which is used to build knowledge-enhanced rigorous decision-making and information retrieval knowledge services.
- TurboRAG - Augmented Generation with Precomputed KV Caches for Chunked Text.
- LightRAG - Augmented Generation.
-
数据 Data
- AotoLabel
- LabelLLM - Source Data Annotation Platform.
- OmniParser
- MinerU - stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.
- PDF-Extract-Kit - Quality PDF Content Extraction.
- Docling
- GOT-OCR2.0
- Zerox - 4o-mini.
- data-juicer - stop data processing system to make data higher-quality, juicier, and more digestible for LLMs!
- Parsera - sites with LLMs.
- Sparrow - source solution for efficient data extraction and processing from various documents and images.
- DocLayout-YOLO - to-Local Adaptive Perception.
- TensorZero
- Promptwright
- LLM Decontaminator
- DataTrove
- llm-swarm
- Tabled
- Distilabel
- Common-Crawl-Pipeline-Creator
-
推理 Inference
- ollama
- Open WebUI - friendly WebUI for LLMs (Formerly Ollama WebUI).
- Text Generation WebUI
- Xinference
- LangChain - aware reasoning applications.
- LlamaIndex
- TensorRT-LLM - LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.
- LlamaChat
- RouteLLM - save LLM costs without compromising quality!
- MInference
- Mem0
- SGLang
- GuideLLM
- LLM-Engines - source models (VLLM, SGLang, Together) and commercial models (OpenAI, Mistral, Claude).
- lobe-chat - source, modern-design LLMs/AI chat framework. Supports Multi AI Providers, Multi-Modals (Vision/TTS) and plugin system.
- vllm - throughput and memory-efficient inference and serving engine for LLMs.
- NVIDIA ChatRTX
- LM Studio
- chat-with-mlx
- LLM Pricing
- Open Interpreter
- Chat-ollama
- chat-ui
- MemGPT - term memory and custom tools.
- koboldcpp - file way to run various GGML and GGUF models with KoboldAI's UI.
- LLMFarm
- enchanted
- Flowise
- Jan - LLM).
- LMDeploy
- AirLLM
- LLMHub
- YuanChat
- LiteLLM
- OARC - TTS speech models, Keras classifiers, Llava vision, Whisper recognition, and more to create a unified chatbot agent for local, custom automation.
- g1 - 3.1 70b on Groq to create o1-like reasoning chains.
- MemoryScope - term memory capabilities, offering a framework for building such abilities.
- OpenLLM - source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.
- Infinity - native database built for LLM applications, providing incredibly fast hybrid search of dense embedding, sparse embedding, tensor and full-text.
-
评估 Evaluation
- lm-evaluation-harness - shot evaluation of language models.
- opencompass - 4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
- llm-comparator - by-side, developed.
- Ollama Benchmark
- VLMEvalKit - source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks.
- EvalScope
- Weave
- Evaluation guidebook
-
RAG
- LightRAG - Agent-Generator pipelines.
-
搜索 Search
- OpenSearch GPT
- nanoPerplexityAI - source implementation of perplexity.ai.
- curiosity - like user experience.
- MindSearch - based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT).
-
论文 Paper
- Huggingface Daily Papers - ai/ML-Papers-Explained)
- OLMoE: Open Mixture-of-Experts Language Models
- The Llama 3 Herd of Models
- Qwen Technical Report
- Hermes-3-Technical-Report
- Qwen2 Technical Report
- DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
- Baichuan 2: Open Large-scale Language Models
- DataComp-LM: In search of the next generation of training sets for language models
- OLMo: Accelerating the Science of Language Models
- MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series
- Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
- Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
- Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
- Jamba: A Hybrid Transformer-Mamba Language Model
- Textbooks Are All You Need
- Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
- Baichuan Alignment Technical Report
- Qwen2-vl Technical Report
- Baichuan-Omni Technical Report
- Model Merging Paper
- 1.5-Pints Technical Report: Pretraining in Days, Not Months – Your Language Model Thrives on Quality Data
Programming Languages
Categories
Sub Categories
Keywords
llm
61
gpt
28
ai
24
llama
23
llms
21
large-language-models
18
chatgpt
18
rag
17
openai
17
python
15
llama2
14
gpt-4
13
agent
13
chatbot
13
llama3
11
llmops
10
pytorch
9
deep-learning
9
mistral
9
fine-tuning
9
ollama
8
machine-learning
8
generative-ai
7
qwen
7
transformers
7
langchain
7
nlp
6
chatglm
6
artificial-intelligence
6
framework
6
llava
6
llm-inference
5
llamacpp
5
llm-agent
5
huggingface
5
language-model
5
gemma
5
retrieval-augmented-generation
5
lora
5
finetuning
5
qlora
5
agents
4
javascript
4
vector-database
4
chat
4
macos
4
inference
4
data-science
4
transformer
4
evaluation
4