Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-LLM-resourses
🧑🚀 全世界最好的中文LLM资料总结
https://github.com/WangRongsheng/awesome-LLM-resourses
Last synced: 3 days ago
JSON representation
-
微调 Fine-Tuning
- AutoTrain - of-the-art Machine Learning models.
- workbench-llamafactory - to-end model development workflow using Llamafactory.
- TRL
- LLaMA-Factory - Tuning of 100+ LLMs.
- unsloth - 5X faster 80% less memory LLM finetuning.
- Firefly
- Xtuner - featured toolkit for fine-tuning large models.
- torchtune - PyTorch Library for LLM Fine-tuning.
- Swift - parameter to finetune 200+ LLMs or 15+ MLLMs.
- Ludwig - code framework for building custom LLMs, neural networks, and other AI models.
- aikit - tune, build, and deploy open-source LLMs easily!
- H2O-LLMStudio - a framework and no-code GUI for fine-tuning LLMs.
- LitGPT - of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
- LLMBox
- PaddleNLP - to-use and powerful NLP and LLM library.
- TinyLLaVA Factory - scale Large Multimodal Models.
- LLM-Foundry
- ChatLearn - scale alignment.
- mistral-finetune - weight codebase that enables memory-efficient and performant finetuning of Mistral's models.
- lmms-finetune - 1.5, qwen-vl, llava-interleave, llava-next-video, phi3-v etc.
- Simplifine
- Transformer Lab - tune, and evaluate large language models on your own computer.
- Liger-Kernel
- nanotron - parallelism training.
- Proxy Tuning
-
Agents
- LinkAI
- Baidu APPBuilder
- OmAgent
- AutoGen - studio.com/)
- AgentGPT
- XAgent
- MobileAgent
- Lagent - based agents.
- Qwen-Agent
- agentUniverse - agent framework that allows developers to easily build multi-agent applications. Furthermore, through the community, they can exchange and share practices of patterns across different domains.
- LazyLLM
- AgentScope - empowered multi-agent applications in an easier way.
- MoA - of-the-art results.
- Agently
- Tribe - agent teams.
- CAMEL - agent framework.
- IoA - source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.
- llama-agentic-system
- Agent Zero
- Agents - source Framework for Data-centric, Self-evolving Autonomous Language Agents.
- Coze
-
课程 Course
- HuggingFace NLP Course
- 清华 NLP 刘知远团队大模型公开课
- Mistral: Getting Started with Mistral
- Knowledge Graphs for RAG
- OpenRAG
- 通往AGI之路
- 斯坦福 CS224N: Natural Language Processing with Deep Learning
- 吴恩达: Generative AI for Everyone
- 吴恩达: LLM series of courses
- ACL 2023 Tutorial: Retrieval-based Language Models and Applications
- llm-course: Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
- 微软: Generative AI for Beginners
- 斯坦福 CS324: Large Language Models
- openai-cookbook
- mistralai-cookbook
- build nanoGPT
- LLMs From Scratch (Datawhale Version)
- LLMsBook
- Multimodal RAG: Chat with Videos
- Large Language Model Agents
- Cohere LLM University
- LLMs and Transformers
- Smol Vision
- 微软: State of GPT
- 斯坦福 CS25: Transformers United V4
- 普林斯顿 COS 597G (Fall 2022): Understanding Large Language Models
- 约翰霍普金斯 CS 601.471/671 NLP: Self-supervised Models
- 滑铁卢大学 CS 886: Recent Advances on Foundation Models
- LLM101n
- Andrej Karpathy - Neural Networks: Zero to Hero
- Interactive visualization of Transformer
- andysingal/llm-course
- LM-class
- Google Advanced: Generative AI for Developers Learning Path
- Anthropics:Prompt Engineering Interactive Tutorial
- Hands on llms - time financial advisor LLM system.
- LangGPT
- LLMs Interview Note
- Coursera: Chatgpt 应用提示工程
- Introduction to Generative AI 2024 Spring
-
教程 Tutorial
- AI开发者频道
- B站:五里墩茶社
- B站:木羽Cheney
- B站:深度学习自然语言处理
- B站:漆妮妮
- Prompt Engineering Guide
- 动手学大模型应用开发
- YTB: Hyung Won Chung
- Blog: Tejaswi kashyap
- Blog: 小昇的博客
- B站:TechBeat人工智能社区
- B站:黄益贺
- LLM Visualization
- B站:AI老兵文哲
- Large Language Models (LLMs) with Colab notebooks
- YTB:IBM Technology
- YTB: Unify Reading Paper Group
- Chip Huyen
- How Much VRAM
- Blog: 科学空间(苏剑林)
- 知乎: 原石人类
- B站:小黑黑讲AI
- B站:面壁的车辆工程师
-
体验 Usage
-
Tips
- 轻松入门大语言模型(LLM)
- Zero-Qwen-VL
- finetune-Qwen2-VL
- MiniMind
- Knowledge distillation: Teaching LLM's with synthetic data
- Part 1: Methods for adapting large language models
- Part 2: To fine-tune or not to fine-tune
- Part 3: How to fine-tune: Focus on effective datasets
- Reader-LM: Small Language Models for Cleaning and Converting HTML to Markdown
- LLMs for Text Classification: A Guide to Supervised Learning
- Unsupervised Text Classification: Categorize Natural Language With LLMs
- Text Classification With LLMs: A Roundup of the Best Methods
- Tiny LLM Universe
- Zero-Chatgpt
- ![Forkers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/network/members)
- ![Stargazers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/stargazers)
- ![Stargazers over time - LLM-resourses)
- LLM Pricing
- Uncensor any LLM with abliteration
- build_MiniLLM_from_scratch
- Tiny LLM zh
- LLMs应用构建一年之心得
- What We Learned from a Year of Building with LLMs (Part I)
- What We Learned from a Year of Building with LLMs (Part II)
- What We Learned from a Year of Building with LLMs (Part III): Strategy
-
书籍 Book
- 《大规模语言模型:从理论到实践》
- 《大语言模型》
- 《动手做AI Agent》
- 《动手学大模型Dive into LLMs》
- 《Build a Large Language Model (From Scratch)》
- 《多模态大模型》
- 《Understanding Deep Learning》
- 《Illustrated book to learn about Transformers & LLMs》
- 《Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG》
- 《Hands-On Large Language Models》
- 《自然语言处理:大模型理论与实践》
- 《Generative AI Handbook: A Roadmap for Learning Resources》
-
课程
-
教程
-
软件
- 沉浸式翻译
- ![Forkers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/network/members)
- ![Stargazers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/stargazers)
- POT
- Bob
- OpenAI Translator Bob Plugin
- ![Forkers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/network/members)
- ![Stargazers repo roster for @WangRongsheng/awesome-LLM-resourses - LLM-resourses/stargazers)
-
RAG
- FlashRAG
- AnythingLLM - in-one AI app for any LLM with full RAG and AI Agent capabilites.
- MaxKB
- RAGFlow - source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
- Dify - source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
- FastGPT - based platform built on the LLM, offers out-of-the-box data processing and model invocation capabilities, allows for workflow orchestration through Flow visualization.
- Langchain-Chatchat
- QAnything
- Quivr - augmented generation.
- Verba
- GraphRAG - based Retrieval-Augmented Generation (RAG) system.
- LightRAG - Agent-Generator pipelines.
- nano-GraphRAG - to-hack GraphRAG implementation.
- RAG Techniques - Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
- ragas
- kotaemon - source clean & customizable RAG UI for chatting with your documents. Built with both end users and developers in mind.
- RAGapp
-
数据 Data
- AotoLabel
- LabelLLM - Source Data Annotation Platform.
- OmniParser
- MinerU - stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.
- PDF-Extract-Kit - Quality PDF Content Extraction.
- Docling
- GOT-OCR2.0
- data-juicer - stop data processing system to make data higher-quality, juicier, and more digestible for LLMs!
- Parsera - sites with LLMs.
- Sparrow - source solution for efficient data extraction and processing from various documents and images.
-
推理 Inference
- ollama
- Open WebUI - friendly WebUI for LLMs (Formerly Ollama WebUI).
- Text Generation WebUI
- Xinference
- LangChain - aware reasoning applications.
- LlamaIndex
- TensorRT-LLM - LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.
- LlamaChat
- RouteLLM - save LLM costs without compromising quality!
- MInference
- Mem0
- SGLang
- GuideLLM
- LLM-Engines - source models (VLLM, SGLang, Together) and commercial models (OpenAI, Mistral, Claude).
- lobe-chat - source, modern-design LLMs/AI chat framework. Supports Multi AI Providers, Multi-Modals (Vision/TTS) and plugin system.
- vllm - throughput and memory-efficient inference and serving engine for LLMs.
- NVIDIA ChatRTX
- LM Studio
- chat-with-mlx
- LLM Pricing
- Open Interpreter
- Chat-ollama
- chat-ui
- MemGPT - term memory and custom tools.
- koboldcpp - file way to run various GGML and GGUF models with KoboldAI's UI.
- LLMFarm
- enchanted
- Flowise
- Jan - LLM).
- LMDeploy
- AirLLM
- LLMHub
- YuanChat
- LiteLLM
- OARC - TTS speech models, Keras classifiers, Llava vision, Whisper recognition, and more to create a unified chatbot agent for local, custom automation.
- g1 - 3.1 70b on Groq to create o1-like reasoning chains.
- MemoryScope - term memory capabilities, offering a framework for building such abilities.
-
评估 Evaluation
- lm-evaluation-harness - shot evaluation of language models.
- opencompass - 4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
- llm-comparator - by-side, developed.
-
搜索 Search
- OpenSearch GPT
- nanoPerplexityAI - source implementation of perplexity.ai.
- MindSearch - based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT).
-
论文 Paper
- Huggingface Daily Papers
- OLMoE: Open Mixture-of-Experts Language Models
- The Llama 3 Herd of Models
- Qwen Technical Report
- Hermes-3-Technical-Report
- Qwen2 Technical Report
- DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
- Baichuan 2: Open Large-scale Language Models
- DataComp-LM: In search of the next generation of training sets for language models
- OLMo: Accelerating the Science of Language Models
- MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series
- Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
- Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
- Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
- Jamba: A Hybrid Transformer-Mamba Language Model
- Textbooks Are All You Need
- Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Programming Languages
Categories
Sub Categories
Keywords
llm
55
gpt
26
llama
20
ai
18
chatgpt
16
llms
16
large-language-models
15
rag
15
gpt-4
13
agent
13
openai
13
chatbot
12
llama2
12
llama3
11
python
10
mistral
8
llmops
8
machine-learning
7
ollama
7
fine-tuning
7
deep-learning
7
llava
6
transformers
6
chatglm
6
pytorch
6
langchain
6
generative-ai
6
nlp
6
framework
5
llamacpp
5
artificial-intelligence
5
llm-agent
5
lora
5
qlora
5
qwen
5
finetuning
5
gemma
5
data-science
4
inference
4
peft
4
transformer
4
chat
4
macos
4
javascript
4
language-model
4
open-source
3
llm-framework
3
huggingface
3
agent-based-framework
3
typescript
3