alan_awesome_llm

LLM resources on model, system, application
https://github.com/tangzhenyu/alan_awesome_llm

Last synced: 5 days ago
JSON representation

RAG
- GraphRAG-Ollama-UI
- AnythingLLM - in-one AI app for any LLM with full RAG and AI Agent capabilites.
- MaxKB
- RAGFlow - source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
- Dify - source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
- FastGPT - based platform built on the LLM, offers out-of-the-box data processing and model invocation capabilities, allows for workflow orchestration through Flow visualization.
- Langchain-Chatchat
- QAnything
- Quivr - augmented generation.
- Verba
- Dify - source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
- AnythingLLM - in-one AI app for any LLM with full RAG and AI Agent capabilites.
- MaxKB
- RAGFlow - source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
- FastGPT - based platform built on the LLM, offers out-of-the-box data processing and model invocation capabilities, allows for workflow orchestration through Flow visualization.
- Langchain-Chatchat
- QAnything
- Quivr - augmented generation.
- Verba
- FlashRAG
- GraphRAG - based Retrieval-Augmented Generation (RAG) system.
- nano-GraphRAG - to-hack GraphRAG implementation.
- RAG Techniques - Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
- FlashRAG
- GraphRAG - based Retrieval-Augmented Generation (RAG) system.
- nano-GraphRAG - to-hack GraphRAG implementation.
- RAG Techniques - Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
- ragas
- ragas
- LightRAG - Agent-Generator pipelines.
- RAG-GPT - GPT, leveraging LLM and RAG technology, learns from user-customized knowledge bases to provide contextually relevant answers for a wide range of queries, ensuring rapid and accurate information retrieval.
数据 Data
- OmniParser
- MinerU - stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.
- PDF-Extract-Kit - Quality PDF Content Extraction.
- Parsera - sites with LLMs.
- AotoLabel
- LabelLLM - Source Data Annotation Platform.
- data-juicer - stop data processing system to make data higher-quality, juicier, and more digestible for LLMs!
- data-juicer - stop data processing system to make data higher-quality, juicier, and more digestible for LLMs!
- OmniParser
- PDF-Extract-Kit - Quality PDF Content Extraction.
- Parsera - sites with LLMs.
- LabelLLM - Source Data Annotation Platform.
微调 Fine-Tuning
- LLaMA-Factory - Tuning of 100+ LLMs.
- unsloth - 5X faster 80% less memory LLM finetuning.
- TRL
- Firefly
- Xtuner - featured toolkit for fine-tuning large models.
- torchtune - PyTorch Library for LLM Fine-tuning.
- AutoTrain - of-the-art Machine Learning models.
- Ludwig - code framework for building custom LLMs, neural networks, and other AI models.
- mistral-finetune - weight codebase that enables memory-efficient and performant finetuning of Mistral's models.
- aikit - tune, build, and deploy open-source LLMs easily!
- H2O-LLMStudio - a framework and no-code GUI for fine-tuning LLMs.
- LitGPT - of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
- LLMBox
- PaddleNLP - to-use and powerful NLP and LLM library.
- workbench-llamafactory - to-end model development workflow using Llamafactory.
- OpenRLHF - to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral).
- LLaMA-Factory - Tuning of 100+ LLMs.
- unsloth - 5X faster 80% less memory LLM finetuning.
- TRL
- Firefly
- Xtuner - featured toolkit for fine-tuning large models.
- torchtune - PyTorch Library for LLM Fine-tuning.
- AutoTrain - of-the-art Machine Learning models.
- Ludwig - code framework for building custom LLMs, neural networks, and other AI models.
- mistral-finetune - weight codebase that enables memory-efficient and performant finetuning of Mistral's models.
- aikit - tune, build, and deploy open-source LLMs easily!
- H2O-LLMStudio - a framework and no-code GUI for fine-tuning LLMs.
- LitGPT - of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
- workbench-llamafactory - to-end model development workflow using Llamafactory.
- TinyLLaVA Factory - scale Large Multimodal Models.
- LLM-Foundry
- lmms-finetune - 1.5, qwen-vl, llava-interleave, llava-next-video, phi3-v etc.
- Simplifine
- Transformer Lab - tune, and evaluate large language models on your own computer.
- LLMBox
- PaddleNLP - to-use and powerful NLP and LLM library.
- Simplifine
- TinyLLaVA Factory - scale Large Multimodal Models.
- Transformer Lab - tune, and evaluate large language models on your own computer.
- LLM-Foundry
- lmms-finetune - 1.5, qwen-vl, llava-interleave, llava-next-video, phi3-v etc.
推理 Inference
- ollama
- Open WebUI - friendly WebUI for LLMs (Formerly Ollama WebUI).
- Text Generation WebUI
- Xinference
- LangChain - aware reasoning applications.
- LlamaIndex
- lobe-chat - source, modern-design LLMs/AI chat framework. Supports Multi AI Providers, Multi-Modals (Vision/TTS) and plugin system.
- TensorRT-LLM - LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.
- vllm - throughput and memory-efficient inference and serving engine for LLMs.
- LlamaChat
- NVIDIA ChatRTX
- LM Studio
- chat-with-mlx
- LLM Pricing
- Open Interpreter
- koboldcpp - file way to run various GGML and GGUF models with KoboldAI's UI.
- LLMFarm
- Flowise
- LMDeploy
- RouteLLM - save LLM costs without compromising quality!
- MInference
- Mem0
- SGLang
- MemGPT - term memory and custom tools.
- ollama
- Open WebUI - friendly WebUI for LLMs (Formerly Ollama WebUI).
- Text Generation WebUI
- Xinference
- LangChain - aware reasoning applications.
- LlamaIndex
- lobe-chat - source, modern-design LLMs/AI chat framework. Supports Multi AI Providers, Multi-Modals (Vision/TTS) and plugin system.
- TensorRT-LLM - LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.
- vllm - throughput and memory-efficient inference and serving engine for LLMs.
- LlamaChat
- NVIDIA ChatRTX
- LM Studio
- chat-with-mlx
- LLM Pricing
- Open Interpreter
- Chat-ollama
- chat-ui
- koboldcpp - file way to run various GGML and GGUF models with KoboldAI's UI.
- LLMFarm
- Jan - LLM).
- LMDeploy
- RouteLLM - save LLM costs without compromising quality!
- MInference
- Mem0
- SGLang
- AirLLM
- Flowise
评估 Evaluation
- lm-evaluation-harness - shot evaluation of language models.
- opencompass - 4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
- llm-comparator - by-side, developed.
- lm-evaluation-harness - shot evaluation of language models.
- opencompass - 4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
- llm-comparator - by-side, developed.
体验 Usage
书籍 Book
课程 Course
教程 Tutorial
论文 Paper
软件
Agents
- CrewAI - playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
- Coze
- AgentGPT
- XAgent
- AutoGen - studio.com/)
- MobileAgent
- Lagent - based agents.
- Qwen-Agent
- LinkAI
- Baidu APPBuilder
- agentUniverse - agent framework that allows developers to easily build multi-agent applications. Furthermore, through the community, they can exchange and share practices of patterns across different domains.
- LazyLLM
- AgentScope - empowered multi-agent applications in an easier way.
- MoA - of-the-art results.
- Agently
- OmAgent
- Tribe - agent teams.
- CAMEL - agent framework.
- IoA - source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.
- Agent Zero
- AutoGen - studio.com/)
- Coze
- AgentGPT
- XAgent
- MobileAgent
- Lagent - based agents.
- Qwen-Agent
- LinkAI
- Baidu APPBuilder
- agentUniverse - agent framework that allows developers to easily build multi-agent applications. Furthermore, through the community, they can exchange and share practices of patterns across different domains.
- LazyLLM
- AgentScope - empowered multi-agent applications in an easier way.
- MoA - of-the-art results.
- Agently
- OmAgent
- Tribe - agent teams.
- CAMEL - agent framework.
- IoA - source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.
- Agent Zero
- Agents - source Framework for Data-centric, Self-evolving Autonomous Language Agents.
- Agents - source Framework for Data-centric, Self-evolving Autonomous Language Agents.
- PraisonAI - code solution for building and managing multi-agent LLM systems, focusing on simplicity, customisation, and efficient human-agent collaboration.
搜索 Search
- MindSearch - based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT).
- nanoPerplexityAI - source implementation of perplexity.ai.
- OpenSearch GPT
- OpenSearch GPT
- MindSearch - based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT).
- nanoPerplexityAI - source implementation of perplexity.ai.
Tips

Programming Languages

Python 118 Jupyter Notebook 29 TypeScript 28 JavaScript 10 Go 6 Swift 4 C++ 4 MDX 2 Vue 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

alan_awesome_llm

RAG

数据 Data

微调 Fine-Tuning

推理 Inference

评估 Evaluation

体验 Usage

书籍 Book

课程 Course

教程 Tutorial

论文 Paper

软件

Agents

搜索 Search

Tips