Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-ai-papers

This repository is used to collect papers and code in the field of AI.
https://github.com/songqiang321/awesome-ai-papers

Last synced: 4 days ago
JSON representation

  • NLP

    • 3. Pretraining

      • [paper
      • [paper - math/MetaMath)\]\[[MathCoder](https://github.com/mathllm/MathCoder)\]
      • [paper - PaLM)\]
      • [paper - Alignment](https://github.com/PKU-Alignment)\]\[[webpage](https://alignmentsurvey.com/)\]
      • [evaluation-guidebook - LLM-Eval](https://github.com/onejune2018/Awesome-LLM-Eval)\]\[[LLM-eval-survey](https://github.com/MLGroupJLU/LLM-eval-survey)\]\[[llm_benchmarks](https://github.com/leobeeson/llm_benchmarks)\]\[[Awesome-LLMs-Evaluation-Papers](https://github.com/tjunlp-lab/Awesome-LLMs-Evaluation-Papers)\]
      • [paper - bench)\]\[[swarm](https://github.com/openai/swarm)\]
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [paper
      • [paper
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
      • [awesome-llm-interpretability - LLM-Interpretability](https://github.com/cooperleong00/Awesome-LLM-Interpretability)\]
      • [paper - baichuan-mllm/bc-omni)\]
      • [paper - deepmind/synthid-text)\]
      • [paper - hwh/AutoCrawler)\]\[[gpt-crawler](https://github.com/BuilderIO/gpt-crawler)\]\[[webllama](https://github.com/McGill-NLP/webllama)\]\[[gpt-researcher](https://github.com/assafelovic/gpt-researcher)\]\[[skyvern](https://github.com/Skyvern-AI/skyvern)\]\[[Scrapegraph-ai](https://github.com/VinciGit00/Scrapegraph-ai)\]\[[crawl4ai](https://github.com/unclecode/crawl4ai)\]\[[crawlee-python](https://github.com/apify/crawlee-python)\]\[[Agent-E](https://github.com/EmergenceAI/Agent-E)\]\[[CyberScraper-2077](https://github.com/itsOwen/CyberScraper-2077)\]\[[browser-use](https://github.com/gregpr07/browser-use)\]
      • [chatgpt-on-wechat - As-Chatbot](https://github.com/deep-diver/LLM-As-Chatbot)\]\[[HuixiangDou](https://github.com/InternLM/HuixiangDou)\]\[[Streamer-Sales](https://github.com/PeterH0323/Streamer-Sales)\]\[[Tianji](https://github.com/SocialAI-tianji/Tianji)\]\[[metahuman-stream](https://github.com/lipku/metahuman-stream)\]\[[aiavatarkit](https://github.com/uezo/aiavatarkit)\]\[[ai-getting-started](https://github.com/a16z-infra/ai-getting-started)\]\[[chatnio](https://github.com/zmh-program/chatnio)\]\[[VideoChat](https://github.com/Henry-23/VideoChat)\]
      • [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
      • [paper - deepmind/graphcast)\]
      • [paper - fib-lab/ACL24-EconAgent)\]\[[Large Language Models Empowered Agent-based Modeling and Simulation: A Survey and Perspectives](https://arxiv.org/abs/2312.11970)\]
      • [paper
      • [paper - project/selfcodealign)\]
      • [OpenDevin - code-rover](https://github.com/nus-apr/auto-code-rover)\]\[[developer](https://github.com/smol-ai/developer)\]\[[aider](https://github.com/paul-gauthier/aider)\]\[[claude-engineer](https://github.com/Doriandarko/claude-engineer)\]\[[SuperCoder](https://github.com/TransformerOptimus/SuperCoder)\]\[[AIDE](https://github.com/WecoAI/aideml)\]\[[vulnhuntr](https://github.com/protectai/vulnhuntr)\]
      • [paper - table-survey](https://github.com/godaai/llm-table-survey)\]\[[table-transformer](https://github.com/microsoft/table-transformer)\]\[[Awesome-Tabular-LLMs](https://github.com/SpursGoZmy/Awesome-Tabular-LLMs)\]\[[Awesome-LLM-Tabular](https://github.com/johnnyhwu/Awesome-LLM-Tabular)\]\[[Table-LLaVA](https://github.com/SpursGoZmy/Table-LLaVA)\]\[[tablegpt-agent](https://github.com/tablegpt/tablegpt-agent)\]
      • [paper - CoT)\]\[[alphageometry](https://github.com/google-deepmind/alphageometry)\]\[[MathCritique](https://github.com/WooooDyy/MathCritique)\]
      • [paper - PaLM)\]
      • [paper
      • [paper - LM)\]\[[GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism](https://arxiv.org/abs/1811.06965)\]\[[Parameter Server OSDI 2014](https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-li_mu.pdf)\]\[[megatron sequence parallelism](https://arxiv.org/abs/2205.05198)\]
      • [paper - to-strong)\]\[[weak-to-strong-deception](https://github.com/keven980716/weak-to-strong-deception)\]\[[Evolving Alignment via Asymmetric Self-Play](https://arxiv.org/abs/2411.00062)\]
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [paper - Hunyuan-Large)\]
      • [paper
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
      • [paper - coai/Safety-Prompts)\]\[[PurpleLlama](https://github.com/meta-llama/PurpleLlama)\]
      • [paper - instruct)\]
      • [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
      • [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
      • [paper - deepmind/graphcast)\]
      • [paper - PaLM)\]
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [paper - ai/xgrammar)\]\[[mlc-llm](https://github.com/mlc-ai/mlc-llm)\]
      • [paper - foundation/bitsandbytes)\]\[[unsloth](https://github.com/unslothai/unsloth)\]\[[ir-qlora](https://github.com/htqin/ir-qlora)\]\[[fsdp_qlora](https://github.com/AnswerDotAI/fsdp_qlora)\]
      • [paper
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
      • [MOSS - RLHF](https://github.com/OpenLMLab/MOSS-RLHF)\]
      • [blog
      • [paper - Corpus-Indexer-NCI)\]\[[DSI-transformers](https://github.com/ArvinZhuang/DSI-transformers)\]\[[GDR EACL 2024 Oral](https://arxiv.org/abs/2401.10487)\]
      • [paper - PaLM)\]
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [paper
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
      • [paper
      • [paper
      • [paper - lab/stanford_alpaca)\]
      • [paper
      • [paper - ml/RoboticsDiffusionTransformer)\]
      • [swarm - AI/AgentStack)\]\[[multi-agent-orchestrator](https://github.com/awslabs/multi-agent-orchestrator)\]
      • [paper - PaLM)\]
      • [paper - prompting)\]\[[docs](http://platform.openai.com/docs/guides/prompt-generation?context=structured-output-schema)\]
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
      • [paper - Transformers](https://github.com/Beomi/BitNet-Transformers)\]\[[BitNet b1.58](https://arxiv.org/abs/2402.17764)\]\[[T-MAC](https://github.com/microsoft/T-MAC)\]\[[BitBLAS](https://github.com/microsoft/BitBLAS)\]\[[BiLLM](https://github.com/Aaronhuang-778/BiLLM)\]
      • [llama-moe - pytorch](https://github.com/lucidrains/PEER-pytorch)\]\[[GRIN-MoE](https://github.com/microsoft/GRIN-MoE)\]\[[MoE-plus-plus](https://github.com/SkyworkAI/MoE-plus-plus)\]\[[MoH](https://github.com/SkyworkAI/MoH)\]
      • [paper - GaLore](https://github.com/VITA-Group/Q-GaLore)\]\[[WeLore](https://github.com/VITA-Group/WeLore)\]\[[Fira](https://github.com/xichen-fy/Fira)\]
      • [paper
      • [RAG-Retrieval - Shanghai/PGRAG)\]\[[CRUD_RAG](https://github.com/IAAR-Shanghai/CRUD_RAG)\]\[[PlanRAG](https://github.com/myeon9h/PlanRAG)\]\[[DPA-RAG](https://github.com/dongguanting/DPA-RAG)\]\[[FollowRAG](https://github.com/dongguanting/FollowRAG)\]\[[LongRAG](https://github.com/TIGER-AI-Lab/LongRAG)\]\[[Controllable-RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[structured-rag](https://github.com/weaviate/structured-rag)\]\[[RAGLab](https://github.com/fate-ubw/RAGLab)\]\[[autogluon-rag](https://github.com/autogluon/autogluon-rag)\]\[[VARAG](https://github.com/adithya-s-k/VARAG)\]\[[PAI-RAG](https://github.com/aigc-apps/PAI-RAG)\]\[[Meta-Chunking](https://github.com/IAAR-Shanghai/Meta-Chunking)\]\[[chonkie](https://github.com/bhavnicksm/chonkie)\]\[[RagVL](https://github.com/IDEA-FinAI/RagVL)\]\[[KAG](https://github.com/OpenSPG/KAG)\]\[[AutoRAG](https://github.com/Marker-Inc-Korea/AutoRAG)\]\[[HtmlRAG](https://github.com/plageon/HtmlRAG)\]\[[OmniSearch](https://github.com/Alibaba-NLP/OmniSearch)\]
      • [paper - deepmind/synthid-text)\]
      • [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
      • [paper - deepmind/graphcast)\]
      • [paper - llm/OpenCoder-llm)\]\[[dataset](https://huggingface.co/collections/OpenCoder-LLM/opencoder-datasets-672e6db6a0fed24bd69ef1c2)\]
      • [paper
      • [paper - cs-nlp/LLMsKnow)\]
      • [paper
      • [PromptPapers - engineering)\]\[[ChatGPT Prompt Engineering for Developers](https://prompt-engineering.xiniushu.com/)\]\[[Prompt Engineering Guide](https://www.promptingguide.ai/zh)\]\[[k12promptguide](https://www.k12promptguide.com/)\]\[[gpt-prompt-engineer](https://github.com/mshumer/gpt-prompt-engineer)\]\[[awesome-chatgpt-prompts](https://github.com/f/awesome-chatgpt-prompts)\]\[[awesome-chatgpt-prompts-zh](https://github.com/PlexPt/awesome-chatgpt-prompts-zh)\]\[[Prompt_Engineering](https://github.com/NirDiamant/Prompt_Engineering)\]
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
      • [paper - nlp/deita)\]
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [paper - PaLM)\]
      • [paper - Embed/blob/main/modeling_nvmmembed.py)\]\[[magiclens](https://github.com/google-deepmind/magiclens)\]\[[E5-V](https://github.com/kongds/E5-V)\]\[[Visualized BGE](https://github.com/FlagOpen/FlagEmbedding/tree/master/research/visual_bge)\]
      • [paper - Collection)\]
      • [paper - deepmind/synthid-text)\]
      • [paper - S](https://github.com/simular-ai/Agent-S)\]\[[The Dawn of GUI Agent](https://arxiv.org/abs/2411.10323)\]\[[ShowUI](https://github.com/showlab/ShowUI)\]\[[TinyClick](https://github.com/SamsungLabs/TinyClick)\]\[[Large Language Model-Brained GUI Agents: A Survey](https://arxiv.org/abs/2411.18279)\]
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
      • [paper - PaLM)\]
      • [paper - Sequence Recommendation Models Need Decoupled Embeddings](https://arxiv.org/abs/2410.02604)\]
      • [paper
      • [paper
      • [paper - embedding-torch](https://github.com/lucidrains/rotary-embedding-torch)\]
      • [paper - HQ](https://arxiv.org/abs/2410.18505)\]\[[LabelLLM](https://github.com/opendatalab/LabelLLM)\]\[[labelU](https://github.com/opendatalab/labelU)\]\[[MinerU](https://github.com/opendatalab/MinerU)\]\[[PDF-Extract-Kit](https://github.com/opendatalab/PDF-Extract-Kit)\]
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [paper - han-lab/duo-attention)\]\[[Star-Attention](https://github.com/NVIDIA/Star-Attention)\]
      • [PEFT - Factory](https://github.com/hiyouga/LLaMA-Factory)\]\[[LMFlow](https://github.com/OptimalScale/LMFlow)\]\[[unsloth](https://github.com/unslothai/unsloth)\]\[[xtuner](https://github.com/InternLM/xtuner)\]\[[MFTCoder](https://github.com/codefuse-ai/MFTCoder)\]\[[llm-foundry](https://github.com/mosaicml/llm-foundry)\]\[[ms-swift](https://github.com/modelscope/ms-swift)\]\[[Liger-Kernel](https://github.com/linkedin/Liger-Kernel)\]\[[autotrain-advanced](https://github.com/huggingface/autotrain-advanced)\]
      • [paper
      • [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
      • [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
      • [paper - deepmind/graphcast)\]
      • [gpt-investor - quant](https://github.com/goldmansachs/gs-quant)\]\[[stockbot-on-groq](https://github.com/bklieger-groq/stockbot-on-groq)\]\[[Real-Time-Stock-Market-Prediction-using-Ensemble-DL-and-Rainbow-DQN](https://github.com/THINK989/Real-Time-Stock-Market-Prediction-using-Ensemble-DL-and-Rainbow-DQN)\]\[[openbb-agents](https://github.com/OpenBB-finance/openbb-agents)\]\[[ai-hedge-fund](https://github.com/virattt/ai-hedge-fund)\]
      • [paper - PaLM)\]
      • [paper
      • [paper - Web/AIPress-code)\]
      • [paper
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [paper
      • [blog - llama/llama3)\]\[[llama-models](https://github.com/meta-llama/llama-models)\]\[[llama-recipes](https://github.com/meta-llama/llama-recipes)\]\[[LLM Adaptation](https://ai.meta.com/blog/adapting-large-language-models-llms/)\]\[[llama3-from-scratch](https://github.com/naklecha/llama3-from-scratch)\]\[[nano-llama31](https://github.com/karpathy/nano-llama31)\]\[[minimind](https://github.com/jingyaogong/minimind)\]\[[felafax](https://github.com/felafax/felafax)\]
      • [blog - llama/llama-32-66f448ffc8c32f949b04c8cf)\]\[[llama-stack](https://github.com/meta-llama/llama-stack)\]\[[llama-stack-apps](https://github.com/meta-llama/llama-stack-apps)\]\[[lingua](https://github.com/facebookresearch/lingua)\]\[[llama-assistant](https://github.com/vietanhdev/llama-assistant)\]\[[minimind-v](https://github.com/jingyaogong/minimind-v)\]
      • [ray - ai/gpt4all)\]\[[ollama](https://github.com/jmorganca/ollama)\]\[[llama.cpp](https://github.com/ggerganov/llama.cpp)\]\[[dify](https://github.com/langgenius/dify)\]\[[mindsdb](https://github.com/mindsdb/mindsdb)\]\[[bisheng](https://github.com/dataelement/bisheng)\]\[[phidata](https://github.com/phidatahq/phidata)\]\[[guidance](https://github.com/guidance-ai/guidance)\]\[[outlines](https://github.com/outlines-dev/outlines)\]\[[jsonformer](https://github.com/1rgs/jsonformer)\]\[[fabric](https://github.com/danielmiessler/fabric)\]\[[mem0](https://github.com/mem0ai/mem0)\]\[[taipy](https://github.com/Avaiga/taipy)\]
      • [crewAI - llama/llama_deploy)\]\[[gpt-computer-assistant](https://github.com/onuratakan/gpt-computer-assistant)\]\[[agentic_patterns](https://github.com/neural-maze/agentic_patterns)\]
      • [paper - AI4Code/HyperAgent)\]\[[Seeker](https://github.com/XMZhangAI/Seeker)\]\[[AutoKaggle](https://github.com/multimodal-art-projection/AutoKaggle)\]\[[Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level](https://arxiv.org/abs/2411.03562)\]
      • [paper - wu/PMC-LLaMA)\]\[[MMedLM](https://github.com/MAGIC-AI4Med/MMedLM)\]
      • [paper - PaLM)\]
      • [Awesome-LegalAI-Resources - compass/LawBench)\]
      • [paper
      • [paper - Aligner)\]\[[Nemotron-4 340B Technical Report](https://d1qx31qr3h6wln.cloudfront.net/publications/Nemotron_4_340B_8T.pdf)\]\[[Mistral NeMo](https://mistral.ai/news/mistral-nemo/)\]\[[SparseLLM](https://github.com/BaiTheBest/SparseLLM)\]\[[MaskLLM](https://github.com/NVlabs/MaskLLM)\]\[[HelpSteer2-Preference](https://arxiv.org/abs/2410.01257)\]
      • [paper - Infinite](https://arxiv.org/abs/2308.16137)\]
      • [paper - NLP/ProX)\]
      • [paper
      • [paper
      • [paper - cross-capabilities)\]
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [paper - Honesty-Survey)\]
      • [paper - AILab/flash-attention)\]\[[xformers](https://github.com/facebookresearch/xformers)\]\[[SageAttention](https://github.com/thu-ml/SageAttention)\]
      • [text-generation-inference - quanto](https://github.com/huggingface/optimum-quanto)\]\[[huggingface-inference-toolkit](https://github.com/huggingface/huggingface-inference-toolkit)\]\[[torchao](https://github.com/pytorch/ao)\]
      • [paper
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
      • [m3e-base - embedding-v2](https://huggingface.co/lier007/xiaobu-embedding-v2)\]\[[stella_en_1.5B_v5](https://huggingface.co/dunzhang/stella_en_1.5B_v5)\]\[[Conan-embedding-v1](https://huggingface.co/TencentBAC/Conan-embedding-v1)\]
      • [paper - NLP/VinePPO)\]
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
      • [paper - gpt4s-mistakes-with-gpt-4/)\]\[[Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning](https://arxiv.org/abs/2410.08146)\]\[[Free Process Rewards without Process Labels](https://arxiv.org/abs/2412.01981)\]
      • [paper - 6B](https://github.com/THUDM/ChatGLM-6B)\]\[[ChatGLM2-6B](https://github.com/THUDM/ChatGLM2-6B)\]\[[ChatGLM3](https://github.com/THUDM/ChatGLM3)\]\[[GLM-4](https://github.com/THUDM/GLM-4)\]\[[modeling_chatglm.py](https://huggingface.co/THUDM/glm-4-9b-chat/blob/main/modeling_chatglm.py)\]\[[AgentTuning](https://github.com/THUDM/AgentTuning)\]\[[AlignBench](https://github.com/THUDM/AlignBench)\]\[[GLM-Edge](https://github.com/THUDM/GLM-Edge)\]
      • [paper
      • [paper - PaLM)\]
      • [alphafold
      • [paper - science/RefChecker)\]\[[HaluAgent](https://github.com/RUCAIBox/HaluAgent)\]\[[LLMsKnow](https://github.com/technion-cs-nlp/LLMsKnow)\]
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [paper
      • [paper
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
      • [paper - GSAI/YuLan-Chat)\]\[[Yulan-GARDEN](https://github.com/RUC-GSAI/Yulan-GARDEN)\]
      • [paper
      • [paper - transformer-lm)\]
      • [paper - 2)\]\[[llm.c](https://github.com/karpathy/llm.c)\]
      • [paper
      • [paper - llm/automix)\]
      • [paper - research/circuit_training)\]
      • [paper - benchmark)\]
      • [paper
      • [paper
      • [paper - ai/OSWorld)\]
      • [paper - Copilot/OS-Copilot)\]\[[OS-Atlas](https://github.com/OS-Copilot/OS-Atlas)\]\[[WindowsAgentArena](https://github.com/microsoft/WindowsAgentArena)\]
      • [paper
      • [paper
      • [paper
      • [paper - NLP/OpenResearcher)\]\[[Paper Copilot](https://arxiv.org/abs/2409.04593)\]\[[SciAgentsDiscovery](https://github.com/lamm-mit/SciAgentsDiscovery)\]\[[paper-qa](https://github.com/Future-House/paper-qa)\]\[[GraphReasoning](https://github.com/lamm-mit/GraphReasoning)\]
      • [paper - Scientist)\]\[[Social_Science](https://github.com/RenqiChen/Social_Science)\]\[[game_theory](https://github.com/Wenyueh/game_theory)\]
      • [paper - Researcher)\]
      • [Awesome-Scientific-Language-Models - husky/gpt_academic)\]\[[ChatPaper](https://github.com/kaixindelele/ChatPaper)\]\[[scispacy](https://github.com/allenai/scispacy)\]\[[awesome-ai4s](https://github.com/hyperai/awesome-ai4s)\]\[[xVal](https://github.com/PolymathicAI/xVal)\]
      • [link
      • [paper - Code-LLM](https://github.com/codefuse-ai/Awesome-Code-LLM)\]\[[MFTCoder](https://github.com/codefuse-ai/MFTCoder)\]\[[Awesome-Code-LLM](https://github.com/huybery/Awesome-Code-LLM)\]\[[CodeFuse-muAgent](https://github.com/codefuse-ai/CodeFuse-muAgent)\]
      • [paper
      • [paper
      • [paper
      • [paper - research/Eureka)\]\[[DrEureka](https://github.com/eureka-research/DrEureka)\]
      • [paper
      • [paper - ToolMaker)\]
      • [paper - chen/ToolQA)\]\[[toolbench](https://github.com/sambanova/toolbench)\]
      • [paper
      • [paper - llm)\]
      • [paper - Ye/ToolEyes)\]
      • [paper
      • [paper - trial-and-error)\]
      • [paper
      • [paper - Bank](https://arxiv.org/abs/2304.08244)\]
      • [paper
      • [paper - Wang/ToolGen)\]
      • [functionary - tool-llm](https://github.com/zorazrw/awesome-tool-llm)\]
      • [blog
      • [blog - architecture-blogpost-encoders-prefixlm-denoising)\]\[[New LLM Pre-training and Post-training Paradigms](https://magazine.sebastianraschka.com/p/new-llm-pre-training-and-post-training)\]
      • [Awesome-LLM-System-Papers - production-llm](https://github.com/jihoo-kim/awesome-production-llm)\]
      • [paper - pytorch-fully-sharded-data-parallel-api/)\]\[[pytorch-fsdp](https://github.com/huggingface/blog/blob/main/zh/pytorch-fsdp.md)\]
      • [paper
      • [paper - h100-clusters-power-network)\]\[[Parameter Server OSDI 2014](https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-li_mu.pdf)\]\[[ps-lite](https://github.com/dmlc/ps-lite)\]\[[ByteCheckpoint](https://arxiv.org/abs/2407.20143)\]\[[HybridFlow](https://arxiv.org/abs/2409.19256)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - 101B)\]\[[Tele-FLM](https://huggingface.co/CofeAI/Tele-FLM)\]
      • [paper - Tuning-with-GPT-4/GPT-4-LLM)\]
      • [paper - group/textgrad)\]\[[appl](https://github.com/appl-team/appl)\]\[[okhat/blog](https://github.com/okhat/blog/blob/main/2024.09.impact.md)\]
      • [paper - language-RL](https://github.com/waterhorse1/Natural-language-RL)\]
      • [paper - ye/OpenFedLLM)\]
      • [paper - ai/MergeKit)\]\[[DistillKit](https://github.com/arcee-ai/DistillKit)\]\[[A Survey on Collaborative Strategies in the Era of Large Language Models](https://arxiv.org/abs/2407.06089)\]\[[FuseAI](https://github.com/fanqiwan/FuseAI)\]
      • [paper - ConvAI/tree/main/Awesome-Self-Evolution-of-LLM)\]
      • [paper - mini)\]
      • [paper - sys/routellm)\]\[[RouterDC](https://github.com/shuhao02/RouterDC)\]
      • [paper
      • [paper - ai/OpenDiLoCo)\]\[[Prime](https://github.com/PrimeIntellect-ai/Prime)\]\[[DiLoCo](https://arxiv.org/abs/2311.08105)\]\[[DisTrO](https://github.com/NousResearch/DisTrO)\]
      • [paper - piexl/JailbreakZoo)\]\[[jailbreak_llms](https://github.com/verazuo/jailbreak_llms)\]\[[llm-attacks](https://github.com/llm-attacks/llm-attacks)\]
      • [paper
      • [paper - platform)\]\[[Parameter Server OSDI 2014](https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-li_mu.pdf)\]\[[ps-lite](https://github.com/dmlc/ps-lite)\]
      • [wandb
      • [paper
      • [paper
      • [paper
      • [paper - LLM-preference-learning)\]
      • [alignment-handbook
      • [tokenizer_summary - Tokenizer](https://github.com/NVIDIA/Cosmos-Tokenizer)\]
      • [paper - zh](https://llmbook-zh.github.io/)\]\[[LLMsPracticalGuide](https://github.com/Mooler0410/LLMsPracticalGuide)\]
      • [paper - MLSys-Lab/Efficient-LLMs-Survey)\]
      • [paper
      • [paper
      • [paper - survey](https://github.com/ulab-uiuc/AGI-survey)\]
      • [paper
      • [paper - ai/OSWorld)\]\[[AgentGym](https://github.com/WooooDyy/AgentGym)\]\[[Agent-S](https://github.com/simular-ai/Agent-S)\]\[[Agent-as-a-Judge](https://arxiv.org/abs/2410.10934)\]
      • [paper
      • [paper - rlhf)\]
      • [paper - cn/agents)\]
      • [paper
      • [paper
      • [paper - AGI/AutoAgents)\]
      • [paper
      • [paper
      • [paper - agent/digirl)\]\[[Android-Lab](https://github.com/THUDM/Android-Lab)\]
      • [paper - PLUG/MobileAgent)\]\[[Mobile-Agent-v2](https://arxiv.org/abs/2406.01014)\]\[[LiMAC](https://arxiv.org/abs/2410.17883)\]
      • [paper
      • [paper - Agent](https://github.com/microsoft/RD-Agent)\]\[[TinyTroupe](https://github.com/microsoft/TinyTroupe)\]
      • [paper - ai/camel)\]\[[crab](https://github.com/camel-ai/crab)\]\[[oasis](https://github.com/camel-ai/oasis)\]
      • [paper - pilot](https://github.com/Pythagora-io/gpt-pilot)\]\[[Scaling Large-Language-Model-based Multi-Agent Collaboration](https://arxiv.org/abs/2406.07155)\]\[[ProactiveAgent](https://github.com/thunlp/ProactiveAgent)\]
      • [paper
      • [paper
      • [paper
      • [paper - transformer-lm)\]
      • [paper - 2)\]\[[llm.c](https://github.com/karpathy/llm.c)\]
      • [paper - 3)\]\[[nanoGPT](https://github.com/karpathy/nanoGPT)\]\[[build-nanogpt](https://github.com/karpathy/build-nanogpt)\]\[[gpt-fast](https://github.com/pytorch-labs/gpt-fast)\]\[[modded-nanogpt](https://github.com/KellerJordan/modded-nanogpt)\]\[[nanotron](https://github.com/huggingface/nanotron)\]
      • [paper - RLHF](https://github.com/OpenLMLab/MOSS-RLHF)\]
      • [paper - research/bert)\]\[[BERT-pytorch](https://github.com/codertimo/BERT-pytorch)\]\[[bert4torch](https://github.com/Tongjilibo/bert4torch)\]\[[bert4keras](https://github.com/bojone/bert4keras)\]
      • [paper - BERT-wwm](https://github.com/ymcui/Chinese-BERT-wwm)\]
      • [paper - analysis)\]
      • [paper
      • [paper
      • [paper - mll/jiant)\]
      • [paper - Classification)\]
      • [paper - 3](https://huggingface.co/collections/microsoft/phi-3-6626e15e9585a200d2d761e3)\]\[[SmolLM](https://huggingface.co/blog/smollm)\]\[[Computational Bottlenecks of Training Small-scale Large Language Models](https://arxiv.org/abs/2410.19456)\]\[[SLMs-Survey](https://github.com/FairyFali/SLMs-Survey)\]
      • [LLM101n - course](https://github.com/mlabonne/llm-course)\]\[[intro-llm](https://intro-llm.github.io/)\]\[[llm-cookbook](https://github.com/datawhalechina/llm-cookbook)\]\[[hugging-llm](https://github.com/datawhalechina/hugging-llm)\]\[[generative-ai-for-beginners](https://github.com/microsoft/generative-ai-for-beginners)\]\[[awesome-generative-ai-guide](https://github.com/aishwaryanr/awesome-generative-ai-guide)\]\[[LLMs-from-scratch](https://github.com/rasbt/LLMs-from-scratch)\]\[[llm-action](https://github.com/liguodongiot/llm-action)\]\[[llms_idx](https://dongnian.icu/llms/llms_idx/)\]\[[tiny-universe](https://github.com/datawhalechina/tiny-universe)\]\[[AISystem](https://github.com/chenzomi12/AISystem)\]
      • [paper - deepmind/gemma](https://github.com/google-deepmind/gemma)\]\[[gemma.cpp](https://github.com/google/gemma.cpp)\]\[[model](https://ai.google.dev/gemma)\]\[[paligemma](https://github.com/google-research/big_vision/tree/main/big_vision/configs/proj/paligemma)\]\[[gemma-cookbook](https://github.com/google-gemini/gemma-cookbook)\]
      • [paper - watermarking)\]\[[MarkLLM](https://github.com/THU-BPM/MarkLLM)\]\[[Awesome-LLM-Watermark](https://github.com/hzy312/Awesome-LLM-Watermark)\]
      • [paper
      • [paper - agent/digirl)\]
      • [paper
      • [paper - Role-Play-Papers)\]\[[RPBench-Auto](https://boson.ai/rpbench-blog/)\]\[[Hermes 3 Technical Report](https://arxiv.org/abs/2408.11857)\]
      • [paper - instructions)\]
      • [paper - crfm/helm)\]
      • [paper
      • [paper - Foundation/FinGPT)\]
      • [Awesome-LLM-Eval - eval-survey](https://github.com/MLGroupJLU/LLM-eval-survey)\]\[[llm_benchmarks](https://github.com/leobeeson/llm_benchmarks)\]\[[Awesome-LLMs-Evaluation-Papers](https://github.com/tjunlp-lab/Awesome-LLMs-Evaluation-Papers)\]
      • [paper
      • [paper - sys/FastChat/tree/main/fastchat/llm_judge)\]
      • [paper - RAG](https://github.com/CLUEbenchmark/SuperCLUE-RAG)\]
      • [paper - nlp/ceval)\]\[[chinese-llm-benchmark](https://github.com/jeinlee1991/chinese-llm-benchmark)\]
      • [paper - li/CMMLU)\]
      • [paper - Benchmark/CMMMU)\]
      • [paper
      • [paper - eval/prometheus-eval)\]\[[prometheus](https://github.com/prometheus-eval/prometheus)\]\[[prometheus-vision](https://github.com/prometheus-eval/prometheus-vision)\]
      • [paper - Lab/lmms-eval)\]
      • [paper - Benchmark/MMMU)\]
      • [Open LLM Leaderboard
      • [AlpacaEval Leaderboard - lab/alpaca_eval)\]
      • [Chatbot-Arena-Leaderboard - 05-03-arena/)\]\[[FastChat](https://github.com/lm-sys/FastChat)\]\[[arena-hard](https://github.com/lm-sys/arena-hard)\]
      • [lm-evaluation-harness - evals](https://github.com/openai/simple-evals)\]
      • [OpenCompass - Eval](https://github.com/open-compass/GAOKAO-Eval)\]\[[VLMEvalKit](https://github.com/open-compass/VLMEvalKit)\]
      • [llm-colosseum
      • [blog
      • [paper - hallucination-survey)\]
      • [paper - LLM-hallucination)\]\[[Awesome-MLLM-Hallucination](https://github.com/showlab/Awesome-MLLM-Hallucination)\]
      • [paper - 2.0)\]
      • [paper - NLP/factool)\]\[[OlympicArena](https://github.com/GAIR-NLP/OlympicArena)\]\[[FActScore](https://arxiv.org/abs/2305.14251)\]
      • [paper - ai/aiconfig/tree/main/cookbooks/Chain-of-Verification)\]
      • [paper - ai/DB-GPT)\]\[[DocsGPT](https://github.com/arc53/DocsGPT)\]\[[privateGPT](https://github.com/imartinez/privateGPT)\]\[[localGPT](https://github.com/PromtEngineer/localGPT)\]
      • [paper
      • [paper - research/generative_agents)\]\[[genagents](https://github.com/joonspk-research/genagents)\]\[[GPTeam](https://github.com/101dotxyz/GPTeam)\]
      • [paper
      • [paper - ai/OpenAgents)\]
      • [paper - ai/DeepSeek-Coder)\]
      • [paper - ai/DeepSeek-Coder-V2)\]\[[DeepSeek-V2.5](https://huggingface.co/deepseek-ai/DeepSeek-V2.5)\]
      • [paper - Coder)\]
      • [paper - LLaMA-Alpaca)\]\[[Chinese-LLaMA-Alpaca-2](https://github.com/ymcui/Chinese-LLaMA-Alpaca-2)\]\[[Chinese-LLaMA-Alpaca-3](https://github.com/ymcui/Chinese-LLaMA-Alpaca-3)\]\[[baby-llama2-chinese](https://github.com/DLLXW/baby-llama2-chinese)\]
      • [paper - wpy/SeqXGPT)\]\[[llm-detect-ai](https://github.com/yanqiangmiffy/llm-detect-ai)\]\[[detect-gpt](https://github.com/eric-mitchell/detect-gpt)\]\[[fast-detect-gpt](https://github.com/baoguangsheng/fast-detect-gpt)\]
      • [paper
      • [paper - ai/DB-GPT)\]\[[DocsGPT](https://github.com/arc53/DocsGPT)\]\[[privateGPT](https://github.com/imartinez/privateGPT)\]\[[localGPT](https://github.com/PromtEngineer/localGPT)\]
      • [paper
      • [paper - interpreter](https://github.com/e2b-dev/code-interpreter)\]\[[open-interpreter](https://github.com/KillianLucas/open-interpreter)\]
      • [paper
      • [paper - PLUG/MobileAgent)\]\[[Mobile-Agent-v2](https://arxiv.org/abs/2406.01014)\]
      • [paper
      • [paper
      • [paper - ai/camel)\]\[[crab](https://github.com/camel-ai/crab)\]
      • [paper - pilot](https://github.com/Pythagora-io/gpt-pilot)\]
      • [paper
      • [paper
      • [paper
      • [LeRobot - rs/dora)\]\[[awesome-ai-agents](https://github.com/e2b-dev/awesome-ai-agents)\]\[[IsaacLab](https://github.com/isaac-sim/IsaacLab)\]\[[Awesome-Robotics-3D](https://github.com/zubair-irshad/Awesome-Robotics-3D)\]\[[AimRT](https://github.com/AimRT/AimRT)\]\[[agibot_x1_train](https://github.com/AgibotTech/agibot_x1_train)\]\[[unitree_IL_lerobot](https://github.com/unitreerobotics/unitree_IL_lerobot)\]
      • [AutoGPT - Engineer](https://github.com/gpt-engineer-org/gpt-engineer)\]\[[AgentGPT](https://github.com/reworkd/AgentGPT)\]
      • [BabyAGI
      • [blog
      • [translation-agent - zero](https://github.com/frdel/agent-zero)\]\[[AgentK](https://github.com/mikekelly/AgentK)\]\[[Twitter Personality](https://github.com/wordware-ai/twitter)\]\[[RD-Agent](https://github.com/microsoft/RD-Agent)\]\[[TinyTroupe](https://github.com/microsoft/TinyTroupe)\]
      • [paper
      • [paper
      • [paper - ai/geogalactica)\]\[[sciparser](https://github.com/davendw49/sciparser)\]
      • [paper - ZJU/Scientific-LLM-Survey)\]\[[sciknoweval](https://github.com/hicai-zju/sciknoweval)\]
      • [paper
      • [paper - 7B-Chat)\]
      • [paper - ai/AlphaCodium)\]\[[pr-agent](https://github.com/Codium-ai/pr-agent)\]\[[cover-agent](https://github.com/Codium-ai/cover-agent)\]
      • [paper - ai/DeepSeek-Coder)\]
      • [paper - ai/DeepSeek-Coder-V2)\]\[[DeepSeek-V2.5](https://huggingface.co/deepseek-ai/DeepSeek-V2.5)\]
      • [paper - Coder)\]
      • [paper
      • [paper - 3)\]\[[nanoGPT](https://github.com/karpathy/nanoGPT)\]\[[build-nanogpt](https://github.com/karpathy/build-nanogpt)\]\[[gpt-fast](https://github.com/pytorch-labs/gpt-fast)\]\[[modded-nanogpt](https://github.com/KellerJordan/modded-nanogpt)\]
      • [paper - RLHF](https://github.com/OpenLMLab/MOSS-RLHF)\]
      • [paper - research/bert)\]\[[BERT-pytorch](https://github.com/codertimo/BERT-pytorch)\]\[[bert4torch](https://github.com/Tongjilibo/bert4torch)\]\[[bert4keras](https://github.com/bojone/bert4keras)\]
      • [paper - BERT-wwm](https://github.com/ymcui/Chinese-BERT-wwm)\]
      • [paper - analysis)\]
      • [paper - mll/jiant)\]
      • [paper - Classification)\]
      • [paper - 3](https://huggingface.co/collections/microsoft/phi-3-6626e15e9585a200d2d761e3)\]\[[SmolLM](https://huggingface.co/blog/smollm)\]
      • [LLM101n - course](https://github.com/mlabonne/llm-course)\]\[[intro-llm](https://intro-llm.github.io/)\]\[[llm-cookbook](https://github.com/datawhalechina/llm-cookbook)\]\[[hugging-llm](https://github.com/datawhalechina/hugging-llm)\]\[[generative-ai-for-beginners](https://github.com/microsoft/generative-ai-for-beginners)\]\[[awesome-generative-ai-guide](https://github.com/aishwaryanr/awesome-generative-ai-guide)\]\[[LLMs-from-scratch](https://github.com/rasbt/LLMs-from-scratch)\]\[[llm-action](https://github.com/liguodongiot/llm-action)\]\[[llms_idx](https://dongnian.icu/llms/llms_idx/)\]\[[tiny-universe](https://github.com/datawhalechina/tiny-universe)\]
      • [cs230-code-examples - template](https://github.com/victoresque/pytorch-template)\]\[[songquanpeng/pytorch-template](https://github.com/songquanpeng/pytorch-template)\]\[[Academic-project-page-template](https://github.com/eliahuhorwitz/Academic-project-page-template)\]\[[WritingAIPaper](https://github.com/hzwer/WritingAIPaper)\]
      • [tokenizer_summary
      • [paper - zh](https://llmbook-zh.github.io/)\]\[[LLMsPracticalGuide](https://github.com/Mooler0410/LLMsPracticalGuide)\]
      • [paper - MLSys-Lab/Efficient-LLMs-Survey)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - cdn.anthropic.com/fed9cc193a14b84131812372d8d5857f8f304c52/Model_Card_Claude_3_Addendum.pdf)\]
      • [paper - workshop)\]\[[model](https://huggingface.co/bigscience)\]
      • [paper
      • [paper
      • [paper
      • [paper - neox)\]
      • [paper - media/gemini/gemini_1_report.pdf)\]\[[Gemini 1.5](https://arxiv.org/abs/2403.05530)\]\[[Unofficial Implementation](https://github.com/kyegomez/Gemini)\]\[[MiniGemini](https://github.com/dvlab-research/MGM)\]
      • [paper - deepmind/gemma](https://github.com/google-deepmind/gemma)\]\[[gemma.cpp](https://github.com/google/gemma.cpp)\]\[[model](https://ai.google.dev/gemma)\]\[[paligemma](https://github.com/google-research/big_vision/tree/main/big_vision/configs/proj/paligemma)\]\[[gemma-cookbook](https://github.com/google-gemini/gemma-cookbook)\]
      • [paper - gemma-2/)\]\[[Advancing Responsible AI with Gemma](https://developers.googleblog.com/en/smaller-safer-more-transparent-advancing-responsible-ai-with-gemma/)\]\[[Gemma Scope](https://arxiv.org/abs/2408.05147)\]\[[ShieldGemma](https://arxiv.org/abs/2407.21772)\]\[[Gemma-2-9B-Chinese-Chat](https://huggingface.co/shenzhi-wang/Gemma-2-9B-Chinese-Chat)\]
      • [paper
      • [paper - 4o](https://openai.com/index/hello-gpt-4o/)\]\[[GPT-4o System Card](https://arxiv.org/abs/2410.21276)\]
      • [paper
      • [paper - ai/guidance)\]
      • [paper - rlhf-pytorch](https://github.com/conceptofmind/LaMDA-rlhf-pytorch)\]
      • [paper - llama/llama/tree/llama_v1)\]\[[llama.cpp](https://github.com/ggerganov/llama.cpp)\]\[[ollama](https://github.com/jmorganca/ollama)\]\[[llamafile](https://github.com/Mozilla-Ocho/llamafile)\]
      • [paper - llama/llama)\]\[[llama2.c](https://github.com/karpathy/llama2.c)\]\[[lit-llama](https://github.com/Lightning-AI/lit-llama)\]\[[litgpt](https://github.com/Lightning-AI/litgpt)\]
      • [paper - 460M-1T)\]\[[MobiLlama](https://github.com/mbzuai-oryx/MobiLlama)\]\[[Steel-LLM](https://github.com/zhanshijinwat/Steel-LLM)\]
      • [paper - inference)\]\[[model](https://huggingface.co/mistralai)\]\[[mistral-finetune](https://github.com/mistralai/mistral-finetune)\]
      • [paper
      • [cs230-code-examples - template](https://github.com/victoresque/pytorch-template)\]\[[songquanpeng/pytorch-template](https://github.com/songquanpeng/pytorch-template)\]\[[Academic-project-page-template](https://github.com/eliahuhorwitz/Academic-project-page-template)\]\[[WritingAIPaper](https://github.com/hzwer/WritingAIPaper)\]
      • [paper - llm/automix)\]
      • [paper
      • [paper - benchmark)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper
      • [blog
      • [paper
      • [paper
      • [paper
      • [paper - LLMs-on-device](https://github.com/NexaAI/Awesome-LLMs-on-device)\]
      • [blog
      • [paper
      • [paper
      • [paper - research/generative_agents)\]\[[GPTeam](https://github.com/101dotxyz/GPTeam)\]
      • [paper
      • [paper - ai/OpenAgents)\]
      • [CohereV3
      • [paper - ai/instructor-embedding)\]
      • [paper - mistral-7b-instruct)\]\[[llm2vec](https://github.com/McGill-NLP/llm2vec)\]
      • [paper - ai/contrastors)\]
      • [paper
      • [paper - NLP/llm2vec)\]
      • [paper - Embed-v1)\]
      • [paper
      • [JamAIBase
      • [paper - of-thought-hub](https://github.com/FranxYao/chain-of-thought-hub)\]
      • [paper
      • [paper - takeshi188/zero_shot_cot)\]
      • [paper - science/auto-cot)\]
      • [paper - science/mm-cot)\]
      • [paper
      • [paper
      • [paper - REACT)\]\[[AutoAct](https://github.com/zjunlp/AutoAct)\]
      • [paper - nlp/tree-of-thought-llm)\]\[[Plug in and Play Implementation](https://github.com/kyegomez/tree-of-thoughts)\]\[[tree-of-thought-prompting](https://github.com/dave1010/tree-of-thought-prompting)\]
      • [paper - of-thoughts)\]
      • [paper - ai/cumulative-reasoning)\]\[[On the Diagram of Thought](https://arxiv.org/abs/2409.10038)\]
      • [paper - Of-Thoughts)\]
      • [paper - of-Thoughts-XoT)\]
      • [paper
      • [paper
      • [paper - aim)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - models)\]\[[The Geometry of Concepts: Sparse Autoencoder Feature Structure](https://arxiv.org/abs/2410.19750)\]
      • [paper - rep)\]
      • [paper
      • [blog - interpretability)\]\[[transformer-debugger](https://github.com/openai/transformer-debugger)\]
      • [paper - gemma-2/)\]\[[Advancing Responsible AI with Gemma](https://developers.googleblog.com/en/smaller-safer-more-transparent-advancing-responsible-ai-with-gemma/)\]\[[Gemma Scope](https://arxiv.org/abs/2408.05147)\]\[[ShieldGemma](https://arxiv.org/abs/2407.21772)\]\[[Gemma-2-9B-Chinese-Chat](https://huggingface.co/shenzhi-wang/Gemma-2-9B-Chinese-Chat)\]
      • [paper
      • [paper
      • [paper
      • [paper - ai/guidance)\]
      • [paper - rlhf-pytorch](https://github.com/conceptofmind/LaMDA-rlhf-pytorch)\]
      • [paper - llama/llama/tree/llama_v1)\]\[[llama.cpp](https://github.com/ggerganov/llama.cpp)\]\[[ollama](https://github.com/jmorganca/ollama)\]\[[llamafile](https://github.com/Mozilla-Ocho/llamafile)\]
      • [paper - llama/llama)\]\[[llama2.c](https://github.com/karpathy/llama2.c)\]\[[lit-llama](https://github.com/Lightning-AI/lit-llama)\]\[[litgpt](https://github.com/Lightning-AI/litgpt)\]
      • [blog - llama/llama3)\]\[[llama-models](https://github.com/meta-llama/llama-models)\]\[[llama-recipes](https://github.com/meta-llama/llama-recipes)\]\[[llama-agentic-system](https://github.com/meta-llama/llama-agentic-system)\]\[[LLM Adaptation](https://ai.meta.com/blog/adapting-large-language-models-llms/)\]\[[llama3-from-scratch](https://github.com/naklecha/llama3-from-scratch)\]\[[nano-llama31](https://github.com/karpathy/nano-llama31)\]\[[minimind](https://github.com/jingyaogong/minimind)\]
      • [paper - 460M-1T)\]\[[MobiLlama](https://github.com/mbzuai-oryx/MobiLlama)\]
      • [paper - inference)\]\[[model](https://huggingface.co/mistralai)\]\[[mistral-finetune](https://github.com/mistralai/mistral-finetune)\]
      • [paper
      • [paper
      • [paper - pytorch](https://github.com/lucidrains/PaLM-pytorch)\]\[[PaLM-rlhf-pytorch](https://github.com/lucidrains/PaLM-rlhf-pytorch)\]\[[PaLM](https://github.com/conceptofmind/PaLM)\]
      • [paper
      • [paper - E)\]
      • [paper - research/text-to-text-transfer-transformer)\]\[[t5-pytorch](https://github.com/conceptofmind/t5-pytorch)\]\[[t5-pegasus-pytorch](https://github.com/renmada/t5-pegasus-pytorch)\]
      • [paper
      • [paper - research/t5x/blob/main/docs/models.md#flan-t5-checkpoints)\]
      • [paper - xl)\]
      • [paper
      • [paper - MARCO-Web-Search](https://github.com/microsoft/MS-MARCO-Web-Search)\]
      • [blog - org/grok-1)\]\[[model](https://huggingface.co/xai-org/grok-1)\]\[[modelscope](https://modelscope.cn/models/AI-ModelScope/grok-1/summary)\]\[[hpcai-tech/grok-1](https://huggingface.co/hpcai-tech/grok-1)\]\[[dbrx](https://github.com/databricks/dbrx)\]\[[Command R+](https://huggingface.co/CohereForAI/c4ai-command-r-plus)\]\[[snowflake-arctic](https://github.com/Snowflake-Labs/snowflake-arctic)\]
      • [paper - watermarking)\]\[[MarkLLM](https://github.com/THU-BPM/MarkLLM)\]
      • [paper
      • [paper
      • [paper - lab/stanford_alpaca)\]
      • [blog
      • [paper
      • [paper
      • [paper - hwh/AutoCrawler)\]\[[gpt-crawler](https://github.com/BuilderIO/gpt-crawler)\]\[[webllama](https://github.com/McGill-NLP/webllama)\]\[[gpt-researcher](https://github.com/assafelovic/gpt-researcher)\]\[[skyvern](https://github.com/Skyvern-AI/skyvern)\]\[[Scrapegraph-ai](https://github.com/VinciGit00/Scrapegraph-ai)\]\[[crawl4ai](https://github.com/unclecode/crawl4ai)\]\[[crawlee-python](https://github.com/apify/crawlee-python)\]\[[Agent-E](https://github.com/EmergenceAI/Agent-E)\]\[[CyberScraper-2077](https://github.com/itsOwen/CyberScraper-2077)\]
      • [paper
      • [paper - LLMs-on-device](https://github.com/NexaAI/Awesome-LLMs-on-device)\]
      • [paper - Role-Play-Papers)\]\[[RPBench-Auto](https://boson.ai/rpbench-blog/)\]\[[Hermes 3 Technical Report](https://arxiv.org/abs/2408.11857)\]
      • [paper - cdn.anthropic.com/fed9cc193a14b84131812372d8d5857f8f304c52/Model_Card_Claude_3_Addendum.pdf)\]
      • [paper - workshop)\]\[[model](https://huggingface.co/bigscience)\]
      • [paper
      • [paper
      • [paper
      • [paper - neox)\]
      • [paper - media/gemini/gemini_1_report.pdf)\]\[[Gemini 1.5](https://arxiv.org/abs/2403.05530)\]\[[Unofficial Implementation](https://github.com/kyegomez/Gemini)\]\[[MiniGemini](https://github.com/dvlab-research/MGM)\]
      • [paper - wpy/SeqXGPT)\]\[[llm-detect-ai](https://github.com/yanqiangmiffy/llm-detect-ai)\]\[[detect-gpt](https://github.com/eric-mitchell/detect-gpt)\]\[[fast-detect-gpt](https://github.com/baoguangsheng/fast-detect-gpt)\]
      • [paper
      • [paper - Copilot/FRIDAY)\]\[[WindowsAgentArena](https://github.com/microsoft/WindowsAgentArena)\]
      • [paper
      • [paper
      • [paper - interpreter](https://github.com/e2b-dev/code-interpreter)\]
      • [paper
      • [paper
      • [paper - Shanghai/CTGSurvey)\]\[[guidance](https://github.com/guidance-ai/guidance)\]\[[outlines](https://github.com/outlines-dev/outlines)\]\[[instructor](https://github.com/instructor-ai/instructor)\]
      • [awesome-llm-apps - Domain-LLM](https://github.com/luban-agi/Awesome-Domain-LLM)\]\[[agents](https://github.com/livekit/agents)\]
      • [blog - Agents-Papers](https://github.com/AGI-Edgerunners/LLM-Agents-Papers)\]\[[awesome-language-agents](https://github.com/ysymyth/awesome-language-agents)\]\[[Awesome-Papers-Autonomous-Agent](https://github.com/lafmdp/Awesome-Papers-Autonomous-Agent)\]
      • [paper - Agent-Survey)\]\[[LLM-Agent-Paper-Digest](https://github.com/XueyangFeng/LLM-Agent-Paper-Digest)\]
      • [paper - Agent-Paper-List)\]
      • [paper
      • [paper
      • [paper
      • [paper - research/Eureka)\]\[[DrEureka](https://github.com/eureka-research/DrEureka)\]
      • [paper
      • [paper - arena-x/webarena)\]\[[visualwebarena](https://github.com/web-arena-x/visualwebarena)\]\[[agent-workflow-memory](https://github.com/zorazrw/agent-workflow-memory)\]\[[WindowsAgentArena](https://github.com/microsoft/WindowsAgentArena)\]
      • [paper - NLP-Group/SeeAct)\]\[[WebDreamer](https://github.com/OSU-NLP-Group/WebDreamer)\]
      • [paper - Agents/Cradle)\]
      • [paper - agent](https://github.com/modelscope/modelscope-agent)\]
      • [paper
      • [paper
      • [paper - agent](https://github.com/andrewyng/translation-agent)\]
      • [paper - zero](https://github.com/frdel/agent-zero)\]\[[AgentK](https://github.com/mikekelly/AgentK)\]\[[AFlow: Automating Agentic Workflow Generation](https://arxiv.org/abs/2410.10762)\]
      • [paper - survey/Awesome-Robotics-Foundation-Models)\]\[[Awesome-Implicit-NeRF-Robotics](https://github.com/zubair-irshad/Awesome-Implicit-NeRF-Robotics)\]
      • [paper - SYSU/Embodied_AI_Paper_List)\]
      • [paper - research/robotics_transformer)\]\[[IRASim](https://github.com/bytedance/IRASim)\]
      • [paper - 2)\]\[[RT-H: Action Hierarchies Using Language](https://arxiv.org/abs/2403.01823)\]
      • [paper - deepmind/open_x_embodiment)\]
      • [blog
      • [paper - Embodied-AI/RoboGen)\]
      • [paper
      • [paper - O](https://github.com/GameGen-O/GameGen-O)\]\[[GameGen-X](https://github.com/GameGen-X/GameGen-X)\]\[[Unbounded](https://arxiv.org/abs/2410.18975)\]\[[open-oasis](https://github.com/etched-ai/open-oasis)\]\[[DIAMOND](https://diamond-wm.github.io)\]
      • [paper
      • [paper - pytorch](https://github.com/lucidrains/PaLM-pytorch)\]\[[PaLM-rlhf-pytorch](https://github.com/lucidrains/PaLM-rlhf-pytorch)\]\[[PaLM](https://github.com/conceptofmind/PaLM)\]
      • [paper
      • [paper - E)\]
      • [paper - research/text-to-text-transfer-transformer)\]\[[t5-pytorch](https://github.com/conceptofmind/t5-pytorch)\]\[[t5-pegasus-pytorch](https://github.com/renmada/t5-pegasus-pytorch)\]
      • [paper
      • [paper
      • [paper - research/t5x/blob/main/docs/models.md#flan-t5-checkpoints)\]
      • [paper - xl)\]
      • [paper
      • [paper - MARCO-Web-Search](https://github.com/microsoft/MS-MARCO-Web-Search)\]
      • [blog
      • [paper - Shanghai/CTGSurvey)\]\[[guidance](https://github.com/guidance-ai/guidance)\]\[[outlines](https://github.com/outlines-dev/outlines)\]
      • [ray - ai/gpt4all)\]\[[ollama](https://github.com/jmorganca/ollama)\]\[[llama.cpp](https://github.com/ggerganov/llama.cpp)\]\[[dify](https://github.com/langgenius/dify)\]\[[mindsdb](https://github.com/mindsdb/mindsdb)\]\[[bisheng](https://github.com/dataelement/bisheng)\]\[[phidata](https://github.com/phidatahq/phidata)\]\[[guidance](https://github.com/guidance-ai/guidance)\]\[[outlines](https://github.com/outlines-dev/outlines)\]\[[jsonformer](https://github.com/1rgs/jsonformer)\]\[[fabric](https://github.com/danielmiessler/fabric)\]\[[mem0](https://github.com/mem0ai/mem0)\]
      • [awesome-llm-apps - Domain-LLM](https://github.com/luban-agi/Awesome-Domain-LLM)\]
      • [chatgpt-on-wechat - As-Chatbot](https://github.com/deep-diver/LLM-As-Chatbot)\]\[[HuixiangDou](https://github.com/InternLM/HuixiangDou)\]\[[Streamer-Sales](https://github.com/PeterH0323/Streamer-Sales)\]\[[metahuman-stream](https://github.com/lipku/metahuman-stream)\]\[[aiavatarkit](https://github.com/uezo/aiavatarkit)\]\[[ai-getting-started](https://github.com/a16z-infra/ai-getting-started)\]
      • [blog - Agents-Papers](https://github.com/AGI-Edgerunners/LLM-Agents-Papers)\]\[[awesome-language-agents](https://github.com/ysymyth/awesome-language-agents)\]\[[Awesome-Papers-Autonomous-Agent](https://github.com/lafmdp/Awesome-Papers-Autonomous-Agent)\]
      • [paper - Agent-Survey)\]
      • [paper - Agent-Paper-List)\]
      • [paper - 3846.12832)\]\[[code](https://github.com/yya518/FinBERT)\]\[[finBERT](https://github.com/ProsusAI/finBERT)\]\[[valuesimplex/FinBERT](https://github.com/valuesimplex/FinBERT)\]
      • [paper - Foundation/FinRobot)\]
      • [paper - Foundation/FinGPT)\]
      • [paper - Foundation/FinGPT/tree/master/fingpt/FinGPT_RAG/instruct-FinGPT)\]
      • [paper - Foundation/FinRL)\]
      • [paper - Foundation/FinRL-Meta)\]
      • [paper - FinLLM)\]
      • [paper
      • [paper - DI/XuanYuan)\]
      • [paper - FinAI/PIXIU)\]
      • [paper
      • [paper - team/rllm)\]
      • [paper - Copilot)\]
      • [paper
      • [paper - proj/AlphaFin)\]
      • [paper
      • [paper
      • [paper - datasets](https://github.com/virattt/financial-datasets)\]
      • [paper
      • [paper - Touchstone](https://github.com/IDEA-FinAI/Golden-Touchstone)\]
      • [paper - sim](https://github.com/ZhuiyiTechnology/roformer-sim)\]
      • [paper - futuredata/ColBERT)\]\[[RAGatouille](https://github.com/AnswerDotAI/RAGatouille)\]\[[A Reproducibility Study of PLAID](https://arxiv.org/abs/2404.14989)\]\[[Jina-ColBERT-v2](https://arxiv.org/abs/2408.16672)\]
      • [paper - louis/xm-retrievers)\]\[[model](https://huggingface.co/antoinelouis/colbert-xm)\]
      • [paper
      • [paper
      • [paper
      • [paper - LLM4IE-Papers)\]\[[UIE](https://github.com/universal-ie/UIE)\]\[[NERRE](https://github.com/LBNLP/NERRE)\]\[[uie_pytorch](https://github.com/HUSTAI/uie_pytorch)\]
      • [paper
      • [paper
      • [paper
      • [paper - NLPIR/GenIR-Survey)\]
      • [paper - ai/D2LLM)\]
      • [paper
      • [paper
      • [paper - ai/OSWorld)\]\[[AgentGym](https://github.com/WooooDyy/AgentGym)\]
      • [paper - cn/agents)\]
      • [paper - AGI/AutoAgents)\]
      • [paper
      • [paper - NLP-Group/Mind2Web)\]\[[AutoWebGLM](https://github.com/THUDM/AutoWebGLM)\]
      • [paper - arena-x/webarena)\]\[[visualwebarena](https://github.com/web-arena-x/visualwebarena)\]\[[agent-workflow-memory](https://github.com/zorazrw/agent-workflow-memory)\]\[[WindowsAgentArena](https://github.com/microsoft/WindowsAgentArena)\]
      • [paper - NLP-Group/SeeAct)\]
      • [paper - Agents/Cradle)\]
      • [paper - agent](https://github.com/modelscope/modelscope-agent)\]
      • [paper
      • [paper - zero](https://github.com/frdel/agent-zero)\]\[[AgentK](https://github.com/mikekelly/AgentK)\]
      • [paper - survey/Awesome-Robotics-Foundation-Models)\]
      • [paper - SYSU/Embodied_AI_Paper_List)\]
      • [paper - research/robotics_transformer)\]\[[IRASim](https://github.com/bytedance/IRASim)\]
      • [paper - 2)\]\[[RT-H: Action Hierarchies Using Language](https://arxiv.org/abs/2403.01823)\]
      • [paper - deepmind/open_x_embodiment)\]
      • [blog
      • [paper - Embodied-AI/RoboGen)\]
      • [paper
      • [paper - O](https://github.com/GameGen-O/GameGen-O)\]
      • [paper - models/octo)\]\[[BodyTransformer](https://github.com/carlosferrazza/BodyTransformer)\]\[[crossformer](https://github.com/rail-berkeley/crossformer)\]
      • [paper
      • [LeRobot - rs/dora)\]\[[awesome-ai-agents](https://github.com/e2b-dev/awesome-ai-agents)\]\[[IsaacLab](https://github.com/isaac-sim/IsaacLab)\]\[[Awesome-Robotics-3D](https://github.com/zubair-irshad/Awesome-Robotics-3D)\]
      • [AutoGPT - Engineer](https://github.com/gpt-engineer-org/gpt-engineer)\]\[[AgentGPT](https://github.com/reworkd/AgentGPT)\]
      • [BabyAGI
      • [paper
      • [paper - Math)\]\[[Qwen2.5-Math-Demo](https://huggingface.co/spaces/Qwen/Qwen2.5-Math-Demo)\]\[[SuperCorrect-llm](https://github.com/YangLing0818/SuperCorrect-llm)\]
      • [Numina 1st Place Solution - numina/aimo-progress-prize](https://github.com/project-numina/aimo-progress-prize)\]\[[How NuminaMath Won the 1st AIMO Progress Prize](https://huggingface.co/blog/winning-aimo-progress-prize)\]\[[NuminaMath-7B-TIR](https://huggingface.co/AI-MO/NuminaMath-7B-TIR)\]\[[AI achieves silver-medal standard solving International Mathematical Olympiad problems](https://deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/)\]
      • [paper - in-Health/MedLLMsPracticalGuide)\]\[[LLM-for-Healthcare](https://github.com/KaiHe-better/LLM-for-Healthcare)\]\[[GMAI-MMBench](https://github.com/uni-medical/GMAI-MMBench)\]
      • [paper
      • [paper - II](https://github.com/FreedomIntelligence/HuatuoGPT-II)\]\[[Medical_NLP](https://github.com/FreedomIntelligence/Medical_NLP)\]\[[Zhongjing](https://github.com/SupritYoung/Zhongjing)\]\[[MedicalGPT](https://github.com/shibing624/MedicalGPT)\]\[[huatuogpt-vision](https://github.com/freedomintelligence/huatuogpt-vision)\]\[[Chain-of-Diagnosis](https://github.com/FreedomIntelligence/Chain-of-Diagnosis)\]\[[BianCang](https://github.com/QLU-NLP/BianCang)\]
      • [paper - YuanGroup/ChatLaw)\]\[[HK-O1aw](https://github.com/HKAIR-Lab/HK-O1aw)\]
      • [paper - LawLLM)\]
      • [paper - MedLLM)\]
      • [paper
      • [paper - Code-LLM](https://github.com/codefuse-ai/Awesome-Code-LLM)\]\[[MFTCoder](https://github.com/codefuse-ai/MFTCoder)\]\[[Awesome-Code-LLM](https://github.com/huybery/Awesome-Code-LLM)\]
      • [paper
      • [paper - llama/codellama)\]\[[model](https://huggingface.co/codellama)\]\[[llamacoder](https://github.com/Nutlope/llamacoder)\]
      • [blog - media/gemma/codegemma_report.pdf)\]
      • [paper - deepmind/code_contests)\]\[[AlphaCode2_Tech_Report](https://storage.googleapis.com/deepmind-media/AlphaCode2/AlphaCode2_Tech_Report.pdf)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - project/starcoder2)\]\[[starcoder.cpp](https://github.com/bigcode-project/starcoder.cpp)\]
      • [paper
      • [paper - uiuc/magicoder)\]
      • [paper - LLaMA-Alpaca)\]\[[Chinese-LLaMA-Alpaca-2](https://github.com/ymcui/Chinese-LLaMA-Alpaca-2)\]\[[Chinese-LLaMA-Alpaca-3](https://github.com/ymcui/Chinese-LLaMA-Alpaca-3)\]\[[baby-llama2-chinese](https://github.com/DLLXW/baby-llama2-chinese)\]
      • [paper
      • [paper - GSAI/Llama-3-SynE)\]
      • [MiniCPM - MoE](https://github.com/SkyworkAI/Skywork-MoE)\]\[[Orion](https://github.com/OrionStarAI/Orion)\]\[[BELLE](https://github.com/LianjiaTech/BELLE)\]\[[Yuan-2.0](https://github.com/IEIT-Yuan/Yuan-2.0)\]\[[Yuan2.0-M32](https://github.com/IEIT-Yuan/Yuan2.0-M32)\]\[[Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM)\]\[[Index-1.9B](https://github.com/bilibili/Index-1.9B)\]\[[Aquila2](https://github.com/FlagAI-Open/Aquila2)\]
      • [LlamaFamily/Llama-Chinese - AI/Chinese-Llama-2-7b](https://github.com/LinkSoul-AI/Chinese-Llama-2-7b)\]\[[llama3-Chinese-chat](https://github.com/CrazyBoyM/llama3-Chinese-chat)\]\[[phi3-Chinese](https://github.com/CrazyBoyM/phi3-Chinese)\]\[[LLM-Chinese](https://github.com/CrazyBoyM/LLM-Chinese)\]\[[Llama3-Chinese-Chat](https://github.com/Shenzhi-Wang/Llama3-Chinese-Chat)\]\[[llama3-chinese](https://github.com/seanzhang-zhichen/llama3-chinese)\]
      • [Firefly - chitchat](https://github.com/yangjianxin1/GPT2-chitchat)\]
      • [paper - CoT)\]
      • [paper - Scientist)\]
      • [paper - Researcher)\]
      • [Awesome-Scientific-Language-Models - husky/gpt_academic)\]\[[ChatPaper](https://github.com/kaixindelele/ChatPaper)\]\[[scispacy](https://github.com/allenai/scispacy)\]\[[awesome-ai4s](https://github.com/hyperai/awesome-ai4s)\]\[[xVal](https://github.com/PolymathicAI/xVal)\]
      • [paper
      • [paper - agent](https://github.com/andrewyng/translation-agent)\]
      • [blog
      • [translation-agent - zero](https://github.com/frdel/agent-zero)\]\[[AgentK](https://github.com/mikekelly/AgentK)\]\[[Twitter Personality](https://github.com/wordware-ai/twitter)\]\[[RD-Agent](https://github.com/microsoft/RD-Agent)\]
      • [paper
      • [paper
      • [paper - ai/geogalactica)\]\[[sciparser](https://github.com/davendw49/sciparser)\]
      • [paper - ZJU/Scientific-LLM-Survey)\]\[[sciknoweval](https://github.com/hicai-zju/sciknoweval)\]
      • [paper
      • [paper - 7B-Chat)\]
      • [paper - research/scFoundation)\]
      • [paper
      • [paper - oval/storm)\]\[[Co-STORM EMNLP 2024](https://www.arxiv.org/abs/2408.15232)\]\[[kiroku](https://github.com/cnunescoelho/kiroku)\]
      • [paper - sea/sea)\]\[[AgentReview](https://github.com/Ahren09/AgentReview)\]
      • [paper
      • [blog
      • [paper
      • [blog - token-context-windows)\]
      • [datatrove - studio](https://github.com/HumanSignal/label-studio)\]\[[autolabel](https://github.com/refuel-ai/autolabel)\]
      • [blog
      • [paper - pile](https://github.com/EleutherAI/the-pile)\]
      • [paper - workshop/data-preparation)\]\[[dataset](https://huggingface.co/bigscience-data)\]
      • [paper - refinedweb)\]
      • [paper - juicer)\]
      • [paper
      • [paper
      • [paper
      • [paper - LLMs-Datasets](https://github.com/lmmlzn/Awesome-LLMs-Datasets)\]
      • [paper - dev/datadreamer)\]
      • [paper - Tan-dmml/LLM4Annotation)\]
      • [paper
      • [paper - a-p/COIG-CQIA)\]
      • [paper
      • [paper - fineweb-v1)\]\[[fineweb](https://huggingface.co/datasets/HuggingFaceFW/fineweb)\]\[[fineweb-edu](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu)\]
      • [paper - 7B-8k](https://huggingface.co/apple/DCLM-7B-8k)\]
      • [paper - ailab/persona-hub)\]
      • [paper - aloha)\]\[[Hardware Code](https://github.com/MarkFzp/mobile-aloha)\]\[[Learning Code](https://github.com/MarkFzp/act-plus-plus)\]\[[UMI](https://github.com/real-stanford/universal_manipulation_interface)\]\[[humanplus](https://github.com/MarkFzp/humanplus)\]\[[TeleVision](https://github.com/OpenTeleVision/TeleVision)\]\[[Surgical Robot Transformer](https://surgical-robot-transformer.github.io/)\]\[[lifelike-agility-and-play](https://github.com/Tencent-RoboticsX/lifelike-agility-and-play)\]\[[ReKep](https://rekep-robot.github.io/)\]\[[Open_Duck_Mini](https://github.com/apirrone/Open_Duck_Mini)\]\[[Learning Visual Parkour from Generated Images](https://lucidsim.github.io/)\]
      • [paper - models/octo)\]\[[BodyTransformer](https://github.com/carlosferrazza/BodyTransformer)\]\[[crossformer](https://github.com/rail-berkeley/crossformer)\]
      • [paper - research/scFoundation)\]
      • [paper
      • [paper - oval/storm)\]
      • [paper - sea/sea)\]
      • [paper - NLP/OpenResearcher)\]\[[Paper Copilot](https://arxiv.org/abs/2409.04593)\]\[[SciAgentsDiscovery](https://github.com/lamm-mit/SciAgentsDiscovery)\]\[[paper-qa](https://github.com/Future-House/paper-qa)\]
      • [paper - eval](https://github.com/openai/human-eval)\]\[[CriticGPT](https://openai.com/index/finding-gpt4s-mistakes-with-gpt-4/)\]\[[On scalable oversight with weak LLMs judging strong LLMs](https://arxiv.org/abs/2407.04622)\]
      • [paper - llama/codellama)\]\[[model](https://huggingface.co/codellama)\]\[[llamacoder](https://github.com/Nutlope/llamacoder)\]
      • [blog - media/gemma/codegemma_report.pdf)\]
      • [paper - deepmind/code_contests)\]\[[AlphaCode2_Tech_Report](https://storage.googleapis.com/deepmind-media/AlphaCode2/AlphaCode2_Tech_Report.pdf)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - project/starcoder)\]\[[bigcode-project](https://github.com/bigcode-project)\]\[[model](https://huggingface.co/bigcode)\]
      • [paper - project/starcoder2)\]\[[starcoder.cpp](https://github.com/bigcode-project/starcoder.cpp)\]
      • [paper
      • [paper - uiuc/magicoder)\]
      • [paper
      • [paper
      • [paper
      • [paper - nlp/SWE-agent)\]\[[swe-bench-technical-report](https://www.cognition-labs.com/post/swe-bench-technical-report)\]\[[CodeR](https://github.com/NL2Code/CodeR)\]\[[Lingma-SWE-GPT](https://github.com/LingmaTongyi/Lingma-SWE-GPT)\]
      • [paper
      • [paper - Hands-AI/OpenHands)\]
      • [paper
      • [paper - Paper-List)\]
      • [Yi-Coder - 7B](https://github.com/aixcoder-plugin/aiXcoder-7B)\]\[[codealpaca](https://github.com/sahil280114/codealpaca)\]
      • [screenshot-to-code - ai/vanna)\]\[[NL2SQL_Handbook](https://github.com/HKUSTDial/NL2SQL_Handbook)\]\[[TAG-Bench](https://github.com/TAG-Research/TAG-Bench)\]\[[Spider2](https://github.com/xlang-ai/Spider2)\]
      • [paper
      • [paper - Foundation/FinRobot)\]
      • [paper - Foundation/FinGPT)\]
      • [paper - Foundation/FinGPT/tree/master/fingpt/FinGPT_RAG/instruct-FinGPT)\]
      • [paper - Foundation/FinRL)\]
      • [paper - Foundation/FinRL-Meta)\]
      • [paper - FinLLM)\]
      • [paper
      • [paper - DI/XuanYuan)\]
      • [paper - FinAI/PIXIU)\]
      • [paper
      • [paper - table-survey](https://github.com/godaai/llm-table-survey)\]\[[table-transformer](https://github.com/microsoft/table-transformer)\]\[[Awesome-Tabular-LLMs](https://github.com/SpursGoZmy/Awesome-Tabular-LLMs)\]\[[Awesome-LLM-Tabular](https://github.com/johnnyhwu/Awesome-LLM-Tabular)\]\[[Table-LLaVA](https://github.com/SpursGoZmy/Table-LLaVA)\]
      • [paper - team/rllm)\]
      • [paper - Copilot)\]
      • [paper
      • [paper - proj/AlphaFin)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper
      • [gpt-investor - quant](https://github.com/goldmansachs/gs-quant)\]\[[stockbot-on-groq](https://github.com/bklieger-groq/stockbot-on-groq)\]\[[Real-Time-Stock-Market-Prediction-using-Ensemble-DL-and-Rainbow-DQN](https://github.com/THINK989/Real-Time-Stock-Market-Prediction-using-Ensemble-DL-and-Rainbow-DQN)\]
      • [paper - sim](https://github.com/ZhuiyiTechnology/roformer-sim)\]
      • [paper - futuredata/ColBERT)\]\[[RAGatouille](https://github.com/AnswerDotAI/RAGatouille)\]\[[A Reproducibility Study of PLAID](https://arxiv.org/abs/2404.14989)\]\[[Jina-ColBERT-v2](https://arxiv.org/abs/2408.16672)\]
      • [paper - louis/xm-retrievers)\]\[[model](https://huggingface.co/antoinelouis/colbert-xm)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - nlp/SWE-agent)\]\[[swe-bench-technical-report](https://www.cognition-labs.com/post/swe-bench-technical-report)\]\[[CodeR](https://github.com/NL2Code/CodeR)\]
      • [paper
      • [paper - project/bigcodebench)\]\[[LiveCodeBench](https://github.com/LiveCodeBench/LiveCodeBench)\]\[[evalplus](https://github.com/evalplus/evalplus)\]
      • [paper - Hands-AI/OpenHands)\]
      • [paper
      • [paper - Paper-List)\]
      • [Yi-Coder - 7B](https://github.com/aixcoder-plugin/aiXcoder-7B)\]\[[codealpaca](https://github.com/sahil280114/codealpaca)\]
      • [screenshot-to-code - ai/vanna)\]\[[NL2SQL_Handbook](https://github.com/HKUSTDial/NL2SQL_Handbook)\]\[[TAG-Bench](https://github.com/TAG-Research/TAG-Bench)\]
      • [paper
      • [paper
      • [paper - Berry](https://arxiv.org/abs/2410.02884)\]
      • [paper - LLaVA)\]
      • [paper - Math/We-Math)\]
      • [paper
      • [paper - ai/OpenDiLoCo)\]\[[DiLoCo](https://arxiv.org/abs/2311.08105)\]\[[DisTrO](https://github.com/NousResearch/DisTrO)\]
      • [paper - piexl/JailbreakZoo)\]\[[jailbreak_llms](https://github.com/verazuo/jailbreak_llms)\]
      • [paper
      • [paper - platform)\]\[[Parameter Server OSDI 2014](https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-li_mu.pdf)\]\[[ps-lite](https://github.com/dmlc/ps-lite)\]
      • [wandb
      • [paper - Alignment](https://github.com/PKU-Alignment)\]
      • [paper
      • [paper
      • [paper
      • [paper - LLM-preference-learning)\]
      • [alignment-handbook
      • [paper - instruct)\]\[[open-instruct](https://github.com/allenai/open-instruct)\]\[[Multi-modal-Self-instruct](https://github.com/zwq2018/Multi-modal-Self-instruct)\]\[[evol-instruct](https://github.com/nlpxucan/evol-instruct)\]\[[MMEvol](https://arxiv.org/abs/2409.05840)\]\[[Automatic Instruction Evolving for Large Language Models](https://arxiv.org/abs/2406.00770)\]
      • [paper
      • [paper - align/magpie)\]
      • [hf blog - from-human-preferences)\]\[[alignment blog](https://openai.com/blog/our-approach-to-alignment-research)\]\[[awesome-RLHF](https://github.com/opendilab/awesome-RLHF)\]
      • [MOSS-RLHF
      • [paper - Alignment/safe-rlhf)\]\[[align-anything](https://github.com/PKU-Alignment/align-anything)\]\[[Safe-Policy-Optimization](https://github.com/PKU-Alignment/Safe-Policy-Optimization)\]
      • [paper
      • [paper - Reward-Modeling)\]
      • [paper
      • [paper
      • [paper - mitchell/direct-preference-optimization)\]\[[trl](https://github.com/huggingface/trl)\]\[[dpo_trainer](https://github.com/huggingface/trl/blob/main/trl/trainer/dpo_trainer.py)\]
      • [paper - coai/BPO)\]
      • [paper
      • [paper
      • [paper - level-Direct-Preference-Optimization)\]\[[Step-DPO](https://github.com/dvlab-research/Step-DPO)\]\[[FineGrainedRLHF](https://github.com/allenai/FineGrainedRLHF)\]\[[MCTS-DPO](https://github.com/YuxiXie/MCTS-DPO)\]\[[Critical Tokens Matter](https://arxiv.org/abs/2411.19943)\]
      • [paper - nlp/SimPO)\]
      • [paper - RLAIF](https://github.com/mengdi-li/awesome-RLAIF)\]
      • [paper
      • [paper
      • [paper - handbook)\]
      • [paper - self-play)\]
      • [paper - play Methods in Reinforcement Learning](https://arxiv.org/abs/2408.01072)\]
      • [paper - pytorch](https://github.com/lucidrains/CALM-pytorch)\]
      • [paper - rewarding-lm-pytorch)\]\[[Meta-Rewarding Language Models](https://arxiv.org/abs/2407.19594)\]\[[Self-Taught Evaluators](https://arxiv.org/abs/2408.02666)\]
      • [paper
      • [paper
      • [paper - LM/Xwin-LM)\]
      • [paper - auto-alignment)\]
      • [blog
      • [paper - LLM4IE-Papers)\]\[[UIE](https://github.com/universal-ie/UIE)\]\[[NERRE](https://github.com/LBNLP/NERRE)\]\[[uie_pytorch](https://github.com/HUSTAI/uie_pytorch)\]
      • [paper
      • [paper
      • [paper
      • [paper - NLPIR/GenIR-Survey)\]
      • [paper - ai/D2LLM)\]
      • [paper
      • [paper
      • [paper
      • [link
      • [link
      • [search_with_lepton - oval/storm)\]\[[searxng](https://github.com/searxng/searxng)\]\[[Perplexica](https://github.com/ItzCrazyKns/Perplexica)\]\[[rag-search](https://github.com/thinkany-ai/rag-search)\]\[[sensei](https://github.com/jjleng/sensei)\]
      • [similarities
      • [SearchEngine - labs](https://github.com/elastic/elasticsearch-labs)\]\[[tevatron](https://github.com/texttron/tevatron)\]
      • [paper
      • [paper - compass/MathBench)\]\[[OlympiadBench](https://github.com/OpenBMB/OlympiadBench)\]
      • [paper - Math)\]
      • [paper - ai/DeepSeek-Math)\]\[[DeepSeek-Prover-V1.5](https://github.com/deepseek-ai/DeepSeek-Prover-V1.5)\]
      • [paper - LM/Xwin-LM/tree/main/Xwin-Math)\]
      • [paper - Math)\]
      • [paper - Math-Reasoning/Super_MARIO)\]
      • [paper
      • [paper
      • [paper - LLaVA)\]
      • [paper - Math/We-Math)\]
      • [paper
      • [paper - Math)\]\[[Qwen2.5-Math-Demo](https://huggingface.co/spaces/Qwen/Qwen2.5-Math-Demo)\]
      • [Numina 1st Place Solution - numina/aimo-progress-prize](https://github.com/project-numina/aimo-progress-prize)\]\[[How NuminaMath Won the 1st AIMO Progress Prize](https://huggingface.co/blog/winning-aimo-progress-prize)\]\[[NuminaMath-7B-TIR](https://huggingface.co/AI-MO/NuminaMath-7B-TIR)\]\[[AI achieves silver-medal standard solving International Mathematical Olympiad problems](https://deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/)\]
      • [paper - in-Health/MedLLMsPracticalGuide)\]\[[LLM-for-Healthcare](https://github.com/KaiHe-better/LLM-for-Healthcare)\]\[[GMAI-MMBench](https://github.com/uni-medical/GMAI-MMBench)\]
      • [paper
      • [paper - II](https://github.com/FreedomIntelligence/HuatuoGPT-II)\]\[[Medical_NLP](https://github.com/FreedomIntelligence/Medical_NLP)\]\[[Zhongjing](https://github.com/SupritYoung/Zhongjing)\]\[[MedicalGPT](https://github.com/shibing624/MedicalGPT)\]\[[huatuogpt-vision](https://github.com/freedomintelligence/huatuogpt-vision)\]\[[Chain-of-Diagnosis](https://github.com/FreedomIntelligence/Chain-of-Diagnosis)\]
      • [paper - YuanGroup/ChatLaw)\]
      • [paper - LawLLM)\]
      • [paper - MedLLM)\]
      • [paper
      • [paper
      • [paper
      • [paper - PaLM)\]
      • [paper
      • [paper - pytorch](https://github.com/lucidrains/AMIE-pytorch)\]
      • [paper
      • [paper
      • [paper - yuexi/AgentCourt)\]
      • [paper - deeplearning](https://github.com/alibaba/x-deeplearning)\]
      • [paper - Torch](https://github.com/shenweichen/DeepCTR-Torch)\]\[[pytorch-mmoe](https://github.com/ZhichenZhao/pytorch-mmoe)\]
      • [paper
      • [paper
      • [paper - GSAI/YuLan-Rec)\]\[[Scaling Law of Large Sequential Recommendation Models](https://arxiv.org/abs/2311.11351)\]
      • [paper - SSLRec-Papers](https://github.com/HKUDS/Awesome-SSLRec-Papers)\]
      • [paper
      • [paper
      • [paper
      • [link
      • [link
      • [search_with_lepton - oval/storm)\]\[[searxng](https://github.com/searxng/searxng)\]\[[Perplexica](https://github.com/ItzCrazyKns/Perplexica)\]\[[rag-search](https://github.com/thinkany-ai/rag-search)\]\[[sensei](https://github.com/jjleng/sensei)\]
      • [similarities
      • [SearchEngine
      • [paper
      • [paper - math/MetaMath)\]
      • [paper - compass/MathBench)\]\[[OlympiadBench](https://github.com/OpenBMB/OlympiadBench)\]
      • [paper - Math)\]
      • [paper - ai/DeepSeek-Math)\]\[[DeepSeek-Prover-V1.5](https://github.com/deepseek-ai/DeepSeek-Prover-V1.5)\]
      • [paper - LM/Xwin-LM/tree/main/Xwin-Math)\]
      • [paper - Math)\]
      • [paper - Math-Reasoning/Super_MARIO)\]
      • [paper
      • [paper
      • [paper - deepmind/opro)\]
      • [paper - Lab/ATLAS)\]
      • [paper - team/appl)\]\[[sammo](https://github.com/microsoft/sammo)\]\[[prompt-poet](https://github.com/character-ai/prompt-poet)\]\[[ell](https://github.com/MadcowD/ell)\]
      • [paper
      • [paper
      • [paper - research/prompt-tuning)\]\[[soft-prompt-tuning](https://github.com/kipgparker/soft-prompt-tuning)\]\[[Prompt-Tuning](https://github.com/mkshing/Prompt-Tuning)\]
      • [paper
      • [paper
      • [paper - demonstrations)\]
      • [paper
      • [paper - machines/pal)\]
      • [paper - instruction-learning)\]
      • [paper
      • [paper - human-preferences)\]\[[lm-human-preference-details](https://github.com/vwxyzjn/lm-human-preference-details)\]
      • [paper - from-feedback)\]
      • [paper - li/Instruction-Tuning-Survey)\]
      • [paper
      • [paper - KGLLM/RAG-Survey)\]\[[Modular RAG](https://arxiv.org/abs/2407.21059)\]
      • [paper - Survey)\]
      • [paper - Augmented Generation for Natural Language Processing: A Survey](https://arxiv.org/abs/2407.13193)\]\[[A Survey on RAG Meeting LLMs](https://arxiv.org/abs/2405.06211)\]\[[A Comprehensive Survey of Retrieval-Augmented Generation](https://arxiv.org/abs/2410.12837)\]
      • [paper
      • [paper - token-nq)\]\[[docs](https://huggingface.co/docs/transformers/main/model_doc/rag)\]\[[FAISS](https://github.com/facebookresearch/faiss)\]
      • [paper - rag)\]\[[CRAG](https://github.com/HuskyInSalt/CRAG)\]\[[Golden-Retriever](https://arxiv.org/abs/2408.00798)\]
      • [paper
      • [paper
      • [paper - pytorch](https://github.com/lucidrains/RETRO-pytorch)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - community/TrustRAG)\]
      • [paper
      • [paper
      • [paper - isf)\]
      • [paper - RAG)\]\[[Adaptive-RAG](https://github.com/starsuzi/Adaptive-RAG)\]\[[Advanced RAG 11: Query Classification and Refinement](https://ai.gopubby.com/advanced-rag-11-query-classification-and-refinement-2aec79f4140b)\]
      • [paper - ecosystem-engineering/Blended-RAG)\]\[[infinity](https://github.com/infiniflow/infinity)\]
      • [paper - nlpir/flashrag)\]\[[FlashRAG-Paddle](https://github.com/RUC-NLPIR/FlashRAG-Paddle)\]\[[Auto-RAG](https://github.com/ictnlp/Auto-RAG)\]
      • [paper - NLP-Group/HippoRAG)\]
      • [paper
      • [paper
      • [paper - PaLM)\]
      • [paper
      • [paper - pytorch](https://github.com/lucidrains/AMIE-pytorch)\]
      • [paper
      • [paper
      • [paper - yuexi/AgentCourt)\]
      • [paper - deeplearning](https://github.com/alibaba/x-deeplearning)\]
      • [paper - Torch](https://github.com/shenweichen/DeepCTR-Torch)\]\[[pytorch-mmoe](https://github.com/ZhichenZhao/pytorch-mmoe)\]
      • [paper
      • [paper
      • [paper - GSAI/YuLan-Rec)\]
      • [paper - SSLRec-Papers](https://github.com/HKUDS/Awesome-SSLRec-Papers)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - recommenders)\]\[[Transformers4Rec](https://github.com/NVIDIA-Merlin/Transformers4Rec)\]
      • [paper - recommendation)\]
      • [paper
      • [recommenders - algorithm)\]\[[Awesome-RSPapers](https://github.com/RUCAIBox/Awesome-RSPapers)\]\[[RecBole](https://github.com/RUCAIBox/RecBole)\]\[[RecSysDatasets](https://github.com/RUCAIBox/RecSysDatasets)\]\[[LLM4Rec-Awesome-Papers](https://github.com/WLiK/LLM4Rec-Awesome-Papers)\]\[[Awesome-LLM-for-RecSys](https://github.com/CHIANGEL/Awesome-LLM-for-RecSys)\]\[[Awesome-LLM4RS-Papers](https://github.com/nancheng58/Awesome-LLM4RS-Papers)\]\[[ReChorus](https://github.com/THUwangcy/ReChorus)\]
      • [fun-rec - RecommenderSystem](https://github.com/zhongqiangwu960812/AI-RecommenderSystem)\]\[[RecSysPapers](https://github.com/tangxyw/RecSysPapers)\]\[[Algorithm-Practice-in-Industry](https://github.com/Doragd/Algorithm-Practice-in-Industry)\]\[[AlgoNotes](https://github.com/shenweichen/AlgoNotes)\]
      • [paper
      • [paper - Tool-Survey)\]
      • [paper - pytorch](https://github.com/lucidrains/toolformer-pytorch)\]\[[conceptofmind/toolformer](https://github.com/conceptofmind/toolformer)\]\[[xrsrke/toolformer](https://github.com/xrsrke/toolformer)\]\[[Graph_Toolformer](https://github.com/jwzhanggy/Graph_Toolformer)\]
      • [paper - MT/StableToolBench)\]
      • [paper
      • [paper - CVC/GPT4Tools)\]
      • [paper - Song793/RestGPT)\]
      • [paper
      • [paper - trial-and-error)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [functionary - tool-llm](https://github.com/zorazrw/awesome-tool-llm)\]
      • [Awesome-LLM-Eval - eval-survey](https://github.com/MLGroupJLU/LLM-eval-survey)\]\[[llm_benchmarks](https://github.com/leobeeson/llm_benchmarks)\]\[[Awesome-LLMs-Evaluation-Papers](https://github.com/tjunlp-lab/Awesome-LLMs-Evaluation-Papers)\]
      • [paper
      • [paper - instructions)\]
      • [paper - crfm/helm)\]
      • [paper - sys/FastChat/tree/main/fastchat/llm_judge)\]
      • [paper - RAG](https://github.com/CLUEbenchmark/SuperCLUE-RAG)\]
      • [paper - nlp/ceval)\]\[[chinese-llm-benchmark](https://github.com/jeinlee1991/chinese-llm-benchmark)\]
      • [paper - li/CMMLU)\]
      • [paper - Benchmark/CMMMU)\]
      • [paper
      • [paper - eval/prometheus-eval)\]
      • [paper - Lab/lmms-eval)\]
      • [paper - Benchmark/MMMU)\]
      • [Open LLM Leaderboard
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - recommenders)\]\[[Transformers4Rec](https://github.com/NVIDIA-Merlin/Transformers4Rec)\]
      • [paper - recommendation)\]
      • [paper
      • [paper
      • [recommenders - algorithm)\]\[[Awesome-RSPapers](https://github.com/RUCAIBox/Awesome-RSPapers)\]\[[RecBole](https://github.com/RUCAIBox/RecBole)\]\[[RecSysDatasets](https://github.com/RUCAIBox/RecSysDatasets)\]\[[LLM4Rec-Awesome-Papers](https://github.com/WLiK/LLM4Rec-Awesome-Papers)\]\[[Awesome-LLM-for-RecSys](https://github.com/CHIANGEL/Awesome-LLM-for-RecSys)\]\[[Awesome-LLM4RS-Papers](https://github.com/nancheng58/Awesome-LLM4RS-Papers)\]\[[ReChorus](https://github.com/THUwangcy/ReChorus)\]
      • [fun-rec - RecommenderSystem](https://github.com/zhongqiangwu960812/AI-RecommenderSystem)\]\[[RecSysPapers](https://github.com/tangxyw/RecSysPapers)\]\[[Algorithm-Practice-in-Industry](https://github.com/Doragd/Algorithm-Practice-in-Industry)\]\[[AlgoNotes](https://github.com/shenweichen/AlgoNotes)\]
      • [paper
      • [paper - Tool-Survey)\]
      • [paper - pytorch](https://github.com/lucidrains/toolformer-pytorch)\]\[[conceptofmind/toolformer](https://github.com/conceptofmind/toolformer)\]\[[xrsrke/toolformer](https://github.com/xrsrke/toolformer)\]\[[Graph_Toolformer](https://github.com/jwzhanggy/Graph_Toolformer)\]
      • [paper - MT/StableToolBench)\]
      • [paper
      • [paper - CVC/GPT4Tools)\]
      • [paper - Song793/RestGPT)\]
      • [paper
      • [paper - ToolMaker)\]
      • [paper - chen/ToolQA)\]\[[toolbench](https://github.com/sambanova/toolbench)\]
      • [paper
      • [paper - llm)\]
      • [paper - Ye/ToolEyes)\]
      • [blog
      • [blog - architecture-blogpost-encoders-prefixlm-denoising)\]\[[New LLM Pre-training and Post-training Paradigms](https://magazine.sebastianraschka.com/p/new-llm-pre-training-and-post-training)\]
      • [Awesome-LLM-System-Papers - production-llm](https://github.com/jihoo-kim/awesome-production-llm)\]
      • [paper - LM)\]\[[GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism](https://arxiv.org/abs/1811.06965)\]\[[Parameter Server OSDI 2014](https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-li_mu.pdf)\]
      • [paper
      • [paper
      • [paper - h100-clusters-power-network)\]\[[Parameter Server OSDI 2014](https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-li_mu.pdf)\]\[[ps-lite](https://github.com/dmlc/ps-lite)\]\[[ByteCheckpoint](https://arxiv.org/abs/2407.20143)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - 101B)\]\[[Tele-FLM](https://huggingface.co/CofeAI/Tele-FLM)\]
      • [paper - Tuning-with-GPT-4/GPT-4-LLM)\]
      • [paper - group/textgrad)\]\[[appl](https://github.com/appl-team/appl)\]
      • [paper
      • [paper - ye/OpenFedLLM)\]
      • [paper - ai/MergeKit)\]\[[DistillKit](https://github.com/arcee-ai/DistillKit)\]\[[A Survey on Collaborative Strategies in the Era of Large Language Models](https://arxiv.org/abs/2407.06089)\]\[[FuseAI](https://github.com/fanqiwan/FuseAI)\]
      • [paper - ConvAI/tree/main/Awesome-Self-Evolution-of-LLM)\]
      • [paper - mini)\]
      • [paper - sys/routellm)\]
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [blog
      • [blog
      • [paper - Quantization-Papers](https://github.com/Zhen-Dong/Awesome-Quantization-Papers)\]\[[awesome-model-quantization](https://github.com/htqin/awesome-model-quantization)\]\[[qllm-eval](https://github.com/thu-nics/qllm-eval)\]
      • [paper
      • [paper
      • [paper - foundation/bitsandbytes)\]
      • [paper - FP4)\]
      • [paper - instruct)\]\[[open-instruct](https://github.com/allenai/open-instruct)\]\[[Multi-modal-Self-instruct](https://github.com/zwq2018/Multi-modal-Self-instruct)\]\[[evol-instruct](https://github.com/nlpxucan/evol-instruct)\]\[[MMEvol](https://arxiv.org/abs/2409.05840)\]\[[Automatic Instruction Evolving for Large Language Models](https://arxiv.org/abs/2406.00770)\]
      • [paper
      • [paper - nlp/deita)\]\[[From Quantity to Quality NAACL'24](https://arxiv.org/abs/2308.12032)\]\[[Reformatted Alignment](https://arxiv.org/abs/2402.12219)\]\[[MAmmoTH2: Scaling Instructions from the Web](https://arxiv.org/abs/2405.03548)\]
      • [paper - align/magpie)\]
      • [hf blog - from-human-preferences)\]\[[alignment blog](https://openai.com/blog/our-approach-to-alignment-research)\]\[[awesome-RLHF](https://github.com/opendilab/awesome-RLHF)\]
      • [MOSS-RLHF
      • [paper - Alignment/safe-rlhf)\]\[[align-anything](https://github.com/PKU-Alignment/align-anything)\]
      • [paper
      • [paper - Reward-Modeling)\]
      • [paper
      • [paper
      • [paper - mitchell/direct-preference-optimization)\]\[[trl](https://github.com/huggingface/trl)\]\[[dpo_trainer](https://github.com/huggingface/trl/blob/main/trl/trainer/dpo_trainer.py)\]
      • [paper - coai/BPO)\]
      • [paper
      • [paper
      • [paper - level-Direct-Preference-Optimization)\]\[[Step-DPO](https://github.com/dvlab-research/Step-DPO)\]\[[FineGrainedRLHF](https://github.com/allenai/FineGrainedRLHF)\]\[[MCTS-DPO](https://github.com/YuxiXie/MCTS-DPO)\]
      • [paper - nlp/SimPO)\]
      • [paper - RLAIF](https://github.com/mengdi-li/awesome-RLAIF)\]
      • [paper
      • [paper
      • [paper - handbook)\]
      • [paper - to-strong)\]\[[weak-to-strong-deception](https://github.com/keven980716/weak-to-strong-deception)\]
      • [paper - self-play)\]
      • [paper - play Methods in Reinforcement Learning](https://arxiv.org/abs/2408.01072)\]
      • [paper - pytorch](https://github.com/lucidrains/CALM-pytorch)\]
      • [paper - rewarding-lm-pytorch)\]\[[Meta-Rewarding Language Models](https://arxiv.org/abs/2407.19594)\]\[[Self-Taught Evaluators](https://arxiv.org/abs/2408.02666)\]
      • [paper
      • [paper
      • [paper
      • [paper - Knowledge-Distillation-of-LLMs)\]
      • [blog
      • [paper - Aligner)\]\[[Nemotron-4 340B Technical Report](https://d1qx31qr3h6wln.cloudfront.net/publications/Nemotron_4_340B_8T.pdf)\]\[[Mistral NeMo](https://mistral.ai/news/mistral-nemo/)\]
      • [paper - LM/Xwin-LM)\]
      • [paper - auto-alignment)\]
      • [blog - verifier-games-improve-legibility-of-llm-outputs/legibility.pdf)\]
      • [blog - rbr-code-and-data)\]
      • [paper - Self-Guide)\]\[[prompt2model](https://github.com/neulab/prompt2model)\]
      • [paper
      • [paper
      • [paper - memory-transformer/tree/aaai24)\]\[[LM-RMT](https://github.com/booydar/LM-RMT)\]
      • [paper - cn/RecurrentGPT)\]
      • [paper
      • [paper
      • [paper - research/LongLoRA)\]
      • [paper - han-lab/streaming-llm)\]\[[SwiftInfer](https://github.com/hpcaitech/SwiftInfer)\]\[[SwiftInfer blog](https://hpc-ai.com/blog/colossal-ai-swiftinfer)\]
      • [paper - attention-pytorch](https://github.com/lucidrains/ring-attention-pytorch)\]\[[local-attention](https://github.com/lucidrains/local-attention)\]\[[tree_attention](https://github.com/Zyphra/tree_attention)\]
      • [paper
      • [paper
      • [paper
      • [paper - LLM-Long-Context-Modeling](https://github.com/Xnhyacinth/Awesome-LLM-Long-Context-Modeling)\]
      • [paper - Context-Data-Engineering)\]
      • [paper - nlp/CEPE)\]
      • [blog - verifier-games-improve-legibility-of-llm-outputs/legibility.pdf)\]
      • [blog - based-rewards-for-language-model-safety.pdf)\]\[[code](https://github.com/openai/safety-rbr-code-and-data)\]
      • [paper - Self-Guide)\]\[[prompt2model](https://github.com/neulab/prompt2model)\]
      • [paper
      • [paper
      • [paper - memory-transformer/tree/aaai24)\]\[[LM-RMT](https://github.com/booydar/LM-RMT)\]
      • [paper - cn/RecurrentGPT)\]
      • [paper
      • [paper
      • [paper - research/LongLoRA)\]
      • [paper - han-lab/streaming-llm)\]\[[SwiftInfer](https://github.com/hpcaitech/SwiftInfer)\]\[[SwiftInfer blog](https://hpc-ai.com/blog/colossal-ai-swiftinfer)\]
      • [paper
      • [paper - attention-pytorch](https://github.com/lucidrains/ring-attention-pytorch)\]\[[local-attention](https://github.com/lucidrains/local-attention)\]\[[tree_attention](https://github.com/Zyphra/tree_attention)\]
      • [paper
      • [paper
      • [paper
      • [paper - LLM-Long-Context-Modeling](https://github.com/Xnhyacinth/Awesome-LLM-Long-Context-Modeling)\]
      • [paper - Context-Data-Engineering)\]
      • [paper - nlp/CEPE)\]
      • [paper
      • [paper
      • [paper - Stars)\]\[[LLMTest_NeedleInAHaystack](https://github.com/gkamradt/LLMTest_NeedleInAHaystack)\]\[[RULER](https://github.com/NVIDIA/RULER)\]\[[LooGLE](https://github.com/bigai-nlco/LooGLE)\]\[[LongBench](https://github.com/THUDM/LongBench)\]\[[google-deepmind/loft](https://github.com/google-deepmind/loft)\]
      • [paper - transformer-pytorch](https://github.com/lucidrains/infini-transformer-pytorch)\]\[[InfiniTransformer](https://github.com/Beomi/InfiniTransformer)\]\[[infini-mini-transformer](https://github.com/jiahe7ay/infini-mini-transformer)\]\[[megalodon](https://github.com/XuezheMax/megalodon)\]
      • [paper
      • [paper
      • [paper
      • [paper - granite/granite-code-models)\]
      • [blog
      • [paper
      • [blog - token-context-windows)\]
      • [datatrove - studio](https://github.com/HumanSignal/label-studio)\]\[[autolabel](https://github.com/refuel-ai/autolabel)\]
      • [blog
      • [paper
      • [paper - workshop/data-preparation)\]\[[dataset](https://huggingface.co/bigscience-data)\]
      • [paper - refinedweb)\]
      • [paper - juicer)\]
      • [paper
      • [paper
      • [paper - Extract-Kit](https://github.com/opendatalab/PDF-Extract-Kit)\]
      • [paper
      • [paper - LLMs-Datasets](https://github.com/lmmlzn/Awesome-LLMs-Datasets)\]
      • [paper - dev/datadreamer)\]
      • [paper - Tan-dmml/LLM4Annotation)\]
      • [paper
      • [paper - a-p/COIG-CQIA)\]
      • [paper
      • [paper - fineweb-v1)\]\[[fineweb](https://huggingface.co/datasets/HuggingFaceFW/fineweb)\]\[[fineweb-edu](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu)\]
      • [paper - 7B-8k](https://huggingface.co/apple/DCLM-7B-8k)\]
      • [paper - ailab/persona-hub)\]
      • [RedPajama-Data - minigrid-datasets](https://github.com/dunno-lab/xland-minigrid-datasets)\]\[[OmniCorpus](https://github.com/OpenGVLab/OmniCorpus)\]\[[dclm](https://github.com/mlfoundations/dclm)\]\[[Infinity-Instruct](https://github.com/FlagOpen/Infinity-Instruct)\]\[[MNBVC](https://github.com/esbatmop/MNBVC)\]\[[LMSYS-Chat-1M](https://arxiv.org/abs/2309.11998)\]
      • [llm-datasets - LLM-Synthetic-Data](https://github.com/wasiahmad/Awesome-LLM-Synthetic-Data)\]
      • [paper
      • [paper
      • [paper - Stars)\]\[[LLMTest_NeedleInAHaystack](https://github.com/gkamradt/LLMTest_NeedleInAHaystack)\]\[[LooGLE](https://github.com/bigai-nlco/LooGLE)\]\[[LongBench](https://github.com/THUDM/LongBench)\]\[[google-deepmind/loft](https://github.com/google-deepmind/loft)\]
      • [paper - transformer-pytorch](https://github.com/lucidrains/infini-transformer-pytorch)\]\[[InfiniTransformer](https://github.com/Beomi/InfiniTransformer)\]\[[infini-mini-transformer](https://github.com/jiahe7ay/infini-mini-transformer)\]\[[megalodon](https://github.com/XuezheMax/megalodon)\]
      • [paper
      • [paper
      • [paper
      • [paper - granite/granite-code-models)\]
      • [blog - Time Compute Optimally can be More Effective than Scaling Model Parameters](https://arxiv.org/abs/2408.03314)\]\[[Let's Verify Step by Step](https://arxiv.org/abs/2305.20050)\]\[[Thinking LLMs: General Instruction Following with Thought Generation](https://arxiv.org/abs/2410.10630)\]\[[Awesome-LLM-Strawberry](https://github.com/hijkzzz/Awesome-LLM-Strawberry)\]
      • [llm-reasoners - groq/g1)\]\[[Open-O1](https://github.com/Open-Source-O1/Open-O1)\]\[[show-me](https://github.com/marlaman/show-me)\]\[[OpenR](https://github.com/openreasoner/openr)\]
      • [Prompt4ReasoningPapers
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - 2-research](https://github.com/open-thought/system-2-research)\]
      • [paper
      • [paper - LLM](https://github.com/deepseek-ai/DeepSeek-LLM)\]\[[DeepSeek-V2](https://github.com/deepseek-ai/DeepSeek-V2)\]\[[DeepSeek-Coder](https://github.com/deepseek-ai/DeepSeek-Coder)\]
      • [RedPajama-Data - minigrid-datasets](https://github.com/dunno-lab/xland-minigrid-datasets)\]\[[OmniCorpus](https://github.com/OpenGVLab/OmniCorpus)\]\[[dclm](https://github.com/mlfoundations/dclm)\]\[[Infinity-Instruct](https://github.com/FlagOpen/Infinity-Instruct)\]\[[MNBVC](https://github.com/esbatmop/MNBVC)\]\[[LMSYS-Chat-1M](https://arxiv.org/abs/2309.11998)\]
      • [llm-datasets - LLM-Synthetic-Data](https://github.com/wasiahmad/Awesome-LLM-Synthetic-Data)\]
      • [AlpacaEval Leaderboard - lab/alpaca_eval)\]
      • [Chatbot-Arena-Leaderboard - 05-03-arena/)\]\[[FastChat](https://github.com/lm-sys/FastChat)\]\[[arena-hard](https://github.com/lm-sys/arena-hard)\]
      • [lm-evaluation-harness - evals](https://github.com/openai/simple-evals)\]
      • [OpenCompass - Eval](https://github.com/open-compass/GAOKAO-Eval)\]\[[VLMEvalKit](https://github.com/open-compass/VLMEvalKit)\]
      • [llm-colosseum
      • [blog
      • [paper - hallucination-survey)\]
      • [paper - LLM-hallucination)\]\[[Awesome-MLLM-Hallucination](https://github.com/showlab/Awesome-MLLM-Hallucination)\]
      • [paper - 2.0)\]
      • [paper - NLP/factool)\]\[[OlympicArena](https://github.com/GAIR-NLP/OlympicArena)\]\[[FActScore](https://arxiv.org/abs/2305.14251)\]
      • [paper - ai/aiconfig/tree/main/cookbooks/Chain-of-Verification)\]
      • [paper - lab/HallusionBench)\]
      • [paper
      • [paper
      • [paper
      • [paper - deepmind/long-form-factuality)\]
      • [paper - science/RefChecker)\]
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [blog
      • [blog
      • [paper - Quantization-Papers](https://github.com/Zhen-Dong/Awesome-Quantization-Papers)\]\[[awesome-model-quantization](https://github.com/htqin/awesome-model-quantization)\]\[[qllm-eval](https://github.com/thu-nics/qllm-eval)\]
      • [paper
      • [paper
      • [paper
      • [paper - FP4)\]
      • [paper - DASLab/gptq)\]\[[AutoGPTQ](https://github.com/PanQiWei/AutoGPTQ)\]
      • [paper - DASLab/qmoe)\]
      • [paper - han-lab/llm-awq)\]\[[AutoAWQ](https://github.com/casper-hansen/AutoAWQ)\]\[[qserve](https://github.com/mit-han-lab/qserve)\]
      • [paper
      • [paper
      • [paper
      • [paper - IPADS/PowerInfer)\]\[[llama.cpp](https://github.com/ggerganov/llama.cpp)\]\[[airllm](https://github.com/lyogavin/airllm)\]\[[PowerInfer-2](https://arxiv.org/abs/2406.06282)\]
      • [paper - AILab/flash-attention)\]
      • [paper - AILab/flash-attention)\]
      • [paper - project/vllm)\]\[[FastChat](https://github.com/lm-sys/FastChat)\]\[[Nanoflow](https://github.com/efeslab/Nanoflow)\]\[[ollama](https://github.com/jmorganca/ollama)\]
      • [blog - project/sglang/)\]
      • [paper
      • [paper - AI-Lab/Sequoia)\]\[[HASS](https://arxiv.org/abs/2408.15766)\]
      • [paper - Bench](https://github.com/hemingkx/Spec-Bench)\]
      • [paper
      • [paper - ai-lab/Consistency_LLM)\]\[[LookaheadDecoding](https://github.com/hao-ai-lab/LookaheadDecoding)\]\[[Lookahead](https://github.com/alipay/PainlessInferenceAcceleration)\]
      • [paper
      • [paper - serve)\]\[[ORCA OSDI 2022](https://www.usenix.org/system/files/osdi22-yu.pdf)\]\[[continuous batching blog](https://www.anyscale.com/blog/continuous-batching-llm-inference)\]
      • [paper
      • [paper - sys/prompt-cache)\]
      • [paper
      • [paper - ai/Mooncake)\]\[[ktransformers](https://github.com/kvcache-ai/ktransformers)\]
      • [TensorRT-LLM - inference-server/server)\]\[[GenerativeAIExamples](https://github.com/NVIDIA/GenerativeAIExamples)\]\[[TensorRT-Model-Optimizer](https://github.com/NVIDIA/TensorRT-Model-Optimizer)\]\[[TensorRT](https://github.com/NVIDIA/TensorRT)\]\[[OpenVINO](https://github.com/openvinotoolkit/openvino)\]
      • [DeepSpeed-MII - FastGen](https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-fastgen)\]\[[ONNX Runtime](https://github.com/microsoft/onnxruntime)\]\[[onnx](https://github.com/onnx/onnx)\]
      • [paper - lab/HallusionBench)\]
      • [paper
      • [paper
      • [paper
      • [paper - deepmind/long-form-factuality)\]
      • [paper - nlpir/flashrag)\]
      • [paper - NLP-Group/HippoRAG)\]
      • [paper - Local-UI](https://github.com/severian42/GraphRAG-Local-UI)\]\[[nano-graphrag](https://github.com/gusye1234/nano-graphrag)\]\[[fast-graphrag](https://github.com/circlemind-ai/fast-graphrag)\]\[[graph-rag](https://github.com/sarthakrastogi/graph-rag)\]\[[llm-graph-builder](https://github.com/neo4j-labs/llm-graph-builder)\]\[[Triplex](https://huggingface.co/SciPhi/Triplex)\]\[[knowledge_graph_maker](https://github.com/rahulnyk/knowledge_graph_maker)\]\[[itext2kg](https://github.com/AuvaLab/itext2kg)\]\[[KG_RAG](https://github.com/BaranziniLab/KG_RAG)\]
      • [paper
      • [paper - NLP/RAG)\]\[[Seven Failure Points When Engineering a Retrieval Augmented Generation System](https://arxiv.org/abs/2401.05856)\]\[[Improving Retrieval Performance in RAG Pipelines with Hybrid Search](https://towardsdatascience.com/improving-retrieval-performance-in-rag-pipelines-with-hybrid-search-c75203c2f2f5)\]\[[15 Advanced RAG Techniques from Pre-Retrieval to Generation](https://www.willowtreeapps.com/guides/advanced-rag-techniques)\]
      • [paper
      • [paper - FiT)\]\[[fastRAG](https://github.com/IntelLabs/fastRAG)\]\[[rag-retrieval-study](https://github.com/intellabs/rag-retrieval-study)\]
      • [paper - science/RAGChecker)\]\[[rageval](https://github.com/gomate-community/rageval)\]\[[CORAL](https://github.com/Ariya12138/CORAL)\]
      • [paper - of-thoughts)\]
      • [paper - teacher)\]
      • [paper
      • [paper - Planner)\]
      • [paper - org/llm-reasoners)\]\[[LLM Reasoners COLM 2024](https://arxiv.org/abs/2404.05221)\]
      • [paper
      • [paper - Impact-of-Reasoning-Step-Length-on-Large-Language-Models)\]
      • [paper - Edgerunners/Plan-and-Solve-Prompting)\]\[[maestro](https://github.com/Doriandarko/maestro)\]
      • [paper - models/llm_multiagent_debate)\]\[[Multi-Agents-Debate](https://github.com/Skytliang/Multi-Agents-Debate)\]
      • [paper - refine)\]\[[MCT Self-Refine](https://github.com/trotsky1997/MathBlackBox)\]\[[SelFee](https://github.com/kaistAI/SelFee)\]
      • [paper
      • [paper
      • [paper
      • [paper - discover)\]\[[SELF-DISCOVER](https://github.com/kailashsp/SELF-DISCOVER)\]
      • [paper
      • [paper
      • [paper
      • [paper - of-thought-llm)\]\[[SymbCoT](https://github.com/Aiden0526/SymbCoT)\]
      • [paper - EM-pytorch)\]
      • [paper
      • [paper
      • [paper - rpm-bench)\]
      • [paper
      • [paper - husky/Husky-v1)\]
      • [paper - System)\]
      • [paper
      • [paper - Shanghai/ICSFSurvey)\]
      • [paper - st)\]
      • [paper - MCTS)\]\[[llm-mcts](https://github.com/1989Ryan/llm-mcts)\]
      • [paper - STaR](https://arxiv.org/abs/2403.09629)\]
      • [paper - han-lab/smoothquant)\]\[[ABQ-LLM](https://github.com/bytedance/ABQ-LLM)\]\[[VPTQ](https://github.com/microsoft/VPTQ)\]\[[ppq](https://github.com/OpenPPL/ppq)\]
      • [paper - MAC](https://github.com/microsoft/T-MAC)\]\[[BitBLAS](https://github.com/microsoft/BitBLAS)\]\[[BiLLM](https://github.com/Aaronhuang-778/BiLLM)\]
      • [paper - DASLab/gptq)\]\[[AutoGPTQ](https://github.com/PanQiWei/AutoGPTQ)\]
      • [paper - DASLab/qmoe)\]
      • [paper - han-lab/llm-awq)\]\[[AutoAWQ](https://github.com/casper-hansen/AutoAWQ)\]\[[qserve](https://github.com/mit-han-lab/qserve)\]
      • [paper
      • [paper
      • [paper
      • [paper - IPADS/PowerInfer)\]\[[llama.cpp](https://github.com/ggerganov/llama.cpp)\]\[[airllm](https://github.com/lyogavin/airllm)\]\[[PowerInfer-2](https://arxiv.org/abs/2406.06282)\]
      • [paper - AILab/flash-attention)\]
      • [paper - AILab/flash-attention)\]
      • [paper - AILab/flash-attention)\]
      • [paper - project/vllm)\]\[[FastChat](https://github.com/lm-sys/FastChat)\]\[[Nanoflow](https://github.com/efeslab/Nanoflow)\]
      • [blog - project/sglang/)\]
      • [paper
      • [paper - AI-Lab/Sequoia)\]
      • [paper - Bench](https://github.com/hemingkx/Spec-Bench)\]
      • [paper
      • [paper - ai-lab/Consistency_LLM)\]\[[LookaheadDecoding](https://github.com/hao-ai-lab/LookaheadDecoding)\]\[[Lookahead](https://github.com/alipay/PainlessInferenceAcceleration)\]
      • [paper
      • [paper - serve)\]\[[ORCA OSDI 2022](https://www.usenix.org/system/files/osdi22-yu.pdf)\]\[[continuous batching blog](https://www.anyscale.com/blog/continuous-batching-llm-inference)\]
      • [paper
      • [paper - sys/prompt-cache)\]
      • [paper
      • [paper - ai/Mooncake)\]\[[ktransformers](https://github.com/kvcache-ai/ktransformers)\]
      • [TensorRT-LLM - inference-server/server)\]\[[GenerativeAIExamples](https://github.com/NVIDIA/GenerativeAIExamples)\]\[[TensorRT-Model-Optimizer](https://github.com/NVIDIA/TensorRT-Model-Optimizer)\]\[[TensorRT](https://github.com/NVIDIA/TensorRT)\]\[[OpenVINO](https://github.com/openvinotoolkit/openvino)\]
      • [DeepSpeed-MII - FastGen](https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-fastgen)\]\[[ONNX Runtime](https://github.com/microsoft/onnxruntime)\]\[[onnx](https://github.com/onnx/onnx)\]
      • [OpenLLM - llm](https://github.com/mlc-ai/mlc-llm)\]\[[ollama](https://github.com/jmorganca/ollama)\]\[[open-webui](https://github.com/open-webui/open-webui)\]\[[torchchat](https://github.com/pytorch/torchchat)\]
      • [LMDeploy - ai/Mooncake)\]\[[inference](https://github.com/xorbitsai/inference)\]\[[LitServe](https://github.com/Lightning-AI/LitServe)\]
      • [ChuanhuChatGPT - Next-Web](https://github.com/ChatGPTNextWeb/ChatGPT-Next-Web)\]
      • [blog
      • [paper - Implementation](https://github.com/davidmrau/mixture-of-experts)\]
      • [paper - of-experts](https://github.com/lucidrains/mixture-of-experts)\]
      • [paper - futuredata/megablocks)\]
      • [paper
      • [paper
      • [paper - offloading)\]
      • [paper - inference)\]\[[megablocks-public](https://github.com/mistralai/megablocks-public)\]\[[model](https://huggingface.co/mistralai)\]\[[blog](https://mistral.ai/news/mixtral-of-experts/)\]\[[Chinese-Mixtral-8x7B](https://github.com/HIT-SCIR/Chinese-Mixtral-8x7B)\]\[[Chinese-Mixtral](https://github.com/ymcui/Chinese-Mixtral)\]
      • [paper - ai/DeepSeek-MoE)\]
      • [paper - ai/DeepSeek-V2)\]\[[DeepSeek-V2.5](https://huggingface.co/deepseek-ai/DeepSeek-V2.5)\]
      • [paper - ai/ESFT)\]\[[Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts](https://arxiv.org/abs/2408.15664)\]
      • [paper - model-merge)\]
      • [paper - into-MoEs)\]
      • [text-generation-inference - quanto](https://github.com/huggingface/optimum-quanto)\]\[[huggingface-inference-toolkit](https://github.com/huggingface/huggingface-inference-toolkit)\]
      • [OpenLLM - llm](https://github.com/mlc-ai/mlc-llm)\]\[[ollama](https://github.com/jmorganca/ollama)\]\[[open-webui](https://github.com/open-webui/open-webui)\]\[[torchchat](https://github.com/pytorch/torchchat)\]
      • [LMDeploy - ai/Mooncake)\]\[[inference](https://github.com/xorbitsai/inference)\]\[[LitServe](https://github.com/Lightning-AI/LitServe)\]
      • [ggml - fast](https://github.com/pytorch-labs/gpt-fast)\]\[[lightllm](https://github.com/ModelTC/lightllm)\]\[[fastllm](https://github.com/ztxz16/fastllm)\]\[[CTranslate2](https://github.com/OpenNMT/CTranslate2)\]\[[ipex-llm](https://github.com/intel-analytics/ipex-llm)\]\[[rtp-llm](https://github.com/alibaba/rtp-llm)\]\[[KsanaLLM](https://github.com/pcg-mlp/KsanaLLM)\]\[[ppl.nn](https://github.com/OpenPPL/ppl.nn)\]
      • [ChuanhuChatGPT - Next-Web](https://github.com/ChatGPTNextWeb/ChatGPT-Next-Web)\]
      • [blog
      • [paper - Implementation](https://github.com/davidmrau/mixture-of-experts)\]
      • [paper - of-experts](https://github.com/lucidrains/mixture-of-experts)\]
      • [paper - futuredata/megablocks)\]
      • [paper
      • [paper
      • [paper - offloading)\]
      • [paper - inference)\]\[[megablocks-public](https://github.com/mistralai/megablocks-public)\]\[[model](https://huggingface.co/mistralai)\]\[[blog](https://mistral.ai/news/mixtral-of-experts/)\]\[[Chinese-Mixtral-8x7B](https://github.com/HIT-SCIR/Chinese-Mixtral-8x7B)\]\[[Chinese-Mixtral](https://github.com/ymcui/Chinese-Mixtral)\]
      • [paper - ai/DeepSeek-MoE)\]
      • [paper - ai/DeepSeek-V2)\]\[[DeepSeek-V2.5](https://huggingface.co/deepseek-ai/DeepSeek-V2.5)\]
      • [paper - ai/ESFT)\]\[[Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts](https://arxiv.org/abs/2408.15664)\]
      • [paper - model-merge)\]
      • [paper - into-MoEs)\]
      • [paper - Survey-on-Mixture-of-Experts)\]
      • [paper
      • [paper
      • [DeepSpeed - us/research/blog/zero-deepspeed-new-system-optimizations-enable-training-models-with-over-100-billion-parameters/)\]
      • [Megatron-LM - DeepSpeed](https://github.com/microsoft/Megatron-DeepSpeed)\]\[[Megatron-DeepSpeed](https://github.com/bigscience-workshop/Megatron-DeepSpeed)\]
      • [torchtune
      • [mergekit - models](https://huggingface.co/blog/mlabonne/merge-models)\]\[[Model Merging](https://huggingface.co/collections/osanseviero/model-merging-65097893623330a3a51ead66)\]\[[OpenChatKit](https://github.com/togethercomputer/OpenChatKit)\]
      • [paper - ai/studios/code-lora-from-scratch)\]\[[lora](https://github.com/cloneofsimo/lora)\]\[[dora](https://github.com/catid/dora)\]\[[MoRA](https://github.com/kongds/MoRA)\]\[[ziplora-pytorch](https://github.com/mkshing/ziplora-pytorch)\]\[[alpaca-lora](https://github.com/tloen/alpaca-lora)\]
      • [paper - LoRA/S-LoRA)\]\[[AdaLoRA](https://github.com/QingruZhang/AdaLoRA)\]\[[LoRAMoE](https://github.com/Ablustrund/LoRAMoE)\]\[[lorahub](https://github.com/sail-sg/lorahub)\]\[[O-LoRA](https://github.com/cmnfriend/O-LoRA)\]\[[qa-lora](https://github.com/yuhuixu1993/qa-lora)\]
      • [paper - GA)\]\[[LoRA-Pro blog](https://kexue.fm/archives/10266)\]\[[dora](https://github.com/catid/dora)\]
      • [paper
      • [paper - research/adapter-bert)\]\[[unify-parameter-efficient-tuning](https://github.com/jxhe/unify-parameter-efficient-tuning)\]
      • [paper - hub/adapters)\]\[[A Survey on LoRA of Large Language Models](https://arxiv.org/abs/2407.11046)\]
      • [paper - Edgerunners/LLM-Adapters)\]
      • [paper - Adapter)\]
      • [paper - Pro)\]
      • [paper - tuning)\]
      • [paper - tuning-v2)\]\[[pet](https://github.com/timoschick/pet)\]\[[PrefixTuning](https://github.com/XiangLi1999/PrefixTuning)\]
      • [paper
      • [paper
      • [paper - Survey-on-Mixture-of-Experts)\]
      • [paper
      • [paper
      • [llama-moe - pytorch](https://github.com/lucidrains/PEER-pytorch)\]\[[GRIN-MoE](https://github.com/microsoft/GRIN-MoE)\]
      • [DeepSpeed - us/research/blog/zero-deepspeed-new-system-optimizations-enable-training-models-with-over-100-billion-parameters/)\]
      • [Megatron-LM - DeepSpeed](https://github.com/microsoft/Megatron-DeepSpeed)\]\[[Megatron-DeepSpeed](https://github.com/bigscience-workshop/Megatron-DeepSpeed)\]
      • [torchtune
      • [PEFT - Factory](https://github.com/hiyouga/LLaMA-Factory)\]\[[LMFlow](https://github.com/OptimalScale/LMFlow)\]\[[unsloth](https://github.com/unslothai/unsloth)\]\[[xtuner](https://github.com/InternLM/xtuner)\]\[[MFTCoder](https://github.com/codefuse-ai/MFTCoder)\]\[[llm-foundry](https://github.com/mosaicml/llm-foundry)\]\[[swift](https://github.com/modelscope/swift)\]\[[Liger-Kernel](https://github.com/linkedin/Liger-Kernel)\]
      • [mergekit - models](https://huggingface.co/blog/mlabonne/merge-models)\]\[[Model Merging](https://huggingface.co/collections/osanseviero/model-merging-65097893623330a3a51ead66)\]\[[OpenChatKit](https://github.com/togethercomputer/OpenChatKit)\]
      • [paper - ai/studios/code-lora-from-scratch)\]\[[lora](https://github.com/cloneofsimo/lora)\]\[[dora](https://github.com/catid/dora)\]\[[MoRA](https://github.com/kongds/MoRA)\]\[[ziplora-pytorch](https://github.com/mkshing/ziplora-pytorch)\]\[[alpaca-lora](https://github.com/tloen/alpaca-lora)\]
      • [paper - qlora](https://github.com/htqin/ir-qlora)\]\[[fsdp_qlora](https://github.com/AnswerDotAI/fsdp_qlora)\]
      • [paper - LoRA/S-LoRA)\]\[[AdaLoRA](https://github.com/QingruZhang/AdaLoRA)\]\[[LoRAMoE](https://github.com/Ablustrund/LoRAMoE)\]\[[lorahub](https://github.com/sail-sg/lorahub)\]\[[O-LoRA](https://github.com/cmnfriend/O-LoRA)\]\[[qa-lora](https://github.com/yuhuixu1993/qa-lora)\]
      • [paper - GA)\]\[[LoRA-Pro blog](https://kexue.fm/archives/10266)\]\[[dora](https://github.com/catid/dora)\]
      • [paper - GaLore](https://github.com/VITA-Group/Q-GaLore)\]\[[WeLore](https://github.com/VITA-Group/WeLore)\]
      • [paper
      • [paper - research/adapter-bert)\]\[[unify-parameter-efficient-tuning](https://github.com/jxhe/unify-parameter-efficient-tuning)\]
      • [paper - hub/adapters)\]\[[A Survey on LoRA of Large Language Models](https://arxiv.org/abs/2407.11046)\]
      • [paper - Edgerunners/LLM-Adapters)\]
      • [paper - Adapter)\]
      • [paper - Pro)\]
      • [paper - tuning)\]
      • [paper - tuning-v2)\]\[[pet](https://github.com/timoschick/pet)\]\[[PrefixTuning](https://github.com/XiangLi1999/PrefixTuning)\]
      • [paper - parameter-efficient-tuning)\]
      • [paper
      • [paper
      • [paper - foundation/bitsandbytes)\]
      • [paper - AMP)\]
      • [paper
      • [paper
      • [paper - Factory)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - basis-bge-zh/)\]
      • [paper - embeddings-v2](https://huggingface.co/jinaai/jina-embeddings-v2-base-en)\]\[[jina-reranker-v2](https://huggingface.co/jinaai/jina-reranker-v2-base-multilingual)\]\[[pe_rank](https://github.com/liuqi6777/pe_rank)\]\[[Jina CLIP](https://arxiv.org/abs/2405.20204)\]\[[jina-embeddings-v3](https://arxiv.org/abs/2409.10173)\]
      • [paper - large-zh)\]\[[gte-Qwen2-7B-instruct](https://huggingface.co/Alibaba-NLP/gte-Qwen2-7B-instruct)\]\[[gte-large-en-v1.5](https://huggingface.co/Alibaba-NLP/gte-large-en-v1.5)\]
      • [BCEmbedding - embedding-base_v1](https://huggingface.co/maidalun1020/bce-embedding-base_v1)\]\[[bce-reranker-base_v1](https://huggingface.co/maidalun1020/bce-reranker-base_v1)\]
      • [CohereV3
      • [paper - ai/instructor-embedding)\]
      • [paper - mistral-7b-instruct)\]\[[llm2vec](https://github.com/McGill-NLP/llm2vec)\]
      • [paper - ai/contrastors)\]
      • [paper
      • [paper
      • [paper - AMP)\]
      • [paper
      • [paper
      • [paper - Factory)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - deepmind/opro)\]
      • [paper - Lab/ATLAS)\]
      • [paper - prompting)\]
      • [paper - team/appl)\]\[[sammo](https://github.com/microsoft/sammo)\]\[[prompt-poet](https://github.com/character-ai/prompt-poet)\]\[[ell](https://github.com/MadcowD/ell)\]
      • [paper
      • [paper
      • [PromptPapers - engineering)\]\[[ChatGPT Prompt Engineering for Developers](https://prompt-engineering.xiniushu.com/)\]\[[Prompt Engineering Guide](https://www.promptingguide.ai/zh)\]\[[k12promptguide](https://www.k12promptguide.com/)\]\[[gpt-prompt-engineer](https://github.com/mshumer/gpt-prompt-engineer)\]\[[awesome-chatgpt-prompts](https://github.com/f/awesome-chatgpt-prompts)\]\[[awesome-chatgpt-prompts-zh](https://github.com/PlexPt/awesome-chatgpt-prompts-zh)\]
      • [paper - demonstrations)\]
      • [paper
      • [paper - machines/pal)\]
      • [paper - instruction-learning)\]
      • [paper
      • [paper - human-preferences)\]
      • [paper - from-feedback)\]
      • [paper
      • [paper - li/Instruction-Tuning-Survey)\]
      • [paper
      • [paper
      • [paper - KGLLM/RAG-Survey)\]\[[Modular RAG](https://arxiv.org/abs/2407.21059)\]
      • [paper - Survey)\]
      • [paper - Augmented Generation for Natural Language Processing: A Survey](https://arxiv.org/abs/2407.13193)\]\[[A Survey on RAG Meeting LLMs](https://arxiv.org/abs/2405.06211)\]
      • [paper
      • [paper - token-nq)\]\[[docs](https://huggingface.co/docs/transformers/main/model_doc/rag)\]\[[FAISS](https://github.com/facebookresearch/faiss)\]
      • [paper - rag)\]\[[CRAG](https://github.com/HuskyInSalt/CRAG)\]\[[Golden-Retriever](https://arxiv.org/abs/2408.00798)\]
      • [paper
      • [paper
      • [paper - pytorch](https://github.com/lucidrains/RETRO-pytorch)\]
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - community/GoMate)\]
      • [paper
      • [paper
      • [paper - isf)\]
      • [paper - RAG)\]\[[Adaptive-RAG](https://github.com/starsuzi/Adaptive-RAG)\]\[[Advanced RAG 11: Query Classification and Refinement](https://ai.gopubby.com/advanced-rag-11-query-classification-and-refinement-2aec79f4140b)\]
      • [paper - ecosystem-engineering/Blended-RAG)\]\[[infinity](https://github.com/infiniflow/infinity)\]
      • [paper - new)\]\[[ind_kdd_2024/](https://www.biendata.net/competition/ind_kdd_2024/)\]\[[KDD2024-WhoIsWho-Top3](https://github.com/yanqiangmiffy/KDD2024-WhoIsWho-Top3)\]
      • [paper
      • [blog
      • [link
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
      • [LangChain - rag/)\]\[[LangChain Hub](https://smith.langchain.com/hub)\]\[[langgraph](https://github.com/langchain-ai/langgraph)\]
      • [LlamaIndex - llama/llama_deploy)\]\[[A Cheat Sheet and Some Recipes For Building Advanced RAG](https://blog.llamaindex.ai/a-cheat-sheet-and-some-recipes-for-building-advanced-rag-803a9d94c41b)\]\[[Fine-Tuning Embeddings for RAG with Synthetic Data](https://www.llamaindex.ai/blog/fine-tuning-embeddings-for-rag-with-synthetic-data-e534409a3971)\]
      • [chatgpt-retrieval-plugin
      • [haystack - Chatchat](https://github.com/chatchat-space/Langchain-Chatchat)\]
      • [paper - Local-UI](https://github.com/severian42/GraphRAG-Local-UI)\]\[[nano-graphrag](https://github.com/gusye1234/nano-graphrag)\]\[[graph-rag](https://github.com/sarthakrastogi/graph-rag)\]\[[llm-graph-builder](https://github.com/neo4j-labs/llm-graph-builder)\]\[[Triplex](https://huggingface.co/SciPhi/Triplex)\]\[[knowledge_graph_maker](https://github.com/rahulnyk/knowledge_graph_maker)\]\[[itext2kg](https://github.com/AuvaLab/itext2kg)\]
      • [paper
      • [paper - NLP/RAG)\]\[[Seven Failure Points When Engineering a Retrieval Augmented Generation System](https://arxiv.org/abs/2401.05856)\]\[[Improving Retrieval Performance in RAG Pipelines with Hybrid Search](https://towardsdatascience.com/improving-retrieval-performance-in-rag-pipelines-with-hybrid-search-c75203c2f2f5)\]\[[15 Advanced RAG Techniques from Pre-Retrieval to Generation](https://www.willowtreeapps.com/guides/advanced-rag-techniques)\]
      • [paper
      • [paper
      • [paper - science/RAGChecker)\]\[[rageval](https://github.com/gomate-community/rageval)\]
      • [paper - new)\]\[[ind_kdd_2024/](https://www.biendata.net/competition/ind_kdd_2024/)\]\[[KDD2024-WhoIsWho-Top3](https://github.com/yanqiangmiffy/KDD2024-WhoIsWho-Top3)\]
      • [paper
      • [blog
      • [link
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
      • [LangChain - rag/)\]\[[LangChain Hub](https://smith.langchain.com/hub)\]\[[langgraph](https://github.com/langchain-ai/langgraph)\]
      • [LlamaIndex - llama/llama_deploy)\]\[[A Cheat Sheet and Some Recipes For Building Advanced RAG](https://blog.llamaindex.ai/a-cheat-sheet-and-some-recipes-for-building-advanced-rag-803a9d94c41b)\]\[[Fine-Tuning Embeddings for RAG with Synthetic Data](https://www.llamaindex.ai/blog/fine-tuning-embeddings-for-rag-with-synthetic-data-e534409a3971)\]
      • [chatgpt-retrieval-plugin
      • [haystack - Chatchat](https://github.com/chatchat-space/Langchain-Chatchat)\]
      • [ragas
      • [vimGPT
      • [QAnything - llm](https://github.com/Mintplex-Labs/anything-llm)\]\[[FastGPT](https://github.com/labring/FastGPT)\]\[[mem0](https://github.com/mem0ai/mem0)\]\[[Memary](https://github.com/kingjulio8238/Memary)\]
      • [ragas
      • [vimGPT
      • [QAnything - llm](https://github.com/Mintplex-Labs/anything-llm)\]\[[FastGPT](https://github.com/labring/FastGPT)\]\[[mem0](https://github.com/mem0ai/mem0)\]\[[Memary](https://github.com/kingjulio8238/Memary)\]
      • [paper - cellar/beir)\]
      • [paper - benchmark/mteb)\]\[[leaderboard](https://huggingface.co/spaces/mteb/leaderboard)\]
      • [paper - transformers)\]\[[model](https://huggingface.co/sentence-transformers)\]\[[vec2text](https://github.com/jxmorris12/vec2text)\]
      • [paper - nlp/SimCSE)\]\[[AnglE ACL 2024](https://github.com/SeanLee97/AnglE)\]
      • [paper - text-and-code-embeddings)\]
      • [paper
      • [m3e-base - embedding-v2](https://huggingface.co/lier007/xiaobu-embedding-v2)\]\[[stella_en_1.5B_v5](https://huggingface.co/dunzhang/stella_en_1.5B_v5)\]
      • [paper - embeddings-v2](https://huggingface.co/jinaai/jina-embeddings-v2-base-en)\]\[[jina-reranker-v2](https://huggingface.co/jinaai/jina-reranker-v2-base-multilingual)\]\[[pe_rank](https://github.com/liuqi6777/pe_rank)\]\[[Jina CLIP](https://arxiv.org/abs/2405.20204)\]\[[jina-embeddings-v3](https://arxiv.org/abs/2409.10173)\]
      • [paper - large-zh)\]\[[gte-Qwen2-7B-instruct](https://huggingface.co/Alibaba-NLP/gte-Qwen2-7B-instruct)\]\[[gte-large-en-v1.5](https://huggingface.co/Alibaba-NLP/gte-large-en-v1.5)\]
      • [BCEmbedding - embedding-base_v1](https://huggingface.co/maidalun1020/bce-embedding-base_v1)\]\[[bce-reranker-base_v1](https://huggingface.co/maidalun1020/bce-reranker-base_v1)\]
      • [paper - MCTS)\]
      • [paper - STaR](https://arxiv.org/abs/2403.09629)\]
      • [paper
      • [paper
      • [paper - aim)\]
      • [paper
      • [paper
      • [paper - zhu.com/part-2-grade-school-math/part-2-1)\]
      • [paper
      • [paper
      • [paper - rep)\]
      • [paper
      • [blog - interpretability)\]\[[transformer-debugger](https://github.com/openai/transformer-debugger)\]
      • [OpenAI Blog - auto-interp](https://github.com/EleutherAI/sae-auto-interp)\]\[[multimodal-sae](https://github.com/EvolvingLMMs-Lab/multimodal-sae)\]
      • [blog
      • [blog
      • [paper
      • [paper - transparency-tool)\]
      • [paper - explainer)\]\[[demo](https://poloclub.github.io/transformer-explainer)\]
      • [paper - dynamics)\]
      • [Transformer Circuits Thread - chapter1-transformer-interp.streamlit.app)\]\[[Awesome-Interpretability-in-Large-Language-Models](https://github.com/ruizheliUOA/Awesome-Interpretability-in-Large-Language-Models)\]\[[TransformerLens](https://github.com/TransformerLensOrg/TransformerLens)\]\[[inseq](https://github.com/inseq-team/inseq)\]
      • [paper
      • [paper
      • [paper
      • [Awesome-Chinese-LLM - LLMs-In-China](https://github.com/wgwang/awesome-LLMs-In-China)\]\[[awesome-LLM-resourses](https://github.com/WangRongsheng/awesome-LLM-resourses)\]
      • [paper
      • [paper - 130B/)\]
      • [paper - inc/Baichuan2)\]\[[BaichuanSEED](https://arxiv.org/abs/2408.15079)\]\[[Baichuan Alignment Technical Report](https://arxiv.org/abs/2410.14940)\]
      • [paper
      • [RAG-Retrieval - Shanghai/PGRAG)\]\[[CRUD_RAG](https://github.com/IAAR-Shanghai/CRUD_RAG)\]\[[PlanRAG](https://github.com/myeon9h/PlanRAG)\]\[[DPA-RAG](https://github.com/dongguanting/DPA-RAG)\]\[[LongRAG](https://github.com/TIGER-AI-Lab/LongRAG)\]\[[Controllable-RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[structured-rag](https://github.com/weaviate/structured-rag)\]\[[RAGLab](https://github.com/fate-ubw/RAGLab)\]\[[autogluon-rag](https://github.com/autogluon/autogluon-rag)\]
      • [paper - cellar/beir)\]
      • [paper - benchmark/mteb)\]\[[leaderboard](https://huggingface.co/spaces/mteb/leaderboard)\]
      • [paper - transformers)\]\[[model](https://huggingface.co/sentence-transformers)\]\[[vec2text](https://github.com/jxmorris12/vec2text)\]
      • [paper - nlp/SimCSE)\]\[[AnglE ACL 2024](https://github.com/SeanLee97/AnglE)\]
      • [paper - text-and-code-embeddings)\]
      • [paper
      • [paper
      • [paper - long.194/)\]\[[llm_reranker](https://github.com/FlagOpen/FlagEmbedding/tree/master/research/llm_reranker)\]\[[FlagEmbedding](https://github.com/FlagOpen/FlagEmbedding)\]
      • [paper - NLP/llm2vec)\]
      • [paper - Embed-v1)\]
      • [paper
      • [JamAIBase
      • [paper - of-thought-hub](https://github.com/FranxYao/chain-of-thought-hub)\]
      • [paper
      • [paper - takeshi188/zero_shot_cot)\]
      • [paper - science/auto-cot)\]
      • [paper - science/mm-cot)\]
      • [paper
      • [paper
      • [paper - REACT)\]
      • [paper - nlp/tree-of-thought-llm)\]\[[Plug in and Play Implementation](https://github.com/kyegomez/tree-of-thoughts)\]\[[tree-of-thought-prompting](https://github.com/dave1010/tree-of-thought-prompting)\]
      • [paper - of-thoughts)\]
      • [paper - ai/cumulative-reasoning)\]\[[On the Diagram of Thought](https://arxiv.org/abs/2409.10038)\]
      • [paper - Of-Thoughts)\]
      • [paper - of-Thoughts-XoT)\]
      • [paper - of-thoughts)\]
      • [paper - teacher)\]
      • [paper
      • [paper - Planner)\]
      • [paper - org/llm-reasoners)\]\[[LLM Reasoners COLM 2024](https://arxiv.org/abs/2404.05221)\]
      • [paper
      • [paper - AI-Lab/Program-of-Thoughts)\]
      • [paper
      • [paper
      • [paper - Impact-of-Reasoning-Step-Length-on-Large-Language-Models)\]
      • [paper - Edgerunners/Plan-and-Solve-Prompting)\]\[[maestro](https://github.com/Doriandarko/maestro)\]
      • [paper - models/llm_multiagent_debate)\]\[[Multi-Agents-Debate](https://github.com/Skytliang/Multi-Agents-Debate)\]
      • [paper - refine)\]\[[MCT Self-Refine](https://github.com/trotsky1997/MathBlackBox)\]
      • [paper
      • [paper
      • [paper
      • [paper - discover)\]\[[SELF-DISCOVER](https://github.com/kailashsp/SELF-DISCOVER)\]
      • [paper
      • [paper
      • [paper
      • [paper - of-thought-llm)\]\[[SymbCoT](https://github.com/Aiden0526/SymbCoT)\]
      • [paper - EM-pytorch)\]
      • [paper
      • [paper - rpm-bench)\]
      • [paper
      • [paper - husky/Husky-v1)\]
      • [paper - System)\]
      • [paper
      • [paper - Shanghai/ICSFSurvey)\]
      • [blog - Time Compute Optimally can be More Effective than Scaling Model Parameters](https://arxiv.org/abs/2408.03314)\]\[[Let's Verify Step by Step](https://arxiv.org/abs/2305.20050)\]\[[Awesome-LLM-Strawberry](https://github.com/hijkzzz/Awesome-LLM-Strawberry)\]
      • [llm-reasoners - groq/g1)\]
      • [Prompt4ReasoningPapers
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper
      • [paper - 2-research](https://github.com/open-thought/system-2-research)\]
      • [paper - AI/Telechat)\]\[[TeleChat2](https://github.com/Tele-AI/TeleChat2)\]\[[Tele-FLM Technical Report](https://arxiv.org/abs/2404.16645)\]\[[Tele-FLM](https://huggingface.co/CofeAI/Tele-FLM)\]\[[Tele-FLM-1T](https://huggingface.co/CofeAI/Tele-FLM-1T)\]
      • [OpenAI Blog - auto-interp](https://github.com/EleutherAI/sae-auto-interp)\]
      • [blog
      • [blog
      • [paper
      • [paper - transparency-tool)\]
      • [paper - explainer)\]\[[demo](https://poloclub.github.io/transformer-explainer)\]
      • [paper - dynamics)\]
      • [Transformer Circuits Thread - chapter1-transformer-interp.streamlit.app)\]\[[Awesome-Interpretability-in-Large-Language-Models](https://github.com/ruizheliUOA/Awesome-Interpretability-in-Large-Language-Models)\]\[[TransformerLens](https://github.com/TransformerLensOrg/TransformerLens)\]\[[inseq](https://github.com/inseq-team/inseq)\]
      • [paper
      • [paper
      • [paper
      • [Awesome-Chinese-LLM - LLMs-In-China](https://github.com/wgwang/awesome-LLMs-In-China)\]\[[awesome-LLM-resourses](https://github.com/WangRongsheng/awesome-LLM-resourses)\]
      • [paper
      • [paper - 130B/)\]
      • [paper - 6B](https://github.com/THUDM/ChatGLM-6B)\]\[[ChatGLM2-6B](https://github.com/THUDM/ChatGLM2-6B)\]\[[ChatGLM3](https://github.com/THUDM/ChatGLM3)\]\[[GLM-4](https://github.com/THUDM/GLM-4)\]\[[AgentTuning](https://github.com/THUDM/AgentTuning)\]\[[AlignBench](https://github.com/THUDM/AlignBench)\]
      • [paper - inc/Baichuan2)\]\[[BaichuanSEED](https://arxiv.org/abs/2408.15079)\]
      • [paper
      • [paper - Agent](https://github.com/QwenLM/Qwen-Agent)\]\[[AutoIF](https://github.com/QwenLM/AutoIF)\]\[[modeling_qwen2.py](https://github.com/huggingface/transformers/blob/main/src/transformers/models/qwen2/modeling_qwen2.py)\]
      • [paper - ai/Yi)\]\[[Yi-1.5](https://github.com/01-ai/Yi-1.5)\]
      • [paper - AI/Telechat)\]\[[Tele-FLM Technical Report](https://arxiv.org/abs/2404.16645)\]\[[Tele-FLM](https://huggingface.co/CofeAI/Tele-FLM)\]\[[Tele-FLM-1T](https://huggingface.co/CofeAI/Tele-FLM-1T)\]
      • [paper
      • [paper - GSAI/Llama-3-SynE)\]
      • [MiniCPM - MoE](https://github.com/SkyworkAI/Skywork-MoE)\]\[[Orion](https://github.com/OrionStarAI/Orion)\]\[[BELLE](https://github.com/LianjiaTech/BELLE)\]\[[Yuan-2.0](https://github.com/IEIT-Yuan/Yuan-2.0)\]\[[Yuan2.0-M32](https://github.com/IEIT-Yuan/Yuan2.0-M32)\]\[[Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM)\]\[[Index-1.9B](https://github.com/bilibili/Index-1.9B)\]\[[Aquila2](https://github.com/FlagAI-Open/Aquila2)\]
      • [LlamaFamily/Llama-Chinese - AI/Chinese-Llama-2-7b](https://github.com/LinkSoul-AI/Chinese-Llama-2-7b)\]\[[llama3-Chinese-chat](https://github.com/CrazyBoyM/llama3-Chinese-chat)\]\[[phi3-Chinese](https://github.com/CrazyBoyM/phi3-Chinese)\]\[[LLM-Chinese](https://github.com/CrazyBoyM/LLM-Chinese)\]\[[Llama3-Chinese-Chat](https://github.com/Shenzhi-Wang/Llama3-Chinese-Chat)\]\[[llama3-chinese](https://github.com/seanzhang-zhichen/llama3-chinese)\]
      • [Firefly - chitchat](https://github.com/yangjianxin1/GPT2-chitchat)\]
      • [paper - CoT)\]
      • [paper - Agent](https://github.com/QwenLM/Qwen-Agent)\]\[[AutoIF](https://github.com/QwenLM/AutoIF)\]
      • [paper - ai/Yi)\]\[[Yi-1.5](https://github.com/01-ai/Yi-1.5)\]
      • [paper
      • [paper - LLM](https://github.com/deepseek-ai/DeepSeek-LLM)\]\[[DeepSeek-V2](https://github.com/deepseek-ai/DeepSeek-V2)\]\[[DeepSeek-Coder](https://github.com/deepseek-ai/DeepSeek-Coder)\]
      • [paper - deepmind/synthid-text)\]
      • [paper - NLP-SG/CoI-Agent)\]
      • [paper - NLPIR/LLM4IR-Survey)\]\[[YuLan-IR](https://github.com/RUC-GSAI/YuLan-IR)\]\[[A Survey of Conversational Search](https://arxiv.org/abs/2410.15576)\]
      • [paper - Modal Search](https://arxiv.org/abs/2408.14698)\]\[[M3DocRAG](https://arxiv.org/abs/2411.04952)\]\[[Visualized BGE](https://github.com/FlagOpen/FlagEmbedding/tree/master/research/visual_bge)\]\[[OmniSearch](https://github.com/Alibaba-NLP/OmniSearch)\]
      • [paper - PaLM)\]
      • [paper - nlp/ProLong)\]
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [paper - han-lab/smoothquant)\]\[[ABQ-LLM](https://github.com/bytedance/ABQ-LLM)\]\[[VPTQ](https://github.com/microsoft/VPTQ)\]\[[ppq](https://github.com/OpenPPL/ppq)\]
      • [ggml - fast](https://github.com/pytorch-labs/gpt-fast)\]\[[lightllm](https://github.com/ModelTC/lightllm)\]\[[fastllm](https://github.com/ztxz16/fastllm)\]\[[CTranslate2](https://github.com/OpenNMT/CTranslate2)\]\[[ipex-llm](https://github.com/intel-analytics/ipex-llm)\]\[[rtp-llm](https://github.com/alibaba/rtp-llm)\]\[[KsanaLLM](https://github.com/pcg-mlp/KsanaLLM)\]\[[ppl.nn](https://github.com/OpenPPL/ppl.nn)\]
      • [paper
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
      • [paper - NLP/O1-Journey)\]\[[O1 Replication Journey -- Part 2](https://arxiv.org/abs/2411.16489)\]\[[LLaMA-O1](https://github.com/SimpleBerry/LLaMA-O1)\]\[[Marco-o1](https://github.com/AIDC-AI/Marco-o1)\]\[[qwq-32b-preview](https://qwenlm.github.io/blog/qwq-32b-preview)\]
      • [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
      • [paper - aloha)\]\[[Hardware Code](https://github.com/MarkFzp/mobile-aloha)\]\[[Learning Code](https://github.com/MarkFzp/act-plus-plus)\]\[[UMI](https://github.com/real-stanford/universal_manipulation_interface)\]\[[humanplus](https://github.com/MarkFzp/humanplus)\]\[[TeleVision](https://github.com/OpenTeleVision/TeleVision)\]\[[Surgical Robot Transformer](https://surgical-robot-transformer.github.io/)\]\[[lifelike-agility-and-play](https://github.com/Tencent-RoboticsX/lifelike-agility-and-play)\]\[[ReKep](https://rekep-robot.github.io/)\]\[[Open_Duck_Mini](https://github.com/apirrone/Open_Duck_Mini)\]\[[Learning Visual Parkour from Generated Images](https://lucidsim.github.io/)\]
      • [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
      • [paper - deepmind/graphcast)\]
      • [paper - project/bigcodebench)\]\[[LiveCodeBench](https://github.com/LiveCodeBench/LiveCodeBench)\]\[[evalplus](https://github.com/evalplus/evalplus)\]
      • [paper - PaLM)\]
      • [alphafold3 - deepmind/alphafold)\]\[[RoseTTAFold](https://github.com/RosettaCommons/RoseTTAFold)\]\[[RFdiffusion](https://github.com/RosettaCommons/RFdiffusion)\]
      • [openfold - pytorch](https://github.com/lucidrains/alphafold3-pytorch)\]\[[Protenix](https://github.com/bytedance/Protenix)\]\[[AlphaFold3](https://github.com/kyegomez/AlphaFold3)\]\[[Ligo-Biosciences/AlphaFold3](https://github.com/Ligo-Biosciences/AlphaFold3)\]\[[LucaOne](https://github.com/LucaOne/LucaOne)\]\[[esm](https://github.com/evolutionaryscale/esm)\]\[[AlphaPPImd](https://github.com/AspirinCode/AlphaPPImd)\]\[[visual-med-alpaca](https://github.com/cambridgeltl/visual-med-alpaca)\]\[[chai-lab](https://github.com/chaidiscovery/chai-lab)\]\[[evo](https://github.com/evo-design/evo)\]
      • [paper - Pruner)\]\[[Awesome-Efficient-LLM](https://github.com/horseee/Awesome-Efficient-LLM)\]
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [paper
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
      • [PDF-Extract-Kit - tech/colpali)\]\[[localGPT-Vision](https://github.com/PromtEngineer/localGPT-Vision)\]\[[mPLUG-DocOwl](https://github.com/X-PLUG/mPLUG-DocOwl)\]
      • [paper
      • [paper - coai/CharacterGLM-6B)\]
      • [paper - V](https://github.com/OpenBMB/MiniCPM-V)\]
      • [Skywork - MoE](https://github.com/SkyworkAI/Skywork-MoE)\]\[[Orion](https://github.com/OrionStarAI/Orion)\]\[[BELLE](https://github.com/LianjiaTech/BELLE)\]\[[Yuan-2.0](https://github.com/IEIT-Yuan/Yuan-2.0)\]\[[Yuan2.0-M32](https://github.com/IEIT-Yuan/Yuan2.0-M32)\]\[[Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM)\]\[[Index-1.9B](https://github.com/bilibili/Index-1.9B)\]\[[Aquila2](https://github.com/FlagAI-Open/Aquila2)\]
      • [paper - instruct)\]
      • [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
      • [paper - AutoML/MobileVLM)\]\[[MobileVLM V2](https://arxiv.org/abs/2402.03766)\]\[[BlueLM-V-3B](https://arxiv.org/abs/2411.10640)\]
      • [open-interpreter
      • [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
      • [paper - deepmind/graphcast)\]
      • [paper
      • [paper - PaLM)\]
      • [openfold - pytorch](https://github.com/lucidrains/alphafold3-pytorch)\]\[[Protenix](https://github.com/bytedance/Protenix)\]\[[AlphaFold3](https://github.com/kyegomez/AlphaFold3)\]\[[Ligo-Biosciences/AlphaFold3](https://github.com/Ligo-Biosciences/AlphaFold3)\]\[[LucaOne](https://github.com/LucaOne/LucaOne)\]\[[esm](https://github.com/evolutionaryscale/esm)\]\[[AlphaPPImd](https://github.com/AspirinCode/AlphaPPImd)\]\[[visual-med-alpaca](https://github.com/cambridgeltl/visual-med-alpaca)\]\[[chai-lab](https://github.com/chaidiscovery/chai-lab)\]\[[evo](https://github.com/evo-design/evo)\]
      • [paper
      • [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
      • [paper
      • [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
      • [PDF-Extract-Kit - tech/colpali)\]\[[localGPT-Vision](https://github.com/PromtEngineer/localGPT-Vision)\]\[[mPLUG-DocOwl](https://github.com/X-PLUG/mPLUG-DocOwl)\]
      • [paper - long.194/)\]\[[llm_reranker](https://github.com/FlagOpen/FlagEmbedding/tree/master/research/llm_reranker)\]\[[FlagEmbedding](https://github.com/FlagOpen/FlagEmbedding)\]
      • [paper
      • [paper - YuanGroup/LLaVA-CoT)\]\[[internvl2.0_mpo](https://github.com/OpenGVLab/InternVL/tree/main/internvl_chat/shell/internvl2.0_mpo)\]\[[Insight-V](https://github.com/dongyh20/Insight-V)\]
    • 1. Word2Vec

    • 2. Seq2Seq