awesome-ai-papers

This repository is used to collect papers and code in the field of AI.
https://github.com/songqiang321/awesome-ai-papers

Last synced: 16 days ago
JSON representation

NLP
- 3. Pretraining
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [paper
  - [paper
  - [paper - PaLM)\]
  - [blog - Scale Playbook](https://huggingface.co/spaces/nanotron/ultrascale-playbook)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - research.github.io/SpatialLM/)\]
  - [paper - math/MetaMath)\]\[[MathCoder](https://github.com/mathllm/MathCoder)\]
  - [paper - PaLM)\]
  - [paper - Alignment](https://github.com/PKU-Alignment)\]\[[webpage](https://alignmentsurvey.com/)\]
  - [evaluation-guidebook - LLM-Eval](https://github.com/onejune2018/Awesome-LLM-Eval)\]\[[LLM-eval-survey](https://github.com/MLGroupJLU/LLM-eval-survey)\]\[[llm_benchmarks](https://github.com/leobeeson/llm_benchmarks)\]\[[Awesome-LLMs-Evaluation-Papers](https://github.com/tjunlp-lab/Awesome-LLMs-Evaluation-Papers)\]
  - [paper - bench)\]\[[swarm](https://github.com/openai/swarm)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
  - [awesome-llm-interpretability - LLM-Interpretability](https://github.com/cooperleong00/Awesome-LLM-Interpretability)\]
  - [paper - deepmind/synthid-text)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]
  - [paper - fib-lab/ACL24-EconAgent)\]\[[Large Language Models Empowered Agent-based Modeling and Simulation: A Survey and Perspectives](https://arxiv.org/abs/2312.11970)\]
  - [paper - Shanghai/SurveyX)\]\[[SurveyForge](https://github.com/Alpha-Innovator/SurveyForge)\]
  - [paper - project/selfcodealign)\]
  - [paper - CoT)\]\[[alphageometry](https://github.com/google-deepmind/alphageometry)\]\[[AlphaGeometry2](https://arxiv.org/abs/2502.03544)\]\[[MathCritique](https://github.com/WooooDyy/MathCritique)\]\[[PromptCoT](https://github.com/inclusionAI/PromptCoT)\]
  - [paper - PaLM)\]
  - [paper
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper - Hunyuan-Large)\]\[[TransMamba](https://arxiv.org/abs/2503.24067)\]\[[FastCuRL](https://arxiv.org/abs/2503.17287)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
  - [paper - coai/Safety-Prompts)\]\[[PurpleLlama](https://github.com/meta-llama/PurpleLlama)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - instruct)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]
  - [paper - PaLM)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper - ai/xgrammar)\]\[[mlc-llm](https://github.com/mlc-ai/mlc-llm)\]
  - [paper - foundation/bitsandbytes)\]\[[unsloth](https://github.com/unslothai/unsloth)\]\[[ir-qlora](https://github.com/htqin/ir-qlora)\]\[[fsdp_qlora](https://github.com/AnswerDotAI/fsdp_qlora)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - Agent-Survey](https://github.com/OS-Agent-Survey/OS-Agent-Survey)\]\[[ACU](https://github.com/francedot/acu)\]\[[Large Language Model-Brained GUI Agents: A Survey](https://arxiv.org/abs/2411.18279)\]\[[LLM-Powered GUI Agents in Phone Automation](https://arxiv.org/abs/2504.19838)\]\[[Aguvis](https://arxiv.org/abs/2412.04454)\]\[[awesome-computer-use](https://github.com/ranpox/awesome-computer-use)\]
  - [MOSS - RLHF](https://github.com/OpenLMLab/MOSS-RLHF)\]
  - [paper
  - [paper - ai/letta)\]\[[Agent Workflow Memory](https://github.com/zorazrw/agent-workflow-memory)\]\[[A-mem](https://github.com/agiresearch/A-mem)\]
  - [paper - aloha)\]\[[Hardware Code](https://github.com/MarkFzp/mobile-aloha)\]\[[Learning Code](https://github.com/MarkFzp/act-plus-plus)\]\[[UMI](https://github.com/real-stanford/universal_manipulation_interface)\]\[[humanplus](https://github.com/MarkFzp/humanplus)\]\[[TeleVision](https://github.com/OpenTeleVision/TeleVision)\]\[[Surgical Robot Transformer](https://surgical-robot-transformer.github.io/)\]\[[lifelike-agility-and-play](https://github.com/Tencent-RoboticsX/lifelike-agility-and-play)\]\[[ReKep](https://rekep-robot.github.io/)\]\[[Open_Duck_Mini](https://github.com/apirrone/Open_Duck_Mini)\]\[[Learning Visual Parkour from Generated Images](https://lucidsim.github.io/)\]\[[ASAP](https://github.com/LeCAR-Lab/ASAP)\]\[[UniAct](https://github.com/2toinf/UniAct)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]\[[PhiFlow](https://github.com/tum-pbs/PhiFlow)\]
  - [paper
  - [paper
  - [paper - NLPIR/WebThinker)\]\[[DeerFlow](https://github.com/bytedance/deer-flow)\]
  - [paper
  - [paper - ai/DeepSeek-Math)\]\[[Math-Shepherd](https://arxiv.org/abs/2312.08935)\]\[[DeepSeek-Prover-V1.5](https://github.com/deepseek-ai/DeepSeek-Prover-V1.5)\]\[[DeepSeek-Prover-V2](https://github.com/deepseek-ai/DeepSeek-Prover-V2)\]\[[Kimina-Prover Preview](https://arxiv.org/abs/2504.11354)\]\[[Goedel-Prover](https://github.com/Goedel-LM/Goedel-Prover)\]\[[BFS-Prover](https://arxiv.org/abs/2502.03438)\]
  - [paper - Skills)\]\[[AI Mathematical Olympiad-Progress Prize 2](https://www.kaggle.com/competitions/ai-mathematical-olympiad-progress-prize-2)\]
  - [paper - PaLM)\]
  - [paper - ToolMaker)\]\[[trove](https://github.com/zorazrw/trove)\]\[[CREATOR](https://arxiv.org/abs/2305.14318)\]
  - [paper - chen/ToolQA)\]\[[toolbench](https://github.com/sambanova/toolbench)\]\[[MetaTool Benchmark](https://arxiv.org/abs/2310.03128)\]
  - [paper - cite](https://github.com/MadryLab/context-cite)\]\[[OmniThink](https://github.com/zjunlp/OmniThink)\]\[[SelfCite](https://arxiv.org/abs/2502.09604)\]\[[LLMxMapReduce](https://github.com/thunlp/LLMxMapReduce)\]\[[WriteHERE](https://github.com/principia-ai/WriteHERE)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - R1V)\]\[[Skywork R1V2](https://arxiv.org/abs/2504.16656)\]
  - [Qwen3
  - [Skywork - MoE](https://github.com/SkyworkAI/Skywork-MoE)\]\[[MiniMax-01](https://github.com/MiniMax-AI/MiniMax-01)\]\[[Orion](https://github.com/OrionStarAI/Orion)\]\[[BELLE](https://github.com/LianjiaTech/BELLE)\]\[[Yuan-2.0](https://github.com/IEIT-Yuan/Yuan-2.0)\]\[[Yuan2.0-M32](https://github.com/IEIT-Yuan/Yuan2.0-M32)\]\[[Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM)\]\[[Index-1.9B](https://github.com/bilibili/Index-1.9B)\]\[[Aquila2](https://github.com/FlagAI-Open/Aquila2)\]\[[MiMo](https://github.com/XiaomiMiMo/MiMo)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [search_with_lepton - oval/storm)\]\[[searxng](https://github.com/searxng/searxng)\]\[[Perplexica](https://github.com/ItzCrazyKns/Perplexica)\]\[[rag-search](https://github.com/thinkany-ai/rag-search)\]\[[sensei](https://github.com/jjleng/sensei)\]\[[azure-search-openai-demo](https://github.com/Azure-Samples/azure-search-openai-demo)\]\[[Gemini-Search](https://github.com/ammaarreshi/Gemini-Search)\]\[[deep-searcher](https://github.com/zilliztech/deep-searcher)\]
  - [paper - PaLM)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper
  - [paper - Lancer](https://github.com/openai/SWELancer-Benchmark)\]\[[OpenCodeReasoning](https://arxiv.org/abs/2504.01943)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [paper
  - [paper - PaLM)\]
  - [paper - piexl/JailbreakZoo)\]\[[jailbreak_llms](https://github.com/verazuo/jailbreak_llms)\]\[[llm-attacks](https://github.com/llm-attacks/llm-attacks)\]\[[Awesome-Jailbreak-on-LLMs](https://github.com/yueliu1999/Awesome-Jailbreak-on-LLMs)\]\[[Constitutional Classifiers](https://arxiv.org/abs/2501.18837)\]
  - [hf blog - rl-ppo blog](https://huggingface.co/blog/deep-rl-ppo)\]\[[OpenAI blog](https://openai.com/index/learning-from-human-preferences)\]\[[alignment blog](https://openai.com/blog/our-approach-to-alignment-research)\]\[[awesome-RLHF](https://github.com/opendilab/awesome-RLHF)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [paper - Survey)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - rg/recurrent-pretraining)\]\[[ReasonFlux](https://github.com/Gen-Verse/ReasonFlux)\]\[[Can 1B LLM Surpass 405B LLM](https://arxiv.org/abs/2502.06703)\]\[[SkyThought](https://arxiv.org/abs/2502.07374)\]
  - [paper
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [paper
  - [paper - Math)\]\[[Qwen2.5-Math-Demo](https://huggingface.co/spaces/Qwen/Qwen2.5-Math-Demo)\]\[[ProcessBench](https://github.com/QwenLM/ProcessBench)\]\[[SuperCorrect-llm](https://github.com/YangLing0818/SuperCorrect-llm)\]\[[The Lessons of Developing Process Reward Models in Mathematical Reasoning](https://arxiv.org/abs/2501.07301)\]
  - [paper - PaLM)\]
  - [paper - quality-chinese-training-datasets-66cfed105f502ece8f29643e)\]\[[MIG](https://arxiv.org/abs/2504.13835)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [LangChain - rag/)\]\[[LangChain Hub](https://smith.langchain.com/hub)\]\[[langgraph](https://github.com/langchain-ai/langgraph)\]\[[executive-ai-assistant](https://github.com/langchain-ai/executive-ai-assistant)\]
  - [paper
  - [blog
  - [paper - Corpus-Indexer-NCI)\]\[[DSI-transformers](https://github.com/ArvinZhuang/DSI-transformers)\]\[[GDR EACL 2024 Oral](https://arxiv.org/abs/2401.10487)\]
  - [paper - PaLM)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
  - [paper - NLP-SG/multimodal_textbook)\]
  - [paper
  - [paper
  - [paper - ml/RoboticsDiffusionTransformer)\]\[[Video Prediction Policy](https://arxiv.org/abs/2412.14803)\]\[[Humanoid-Gym](https://github.com/roboterax/humanoid-gym)\]
  - [paper - PaLM)\]
  - [paper - prompting)\]\[[docs](http://platform.openai.com/docs/guides/prompt-generation?context=structured-output-schema)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
  - [llama-moe - pytorch](https://github.com/lucidrains/PEER-pytorch)\]\[[GRIN-MoE](https://github.com/microsoft/GRIN-MoE)\]\[[MoE-plus-plus](https://github.com/SkyworkAI/MoE-plus-plus)\]\[[MoH](https://github.com/SkyworkAI/MoH)\]
  - [paper - GaLore](https://github.com/VITA-Group/Q-GaLore)\]\[[WeLore](https://github.com/VITA-Group/WeLore)\]\[[Fira](https://github.com/xichen-fy/Fira)\]
  - [paper
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]\[[PhiFlow](https://github.com/tum-pbs/PhiFlow)\]
  - [paper
  - [paper
  - [paper - PaLM)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - deepmind/synthid-text)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]
  - [paper - llm/OpenCoder-llm)\]\[[dataset](https://huggingface.co/collections/OpenCoder-LLM/opencoder-datasets-672e6db6a0fed24bd69ef1c2)\]\[[opc_data_filtering](https://github.com/OpenCoder-llm/opc_data_filtering)\]\[[OpenCodeEval](https://github.com/richardodliu/OpenCodeEval)\]
  - [paper
  - [paper - cs-nlp/LLMsKnow)\]
  - [paper
  - [PromptPapers - engineering)\]\[[ChatGPT Prompt Engineering for Developers](https://prompt-engineering.xiniushu.com/)\]\[[Prompt Engineering Guide](https://www.promptingguide.ai/zh)\]\[[k12promptguide](https://www.k12promptguide.com/)\]\[[gpt-prompt-engineer](https://github.com/mshumer/gpt-prompt-engineer)\]\[[awesome-chatgpt-prompts](https://github.com/f/awesome-chatgpt-prompts)\]\[[awesome-chatgpt-prompts-zh](https://github.com/PlexPt/awesome-chatgpt-prompts-zh)\]\[[Prompt_Engineering](https://github.com/NirDiamant/Prompt_Engineering)\]\[[system-prompts-and-models-of-ai-tools](https://github.com/x1xhlol/system-prompts-and-models-of-ai-tools)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper - PaLM)\]
  - [paper - Collection)\]
  - [paper - deepmind/synthid-text)\]
  - [paper - S](https://github.com/simular-ai/Agent-S)\]\[[The Dawn of GUI Agent](https://arxiv.org/abs/2411.10323)\]\[[ShowUI](https://github.com/showlab/ShowUI)\]\[[Aria-UI](https://github.com/AriaUI/Aria-UI)\]\[[aguvis](https://github.com/xlang-ai/aguvis)\]\[[TinyClick](https://github.com/SamsungLabs/TinyClick)\]\[[InfiGUIAgent](https://github.com/Reallm-Labs/InfiGUIAgent)\]\[[autoMate](https://github.com/yuruotong1/autoMate)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
  - [paper - PaLM)\]
  - [paper - Sequence Recommendation Models Need Decoupled Embeddings](https://arxiv.org/abs/2410.02604)\]\[[OneRec](https://arxiv.org/abs/2502.18965)\]\[[MIM](https://arxiv.org/abs/2502.00321)\]
  - [paper - Machine-Learning-Lab/NoteLLM)\]\[[NoteLLM](https://arxiv.org/abs/2403.01744)\]\[[SSD](https://arxiv.org/abs/2107.05204)\]\[[PaRT](https://arxiv.org/abs/2504.20624)\]
  - [paper
  - [paper - embedding-torch](https://github.com/lucidrains/rotary-embedding-torch)\]
  - [paper - HQ](https://arxiv.org/abs/2410.18505)\]\[[LabelLLM](https://github.com/opendatalab/LabelLLM)\]\[[labelU](https://github.com/opendatalab/labelU)\]\[[MinerU](https://github.com/opendatalab/MinerU)\]\[[PDF-Extract-Kit](https://github.com/opendatalab/PDF-Extract-Kit)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper - han-lab/duo-attention)\]\[[Star-Attention](https://github.com/NVIDIA/Star-Attention)\]
  - [PEFT - advanced](https://github.com/huggingface/autotrain-advanced)\]\[[accelerate](https://github.com/huggingface/accelerate)\]\[[LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory)\]\[[LMFlow](https://github.com/OptimalScale/LMFlow)\]\[[xtuner](https://github.com/InternLM/xtuner)\]\[[MFTCoder](https://github.com/codefuse-ai/MFTCoder)\]\[[llm-foundry](https://github.com/mosaicml/llm-foundry)\]\[[ms-swift](https://github.com/modelscope/ms-swift)\]\[[Liger-Kernel](https://github.com/linkedin/Liger-Kernel)\]
  - [paper
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]
  - [paper - PaLM)\]
  - [paper - SJTU/MING)\]\[[EmoLLM](https://github.com/SmartFlowAI/EmoLLM)\]
  - [paper - Web/AIPress-code)\]
  - [paper
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [paper - 4](https://huggingface.co/collections/microsoft/phi-4-677e9380e514feb5577a40e4)\]\[[SmolLM](https://huggingface.co/blog/smollm)\]\[[SmolLM2](https://arxiv.org/abs/2502.02737)\]\[[SmolVLM](https://arxiv.org/abs/2504.05299)\]\[[Computational Bottlenecks of Training Small-scale Large Language Models](https://arxiv.org/abs/2410.19456)\]\[[SLMs-Survey](https://github.com/FairyFali/SLMs-Survey)\]\[[MiniLLM](https://arxiv.org/abs/2306.08543)\]\[[aligning_tinystories](https://philliphaeusler.com/posts/aligning_tinystories/)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [blog
  - [blog - agents-python](https://github.com/openai/openai-agents-python)\]\[[openai-cua-sample-app](https://github.com/openai/openai-cua-sample-app)\]\[[swarm](https://github.com/openai/swarm)\]\[[A practical guide to building agents](https://cdn.openai.com/business-guides-and-resources/a-practical-guide-to-building-agents.pdf)\]\[[adk-python](https://github.com/google/adk-python)\]\[[agents-deep-research](https://github.com/qx-labs/agents-deep-research)\]
  - [crewAI - llama/llama_deploy)\]\[[gpt-computer-assistant](https://github.com/onuratakan/gpt-computer-assistant)\]\[[agentic_patterns](https://github.com/neural-maze/agentic_patterns)\]\[[pydantic-ai](https://github.com/pydantic/pydantic-ai)\]\[[PocketFlow](https://github.com/The-Pocket/PocketFlow)\]\[[suna](https://github.com/kortix-ai/suna)\]
  - [paper
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]\[[PhiFlow](https://github.com/tum-pbs/PhiFlow)\]
  - [paper
  - [paper - PaLM)\]
  - [functionary - tool-llm](https://github.com/zorazrw/awesome-tool-llm)\]\[[agents-json](https://github.com/wild-card-ai/agents-json)\]\[[langgraph-bigtool](https://github.com/langchain-ai/langgraph-bigtool)\]\[[octotools](https://github.com/octotools/octotools)\]
  - [paper - oryx/Awesome-LLM-Post-training)\]\[[A Survey on Post-training of Large Language Models](https://arxiv.org/abs/2503.06072)\]\[[Sailing AI by the Stars](https://arxiv.org/abs/2505.02686)\]
  - [paper - M](https://github.com/OpenRLHF/OpenRLHF-M)\]\[[Unraveling RLHF and Its Variants](https://hijkzzz.notion.site/unraveling-rlhf-and-its-variants-engineering-insights)\]\[[Does RLHF Scale](https://arxiv.org/abs/2412.06000)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper - infra-index](https://github.com/deepseek-ai/open-infra-index)\]
  - [paper
  - [paper - ai/VisualThinker-R1-Zero)\]\[[VLM-R1](https://github.com/om-ai-lab/VLM-R1)\]\[[open-r1-multimodal](https://github.com/EvolvingLMMs-Lab/open-r1-multimodal)\]\[[R1-Onevision](https://github.com/Fancy-MLLM/R1-Onevision)\]\[[Visual-RFT](https://github.com/Liuziyu77/Visual-RFT)\]\[[R1-VL](https://github.com/jingyi0000/R1-VL)\]\[[R1-Omni](https://github.com/HumanMLLM/R1-Omni)\]\[[Vision-R1](https://github.com/Osilly/Vision-R1)\]\[[Open-R1-Video](https://github.com/Wang-Xiaodong1899/Open-R1-Video)\]\[[OpenVLThinker](https://github.com/yihedeng9/OpenVLThinker)\]\[[R1-Zero-VSI](https://github.com/zhijie-group/R1-Zero-VSI)\]\[[VLAA-Thinking](https://github.com/UCSC-VLAA/VLAA-Thinking)\]\[[NoisyRollout](https://github.com/John-AI-Lab/NoisyRollout)\]\[[VisuLogic-Train](https://github.com/VisuLogic-Benchmark/VisuLogic-Train)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - r1)\]\[[MM-EUREKA](https://github.com/ModalMinds/MM-EUREKA)\]\[[Vision-R1](https://arxiv.org/abs/2503.18013)\]\[[Perception-R1](https://github.com/linkangheng/PR1)\]
  - [Agent-R1 - AI/RAGEN)\]\[[VAGEN](https://github.com/RAGEN-AI/VAGEN)\]\[[OpenManus-RL](https://github.com/OpenManus/OpenManus-RL)\]\[[SWEET-RL](https://arxiv.org/abs/2503.15478)\]\[[APIGen-MT](https://arxiv.org/abs/2504.03601)\]
  - [paper - law/steplaw)\]
  - [blog - llama/llama3)\]\[[llama-models](https://github.com/meta-llama/llama-models)\]\[[llama-recipes](https://github.com/meta-llama/llama-recipes)\]\[[LLM Adaptation](https://ai.meta.com/blog/adapting-large-language-models-llms/)\]\[[Llama-Nemotron: Efficient Reasoning Models](https://arxiv.org/abs/2505.00949)\]\[[llama3-from-scratch](https://github.com/naklecha/llama3-from-scratch)\]\[[nano-llama31](https://github.com/karpathy/nano-llama31)\]\[[minimind](https://github.com/jingyaogong/minimind)\]\[[felafax](https://github.com/felafax/felafax)\]
  - [blog - llama/llama-32-66f448ffc8c32f949b04c8cf)\]\[[llama-stack](https://github.com/meta-llama/llama-stack)\]\[[llama-stack-apps](https://github.com/meta-llama/llama-stack-apps)\]\[[lingua](https://github.com/facebookresearch/lingua)\]\[[llama-assistant](https://github.com/vietanhdev/llama-assistant)\]\[[minimind-v](https://github.com/jingyaogong/minimind-v)\]\[[nanoVLM](https://github.com/huggingface/nanoVLM)\]\[[Llama3.2-Vision-Finetune](https://github.com/2U1/Llama3.2-Vision-Finetune)\]
  - [ray - ray](https://github.com/antgroup/ant-ray)\]\[[dask](https://github.com/dask/dask)\]\[[TaskingAI](https://github.com/TaskingAI/TaskingAI)\]\[[gpt4all](https://github.com/nomic-ai/gpt4all)\]\[[ollama](https://github.com/jmorganca/ollama)\]\[[llama.cpp](https://github.com/ggerganov/llama.cpp)\]\[[mindsdb](https://github.com/mindsdb/mindsdb)\]\[[bisheng](https://github.com/dataelement/bisheng)\]\[[phidata](https://github.com/phidatahq/phidata)\]\[[guidance](https://github.com/guidance-ai/guidance)\]\[[outlines](https://github.com/outlines-dev/outlines)\]\[[jsonformer](https://github.com/1rgs/jsonformer)\]\[[fabric](https://github.com/danielmiessler/fabric)\]\[[mem0](https://github.com/mem0ai/mem0)\]\[[taipy](https://github.com/Avaiga/taipy)\]\[[langflow](https://github.com/langflow-ai/langflow)\]
  - [paper - AI4Code/HyperAgent)\]\[[Seeker](https://github.com/XMZhangAI/Seeker)\]\[[AutoKaggle](https://github.com/multimodal-art-projection/AutoKaggle)\]\[[Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level](https://arxiv.org/abs/2411.03562)\]
  - [paper - wu/PMC-LLaMA)\]\[[MMedLM](https://github.com/MAGIC-AI4Med/MMedLM)\]
  - [paper - PaLM)\]
  - [Awesome-LegalAI-Resources - compass/LawBench)\]
  - [paper
  - [paper - Infinite](https://arxiv.org/abs/2308.16137)\]
  - [paper - NLP/ProX)\]
  - [paper - Extract-Kit](https://github.com/opendatalab/PDF-Extract-Kit)\]\[[DocLayout-YOLO](https://github.com/opendatalab/DocLayout-YOLO)\]\[[OmniDocBench](https://github.com/opendatalab/OmniDocBench)\]\[[Document Parsing Unveiled](https://arxiv.org/abs/2410.21169)\]\[[Docling Technical Report](https://arxiv.org/abs/2408.09869)\]\[[markitdown](https://github.com/microsoft/markitdown)\]\[[pandoc](https://github.com/jgm/pandoc)\]
  - [paper
  - [paper - cross-capabilities)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper - Honesty-Survey)\]
  - [paper - AILab/flash-attention)\]\[[xformers](https://github.com/facebookresearch/xformers)\]\[[SageAttention](https://github.com/thu-ml/SageAttention)\]\[[SpargeAttn](https://github.com/thu-ml/SpargeAttn)\]
  - [text-generation-inference - embeddings-inference](https://github.com/huggingface/text-embeddings-inference)\]\[[quantization](https://huggingface.co/docs/transformers/main/en/quantization)\]\[[optimum-quanto](https://github.com/huggingface/optimum-quanto)\]\[[huggingface-inference-toolkit](https://github.com/huggingface/huggingface-inference-toolkit)\]\[[torchao](https://github.com/pytorch/ao)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
  - [m3e-base - embedding-v2](https://huggingface.co/lier007/xiaobu-embedding-v2)\]\[[stella_en_1.5B_v5](https://huggingface.co/dunzhang/stella_en_1.5B_v5)\]\[[Conan-embedding-v2](https://huggingface.co/TencentBAC/Conan-embedding-v2)\]
  - [paper - NLP/VinePPO)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - gpt4s-mistakes-with-gpt-4)\]\[[Heimdall](https://arxiv.org/abs/2504.10337)\]\[[Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning](https://arxiv.org/abs/2410.08146)\]\[[Agentic Reward Modeling](https://arxiv.org/abs/2502.19328)\]\[[Reward Hacking in Reinforcement Learning](https://lilianweng.github.io/posts/2024-11-28-reward-hacking)\]\[[DeepSeek-GRM](https://arxiv.org/abs/2504.02495)\]\[[RM-R1](https://arxiv.org/abs/2505.02387)\]
  - [paper - 6B](https://github.com/THUDM/ChatGLM-6B)\]\[[ChatGLM2-6B](https://github.com/THUDM/ChatGLM2-6B)\]\[[ChatGLM3](https://github.com/THUDM/ChatGLM3)\]\[[GLM-4](https://github.com/THUDM/GLM-4)\]\[[modeling_chatglm.py](https://huggingface.co/THUDM/glm-4-9b-chat/blob/main/modeling_chatglm.py)\]\[[AgentTuning](https://github.com/THUDM/AgentTuning)\]\[[AlignBench](https://github.com/THUDM/AlignBench)\]\[[GLM-Edge](https://github.com/THUDM/GLM-Edge)\]
  - [paper
  - [paper - PaLM)\]
  - [alphafold
  - [paper - science/RefChecker)\]\[[HaluAgent](https://github.com/RUCAIBox/HaluAgent)\]\[[LLMsKnow](https://github.com/technion-cs-nlp/LLMsKnow)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
  - [paper - GSAI/YuLan-Chat)\]\[[Yulan-GARDEN](https://github.com/RUC-GSAI/Yulan-GARDEN)\]\[[YuLan-Mini](https://github.com/RUC-GSAI/YuLan-Mini)\]
  - [paper
  - [paper - transformer-lm)\]
  - [paper - 2)\]\[[llm.c](https://github.com/karpathy/llm.c)\]
  - [paper
  - [paper - llm/automix)\]
  - [paper - benchmark)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - NLP/OpenResearcher)\]\[[Paper Copilot](https://arxiv.org/abs/2409.04593)\]\[[SciAgentsDiscovery](https://github.com/lamm-mit/SciAgentsDiscovery)\]\[[paper-qa](https://github.com/Future-House/paper-qa)\]\[[GraphReasoning](https://github.com/lamm-mit/GraphReasoning)\]
  - [paper - Researcher)\]
  - [Awesome-Scientific-Language-Models - husky/gpt_academic)\]\[[ChatPaper](https://github.com/kaixindelele/ChatPaper)\]\[[scispacy](https://github.com/allenai/scispacy)\]\[[awesome-ai4s](https://github.com/hyperai/awesome-ai4s)\]\[[xVal](https://github.com/PolymathicAI/xVal)\]
  - [link
  - [paper - Code-LLM](https://github.com/codefuse-ai/Awesome-Code-LLM)\]\[[MFTCoder](https://github.com/codefuse-ai/MFTCoder)\]\[[Awesome-Code-LLM](https://github.com/huybery/Awesome-Code-LLM)\]\[[CodeFuse-muAgent](https://github.com/codefuse-ai/CodeFuse-muAgent)\]\[[Awesome-Code-Intelligence](https://github.com/QiushiSun/Awesome-Code-Intelligence)\]
  - [paper
  - [paper
  - [paper - science/PAE)\]
  - [paper - research/Eureka)\]\[[DrEureka](https://github.com/eureka-research/DrEureka)\]\[[MM-EUREKA](https://github.com/ModalMinds/MM-EUREKA)\]
  - [paper
  - [paper
  - [paper - llm)\]
  - [paper - Ye/ToolEyes)\]
  - [paper
  - [paper - trial-and-error)\]
  - [paper
  - [paper - Bank](https://arxiv.org/abs/2304.08244)\]\[[ToolHop](https://arxiv.org/abs/2501.02506)\]\[[ComplexFuncBench](https://github.com/THUDM/ComplexFuncBench)\]\[[tool-retrieval-benchmark](https://github.com/mangopy/tool-retrieval-benchmark)\]
  - [paper
  - [paper - Wang/ToolGen)\]
  - [blog - architecture-blogpost-encoders-prefixlm-denoising)\]\[[New LLM Pre-training and Post-training Paradigms](https://magazine.sebastianraschka.com/p/new-llm-pre-training-and-post-training)\]
  - [paper - pytorch-fully-sharded-data-parallel-api/)\]\[[pytorch-fsdp](https://github.com/huggingface/blog/blob/main/zh/pytorch-fsdp.md)\]
  - [paper
  - [paper - h100-clusters-power-network)\]\[[Parameter Server OSDI 2014](https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-li_mu.pdf)\]\[[ps-lite](https://github.com/dmlc/ps-lite)\]\[[ByteCheckpoint](https://arxiv.org/abs/2407.20143)\]\[[HybridFlow](https://arxiv.org/abs/2409.19256)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - 101B)\]\[[Tele-FLM](https://huggingface.co/CofeAI/Tele-FLM)\]
  - [paper - Tuning-with-GPT-4/GPT-4-LLM)\]
  - [paper - language-RL](https://github.com/waterhorse1/Natural-language-RL)\]
  - [paper - ye/OpenFedLLM)\]
  - [paper - ConvAI/tree/main/Awesome-Self-Evolution-of-LLM)\]
  - [paper - sys/routellm)\]\[[RouterDC](https://github.com/shuhao02/RouterDC)\]\[[masrouter](https://github.com/yanweiyue/masrouter)\]\[[RouterEval](https://github.com/MilkThink-Lab/RouterEval)\]
  - [paper
  - [paper - ai/OpenDiLoCo)\]\[[Prime](https://github.com/PrimeIntellect-ai/Prime)\]\[[DiLoCo](https://arxiv.org/abs/2311.08105)\]\[[DisTrO](https://github.com/NousResearch/DisTrO)\]\[[Streaming DiLoCo](https://arxiv.org/abs/2501.18512)\]\[[Eager Updates For Overlapped Communication and Computation in DiLoCo](https://arxiv.org/abs/2502.12996)\]\[[Scaling Laws for DiLoCo](https://arxiv.org/abs/2503.09799)\]
  - [paper
  - [paper - platform)\]\[[open-infra-index](https://github.com/deepseek-ai/open-infra-index)\]\[[Parameter Server OSDI 2014](https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-li_mu.pdf)\]\[[ps-lite](https://github.com/dmlc/ps-lite)\]
  - [wandb
  - [paper
  - [paper
  - [paper
  - [paper - LLM-preference-learning)\]
  - [alignment-handbook - Chat](https://github.com/deepspeedai/DeepSpeedExamples/blob/master/applications/DeepSpeed-Chat/README.md)\]\[[OpenRLHF](https://github.com/OpenRLHF/OpenRLHF)\]\[[verl](https://github.com/volcengine/verl)\]\[[AReaL](https://github.com/inclusionAI/AReaL)\]
  - [tokenizer_summary - Tokenizer](https://github.com/NVIDIA/Cosmos-Tokenizer)\]\[[tiktokenizer](https://github.com/dqbd/tiktokenizer)\]
  - [paper - zh](https://llmbook-zh.github.io/)\]\[[LLMsPracticalGuide](https://github.com/Mooler0410/LLMsPracticalGuide)\]\[[Foundations-of-LLMs](https://github.com/ZJU-LLMs/Foundations-of-LLMs)\]
  - [paper - MLSys-Lab/Efficient-LLMs-Survey)\]
  - [paper
  - [paper
  - [paper - survey](https://github.com/ulab-uiuc/AGI-survey)\]
  - [paper
  - [paper
  - [paper - rlhf)\]
  - [paper - cn/agents)\]\[[Symbolic Learning Enables Self-Evolving Agents](https://arxiv.org/abs/2406.18532)\]
  - [paper
  - [paper
  - [paper - AGI/AutoAgents)\]
  - [paper
  - [paper
  - [paper - agent/digirl)\]\[[Android-Lab](https://github.com/THUDM/Android-Lab)\]\[[AppAgentX](https://github.com/Westlake-AGI-Lab/AppAgentX)\]
  - [paper - Agent](https://github.com/microsoft/RD-Agent)\]\[[TinyTroupe](https://github.com/microsoft/TinyTroupe)\]
  - [paper - ai/camel)\]\[[crab](https://github.com/camel-ai/crab)\]\[[oasis](https://github.com/camel-ai/oasis)\]\[[owl](https://github.com/camel-ai/owl)\]
  - [paper - RL](https://github.com/OpenManus/OpenManus-RL)\]
  - [paper
  - [paper
  - [paper - transformer-lm)\]
  - [paper - 2)\]\[[llm.c](https://github.com/karpathy/llm.c)\]
  - [paper - 3)\]\[[nanoGPT](https://github.com/karpathy/nanoGPT)\]\[[build-nanogpt](https://github.com/karpathy/build-nanogpt)\]\[[gpt-fast](https://github.com/pytorch-labs/gpt-fast)\]\[[modded-nanogpt](https://github.com/KellerJordan/modded-nanogpt)\]\[[nanotron](https://github.com/huggingface/nanotron)\]
  - [paper - RLHF](https://github.com/OpenLMLab/MOSS-RLHF)\]
  - [paper - research/bert)\]\[[BERT-pytorch](https://github.com/codertimo/BERT-pytorch)\]\[[bert4torch](https://github.com/Tongjilibo/bert4torch)\]\[[bert4keras](https://github.com/bojone/bert4keras)\]\[[ModernBERT](https://github.com/AnswerDotAI/ModernBERT)\]\[[What Should We Learn From ModernBERT](https://jina.ai/news/what-should-we-learn-from-modernbert)\]
  - [paper - BERT-wwm](https://github.com/ymcui/Chinese-BERT-wwm)\]
  - [paper - analysis)\]
  - [paper
  - [paper
  - [paper - mll/jiant)\]
  - [paper - Classification)\]
  - [LLM101n - course](https://github.com/mlabonne/llm-course)\]\[[intro-llm](https://intro-llm.github.io/)\]\[[llm-cookbook](https://github.com/datawhalechina/llm-cookbook)\]\[[hugging-llm](https://github.com/datawhalechina/hugging-llm)\]\[[generative-ai-for-beginners](https://github.com/microsoft/generative-ai-for-beginners)\]\[[awesome-generative-ai-guide](https://github.com/aishwaryanr/awesome-generative-ai-guide)\]\[[LLMs-from-scratch](https://github.com/rasbt/LLMs-from-scratch)\]\[[llm-action](https://github.com/liguodongiot/llm-action)\]\[[llms_idx](https://dongnian.icu/llms/llms_idx/)\]\[[tiny-universe](https://github.com/datawhalechina/tiny-universe)\]\[[AISystem](https://github.com/chenzomi12/AISystem)\]
  - [paper - deepmind/gemma](https://github.com/google-deepmind/gemma)\]\[[gemma.cpp](https://github.com/google/gemma.cpp)\]\[[model](https://ai.google.dev/gemma)\]\[[paligemma](https://github.com/google-research/big_vision/tree/main/big_vision/configs/proj/paligemma)\]\[[gemma-cookbook](https://github.com/google-gemini/gemma-cookbook)\]
  - [paper - watermarking)\]\[[MarkLLM](https://github.com/THU-BPM/MarkLLM)\]\[[Watermarked_LLM_Identification](https://github.com/THU-BPM/Watermarked_LLM_Identification)\]\[[Awesome-LLM-Watermark](https://github.com/hzy312/Awesome-LLM-Watermark)\]
  - [paper
  - [paper - agent/digirl)\]
  - [paper
  - [paper - instructions)\]
  - [paper - crfm/helm)\]
  - [paper
  - [paper - Foundation/FinGPT)\]
  - [Awesome-LLM-Eval - eval-survey](https://github.com/MLGroupJLU/LLM-eval-survey)\]\[[llm_benchmarks](https://github.com/leobeeson/llm_benchmarks)\]\[[Awesome-LLMs-Evaluation-Papers](https://github.com/tjunlp-lab/Awesome-LLMs-Evaluation-Papers)\]
  - [paper
  - [paper - sys/FastChat/tree/main/fastchat/llm_judge)\]
  - [paper - RAG](https://github.com/CLUEbenchmark/SuperCLUE-RAG)\]
  - [paper - nlp/ceval)\]\[[chinese-llm-benchmark](https://github.com/jeinlee1991/chinese-llm-benchmark)\]
  - [paper - li/CMMLU)\]
  - [paper - Benchmark/CMMMU)\]
  - [paper
  - [paper - eval/prometheus-eval)\]\[[prometheus](https://github.com/prometheus-eval/prometheus)\]\[[prometheus-vision](https://github.com/prometheus-eval/prometheus-vision)\]
  - [paper - Lab/lmms-eval)\]\[[VLMEvalKit](https://github.com/open-compass/VLMEvalKit)\]\[[VideoMMMU](https://github.com/EvolvingLMMs-Lab/VideoMMMU)\]
  - [paper - Benchmark/MMMU)\]
  - [AlpacaEval Leaderboard - lab/alpaca_eval)\]
  - [Chatbot-Arena-Leaderboard - 05-03-arena/)\]\[[FastChat](https://github.com/lm-sys/FastChat)\]\[[arena-hard](https://github.com/lm-sys/arena-hard)\]
  - [lm-evaluation-harness - evals](https://github.com/openai/simple-evals)\]
  - [OpenCompass - Eval](https://github.com/open-compass/GAOKAO-Eval)\]\[[VLMEvalKit](https://github.com/open-compass/VLMEvalKit)\]
  - [llm-colosseum - org/GamingAgent)\]\[[UltraEval](https://github.com/OpenBMB/UltraEval)\]\[[Humanity's Last Exam](https://github.com/centerforaisafety/hle)\]
  - [paper - hallucination-survey)\]
  - [paper - LLM-hallucination)\]\[[Awesome-MLLM-Hallucination](https://github.com/showlab/Awesome-MLLM-Hallucination)\]
  - [paper - 2.0)\]
  - [paper - NLP/factool)\]\[[OlympicArena](https://github.com/GAIR-NLP/OlympicArena)\]\[[FActScore](https://arxiv.org/abs/2305.14251)\]
  - [paper - ai/aiconfig/tree/main/cookbooks/Chain-of-Verification)\]
  - [paper - ai/DB-GPT)\]\[[DocsGPT](https://github.com/arc53/DocsGPT)\]\[[privateGPT](https://github.com/imartinez/privateGPT)\]\[[localGPT](https://github.com/PromtEngineer/localGPT)\]
  - [paper
  - [paper - research/generative_agents)\]\[[genagents](https://github.com/joonspk-research/genagents)\]\[[GPTeam](https://github.com/101dotxyz/GPTeam)\]
  - [paper - 9b-20241220-technical-report)\]
  - [paper - ai/OpenAgents)\]
  - [paper - ai/DeepSeek-Coder)\]
  - [paper - LLaMA-Alpaca)\]\[[Chinese-LLaMA-Alpaca-2](https://github.com/ymcui/Chinese-LLaMA-Alpaca-2)\]\[[Chinese-LLaMA-Alpaca-3](https://github.com/ymcui/Chinese-LLaMA-Alpaca-3)\]\[[baby-llama2-chinese](https://github.com/DLLXW/baby-llama2-chinese)\]
  - [paper
  - [paper - ai/DB-GPT)\]\[[DocsGPT](https://github.com/arc53/DocsGPT)\]\[[privateGPT](https://github.com/imartinez/privateGPT)\]\[[localGPT](https://github.com/PromtEngineer/localGPT)\]
  - [paper
  - [paper - interpreter](https://github.com/e2b-dev/code-interpreter)\]\[[open-interpreter](https://github.com/KillianLucas/open-interpreter)\]
  - [paper
  - [paper
  - [paper - ai/camel)\]\[[crab](https://github.com/camel-ai/crab)\]
  - [paper
  - [paper
  - [paper
  - [AutoGPT - Engineer](https://github.com/gpt-engineer-org/gpt-engineer)\]\[[AgentGPT](https://github.com/reworkd/AgentGPT)\]\[[OpenManus](https://github.com/FoundationAgents/OpenManus)\]\[[owl](https://github.com/camel-ai/owl)\]\[[langmanus](https://github.com/langmanus/langmanus)\]\[[DeerFlow](https://github.com/bytedance/deer-flow)\]
  - [BabyAGI
  - [blog
  - [translation-agent - zero](https://github.com/frdel/agent-zero)\]\[[AgentK](https://github.com/mikekelly/AgentK)\]\[[evolving-agents](https://github.com/matiasmolinas/evolving-agents)\]\[[Twitter Personality](https://github.com/wordware-ai/twitter)\]\[[RD-Agent](https://github.com/microsoft/RD-Agent)\]\[[TinyTroupe](https://github.com/microsoft/TinyTroupe)\]
  - [paper
  - [paper
  - [paper - ai/geogalactica)\]\[[sciparser](https://github.com/davendw49/sciparser)\]\[[GeoGPT](https://github.com/GeoGPT-Research-Project/GeoGPT)\]
  - [paper - ZJU/Scientific-LLM-Survey)\]\[[sciknoweval](https://github.com/hicai-zju/sciknoweval)\]
  - [paper
  - [paper - 7B-Chat)\]
  - [paper - ai/AlphaCodium)\]\[[pr-agent](https://github.com/Codium-ai/pr-agent)\]\[[cover-agent](https://github.com/Codium-ai/cover-agent)\]
  - [paper - ai/DeepSeek-Coder)\]
  - [paper - ai/DeepSeek-Coder-V2)\]\[[DeepSeek-V2.5](https://huggingface.co/deepseek-ai/DeepSeek-V2.5)\]\[[Ling-Coder-lite](https://arxiv.org/abs/2503.17793)\]
  - [paper - Coder)\]\[[CodeArena](https://arxiv.org/abs/2412.05210)\]\[[CodeElo](https://arxiv.org/abs/2501.01257)\]
  - [paper
  - [paper - 3)\]\[[nanoGPT](https://github.com/karpathy/nanoGPT)\]\[[build-nanogpt](https://github.com/karpathy/build-nanogpt)\]\[[gpt-fast](https://github.com/pytorch-labs/gpt-fast)\]\[[modded-nanogpt](https://github.com/KellerJordan/modded-nanogpt)\]
  - [paper - RLHF](https://github.com/OpenLMLab/MOSS-RLHF)\]
  - [paper - research/bert)\]\[[BERT-pytorch](https://github.com/codertimo/BERT-pytorch)\]\[[bert4torch](https://github.com/Tongjilibo/bert4torch)\]\[[bert4keras](https://github.com/bojone/bert4keras)\]
  - [paper - BERT-wwm](https://github.com/ymcui/Chinese-BERT-wwm)\]
  - [paper - analysis)\]
  - [paper - mll/jiant)\]
  - [paper - Classification)\]
  - [LLM101n - course](https://github.com/mlabonne/llm-course)\]\[[intro-llm](https://intro-llm.github.io/)\]\[[llm-cookbook](https://github.com/datawhalechina/llm-cookbook)\]\[[hugging-llm](https://github.com/datawhalechina/hugging-llm)\]\[[generative-ai-for-beginners](https://github.com/microsoft/generative-ai-for-beginners)\]\[[awesome-generative-ai-guide](https://github.com/aishwaryanr/awesome-generative-ai-guide)\]\[[LLMs-from-scratch](https://github.com/rasbt/LLMs-from-scratch)\]\[[llm-action](https://github.com/liguodongiot/llm-action)\]\[[llms_idx](https://dongnian.icu/llms/llms_idx/)\]\[[tiny-universe](https://github.com/datawhalechina/tiny-universe)\]
  - [cs230-code-examples - template](https://github.com/victoresque/pytorch-template)\]\[[songquanpeng/pytorch-template](https://github.com/songquanpeng/pytorch-template)\]\[[Academic-project-page-template](https://github.com/eliahuhorwitz/Academic-project-page-template)\]\[[WritingAIPaper](https://github.com/hzwer/WritingAIPaper)\]
  - [tokenizer_summary
  - [paper - zh](https://llmbook-zh.github.io/)\]\[[LLMsPracticalGuide](https://github.com/Mooler0410/LLMsPracticalGuide)\]
  - [paper - MLSys-Lab/Efficient-LLMs-Survey)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - cdn.anthropic.com/fed9cc193a14b84131812372d8d5857f8f304c52/Model_Card_Claude_3_Addendum.pdf)\]
  - [paper - workshop)\]\[[model](https://huggingface.co/bigscience)\]
  - [paper
  - [paper
  - [paper
  - [paper - neox)\]
  - [paper - media/gemini/gemini_1_report.pdf)\]\[[Gemini 1.5](https://arxiv.org/abs/2403.05530)\]\[[Unofficial Implementation](https://github.com/kyegomez/Gemini)\]\[[MiniGemini](https://github.com/dvlab-research/MGM)\]
  - [paper - deepmind/gemma](https://github.com/google-deepmind/gemma)\]\[[gemma.cpp](https://github.com/google/gemma.cpp)\]\[[model](https://ai.google.dev/gemma)\]\[[paligemma](https://github.com/google-research/big_vision/tree/main/big_vision/configs/proj/paligemma)\]\[[gemma-cookbook](https://github.com/google-gemini/gemma-cookbook)\]
  - [paper - gemma-2/)\]\[[Advancing Responsible AI with Gemma](https://developers.googleblog.com/en/smaller-safer-more-transparent-advancing-responsible-ai-with-gemma/)\]\[[Gemma Scope](https://arxiv.org/abs/2408.05147)\]\[[ShieldGemma](https://arxiv.org/abs/2407.21772)\]\[[Gemma-2-9B-Chinese-Chat](https://huggingface.co/shenzhi-wang/Gemma-2-9B-Chinese-Chat)\]
  - [paper
  - [paper - 4o](https://openai.com/index/hello-gpt-4o/)\]\[[GPT-4o System Card](https://arxiv.org/abs/2410.21276)\]
  - [paper
  - [paper - ai/guidance)\]
  - [paper - rlhf-pytorch](https://github.com/conceptofmind/LaMDA-rlhf-pytorch)\]
  - [paper - llama/llama/tree/llama_v1)\]\[[llama.cpp](https://github.com/ggerganov/llama.cpp)\]\[[ollama](https://github.com/jmorganca/ollama)\]\[[llamafile](https://github.com/Mozilla-Ocho/llamafile)\]
  - [paper - llama/llama)\]\[[llama2.c](https://github.com/karpathy/llama2.c)\]\[[lit-llama](https://github.com/Lightning-AI/lit-llama)\]\[[litgpt](https://github.com/Lightning-AI/litgpt)\]
  - [paper - inference)\]\[[model](https://huggingface.co/mistralai)\]\[[mistral-finetune](https://github.com/mistralai/mistral-finetune)\]
  - [paper
  - [cs230-code-examples - template](https://github.com/victoresque/pytorch-template)\]\[[songquanpeng/pytorch-template](https://github.com/songquanpeng/pytorch-template)\]\[[Academic-project-page-template](https://github.com/eliahuhorwitz/Academic-project-page-template)\]\[[WritingAIPaper](https://github.com/hzwer/WritingAIPaper)\]
  - [paper - llm/automix)\]
  - [paper - benchmark)\]
  - [paper
  - [paper - ai/OSWorld)\]\[[aguvis](https://github.com/xlang-ai/aguvis)\]\[[Large Action Models](https://arxiv.org/abs/2412.10047)\]
  - [paper
  - [paper
  - [blog
  - [paper
  - [paper
  - [paper
  - [paper - LLMs-on-device](https://github.com/NexaAI/Awesome-LLMs-on-device)\]
  - [blog
  - [paper
  - [paper
  - [paper - research/generative_agents)\]\[[GPTeam](https://github.com/101dotxyz/GPTeam)\]
  - [paper
  - [paper - ai/OpenAgents)\]
  - [CohereV3
  - [paper - ai/instructor-embedding)\]
  - [paper - ai/contrastors)\]\[[nomic-embed-vision-v1.5](https://huggingface.co/nomic-ai/nomic-embed-vision-v1.5)\]\[[nomic-embed-text-v2-moe](https://huggingface.co/nomic-ai/nomic-embed-text-v2-moe)\]
  - [paper
  - [paper - Embed-v1)\]\[[nv-ingest](https://github.com/NVIDIA/nv-ingest)\]
  - [paper
  - [paper - of-thought-hub](https://github.com/FranxYao/chain-of-thought-hub)\]
  - [paper
  - [paper - takeshi188/zero_shot_cot)\]
  - [paper - science/auto-cot)\]
  - [paper - science/mm-cot)\]
  - [paper
  - [paper
  - [paper - REACT)\]\[[AutoAct](https://github.com/zjunlp/AutoAct)\]
  - [paper - nlp/tree-of-thought-llm)\]\[[Plug in and Play Implementation](https://github.com/kyegomez/tree-of-thoughts)\]\[[tree-of-thought-prompting](https://github.com/dave1010/tree-of-thought-prompting)\]
  - [paper - of-thoughts)\]
  - [paper - ai/cumulative-reasoning)\]\[[On the Diagram of Thought](https://arxiv.org/abs/2409.10038)\]
  - [paper - Of-Thoughts)\]
  - [paper - of-Thoughts-XoT)\]
  - [paper
  - [paper - aim)\]\[[An Empirical Study of Autoregressive Pre-training from Videos](https://arxiv.org/abs/2501.05453)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - models)\]\[[The Geometry of Concepts: Sparse Autoencoder Feature Structure](https://arxiv.org/abs/2410.19750)\]
  - [paper - rep)\]
  - [paper
  - [blog - interpretability)\]\[[transformer-debugger](https://github.com/openai/transformer-debugger)\]
  - [paper - gemma-2/)\]\[[Advancing Responsible AI with Gemma](https://developers.googleblog.com/en/smaller-safer-more-transparent-advancing-responsible-ai-with-gemma/)\]\[[Gemma Scope](https://arxiv.org/abs/2408.05147)\]\[[ShieldGemma](https://arxiv.org/abs/2407.21772)\]\[[Gemma-2-9B-Chinese-Chat](https://huggingface.co/shenzhi-wang/Gemma-2-9B-Chinese-Chat)\]
  - [paper
  - [paper
  - [paper
  - [paper - ai/guidance)\]
  - [paper - rlhf-pytorch](https://github.com/conceptofmind/LaMDA-rlhf-pytorch)\]
  - [paper - llama/llama/tree/llama_v1)\]\[[llama.cpp](https://github.com/ggerganov/llama.cpp)\]\[[ollama](https://github.com/jmorganca/ollama)\]\[[llamafile](https://github.com/Mozilla-Ocho/llamafile)\]
  - [paper - llama/llama)\]\[[llama2.c](https://github.com/karpathy/llama2.c)\]\[[lit-llama](https://github.com/Lightning-AI/lit-llama)\]\[[litgpt](https://github.com/Lightning-AI/litgpt)\]
  - [blog - llama/llama3)\]\[[llama-models](https://github.com/meta-llama/llama-models)\]\[[llama-recipes](https://github.com/meta-llama/llama-recipes)\]\[[llama-agentic-system](https://github.com/meta-llama/llama-agentic-system)\]\[[LLM Adaptation](https://ai.meta.com/blog/adapting-large-language-models-llms/)\]\[[llama3-from-scratch](https://github.com/naklecha/llama3-from-scratch)\]\[[nano-llama31](https://github.com/karpathy/nano-llama31)\]\[[minimind](https://github.com/jingyaogong/minimind)\]
  - [paper - inference)\]\[[model](https://huggingface.co/mistralai)\]\[[mistral-finetune](https://github.com/mistralai/mistral-finetune)\]
  - [paper
  - [paper
  - [paper - pytorch](https://github.com/lucidrains/PaLM-pytorch)\]\[[PaLM-rlhf-pytorch](https://github.com/lucidrains/PaLM-rlhf-pytorch)\]\[[PaLM](https://github.com/conceptofmind/PaLM)\]
  - [paper
  - [paper - E)\]
  - [paper - research/text-to-text-transfer-transformer)\]\[[t5-pytorch](https://github.com/conceptofmind/t5-pytorch)\]\[[t5-pegasus-pytorch](https://github.com/renmada/t5-pegasus-pytorch)\]\[[nanoT5](https://github.com/PiotrNawrot/nanoT5)\]
  - [paper
  - [paper - research/t5x/blob/main/docs/models.md#flan-t5-checkpoints)\]
  - [paper - xl)\]
  - [paper
  - [paper - MARCO-Web-Search](https://github.com/microsoft/MS-MARCO-Web-Search)\]\[[WebWalker](https://github.com/Alibaba-NLP/WebWalker)\]
  - [blog - org/grok-1)\]\[[model](https://huggingface.co/xai-org/grok-1)\]\[[modelscope](https://modelscope.cn/models/AI-ModelScope/grok-1/summary)\]\[[hpcai-tech/grok-1](https://huggingface.co/hpcai-tech/grok-1)\]\[[dbrx](https://github.com/databricks/dbrx)\]\[[Command R+](https://huggingface.co/CohereForAI/c4ai-command-r-plus)\]\[[Command A](https://arxiv.org/abs/2504.00698)\]\[[snowflake-arctic](https://github.com/Snowflake-Labs/snowflake-arctic)\]
  - [paper - watermarking)\]\[[MarkLLM](https://github.com/THU-BPM/MarkLLM)\]
  - [paper
  - [paper
  - [blog
  - [paper
  - [paper
  - [paper
  - [paper - LLMs-on-device](https://github.com/NexaAI/Awesome-LLMs-on-device)\]
  - [paper - cdn.anthropic.com/fed9cc193a14b84131812372d8d5857f8f304c52/Model_Card_Claude_3_Addendum.pdf)\]
  - [paper - workshop)\]\[[model](https://huggingface.co/bigscience)\]
  - [paper
  - [paper
  - [paper
  - [paper - neox)\]
  - [paper - media/gemini/gemini_1_report.pdf)\]\[[Gemini 1.5](https://arxiv.org/abs/2403.05530)\]\[[Unofficial Implementation](https://github.com/kyegomez/Gemini)\]\[[MiniGemini](https://github.com/dvlab-research/MGM)\]
  - [paper - wpy/SeqXGPT)\]\[[llm-detect-ai](https://github.com/yanqiangmiffy/llm-detect-ai)\]\[[detect-gpt](https://github.com/eric-mitchell/detect-gpt)\]\[[fast-detect-gpt](https://github.com/baoguangsheng/fast-detect-gpt)\]\[[ImBD](https://github.com/Jiaqi-Chen-00/ImBD)\]\[[MAGE](https://github.com/yafuly/MAGE)\]
  - [paper
  - [paper
  - [paper
  - [paper - interpreter](https://github.com/e2b-dev/code-interpreter)\]
  - [paper
  - [paper
  - [paper - Shanghai/CTGSurvey)\]\[[guidance](https://github.com/guidance-ai/guidance)\]\[[outlines](https://github.com/outlines-dev/outlines)\]\[[instructor](https://github.com/instructor-ai/instructor)\]\[[marvin](https://github.com/PrefectHQ/marvin)\]
  - [awesome-llm-apps - Domain-LLM](https://github.com/luban-agi/Awesome-Domain-LLM)\]\[[agents](https://github.com/livekit/agents)\]\[[ai-app-lab](https://github.com/volcengine/ai-app-lab)\]
  - [paper - Agent-Paper-List)\]\[[Advances and Challenges in Foundation Agents](https://arxiv.org/abs/2504.01990)\]\[[awesome-foundation-agents](https://github.com/FoundationAgents/awesome-foundation-agents)\]
  - [paper
  - [paper
  - [paper
  - [paper - research/Eureka)\]\[[DrEureka](https://github.com/eureka-research/DrEureka)\]
  - [paper
  - [paper - arena-x/webarena)\]\[[visualwebarena](https://github.com/web-arena-x/visualwebarena)\]\[[agent-workflow-memory](https://github.com/zorazrw/agent-workflow-memory)\]\[[WindowsAgentArena](https://github.com/microsoft/WindowsAgentArena)\]
  - [paper - NLP-Group/SeeAct)\]\[[WebDreamer](https://github.com/OSU-NLP-Group/WebDreamer)\]
  - [paper - Agents/Cradle)\]
  - [paper - agent](https://github.com/modelscope/modelscope-agent)\]
  - [paper
  - [paper
  - [paper - agent](https://github.com/andrewyng/translation-agent)\]
  - [paper - zero](https://github.com/frdel/agent-zero)\]\[[AgentK](https://github.com/mikekelly/AgentK)\]\[[AFlow: Automating Agentic Workflow Generation](https://arxiv.org/abs/2410.10762)\]
  - [paper - survey/Awesome-Robotics-Foundation-Models)\]\[[Awesome-Implicit-NeRF-Robotics](https://github.com/zubair-irshad/Awesome-Implicit-NeRF-Robotics)\]
  - [paper - SYSU/Embodied_AI_Paper_List)\]
  - [paper - research/robotics_transformer)\]\[[IRASim](https://github.com/bytedance/IRASim)\]
  - [paper - deepmind/open_x_embodiment)\]
  - [blog - robotics-brings-ai-into-the-physical-world)\]
  - [paper - Embodied-AI/RoboGen)\]\[[Genesis](https://github.com/Genesis-Embodied-AI/Genesis)\]
  - [paper
  - [paper
  - [paper - pytorch](https://github.com/lucidrains/PaLM-pytorch)\]\[[PaLM-rlhf-pytorch](https://github.com/lucidrains/PaLM-rlhf-pytorch)\]\[[PaLM](https://github.com/conceptofmind/PaLM)\]
  - [paper
  - [paper - E)\]
  - [paper - research/text-to-text-transfer-transformer)\]\[[t5-pytorch](https://github.com/conceptofmind/t5-pytorch)\]\[[t5-pegasus-pytorch](https://github.com/renmada/t5-pegasus-pytorch)\]
  - [paper
  - [paper
  - [paper - research/t5x/blob/main/docs/models.md#flan-t5-checkpoints)\]
  - [paper - xl)\]
  - [paper
  - [paper - MARCO-Web-Search](https://github.com/microsoft/MS-MARCO-Web-Search)\]
  - [blog
  - [paper - Shanghai/CTGSurvey)\]\[[guidance](https://github.com/guidance-ai/guidance)\]\[[outlines](https://github.com/outlines-dev/outlines)\]
  - [ray - ai/gpt4all)\]\[[ollama](https://github.com/jmorganca/ollama)\]\[[llama.cpp](https://github.com/ggerganov/llama.cpp)\]\[[dify](https://github.com/langgenius/dify)\]\[[mindsdb](https://github.com/mindsdb/mindsdb)\]\[[bisheng](https://github.com/dataelement/bisheng)\]\[[phidata](https://github.com/phidatahq/phidata)\]\[[guidance](https://github.com/guidance-ai/guidance)\]\[[outlines](https://github.com/outlines-dev/outlines)\]\[[jsonformer](https://github.com/1rgs/jsonformer)\]\[[fabric](https://github.com/danielmiessler/fabric)\]\[[mem0](https://github.com/mem0ai/mem0)\]
  - [awesome-llm-apps - Domain-LLM](https://github.com/luban-agi/Awesome-Domain-LLM)\]
  - [paper - Agent-Survey)\]\[[LLM-Agent-Paper-Digest](https://github.com/XueyangFeng/LLM-Agent-Paper-Digest)\]\[[awesome-lifelong-llm-agent](https://github.com/qianlima-lab/awesome-lifelong-llm-agent)\]
  - [paper - Agent-Paper-List)\]
  - [paper - 3846.12832)\]\[[code](https://github.com/yya518/FinBERT)\]\[[finBERT](https://github.com/ProsusAI/finBERT)\]\[[valuesimplex/FinBERT](https://github.com/valuesimplex/FinBERT)\]
  - [paper - Foundation/FinRobot)\]
  - [paper - Foundation/FinGPT)\]
  - [paper - Foundation/FinGPT/tree/master/fingpt/FinGPT_RAG/instruct-FinGPT)\]
  - [paper - Foundation/FinRL)\]
  - [paper - Foundation/FinRL-Meta)\]
  - [paper - FinLLM)\]
  - [paper
  - [paper - DI/XuanYuan)\]
  - [paper - FinAI/PIXIU)\]
  - [paper
  - [paper - team/rllm)\]
  - [paper - Copilot)\]
  - [paper
  - [paper - proj/AlphaFin)\]
  - [paper
  - [paper
  - [paper
  - [paper - sim](https://github.com/ZhuiyiTechnology/roformer-sim)\]
  - [paper - louis/xm-retrievers)\]\[[model](https://huggingface.co/antoinelouis/colbert-xm)\]
  - [paper
  - [paper
  - [paper - rag-learning](https://github.com/mangopy/direct-rag-learning)\]
  - [paper - LLM4IE-Papers)\]\[[UIE](https://github.com/universal-ie/UIE)\]\[[NERRE](https://github.com/LBNLP/NERRE)\]\[[uie_pytorch](https://github.com/HUSTAI/uie_pytorch)\]
  - [paper
  - [paper
  - [paper
  - [paper - NLPIR/GenIR-Survey)\]\[[Alipay Search](https://arxiv.org/abs/2503.21098)\]
  - [paper - ai/D2LLM)\]
  - [paper
  - [paper
  - [paper - ai/OSWorld)\]\[[AgentGym](https://github.com/WooooDyy/AgentGym)\]\[[Agent-as-a-Judge](https://arxiv.org/abs/2410.10934)\]\[[intellagent](https://github.com/plurai-ai/intellagent)\]\[[Survey on Evaluation of LLM-based Agents](https://arxiv.org/abs/2503.16416)\]\[[AgentRewardBench](https://github.com/McGill-NLP/agent-reward-bench)\]
  - [paper - cn/agents)\]
  - [paper - AGI/AutoAgents)\]
  - [paper
  - [paper - NLP-Group/Mind2Web)\]\[[AutoWebGLM](https://github.com/THUDM/AutoWebGLM)\]
  - [paper - arena-x/webarena)\]\[[visualwebarena](https://github.com/web-arena-x/visualwebarena)\]\[[agent-workflow-memory](https://github.com/zorazrw/agent-workflow-memory)\]\[[WindowsAgentArena](https://github.com/microsoft/WindowsAgentArena)\]
  - [paper - NLP-Group/SeeAct)\]
  - [paper - Agents/Cradle)\]
  - [paper - agent](https://github.com/modelscope/modelscope-agent)\]
  - [paper
  - [paper - zero](https://github.com/frdel/agent-zero)\]\[[AgentK](https://github.com/mikekelly/AgentK)\]
  - [paper - survey/Awesome-Robotics-Foundation-Models)\]
  - [paper - SYSU/Embodied_AI_Paper_List)\]
  - [paper - research/robotics_transformer)\]\[[IRASim](https://github.com/bytedance/IRASim)\]
  - [paper - 2)\]\[[RT-H: Action Hierarchies Using Language](https://arxiv.org/abs/2403.01823)\]\[[RoboMamba](https://arxiv.org/abs/2406.04339)\]
  - [paper - deepmind/open_x_embodiment)\]
  - [blog
  - [paper - Embodied-AI/RoboGen)\]
  - [paper
  - [paper - models/octo)\]\[[BodyTransformer](https://github.com/carlosferrazza/BodyTransformer)\]\[[crossformer](https://github.com/rail-berkeley/crossformer)\]\[[VideoMimic](https://arxiv.org/abs/2505.03729)\]
  - [paper
  - [AutoGPT - Engineer](https://github.com/gpt-engineer-org/gpt-engineer)\]\[[AgentGPT](https://github.com/reworkd/AgentGPT)\]
  - [BabyAGI
  - [paper
  - [Numina 1st Place Solution - numina/aimo-progress-prize](https://github.com/project-numina/aimo-progress-prize)\]\[[How NuminaMath Won the 1st AIMO Progress Prize](https://huggingface.co/blog/winning-aimo-progress-prize)\]\[[NuminaMath-7B-TIR](https://huggingface.co/AI-MO/NuminaMath-7B-TIR)\]\[[AI achieves silver-medal standard solving International Mathematical Olympiad problems](https://deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/)\]
  - [paper - in-Health/MedLLMsPracticalGuide)\]\[[LLM-for-Healthcare](https://github.com/KaiHe-better/LLM-for-Healthcare)\]\[[GMAI-MMBench](https://github.com/uni-medical/GMAI-MMBench)\]
  - [paper
  - [paper - YuanGroup/ChatLaw)\]\[[HK-O1aw](https://github.com/HKAIR-Lab/HK-O1aw)\]
  - [paper - LawLLM)\]
  - [paper - MedLLM)\]
  - [paper
  - [paper - Code-LLM](https://github.com/codefuse-ai/Awesome-Code-LLM)\]\[[MFTCoder](https://github.com/codefuse-ai/MFTCoder)\]\[[Awesome-Code-LLM](https://github.com/huybery/Awesome-Code-LLM)\]
  - [paper
  - [paper - llama/codellama)\]\[[model](https://huggingface.co/codellama)\]\[[llamacoder](https://github.com/Nutlope/llamacoder)\]
  - [blog
  - [paper - deepmind/code_contests)\]\[[AlphaCode2_Tech_Report](https://storage.googleapis.com/deepmind-media/AlphaCode2/AlphaCode2_Tech_Report.pdf)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - project/starcoder2)\]\[[starcoder.cpp](https://github.com/bigcode-project/starcoder.cpp)\]
  - [paper
  - [paper - uiuc/magicoder)\]
  - [paper - LLaMA-Alpaca)\]\[[Chinese-LLaMA-Alpaca-2](https://github.com/ymcui/Chinese-LLaMA-Alpaca-2)\]\[[Chinese-LLaMA-Alpaca-3](https://github.com/ymcui/Chinese-LLaMA-Alpaca-3)\]\[[baby-llama2-chinese](https://github.com/DLLXW/baby-llama2-chinese)\]
  - [paper
  - [paper - GSAI/Llama-3-SynE)\]
  - [MiniCPM - MoE](https://github.com/SkyworkAI/Skywork-MoE)\]\[[Orion](https://github.com/OrionStarAI/Orion)\]\[[BELLE](https://github.com/LianjiaTech/BELLE)\]\[[Yuan-2.0](https://github.com/IEIT-Yuan/Yuan-2.0)\]\[[Yuan2.0-M32](https://github.com/IEIT-Yuan/Yuan2.0-M32)\]\[[Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM)\]\[[Index-1.9B](https://github.com/bilibili/Index-1.9B)\]\[[Aquila2](https://github.com/FlagAI-Open/Aquila2)\]
  - [LlamaFamily/Llama-Chinese - AI/Chinese-Llama-2-7b](https://github.com/LinkSoul-AI/Chinese-Llama-2-7b)\]\[[llama3-Chinese-chat](https://github.com/CrazyBoyM/llama3-Chinese-chat)\]\[[phi3-Chinese](https://github.com/CrazyBoyM/phi3-Chinese)\]\[[LLM-Chinese](https://github.com/CrazyBoyM/LLM-Chinese)\]\[[Llama3-Chinese-Chat](https://github.com/Shenzhi-Wang/Llama3-Chinese-Chat)\]\[[llama3-chinese](https://github.com/seanzhang-zhichen/llama3-chinese)\]
  - [Firefly - chitchat](https://github.com/yangjianxin1/GPT2-chitchat)\]
  - [paper - CoT)\]
  - [paper - Researcher)\]
  - [Awesome-Scientific-Language-Models - husky/gpt_academic)\]\[[ChatPaper](https://github.com/kaixindelele/ChatPaper)\]\[[scispacy](https://github.com/allenai/scispacy)\]\[[awesome-ai4s](https://github.com/hyperai/awesome-ai4s)\]\[[xVal](https://github.com/PolymathicAI/xVal)\]
  - [paper
  - [paper - agent](https://github.com/andrewyng/translation-agent)\]
  - [blog
  - [translation-agent - zero](https://github.com/frdel/agent-zero)\]\[[AgentK](https://github.com/mikekelly/AgentK)\]\[[Twitter Personality](https://github.com/wordware-ai/twitter)\]\[[RD-Agent](https://github.com/microsoft/RD-Agent)\]
  - [paper
  - [paper
  - [paper - ai/geogalactica)\]\[[sciparser](https://github.com/davendw49/sciparser)\]
  - [paper - ZJU/Scientific-LLM-Survey)\]\[[sciknoweval](https://github.com/hicai-zju/sciknoweval)\]
  - [paper
  - [paper - 7B-Chat)\]
  - [paper - research/scFoundation)\]
  - [paper
  - [paper - sea/sea)\]\[[AgentReview](https://github.com/Ahren09/AgentReview)\]\[[Researcher](https://github.com/zhu-minjun/Researcher)\]
  - [paper
  - [blog
  - [blog - token-context-windows)\]
  - [blog
  - [paper - pile](https://github.com/EleutherAI/the-pile)\]
  - [paper - workshop/data-preparation)\]\[[dataset](https://huggingface.co/bigscience-data)\]
  - [paper - refinedweb)\]
  - [paper - juicer)\]
  - [paper
  - [paper
  - [paper
  - [paper - LLMs-Datasets](https://github.com/lmmlzn/Awesome-LLMs-Datasets)\]
  - [paper - dev/datadreamer)\]
  - [paper - Tan-dmml/LLM4Annotation)\]
  - [paper - sg/regmix)\]\[[CLIMB](https://arxiv.org/abs/2504.13161)\]\[[QuaDMix](https://arxiv.org/abs/2504.16511)\]
  - [paper - a-p/COIG-CQIA)\]
  - [paper
  - [paper - fineweb-v1)\]\[[fineweb](https://huggingface.co/datasets/HuggingFaceFW/fineweb)\]\[[fineweb-edu](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu)\]
  - [paper - ailab/persona-hub)\]\[[MAGA](https://arxiv.org/abs/2502.04235)\]
  - [paper - models/octo)\]\[[BodyTransformer](https://github.com/carlosferrazza/BodyTransformer)\]\[[crossformer](https://github.com/rail-berkeley/crossformer)\]
  - [paper - research/scFoundation)\]
  - [paper
  - [paper - sea/sea)\]
  - [paper - NLP/OpenResearcher)\]\[[Paper Copilot](https://arxiv.org/abs/2409.04593)\]\[[SciAgentsDiscovery](https://github.com/lamm-mit/SciAgentsDiscovery)\]\[[paper-qa](https://github.com/Future-House/paper-qa)\]
  - [paper - llama/codellama)\]\[[model](https://huggingface.co/codellama)\]\[[llamacoder](https://github.com/Nutlope/llamacoder)\]
  - [blog - media/gemma/codegemma_report.pdf)\]
  - [paper - deepmind/code_contests)\]\[[AlphaCode2_Tech_Report](https://storage.googleapis.com/deepmind-media/AlphaCode2/AlphaCode2_Tech_Report.pdf)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - project/starcoder)\]\[[bigcode-project](https://github.com/bigcode-project)\]\[[model](https://huggingface.co/bigcode)\]
  - [paper - project/starcoder2)\]\[[starcoder.cpp](https://github.com/bigcode-project/starcoder.cpp)\]
  - [paper
  - [paper - uiuc/magicoder)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - Hands-AI/OpenHands)\]\[[open-operator](https://github.com/All-Hands-AI/open-operator)\]\[[potpie](https://github.com/potpie-ai/potpie)\]
  - [paper - Paper-List)\]\[[Challenges and Paths Towards AI for Software Engineering](https://arxiv.org/abs/2503.22625)\]
  - [Yi-Coder - 7B](https://github.com/aixcoder-plugin/aiXcoder-7B)\]\[[codealpaca](https://github.com/sahil280114/codealpaca)\]
  - [screenshot-to-code - ai/vanna)\]\[[NL2SQL_Handbook](https://github.com/HKUSTDial/NL2SQL_Handbook)\]\[[TAG-Bench](https://github.com/TAG-Research/TAG-Bench)\]\[[Spider2](https://github.com/xlang-ai/Spider2)\]\[[WrenAI](https://github.com/Canner/WrenAI)\]
  - [paper
  - [paper - Foundation/FinRobot)\]
  - [paper - Foundation/FinGPT)\]
  - [paper - Foundation/FinGPT/tree/master/fingpt/FinGPT_RAG/instruct-FinGPT)\]
  - [paper - Foundation/FinRL)\]
  - [paper - Foundation/FinRL-Meta)\]
  - [paper - FinLLM)\]
  - [paper
  - [paper - DI/XuanYuan)\]
  - [paper - FinAI/PIXIU)\]
  - [paper
  - [paper - team/rllm)\]
  - [paper - Copilot)\]
  - [paper
  - [paper - proj/AlphaFin)\]
  - [paper
  - [paper
  - [paper
  - [gpt-investor - quant](https://github.com/goldmansachs/gs-quant)\]\[[stockbot-on-groq](https://github.com/bklieger-groq/stockbot-on-groq)\]\[[Real-Time-Stock-Market-Prediction-using-Ensemble-DL-and-Rainbow-DQN](https://github.com/THINK989/Real-Time-Stock-Market-Prediction-using-Ensemble-DL-and-Rainbow-DQN)\]\[[openbb-agents](https://github.com/OpenBB-finance/openbb-agents)\]\[[ai-hedge-fund](https://github.com/virattt/ai-hedge-fund)\]\[[ai-financial-agent](https://github.com/virattt/ai-financial-agent)\]\[[Finance](https://github.com/shashankvemuri/Finance)\]
  - [paper - sim](https://github.com/ZhuiyiTechnology/roformer-sim)\]
  - [paper - futuredata/ColBERT)\]\[[RAGatouille](https://github.com/AnswerDotAI/RAGatouille)\]\[[rerankers](https://github.com/AnswerDotAI/rerankers)\]\[[Rankify](https://github.com/DataScienceUIBK/Rankify)\]\[[A Reproducibility Study of PLAID](https://arxiv.org/abs/2404.14989)\]\[[Jina-ColBERT-v2](https://arxiv.org/abs/2408.16672)\]
  - [paper - louis/xm-retrievers)\]\[[model](https://huggingface.co/antoinelouis/colbert-xm)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - project/bigcodebench)\]\[[LiveCodeBench](https://github.com/LiveCodeBench/LiveCodeBench)\]\[[evalplus](https://github.com/evalplus/evalplus)\]\[[BigOBench](https://arxiv.org/abs/2503.15242)\]
  - [paper - Hands-AI/OpenHands)\]
  - [paper - MCTS](https://github.com/DIRECT-BIT/SRA-MCTS)\]
  - [paper - Paper-List)\]
  - [Yi-Coder - 7B](https://github.com/aixcoder-plugin/aiXcoder-7B)\]\[[codealpaca](https://github.com/sahil280114/codealpaca)\]
  - [screenshot-to-code - ai/vanna)\]\[[NL2SQL_Handbook](https://github.com/HKUSTDial/NL2SQL_Handbook)\]\[[TAG-Bench](https://github.com/TAG-Research/TAG-Bench)\]
  - [paper
  - [paper
  - [paper - Berry](https://arxiv.org/abs/2410.02884)\]
  - [paper - LLaVA)\]
  - [paper - Math/We-Math)\]\[[URSA](https://arxiv.org/abs/2501.04686)\]
  - [paper
  - [paper - ai/OpenDiLoCo)\]\[[DiLoCo](https://arxiv.org/abs/2311.08105)\]\[[DisTrO](https://github.com/NousResearch/DisTrO)\]
  - [paper
  - [paper - platform)\]\[[Parameter Server OSDI 2014](https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-li_mu.pdf)\]\[[ps-lite](https://github.com/dmlc/ps-lite)\]
  - [wandb
  - [paper - Alignment](https://github.com/PKU-Alignment)\]
  - [paper
  - [paper
  - [paper
  - [paper - LLM-preference-learning)\]
  - [alignment-handbook
  - [paper - instruct)\]\[[open-instruct](https://github.com/allenai/open-instruct)\]\[[Multi-modal-Self-instruct](https://github.com/zwq2018/Multi-modal-Self-instruct)\]\[[evol-instruct](https://github.com/nlpxucan/evol-instruct)\]\[[MMEvol](https://arxiv.org/abs/2409.05840)\]\[[Automatic Instruction Evolving for Large Language Models](https://arxiv.org/abs/2406.00770)\]
  - [paper
  - [paper - align/magpie)\]\[[Condor](https://arxiv.org/abs/2501.12273)\]
  - [MOSS-RLHF - NLP/OctoThinker)\]\[[limit-of-RLVR](https://arxiv.org/abs/2504.13837)\]
  - [paper - Alignment/safe-rlhf)\]\[[align-anything](https://github.com/PKU-Alignment/align-anything)\]\[[Safe-Policy-Optimization](https://github.com/PKU-Alignment/Safe-Policy-Optimization)\]
  - [paper
  - [paper - Reward-Modeling)\]\[[Online-RLHF](https://github.com/RLHFlow/Online-RLHF)\]\[[Online-DPO-R1](https://github.com/RLHFlow/Online-DPO-R1)\]\[[Minimal-RL](https://github.com/RLHFlow/Minimal-RL)\]
  - [paper - NLP/LIMO)\]\[[LIMR](https://github.com/GAIR-NLP/LIMR)\]
  - [paper - mitchell/direct-preference-optimization)\]\[[trl](https://github.com/huggingface/trl)\]\[[dpo_trainer](https://github.com/huggingface/trl/blob/main/trl/trainer/dpo_trainer.py)\]
  - [paper - coai/BPO)\]
  - [paper
  - [paper - level-Direct-Preference-Optimization)\]\[[Step-DPO](https://github.com/dvlab-research/Step-DPO)\]\[[FineGrainedRLHF](https://github.com/allenai/FineGrainedRLHF)\]\[[MCTS-DPO](https://github.com/YuxiXie/MCTS-DPO)\]\[[Critical Tokens Matter](https://arxiv.org/abs/2411.19943)\]
  - [paper - nlp/SimPO)\]
  - [paper - RLAIF](https://github.com/mengdi-li/awesome-RLAIF)\]
  - [paper
  - [paper
  - [paper - handbook)\]
  - [paper - self-play)\]
  - [paper - play Methods in Reinforcement Learning](https://arxiv.org/abs/2408.01072)\]
  - [paper - pytorch](https://github.com/lucidrains/CALM-pytorch)\]
  - [paper - rewarding-lm-pytorch)\]\[[Meta-Rewarding Language Models](https://arxiv.org/abs/2407.19594)\]\[[Self-Taught Evaluators](https://arxiv.org/abs/2408.02666)\]
  - [paper
  - [paper
  - [paper - LM/Xwin-LM)\]
  - [paper - auto-alignment)\]
  - [blog
  - [paper - LLM4IE-Papers)\]\[[UIE](https://github.com/universal-ie/UIE)\]\[[NERRE](https://github.com/LBNLP/NERRE)\]\[[uie_pytorch](https://github.com/HUSTAI/uie_pytorch)\]
  - [paper
  - [paper
  - [paper
  - [paper - NLPIR/GenIR-Survey)\]
  - [paper - ai/D2LLM)\]
  - [paper
  - [paper - NLP/WebWalker)\]
  - [paper
  - [link
  - [link
  - [similarities - dev/leettools)\]
  - [SearchEngine - labs](https://github.com/elastic/elasticsearch-labs)\]\[[tevatron](https://github.com/texttron/tevatron)\]\[[txtai](https://github.com/neuml/txtai)\]
  - [paper
  - [paper - compass/MathBench)\]\[[OlympiadBench](https://github.com/OpenBMB/OlympiadBench)\]\[[Math-Verify](https://github.com/huggingface/Math-Verify)\]\[[MathPile](https://github.com/GAIR-NLP/MathPile)\]\[[DeepMath-103K](https://arxiv.org/abs/2504.11456)\]\[[VCBench](https://arxiv.org/abs/2504.18589)\]
  - [paper - Math)\]
  - [paper - LM/Xwin-LM/tree/main/Xwin-Math)\]
  - [paper - Math)\]
  - [paper - Math-Reasoning/Super_MARIO)\]
  - [paper
  - [paper
  - [paper - LLaVA)\]
  - [paper - Math/We-Math)\]
  - [paper
  - [Numina 1st Place Solution - numina/aimo-progress-prize](https://github.com/project-numina/aimo-progress-prize)\]\[[How NuminaMath Won the 1st AIMO Progress Prize](https://huggingface.co/blog/winning-aimo-progress-prize)\]\[[NuminaMath-7B-TIR](https://huggingface.co/AI-MO/NuminaMath-7B-TIR)\]\[[AI achieves silver-medal standard solving International Mathematical Olympiad problems](https://deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/)\]
  - [paper - in-Health/MedLLMsPracticalGuide)\]\[[LLM-for-Healthcare](https://github.com/KaiHe-better/LLM-for-Healthcare)\]\[[GMAI-MMBench](https://github.com/uni-medical/GMAI-MMBench)\]
  - [paper
  - [paper - YuanGroup/ChatLaw)\]
  - [paper - LawLLM)\]
  - [paper - MedLLM)\]
  - [paper
  - [paper
  - [paper - PaLM)\]
  - [paper
  - [paper - pytorch](https://github.com/lucidrains/AMIE-pytorch)\]
  - [paper
  - [paper
  - [paper - yuexi/AgentCourt)\]
  - [paper - deeplearning](https://github.com/alibaba/x-deeplearning)\]
  - [paper - Torch](https://github.com/shenweichen/DeepCTR-Torch)\]\[[pytorch-mmoe](https://github.com/ZhichenZhao/pytorch-mmoe)\]
  - [paper
  - [paper
  - [paper - GSAI/YuLan-Rec)\]\[[Scaling Law of Large Sequential Recommendation Models](https://arxiv.org/abs/2311.11351)\]
  - [paper - SSLRec-Papers](https://github.com/HKUDS/Awesome-SSLRec-Papers)\]
  - [paper
  - [paper
  - [paper
  - [link
  - [link
  - [similarities
  - [SearchEngine
  - [paper
  - [paper - math/MetaMath)\]
  - [paper - compass/MathBench)\]\[[OlympiadBench](https://github.com/OpenBMB/OlympiadBench)\]
  - [paper - Math)\]
  - [paper - ai/DeepSeek-Math)\]\[[DeepSeek-Prover-V1.5](https://github.com/deepseek-ai/DeepSeek-Prover-V1.5)\]
  - [paper - LM/Xwin-LM/tree/main/Xwin-Math)\]
  - [paper - Math)\]
  - [paper - Math-Reasoning/Super_MARIO)\]
  - [paper
  - [paper
  - [paper - deepmind/opro)\]
  - [paper - Lab/ATLAS)\]
  - [paper - team/appl)\]\[[sammo](https://github.com/microsoft/sammo)\]\[[prompt-poet](https://github.com/character-ai/prompt-poet)\]\[[ell](https://github.com/MadcowD/ell)\]
  - [paper
  - [paper
  - [paper - research/prompt-tuning)\]\[[soft-prompt-tuning](https://github.com/kipgparker/soft-prompt-tuning)\]\[[Prompt-Tuning](https://github.com/mkshing/Prompt-Tuning)\]
  - [paper
  - [paper
  - [paper - demonstrations)\]
  - [paper
  - [paper - machines/pal)\]\[[CodeAct](https://github.com/xingyaoww/code-act)\]
  - [paper - instruction-learning)\]
  - [paper
  - [paper - human-preferences)\]\[[lm-human-preference-details](https://github.com/vwxyzjn/lm-human-preference-details)\]
  - [paper - from-feedback)\]
  - [paper - li/Instruction-Tuning-Survey)\]
  - [paper
  - [paper - KGLLM/RAG-Survey)\]\[[Modular RAG](https://arxiv.org/abs/2407.21059)\]
  - [paper - Survey)\]
  - [paper - Augmented Generation for Natural Language Processing: A Survey](https://arxiv.org/abs/2407.13193)\]\[[A Survey on RAG Meeting LLMs](https://arxiv.org/abs/2405.06211)\]\[[A Comprehensive Survey of Retrieval-Augmented Generation](https://arxiv.org/abs/2410.12837)\]
  - [paper
  - [paper - token-nq)\]\[[docs](https://huggingface.co/docs/transformers/main/model_doc/rag)\]\[[FAISS](https://github.com/facebookresearch/faiss)\]
  - [paper - rag)\]\[[CRAG](https://github.com/HuskyInSalt/CRAG)\]\[[Golden-Retriever](https://arxiv.org/abs/2408.00798)\]
  - [paper
  - [paper
  - [paper - pytorch](https://github.com/lucidrains/RETRO-pytorch)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - community/TrustRAG)\]
  - [paper
  - [paper
  - [paper - isf)\]
  - [paper - RAG)\]\[[Adaptive-RAG](https://github.com/starsuzi/Adaptive-RAG)\]\[[Advanced RAG 11: Query Classification and Refinement](https://ai.gopubby.com/advanced-rag-11-query-classification-and-refinement-2aec79f4140b)\]
  - [paper - ecosystem-engineering/Blended-RAG)\]\[[infinity](https://github.com/infiniflow/infinity)\]
  - [paper - NLP-Group/HippoRAG)\]\[[HippoRAG 2](https://arxiv.org/abs/2502.14802)\]
  - [paper - harvard/TxAgent)\]\[[MedAgent-Pro](https://arxiv.org/abs/2503.18968)\]
  - [paper
  - [paper - PaLM)\]
  - [paper
  - [paper - pytorch](https://github.com/lucidrains/AMIE-pytorch)\]
  - [paper
  - [paper
  - [paper - yuexi/AgentCourt)\]
  - [paper - deeplearning](https://github.com/alibaba/x-deeplearning)\]
  - [paper - Torch](https://github.com/shenweichen/DeepCTR-Torch)\]\[[pytorch-mmoe](https://github.com/ZhichenZhao/pytorch-mmoe)\]
  - [paper
  - [paper
  - [paper - GSAI/YuLan-Rec)\]
  - [paper - SSLRec-Papers](https://github.com/HKUDS/Awesome-SSLRec-Papers)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - recommendation)\]\[[Towards An Efficient LLM Training Paradigm for CTR Prediction](https://arxiv.org/abs/2503.01001)\]
  - [paper
  - [paper
  - [paper - Tool-Survey)\]
  - [paper - pytorch](https://github.com/lucidrains/toolformer-pytorch)\]\[[conceptofmind/toolformer](https://github.com/conceptofmind/toolformer)\]\[[xrsrke/toolformer](https://github.com/xrsrke/toolformer)\]\[[Graph_Toolformer](https://github.com/jwzhanggy/Graph_Toolformer)\]
  - [paper - MT/StableToolBench)\]
  - [paper - calling)\]
  - [paper - CVC/GPT4Tools)\]
  - [paper - Song793/RestGPT)\]
  - [paper
  - [paper - trial-and-error)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [functionary - tool-llm](https://github.com/zorazrw/awesome-tool-llm)\]
  - [Awesome-LLM-Eval - eval-survey](https://github.com/MLGroupJLU/LLM-eval-survey)\]\[[llm_benchmarks](https://github.com/leobeeson/llm_benchmarks)\]\[[Awesome-LLMs-Evaluation-Papers](https://github.com/tjunlp-lab/Awesome-LLMs-Evaluation-Papers)\]
  - [paper
  - [paper - instructions)\]
  - [paper - crfm/helm)\]
  - [paper - sys/FastChat/tree/main/fastchat/llm_judge)\]
  - [paper - RAG](https://github.com/CLUEbenchmark/SuperCLUE-RAG)\]
  - [paper - nlp/ceval)\]\[[chinese-llm-benchmark](https://github.com/jeinlee1991/chinese-llm-benchmark)\]
  - [paper - li/CMMLU)\]
  - [paper - Benchmark/CMMMU)\]
  - [paper
  - [paper - eval/prometheus-eval)\]
  - [paper - Lab/lmms-eval)\]
  - [paper - Benchmark/MMMU)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - recommendation)\]
  - [paper
  - [paper
  - [fun-rec - RecommenderSystem](https://github.com/zhongqiangwu960812/AI-RecommenderSystem)\]\[[RecSysPapers](https://github.com/tangxyw/RecSysPapers)\]\[[Algorithm-Practice-in-Industry](https://github.com/Doragd/Algorithm-Practice-in-Industry)\]\[[AlgoNotes](https://github.com/shenweichen/AlgoNotes)\]\[[torch-rechub](https://github.com/datawhalechina/torch-rechub)\]
  - [paper
  - [paper - Tool-Survey)\]
  - [paper - pytorch](https://github.com/lucidrains/toolformer-pytorch)\]\[[conceptofmind/toolformer](https://github.com/conceptofmind/toolformer)\]\[[xrsrke/toolformer](https://github.com/xrsrke/toolformer)\]\[[Graph_Toolformer](https://github.com/jwzhanggy/Graph_Toolformer)\]
  - [paper - MT/StableToolBench)\]
  - [paper
  - [paper - CVC/GPT4Tools)\]
  - [paper - Song793/RestGPT)\]
  - [paper
  - [paper - ToolMaker)\]
  - [paper - chen/ToolQA)\]\[[toolbench](https://github.com/sambanova/toolbench)\]
  - [paper
  - [paper - llm)\]
  - [paper - Ye/ToolEyes)\]
  - [blog
  - [blog - architecture-blogpost-encoders-prefixlm-denoising)\]\[[New LLM Pre-training and Post-training Paradigms](https://magazine.sebastianraschka.com/p/new-llm-pre-training-and-post-training)\]
  - [paper
  - [paper
  - [paper - h100-clusters-power-network)\]\[[Parameter Server OSDI 2014](https://www.usenix.org/system/files/conference/osdi14/osdi14-paper-li_mu.pdf)\]\[[ps-lite](https://github.com/dmlc/ps-lite)\]\[[ByteCheckpoint](https://arxiv.org/abs/2407.20143)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - 101B)\]\[[Tele-FLM](https://huggingface.co/CofeAI/Tele-FLM)\]
  - [paper - Tuning-with-GPT-4/GPT-4-LLM)\]
  - [paper
  - [paper - ye/OpenFedLLM)\]
  - [paper - ai/MergeKit)\]\[[DistillKit](https://github.com/arcee-ai/DistillKit)\]\[[A Survey on Collaborative Strategies in the Era of Large Language Models](https://arxiv.org/abs/2407.06089)\]\[[FuseAI](https://github.com/fanqiwan/FuseAI)\]\[[MergeLM](https://github.com/yule-BUAA/MergeLM)\]\[[Long-to-Short-via-Model-Merging](https://github.com/hahahawu/Long-to-Short-via-Model-Merging)\]
  - [paper - ConvAI/tree/main/Awesome-Self-Evolution-of-LLM)\]
  - [paper - mini)\]\[[Adam](https://arxiv.org/abs/1412.6980)\]\[[AdamW](https://arxiv.org/abs/1711.05101)\]
  - [paper - sys/routellm)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [blog
  - [paper - Quantization-Papers](https://github.com/Zhen-Dong/Awesome-Quantization-Papers)\]\[[Awesome-Model-Quantization](https://github.com/Efficient-ML/Awesome-Model-Quantization)\]\[[qllm-eval](https://github.com/thu-nics/qllm-eval)\]
  - [paper
  - [paper - LLM-Inference](https://github.com/xlite-dev/Awesome-LLM-Inference)\]\[[A Survey on Inference Engines for Large Language Models](https://arxiv.org/abs/2505.01658)\]
  - [paper - foundation/bitsandbytes)\]
  - [paper - FP4)\]
  - [paper - instruct)\]\[[open-instruct](https://github.com/allenai/open-instruct)\]\[[Multi-modal-Self-instruct](https://github.com/zwq2018/Multi-modal-Self-instruct)\]\[[evol-instruct](https://github.com/nlpxucan/evol-instruct)\]\[[MMEvol](https://arxiv.org/abs/2409.05840)\]\[[Automatic Instruction Evolving for Large Language Models](https://arxiv.org/abs/2406.00770)\]
  - [paper
  - [paper - align/magpie)\]
  - [hf blog - from-human-preferences)\]\[[alignment blog](https://openai.com/blog/our-approach-to-alignment-research)\]\[[awesome-RLHF](https://github.com/opendilab/awesome-RLHF)\]
  - [MOSS-RLHF
  - [paper - Alignment/safe-rlhf)\]\[[align-anything](https://github.com/PKU-Alignment/align-anything)\]
  - [paper
  - [paper - Reward-Modeling)\]
  - [paper
  - [paper
  - [paper - mitchell/direct-preference-optimization)\]\[[trl](https://github.com/huggingface/trl)\]\[[dpo_trainer](https://github.com/huggingface/trl/blob/main/trl/trainer/dpo_trainer.py)\]
  - [paper - coai/BPO)\]
  - [paper
  - [paper - level-Direct-Preference-Optimization)\]\[[Step-DPO](https://github.com/dvlab-research/Step-DPO)\]\[[FineGrainedRLHF](https://github.com/allenai/FineGrainedRLHF)\]\[[MCTS-DPO](https://github.com/YuxiXie/MCTS-DPO)\]
  - [paper - nlp/SimPO)\]
  - [paper - RLAIF](https://github.com/mengdi-li/awesome-RLAIF)\]
  - [paper
  - [paper
  - [paper - handbook)\]
  - [paper - self-play)\]
  - [paper - play Methods in Reinforcement Learning](https://arxiv.org/abs/2408.01072)\]
  - [paper - pytorch](https://github.com/lucidrains/CALM-pytorch)\]
  - [paper - rewarding-lm-pytorch)\]\[[Meta-Rewarding Language Models](https://arxiv.org/abs/2407.19594)\]\[[Self-Taught Evaluators](https://arxiv.org/abs/2408.02666)\]
  - [paper
  - [paper
  - [paper
  - [paper - Knowledge-Distillation-of-LLMs)\]
  - [blog
  - [paper - Aligner)\]\[[NeMo-Curator](https://github.com/NVIDIA/NeMo-Curator)\]\[[Nemotron-4 340B Technical Report](https://d1qx31qr3h6wln.cloudfront.net/publications/Nemotron_4_340B_8T.pdf)\]\[[Mistral NeMo](https://mistral.ai/news/mistral-nemo/)\]\[[SparseLLM](https://github.com/BaiTheBest/SparseLLM)\]\[[MaskLLM](https://github.com/NVlabs/MaskLLM)\]\[[HelpSteer2-Preference](https://arxiv.org/abs/2410.01257)\]
  - [paper - LM/Xwin-LM)\]
  - [paper - auto-alignment)\]
  - [blog - verifier-games-improve-legibility-of-llm-outputs/legibility.pdf)\]
  - [blog - rbr-code-and-data)\]
  - [paper - Self-Guide)\]\[[prompt2model](https://github.com/neulab/prompt2model)\]
  - [paper
  - [paper
  - [paper - memory-transformer/tree/aaai24)\]\[[LM-RMT](https://github.com/booydar/LM-RMT)\]
  - [paper - cn/RecurrentGPT)\]
  - [paper
  - [paper
  - [paper - research/LongLoRA)\]
  - [paper - han-lab/streaming-llm)\]\[[SwiftInfer](https://github.com/hpcaitech/SwiftInfer)\]\[[SwiftInfer blog](https://hpc-ai.com/blog/colossal-ai-swiftinfer)\]
  - [paper
  - [paper
  - [paper
  - [paper - LLM-Long-Context-Modeling](https://github.com/Xnhyacinth/Awesome-LLM-Long-Context-Modeling)\]
  - [paper - Context-Data-Engineering)\]
  - [paper - nlp/CEPE)\]
  - [blog - verifier-games-improve-legibility-of-llm-outputs/legibility.pdf)\]
  - [blog - based-rewards-for-language-model-safety.pdf)\]\[[code](https://github.com/openai/safety-rbr-code-and-data)\]
  - [paper - Self-Guide)\]\[[prompt2model](https://github.com/neulab/prompt2model)\]
  - [paper
  - [paper
  - [paper - memory-transformer/tree/aaai24)\]\[[LM-RMT](https://github.com/booydar/LM-RMT)\]
  - [paper - cn/RecurrentGPT)\]
  - [paper
  - [paper
  - [paper - research/LongLoRA)\]
  - [paper - han-lab/streaming-llm)\]\[[SwiftInfer](https://github.com/hpcaitech/SwiftInfer)\]\[[SwiftInfer blog](https://hpc-ai.com/blog/colossal-ai-swiftinfer)\]
  - [paper
  - [paper - attention-pytorch](https://github.com/lucidrains/ring-attention-pytorch)\]\[[ring-flash-attention](https://github.com/zhuzilin/ring-flash-attention)\]\[[local-attention](https://github.com/lucidrains/local-attention)\]\[[tree_attention](https://github.com/Zyphra/tree_attention)\]
  - [paper
  - [paper
  - [paper
  - [paper - LLM-Long-Context-Modeling](https://github.com/Xnhyacinth/Awesome-LLM-Long-Context-Modeling)\]
  - [paper - Context-Data-Engineering)\]
  - [paper - nlp/CEPE)\]
  - [paper
  - [paper
  - [paper - Stars)\]\[[LLMTest_NeedleInAHaystack](https://github.com/gkamradt/LLMTest_NeedleInAHaystack)\]\[[RULER](https://github.com/NVIDIA/RULER)\]\[[LooGLE](https://github.com/bigai-nlco/LooGLE)\]\[[LongBench](https://github.com/THUDM/LongBench)\]\[[google-deepmind/loft](https://github.com/google-deepmind/loft)\]
  - [paper - transformer-pytorch](https://github.com/lucidrains/infini-transformer-pytorch)\]\[[InfiniTransformer](https://github.com/Beomi/InfiniTransformer)\]\[[infini-mini-transformer](https://github.com/jiahe7ay/infini-mini-transformer)\]\[[megalodon](https://github.com/XuezheMax/megalodon)\]\[[InfiniteHiP](https://arxiv.org/abs/2502.08910)\]
  - [paper
  - [paper
  - [paper
  - [blog
  - [blog - token-context-windows)\]
  - [blog
  - [paper
  - [paper - workshop/data-preparation)\]\[[dataset](https://huggingface.co/bigscience-data)\]
  - [paper - refinedweb)\]
  - [paper - juicer)\]
  - [paper
  - [paper
  - [paper - Extract-Kit](https://github.com/opendatalab/PDF-Extract-Kit)\]
  - [paper
  - [paper - LLMs-Datasets](https://github.com/lmmlzn/Awesome-LLMs-Datasets)\]
  - [paper - dev/datadreamer)\]
  - [paper - Tan-dmml/LLM4Annotation)\]
  - [paper
  - [paper - a-p/COIG-CQIA)\]
  - [paper
  - [paper - fineweb-v1)\]\[[fineweb](https://huggingface.co/datasets/HuggingFaceFW/fineweb)\]\[[fineweb-edu](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu)\]
  - [paper - 7B-8k](https://huggingface.co/apple/DCLM-7B-8k)\]\[[data-agora](https://github.com/neulab/data-agora)\]\[[Data Selection via Optimal Control for Language Models](https://arxiv.org/abs/2410.07064)\]
  - [paper - ailab/persona-hub)\]
  - [llm-datasets - LLM-Synthetic-Data](https://github.com/wasiahmad/Awesome-LLM-Synthetic-Data)\]
  - [paper
  - [paper
  - [paper - Stars)\]\[[LLMTest_NeedleInAHaystack](https://github.com/gkamradt/LLMTest_NeedleInAHaystack)\]\[[LooGLE](https://github.com/bigai-nlco/LooGLE)\]\[[LongBench](https://github.com/THUDM/LongBench)\]\[[google-deepmind/loft](https://github.com/google-deepmind/loft)\]
  - [paper - transformer-pytorch](https://github.com/lucidrains/infini-transformer-pytorch)\]\[[InfiniTransformer](https://github.com/Beomi/InfiniTransformer)\]\[[infini-mini-transformer](https://github.com/jiahe7ay/infini-mini-transformer)\]\[[megalodon](https://github.com/XuezheMax/megalodon)\]
  - [paper
  - [paper
  - [paper
  - [paper - granite/granite-code-models)\]\[[granite-3.1-language-models](https://github.com/ibm-granite/granite-3.1-language-models)\]
  - [llm-reasoners - groq/g1)\]\[[Open-O1](https://github.com/Open-Source-O1/Open-O1)\]\[[show-me](https://github.com/marlaman/show-me)\]\[[OpenR](https://github.com/openreasoner/openr)\]
  - [Prompt4ReasoningPapers
  - [paper - 3 to o3](https://cameronrwolfe.substack.com/p/llm-scaling-laws)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - LLM](https://github.com/deepseek-ai/DeepSeek-LLM)\]\[[DeepSeek-V2](https://github.com/deepseek-ai/DeepSeek-V2)\]\[[DeepSeek-V3](https://github.com/deepseek-ai/DeepSeek-V3)\]\[[DeepSeek-Coder](https://github.com/deepseek-ai/DeepSeek-Coder)\]
  - [llm-datasets - LLM-Synthetic-Data](https://github.com/wasiahmad/Awesome-LLM-Synthetic-Data)\]
  - [AlpacaEval Leaderboard - lab/alpaca_eval)\]
  - [Chatbot-Arena-Leaderboard - 05-03-arena/)\]\[[FastChat](https://github.com/lm-sys/FastChat)\]\[[arena-hard](https://github.com/lm-sys/arena-hard)\]
  - [lm-evaluation-harness - evals](https://github.com/openai/simple-evals)\]
  - [OpenCompass - Eval](https://github.com/open-compass/GAOKAO-Eval)\]\[[VLMEvalKit](https://github.com/open-compass/VLMEvalKit)\]
  - [llm-colosseum
  - [blog - leaderboard](https://github.com/vectara/hallucination-leaderboard)\]
  - [paper - hallucination-survey)\]
  - [paper - LLM-hallucination)\]\[[Awesome-MLLM-Hallucination](https://github.com/showlab/Awesome-MLLM-Hallucination)\]
  - [paper - 2.0)\]
  - [paper - NLP/factool)\]\[[OlympicArena](https://github.com/GAIR-NLP/OlympicArena)\]\[[FActScore](https://arxiv.org/abs/2305.14251)\]
  - [paper - ai/aiconfig/tree/main/cookbooks/Chain-of-Verification)\]
  - [paper - lab/HallusionBench)\]
  - [paper - MLLM/Woodpecker)\]
  - [paper
  - [paper
  - [paper - deepmind/long-form-factuality)\]
  - [paper - science/RefChecker)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [blog - llm-inference-backends)\]\[[CSE 234: Data Systems for Machine Learning](https://hao-ai-lab.github.io/cse234-w25/index.html)\]\[[CS 598: Systems for Generative AI](https://github.com/fanlai0990/CS598)\]
  - [blog
  - [paper - Quantization-Papers](https://github.com/Zhen-Dong/Awesome-Quantization-Papers)\]\[[awesome-model-quantization](https://github.com/htqin/awesome-model-quantization)\]\[[qllm-eval](https://github.com/thu-nics/qllm-eval)\]
  - [paper
  - [paper
  - [paper
  - [paper - FP4)\]
  - [paper - DASLab/qmoe)\]
  - [paper - han-lab/llm-awq)\]\[[AutoAWQ](https://github.com/casper-hansen/AutoAWQ)\]\[[smoothquant](https://github.com/mit-han-lab/smoothquant)\]\[[omniserve](https://github.com/mit-han-lab/omniserve)\]
  - [paper
  - [paper
  - [paper
  - [paper - IPADS/PowerInfer)\]\[[llama.cpp](https://github.com/ggerganov/llama.cpp)\]\[[airllm](https://github.com/lyogavin/airllm)\]\[[PowerInfer-2](https://arxiv.org/abs/2406.06282)\]\[[PowerServe](https://github.com/powerserve-project/PowerServe)\]\[[prima.cpp](https://github.com/Lizonghang/prima.cpp)\]
  - [paper - AILab/flash-attention)\]
  - [paper - AILab/flash-attention)\]
  - [paper - Factory](https://github.com/Zefan-Cai/KVCache-Factory)\]\[[SpecInfer](https://arxiv.org/abs/2305.09781)\]
  - [paper - 2](https://arxiv.org/abs/2406.16858)\]\[[EAGLE-3](https://arxiv.org/abs/2503.01840)\]\[[LLMSpeculativeSampling](https://github.com/feifeibear/LLMSpeculativeSampling)\]\[[Sequoia](https://github.com/Infini-AI-Lab/Sequoia)\]\[[HASS](https://arxiv.org/abs/2408.15766)\]\[[LongSpec](https://github.com/sail-sg/LongSpec)\]
  - [paper - Bench](https://github.com/hemingkx/Spec-Bench)\]
  - [paper
  - [paper - ai-lab/Consistency_LLM)\]\[[LookaheadDecoding](https://github.com/hao-ai-lab/LookaheadDecoding)\]\[[Lookahead](https://github.com/alipay/PainlessInferenceAcceleration)\]
  - [paper
  - [paper
  - [paper - ai/Mooncake)\]\[[ktransformers](https://github.com/kvcache-ai/ktransformers)\]
  - [paper - lab/HallusionBench)\]
  - [paper
  - [paper
  - [paper
  - [paper - deepmind/long-form-factuality)\]
  - [paper - NLP-Group/HippoRAG)\]
  - [paper - NLP/RAG)\]\[[Seven Failure Points When Engineering a Retrieval Augmented Generation System](https://arxiv.org/abs/2401.05856)\]\[[Improving Retrieval Performance in RAG Pipelines with Hybrid Search](https://towardsdatascience.com/improving-retrieval-performance-in-rag-pipelines-with-hybrid-search-c75203c2f2f5)\]\[[15 Advanced RAG Techniques from Pre-Retrieval to Generation](https://www.willowtreeapps.com/guides/advanced-rag-techniques)\]
  - [paper
  - [paper - FiT)\]\[[fastRAG](https://github.com/IntelLabs/fastRAG)\]\[[rag-retrieval-study](https://github.com/intellabs/rag-retrieval-study)\]
  - [paper - of-thoughts)\]
  - [paper - teacher)\]
  - [paper
  - [paper - Planner)\]
  - [paper
  - [paper - Impact-of-Reasoning-Step-Length-on-Large-Language-Models)\]
  - [paper - Edgerunners/Plan-and-Solve-Prompting)\]\[[maestro](https://github.com/Doriandarko/maestro)\]
  - [paper - models/llm_multiagent_debate)\]\[[Multi-Agents-Debate](https://github.com/Skytliang/Multi-Agents-Debate)\]
  - [paper - refine)\]\[[MCT Self-Refine](https://github.com/trotsky1997/MathBlackBox)\]\[[SelFee](https://github.com/kaistAI/SelFee)\]
  - [paper
  - [paper
  - [paper
  - [paper - discover)\]\[[SELF-DISCOVER](https://github.com/kailashsp/SELF-DISCOVER)\]
  - [paper
  - [paper
  - [paper - of-thought-llm)\]\[[SymbCoT](https://github.com/Aiden0526/SymbCoT)\]
  - [paper - EM-pytorch)\]
  - [paper
  - [paper
  - [paper - rpm-bench)\]
  - [paper
  - [paper - husky/Husky-v1)\]\[[QueryAgent](https://github.com/cdhx/QueryAgent)\]\[[OctoTools](https://github.com/octotools/octotools)\]\[[START](https://arxiv.org/abs/2503.04625)\]
  - [paper - System)\]
  - [paper
  - [paper - Shanghai/ICSFSurvey)\]
  - [paper - st)\]
  - [paper - Math](https://arxiv.org/abs/2501.04519)\]\[[Orca 2](https://arxiv.org/abs/2311.11045)\]\[[STaR](https://arxiv.org/abs/2203.14465)\]\[[Quiet-STaR](https://arxiv.org/abs/2403.09629)\]
  - [paper - LLM](https://github.com/bytedance/ABQ-LLM)\]\[[VPTQ](https://github.com/microsoft/VPTQ)\]\[[ppq](https://github.com/OpenPPL/ppq)\]
  - [paper - DASLab/qmoe)\]
  - [paper - han-lab/llm-awq)\]\[[AutoAWQ](https://github.com/casper-hansen/AutoAWQ)\]\[[qserve](https://github.com/mit-han-lab/qserve)\]
  - [paper
  - [paper
  - [paper
  - [paper - IPADS/PowerInfer)\]\[[llama.cpp](https://github.com/ggerganov/llama.cpp)\]\[[airllm](https://github.com/lyogavin/airllm)\]\[[PowerInfer-2](https://arxiv.org/abs/2406.06282)\]
  - [paper - AILab/flash-attention)\]
  - [paper - AILab/flash-attention)\]
  - [paper - AILab/flash-attention)\]
  - [paper - project/vllm)\]\[[FastChat](https://github.com/lm-sys/FastChat)\]\[[ollama](https://github.com/jmorganca/ollama)\]
  - [blog - project/sglang)\]\[[sgl-learning-materials](https://github.com/sgl-project/sgl-learning-materials)\]\[[PD Disaggregation and Large-scale Expert Parallelism](https://lmsys.org/blog/2025-05-05-large-scale-ep/)\]
  - [paper
  - [paper - AI-Lab/Sequoia)\]
  - [paper - Bench](https://github.com/hemingkx/Spec-Bench)\]
  - [paper
  - [paper - ai-lab/Consistency_LLM)\]\[[LookaheadDecoding](https://github.com/hao-ai-lab/LookaheadDecoding)\]\[[Lookahead](https://github.com/alipay/PainlessInferenceAcceleration)\]
  - [paper
  - [paper - serve)\]\[[SARATHI](https://arxiv.org/abs/2308.16369)\]\[[ORCA OSDI 2022](https://www.usenix.org/system/files/osdi22-yu.pdf)\]\[[continuous batching blog](https://www.anyscale.com/blog/continuous-batching-llm-inference)\]\[[vattention](https://github.com/microsoft/vattention)\]
  - [paper
  - [paper - sys/prompt-cache)\]\[[FastServe](https://arxiv.org/abs/2305.05920)\]
  - [paper - ai/Mooncake)\]\[[ktransformers](https://github.com/kvcache-ai/ktransformers)\]
  - [OpenLLM - llm](https://github.com/mlc-ai/mlc-llm)\]\[[ollama](https://github.com/jmorganca/ollama)\]\[[open-webui](https://github.com/open-webui/open-webui)\]\[[torchchat](https://github.com/pytorch/torchchat)\]
  - [LMDeploy - AI/LitServe)\]
  - [ChuanhuChatGPT - Next-Web](https://github.com/ChatGPTNextWeb/ChatGPT-Next-Web)\]
  - [blog
  - [paper - Implementation](https://github.com/davidmrau/mixture-of-experts)\]
  - [paper - of-experts](https://github.com/lucidrains/mixture-of-experts)\]
  - [paper - futuredata/megablocks)\]
  - [paper
  - [paper
  - [paper - offloading)\]
  - [paper - inference)\]\[[megablocks-public](https://github.com/mistralai/megablocks-public)\]\[[model](https://huggingface.co/mistralai)\]\[[blog](https://mistral.ai/news/mixtral-of-experts/)\]\[[Chinese-Mixtral-8x7B](https://github.com/HIT-SCIR/Chinese-Mixtral-8x7B)\]\[[Chinese-Mixtral](https://github.com/ymcui/Chinese-Mixtral)\]
  - [paper - ai/DeepSeek-MoE)\]
  - [paper - ai/DeepSeek-V2)\]\[[DeepSeek-V2.5](https://huggingface.co/deepseek-ai/DeepSeek-V2.5)\]
  - [paper - model-merge)\]
  - [paper - into-MoEs)\]
  - [text-generation-inference - quanto](https://github.com/huggingface/optimum-quanto)\]\[[huggingface-inference-toolkit](https://github.com/huggingface/huggingface-inference-toolkit)\]
  - [OpenLLM - llm](https://github.com/mlc-ai/mlc-llm)\]\[[ollama](https://github.com/jmorganca/ollama)\]\[[open-webui](https://github.com/open-webui/open-webui)\]\[[torchchat](https://github.com/pytorch/torchchat)\]
  - [LMDeploy - ai/Mooncake)\]\[[inference](https://github.com/xorbitsai/inference)\]\[[LitServe](https://github.com/Lightning-AI/LitServe)\]
  - [ChuanhuChatGPT - Next-Web](https://github.com/ChatGPTNextWeb/ChatGPT-Next-Web)\]
  - [blog
  - [paper - Implementation](https://github.com/davidmrau/mixture-of-experts)\]
  - [paper - of-experts](https://github.com/lucidrains/mixture-of-experts)\]
  - [paper - futuredata/megablocks)\]
  - [paper
  - [paper
  - [paper - offloading)\]
  - [paper - inference)\]\[[megablocks-public](https://github.com/mistralai/megablocks-public)\]\[[model](https://huggingface.co/mistralai)\]\[[blog](https://mistral.ai/news/mixtral-of-experts/)\]\[[Chinese-Mixtral-8x7B](https://github.com/HIT-SCIR/Chinese-Mixtral-8x7B)\]\[[Chinese-Mixtral](https://github.com/ymcui/Chinese-Mixtral)\]
  - [paper - ai/DeepSeek-MoE)\]
  - [paper - ai/DeepSeek-V2)\]\[[DeepSeek-V2.5](https://huggingface.co/deepseek-ai/DeepSeek-V2.5)\]
  - [paper - ai/ESFT)\]\[[Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts](https://arxiv.org/abs/2408.15664)\]\[[On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models](https://arxiv.org/abs/2501.11873)\]
  - [paper - model-merge)\]
  - [paper - into-MoEs)\]
  - [paper - Survey-on-Mixture-of-Experts)\]
  - [paper
  - [paper
  - [torchtune
  - [mergekit - models](https://huggingface.co/blog/mlabonne/merge-models)\]\[[Model Merging](https://huggingface.co/collections/osanseviero/model-merging-65097893623330a3a51ead66)\]\[[OpenChatKit](https://github.com/togethercomputer/OpenChatKit)\]
  - [paper - ai/studios/code-lora-from-scratch)\]\[[lora](https://github.com/cloneofsimo/lora)\]\[[dora](https://github.com/catid/dora)\]\[[MoRA](https://github.com/kongds/MoRA)\]\[[ziplora-pytorch](https://github.com/mkshing/ziplora-pytorch)\]\[[alpaca-lora](https://github.com/tloen/alpaca-lora)\]\[[lorax](https://github.com/predibase/lorax)\]
  - [paper - LoRA/S-LoRA)\]\[[AdaLoRA](https://github.com/QingruZhang/AdaLoRA)\]\[[LoRAMoE](https://github.com/Ablustrund/LoRAMoE)\]\[[lorahub](https://github.com/sail-sg/lorahub)\]\[[O-LoRA](https://github.com/cmnfriend/O-LoRA)\]\[[qa-lora](https://github.com/yuhuixu1993/qa-lora)\]
  - [paper - GA)\]\[[LoRA-Pro blog](https://kexue.fm/archives/10266)\]\[[dora](https://github.com/catid/dora)\]
  - [paper
  - [paper - research/adapter-bert)\]\[[unify-parameter-efficient-tuning](https://github.com/jxhe/unify-parameter-efficient-tuning)\]
  - [paper - hub/adapters)\]\[[A Survey on LoRA of Large Language Models](https://arxiv.org/abs/2407.11046)\]
  - [paper - Edgerunners/LLM-Adapters)\]
  - [paper - Adapter)\]
  - [paper - Pro)\]
  - [paper - tuning)\]
  - [paper - tuning-v2)\]\[[pet](https://github.com/timoschick/pet)\]\[[PrefixTuning](https://github.com/XiangLi1999/PrefixTuning)\]
  - [paper
  - [paper
  - [paper - Survey-on-Mixture-of-Experts)\]
  - [paper
  - [paper
  - [llama-moe - pytorch](https://github.com/lucidrains/PEER-pytorch)\]\[[GRIN-MoE](https://github.com/microsoft/GRIN-MoE)\]
  - [Megatron-LM - DeepSpeed](https://github.com/deepspeedai/Megatron-DeepSpeed)\]\[[Megatron-DeepSpeed](https://github.com/bigscience-workshop/Megatron-DeepSpeed)\]\[[Pai-Megatron-Patch](https://github.com/alibaba/Pai-Megatron-Patch)\]
  - [torchtune
  - [PEFT - Factory](https://github.com/hiyouga/LLaMA-Factory)\]\[[LMFlow](https://github.com/OptimalScale/LMFlow)\]\[[unsloth](https://github.com/unslothai/unsloth)\]\[[xtuner](https://github.com/InternLM/xtuner)\]\[[MFTCoder](https://github.com/codefuse-ai/MFTCoder)\]\[[llm-foundry](https://github.com/mosaicml/llm-foundry)\]\[[swift](https://github.com/modelscope/swift)\]\[[Liger-Kernel](https://github.com/linkedin/Liger-Kernel)\]
  - [mergekit - models](https://huggingface.co/blog/mlabonne/merge-models)\]\[[Model Merging](https://huggingface.co/collections/osanseviero/model-merging-65097893623330a3a51ead66)\]\[[OpenChatKit](https://github.com/togethercomputer/OpenChatKit)\]
  - [paper - ai/studios/code-lora-from-scratch)\]\[[lora](https://github.com/cloneofsimo/lora)\]\[[dora](https://github.com/catid/dora)\]\[[MoRA](https://github.com/kongds/MoRA)\]\[[ziplora-pytorch](https://github.com/mkshing/ziplora-pytorch)\]\[[alpaca-lora](https://github.com/tloen/alpaca-lora)\]
  - [paper - qlora](https://github.com/htqin/ir-qlora)\]\[[fsdp_qlora](https://github.com/AnswerDotAI/fsdp_qlora)\]
  - [paper - LoRA/S-LoRA)\]\[[AdaLoRA](https://github.com/QingruZhang/AdaLoRA)\]\[[LoRAMoE](https://github.com/Ablustrund/LoRAMoE)\]\[[lorahub](https://github.com/sail-sg/lorahub)\]\[[O-LoRA](https://github.com/cmnfriend/O-LoRA)\]\[[qa-lora](https://github.com/yuhuixu1993/qa-lora)\]
  - [paper - GA)\]\[[LoRA-Pro blog](https://kexue.fm/archives/10266)\]\[[dora](https://github.com/catid/dora)\]
  - [paper - GaLore](https://github.com/VITA-Group/Q-GaLore)\]\[[WeLore](https://github.com/VITA-Group/WeLore)\]
  - [paper
  - [paper - research/adapter-bert)\]\[[unify-parameter-efficient-tuning](https://github.com/jxhe/unify-parameter-efficient-tuning)\]
  - [paper - hub/adapters)\]\[[A Survey on LoRA of Large Language Models](https://arxiv.org/abs/2407.11046)\]
  - [paper - Edgerunners/LLM-Adapters)\]
  - [paper - Adapter)\]
  - [paper - Pro)\]
  - [paper - tuning)\]
  - [paper - tuning-v2)\]\[[pet](https://github.com/timoschick/pet)\]\[[PrefixTuning](https://github.com/XiangLi1999/PrefixTuning)\]
  - [paper - parameter-efficient-tuning)\]
  - [paper
  - [paper
  - [paper - foundation/bitsandbytes)\]
  - [paper - AMP)\]
  - [paper
  - [paper
  - [paper - Factory)\]\[[360-LLaMA-Factory](https://github.com/Qihoo360/360-LLaMA-Factory)\]\[[EasyR1](https://github.com/hiyouga/EasyR1)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - basis-bge-zh/)\]
  - [paper - large-zh)\]\[[gte-Qwen2-7B-instruct](https://huggingface.co/Alibaba-NLP/gte-Qwen2-7B-instruct)\]\[[gte-large-en-v1.5](https://huggingface.co/Alibaba-NLP/gte-large-en-v1.5)\]
  - [BCEmbedding - embedding-base_v1](https://huggingface.co/maidalun1020/bce-embedding-base_v1)\]\[[bce-reranker-base_v1](https://huggingface.co/maidalun1020/bce-reranker-base_v1)\]
  - [CohereV3
  - [paper - ai/instructor-embedding)\]
  - [paper - ai/contrastors)\]
  - [paper
  - [paper
  - [paper - AMP)\]
  - [paper
  - [paper
  - [paper - Factory)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - deepmind/opro)\]
  - [paper - Lab/ATLAS)\]
  - [paper - prompting)\]
  - [paper - team/appl)\]\[[sammo](https://github.com/microsoft/sammo)\]\[[prompt-poet](https://github.com/character-ai/prompt-poet)\]\[[ell](https://github.com/MadcowD/ell)\]
  - [paper
  - [paper
  - [PromptPapers - engineering)\]\[[ChatGPT Prompt Engineering for Developers](https://prompt-engineering.xiniushu.com/)\]\[[Prompt Engineering Guide](https://www.promptingguide.ai/zh)\]\[[k12promptguide](https://www.k12promptguide.com/)\]\[[gpt-prompt-engineer](https://github.com/mshumer/gpt-prompt-engineer)\]\[[awesome-chatgpt-prompts](https://github.com/f/awesome-chatgpt-prompts)\]\[[awesome-chatgpt-prompts-zh](https://github.com/PlexPt/awesome-chatgpt-prompts-zh)\]
  - [paper - demonstrations)\]
  - [paper
  - [paper - machines/pal)\]
  - [paper - instruction-learning)\]
  - [paper
  - [paper - human-preferences)\]
  - [paper - from-feedback)\]
  - [paper
  - [paper - li/Instruction-Tuning-Survey)\]
  - [paper
  - [paper
  - [paper - KGLLM/RAG-Survey)\]\[[Modular RAG](https://arxiv.org/abs/2407.21059)\]
  - [paper - Survey)\]
  - [paper - Augmented Generation for Natural Language Processing: A Survey](https://arxiv.org/abs/2407.13193)\]\[[A Survey on RAG Meeting LLMs](https://arxiv.org/abs/2405.06211)\]
  - [paper
  - [paper - token-nq)\]\[[docs](https://huggingface.co/docs/transformers/main/model_doc/rag)\]\[[FAISS](https://github.com/facebookresearch/faiss)\]
  - [paper - rag)\]\[[CRAG](https://github.com/HuskyInSalt/CRAG)\]\[[Golden-Retriever](https://arxiv.org/abs/2408.00798)\]
  - [paper
  - [paper
  - [paper - pytorch](https://github.com/lucidrains/RETRO-pytorch)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - community/GoMate)\]
  - [paper
  - [paper
  - [paper - isf)\]
  - [paper - RAG)\]\[[Adaptive-RAG](https://github.com/starsuzi/Adaptive-RAG)\]\[[Advanced RAG 11: Query Classification and Refinement](https://ai.gopubby.com/advanced-rag-11-query-classification-and-refinement-2aec79f4140b)\]
  - [paper - ecosystem-engineering/Blended-RAG)\]\[[infinity](https://github.com/infiniflow/infinity)\]
  - [paper - new)\]\[[ind_kdd_2024/](https://www.biendata.net/competition/ind_kdd_2024/)\]\[[KDD2024-WhoIsWho-Top3](https://github.com/yanqiangmiffy/KDD2024-WhoIsWho-Top3)\]
  - [link
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
  - [LlamaIndex - llama/llama_deploy)\]\[[A Cheat Sheet and Some Recipes For Building Advanced RAG](https://blog.llamaindex.ai/a-cheat-sheet-and-some-recipes-for-building-advanced-rag-803a9d94c41b)\]\[[Fine-Tuning Embeddings for RAG with Synthetic Data](https://www.llamaindex.ai/blog/fine-tuning-embeddings-for-rag-with-synthetic-data-e534409a3971)\]
  - [paper - NLP/RAG)\]\[[Seven Failure Points When Engineering a Retrieval Augmented Generation System](https://arxiv.org/abs/2401.05856)\]\[[Improving Retrieval Performance in RAG Pipelines with Hybrid Search](https://towardsdatascience.com/improving-retrieval-performance-in-rag-pipelines-with-hybrid-search-c75203c2f2f5)\]\[[15 Advanced RAG Techniques from Pre-Retrieval to Generation](https://www.willowtreeapps.com/guides/advanced-rag-techniques)\]
  - [paper
  - [paper
  - [paper - new)\]\[[ind_kdd_2024/](https://www.biendata.net/competition/ind_kdd_2024/)\]\[[KDD2024-WhoIsWho-Top3](https://github.com/yanqiangmiffy/KDD2024-WhoIsWho-Top3)\]
  - [paper - io/memobase)\]\[[A-MEM](https://arxiv.org/abs/2502.12110)\]\[[cognee](https://github.com/topoteretes/cognee)\]
  - [link
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
  - [LangChain - rag/)\]\[[LangChain Hub](https://smith.langchain.com/hub)\]\[[langgraph](https://github.com/langchain-ai/langgraph)\]
  - [LlamaIndex - llama/llama_deploy)\]\[[A Cheat Sheet and Some Recipes For Building Advanced RAG](https://blog.llamaindex.ai/a-cheat-sheet-and-some-recipes-for-building-advanced-rag-803a9d94c41b)\]\[[Fine-Tuning Embeddings for RAG with Synthetic Data](https://www.llamaindex.ai/blog/fine-tuning-embeddings-for-rag-with-synthetic-data-e534409a3971)\]
  - [chatgpt-retrieval-plugin - LLM-RAG-Application](https://github.com/lizhe2004/Awesome-LLM-RAG-Application)\]
  - [ragas - community/rageval)\]
  - [vimGPT
  - [QAnything - llm](https://github.com/Mintplex-Labs/anything-llm)\]\[[FastGPT](https://github.com/labring/FastGPT)\]\[[mem0](https://github.com/mem0ai/mem0)\]\[[Memary](https://github.com/kingjulio8238/Memary)\]
  - [ragas
  - [vimGPT
  - [QAnything - llm](https://github.com/Mintplex-Labs/anything-llm)\]\[[FastGPT](https://github.com/labring/FastGPT)\]\[[mem0](https://github.com/mem0ai/mem0)\]\[[Memary](https://github.com/kingjulio8238/Memary)\]
  - [paper - cellar/beir)\]\[[AIR-Bench](https://github.com/AIR-Bench/AIR-Bench)\]
  - [paper - benchmark/mteb)\]\[[leaderboard](https://huggingface.co/spaces/mteb/leaderboard)\]\[[MMTEB](https://arxiv.org/abs/2502.13595)\]\[[MIEB](https://arxiv.org/abs/2504.10471)\]
  - [paper - transformers)\]\[[model](https://huggingface.co/sentence-transformers)\]\[[vec2text](https://github.com/jxmorris12/vec2text)\]
  - [paper - nlp/SimCSE)\]\[[AnglE ACL 2024](https://github.com/SeanLee97/AnglE)\]
  - [paper - text-and-code-embeddings)\]
  - [paper
  - [m3e-base - embedding-v2](https://huggingface.co/lier007/xiaobu-embedding-v2)\]\[[stella_en_1.5B_v5](https://huggingface.co/dunzhang/stella_en_1.5B_v5)\]
  - [paper - large-zh)\]\[[gte-Qwen2-7B-instruct](https://huggingface.co/Alibaba-NLP/gte-Qwen2-7B-instruct)\]\[[gte-large-en-v1.5](https://huggingface.co/Alibaba-NLP/gte-large-en-v1.5)\]
  - [BCEmbedding - embedding-base_v1](https://huggingface.co/maidalun1020/bce-embedding-base_v1)\]\[[bce-reranker-base_v1](https://huggingface.co/maidalun1020/bce-reranker-base_v1)\]
  - [paper - STaR](https://arxiv.org/abs/2403.09629)\]
  - [paper
  - [paper - Efficient Model Ladders](https://arxiv.org/abs/2412.04403)\]\[[Inference Scaling Laws](https://arxiv.org/abs/2408.00724)\]\[[Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models](https://arxiv.org/abs/2501.12370)\]\[[Distillation Scaling Laws](https://arxiv.org/abs/2502.08606)\]
  - [paper - aim)\]
  - [paper
  - [paper
  - [paper - zhu.com/part-2-grade-school-math/part-2-1)\]
  - [paper
  - [paper
  - [paper - rep)\]
  - [paper
  - [blog - interpretability)\]\[[transformer-debugger](https://github.com/openai/transformer-debugger)\]
  - [blog
  - [blog - thoughts-language-model](https://www.anthropic.com/research/tracing-thoughts-language-model)\]
  - [paper
  - [paper - transparency-tool)\]\[[LLM-Microscope](https://arxiv.org/abs/2502.15007)\]
  - [paper - explainer)\]\[[demo](https://poloclub.github.io/transformer-explainer)\]
  - [paper - dynamics)\]
  - [Transformer Circuits Thread - chapter1-transformer-interp.streamlit.app)\]\[[Awesome-Interpretability-in-Large-Language-Models](https://github.com/ruizheliUOA/Awesome-Interpretability-in-Large-Language-Models)\]\[[TransformerLens](https://github.com/TransformerLensOrg/TransformerLens)\]\[[inseq](https://github.com/inseq-team/inseq)\]
  - [paper
  - [paper
  - [paper
  - [Awesome-Chinese-LLM - LLMs-In-China](https://github.com/wgwang/awesome-LLMs-In-China)\]\[[awesome-LLM-resourses](https://github.com/WangRongsheng/awesome-LLM-resourses)\]
  - [paper
  - [paper - 130B)\]
  - [paper
  - [paper - cellar/beir)\]
  - [paper - benchmark/mteb)\]\[[leaderboard](https://huggingface.co/spaces/mteb/leaderboard)\]
  - [paper - transformers)\]\[[model](https://huggingface.co/sentence-transformers)\]\[[vec2text](https://github.com/jxmorris12/vec2text)\]
  - [paper - nlp/SimCSE)\]\[[AnglE ACL 2024](https://github.com/SeanLee97/AnglE)\]
  - [paper - text-and-code-embeddings)\]
  - [paper
  - [paper
  - [paper - long.194/)\]\[[llm_reranker](https://github.com/FlagOpen/FlagEmbedding/tree/master/research/llm_reranker)\]\[[FlagEmbedding](https://github.com/FlagOpen/FlagEmbedding)\]
  - [paper - NLP/llm2vec)\]\[[VLM2Vec](https://github.com/TIGER-AI-Lab/VLM2Vec)\]\[[Gemini Embedding](https://arxiv.org/abs/2503.07891)\]
  - [paper - Embed-v1)\]
  - [paper
  - [JamAIBase - Retrieval](https://github.com/NovaSearch-Team/RAG-Retrieval)\]\[[model2vec](https://github.com/MinishLab/model2vec)\]
  - [paper - of-thought-hub](https://github.com/FranxYao/chain-of-thought-hub)\]
  - [paper
  - [paper - takeshi188/zero_shot_cot)\]
  - [paper - science/auto-cot)\]
  - [paper - science/mm-cot)\]
  - [paper
  - [paper
  - [paper - REACT)\]
  - [paper - nlp/tree-of-thought-llm)\]\[[Plug in and Play Implementation](https://github.com/kyegomez/tree-of-thoughts)\]\[[tree-of-thought-prompting](https://github.com/dave1010/tree-of-thought-prompting)\]
  - [paper - of-thoughts)\]
  - [paper - ai/cumulative-reasoning)\]\[[On the Diagram of Thought](https://arxiv.org/abs/2409.10038)\]
  - [paper - Of-Thoughts)\]
  - [paper - of-Thoughts-XoT)\]
  - [paper - of-thoughts)\]
  - [paper - teacher)\]
  - [paper
  - [paper - Planner)\]
  - [paper - org/llm-reasoners)\]\[[LLM Reasoners COLM 2024](https://arxiv.org/abs/2404.05221)\]\[[AgentGen KDD 2025](https://arxiv.org/abs/2408.00764)\]
  - [paper
  - [paper - AI-Lab/Program-of-Thoughts)\]
  - [paper
  - [paper
  - [paper - Impact-of-Reasoning-Step-Length-on-Large-Language-Models)\]
  - [paper - Edgerunners/Plan-and-Solve-Prompting)\]\[[maestro](https://github.com/Doriandarko/maestro)\]
  - [paper - models/llm_multiagent_debate)\]\[[Multi-Agents-Debate](https://github.com/Skytliang/Multi-Agents-Debate)\]
  - [paper - refine)\]\[[MCT Self-Refine](https://github.com/trotsky1997/MathBlackBox)\]
  - [paper
  - [paper
  - [paper
  - [paper - discover)\]\[[SELF-DISCOVER](https://github.com/kailashsp/SELF-DISCOVER)\]
  - [paper
  - [paper
  - [paper - RL/PRIME)\]
  - [paper - of-thought-llm)\]\[[SymbCoT](https://github.com/Aiden0526/SymbCoT)\]
  - [paper - EM-pytorch)\]
  - [paper
  - [paper - rpm-bench)\]
  - [paper
  - [paper - husky/Husky-v1)\]
  - [paper - System)\]
  - [paper
  - [paper - Shanghai/ICSFSurvey)\]
  - [llm-reasoners - groq/g1)\]
  - [Prompt4ReasoningPapers
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - AI/Telechat)\]\[[TeleChat2](https://github.com/Tele-AI/TeleChat2)\]\[[Tele-FLM Technical Report](https://arxiv.org/abs/2404.16645)\]\[[Tele-FLM](https://huggingface.co/CofeAI/Tele-FLM)\]\[[Tele-FLM-1T](https://huggingface.co/CofeAI/Tele-FLM-1T)\]
  - [blog
  - [blog
  - [paper
  - [paper - transparency-tool)\]
  - [paper - explainer)\]\[[demo](https://poloclub.github.io/transformer-explainer)\]
  - [paper - dynamics)\]
  - [Transformer Circuits Thread - chapter1-transformer-interp.streamlit.app)\]\[[Awesome-Interpretability-in-Large-Language-Models](https://github.com/ruizheliUOA/Awesome-Interpretability-in-Large-Language-Models)\]\[[TransformerLens](https://github.com/TransformerLensOrg/TransformerLens)\]\[[inseq](https://github.com/inseq-team/inseq)\]
  - [paper
  - [paper
  - [paper
  - [Awesome-Chinese-LLM - LLMs-In-China](https://github.com/wgwang/awesome-LLMs-In-China)\]\[[awesome-LLM-resourses](https://github.com/WangRongsheng/awesome-LLM-resourses)\]
  - [paper
  - [paper - 130B/)\]
  - [paper - 6B](https://github.com/THUDM/ChatGLM-6B)\]\[[ChatGLM2-6B](https://github.com/THUDM/ChatGLM2-6B)\]\[[ChatGLM3](https://github.com/THUDM/ChatGLM3)\]\[[GLM-4](https://github.com/THUDM/GLM-4)\]\[[AgentTuning](https://github.com/THUDM/AgentTuning)\]\[[AlignBench](https://github.com/THUDM/AlignBench)\]
  - [paper
  - [paper - Agent](https://github.com/QwenLM/Qwen-Agent)\]\[[AutoIF](https://github.com/QwenLM/AutoIF)\]\[[modeling_qwen2.py](https://github.com/huggingface/transformers/blob/main/src/transformers/models/qwen2/modeling_qwen2.py)\]
  - [paper - ai/Yi)\]\[[Yi-1.5](https://github.com/01-ai/Yi-1.5)\]
  - [paper - AI/Telechat)\]\[[Tele-FLM Technical Report](https://arxiv.org/abs/2404.16645)\]\[[Tele-FLM](https://huggingface.co/CofeAI/Tele-FLM)\]\[[Tele-FLM-1T](https://huggingface.co/CofeAI/Tele-FLM-1T)\]
  - [paper
  - [paper - GSAI/Llama-3-SynE)\]
  - [MiniCPM - MoE](https://github.com/SkyworkAI/Skywork-MoE)\]\[[Orion](https://github.com/OrionStarAI/Orion)\]\[[BELLE](https://github.com/LianjiaTech/BELLE)\]\[[Yuan-2.0](https://github.com/IEIT-Yuan/Yuan-2.0)\]\[[Yuan2.0-M32](https://github.com/IEIT-Yuan/Yuan2.0-M32)\]\[[Fengshenbang-LM](https://github.com/IDEA-CCNL/Fengshenbang-LM)\]\[[Index-1.9B](https://github.com/bilibili/Index-1.9B)\]\[[Aquila2](https://github.com/FlagAI-Open/Aquila2)\]
  - [LlamaFamily/Llama-Chinese - AI/Chinese-Llama-2-7b](https://github.com/LinkSoul-AI/Chinese-Llama-2-7b)\]\[[llama3-Chinese-chat](https://github.com/CrazyBoyM/llama3-Chinese-chat)\]\[[phi3-Chinese](https://github.com/CrazyBoyM/phi3-Chinese)\]\[[LLM-Chinese](https://github.com/CrazyBoyM/LLM-Chinese)\]\[[Llama3-Chinese-Chat](https://github.com/Shenzhi-Wang/Llama3-Chinese-Chat)\]\[[llama3-chinese](https://github.com/seanzhang-zhichen/llama3-chinese)\]
  - [Firefly - chitchat](https://github.com/yangjianxin1/GPT2-chitchat)\]
  - [paper - CoT)\]
  - [paper - Agent](https://github.com/QwenLM/Qwen-Agent)\]\[[AutoIF](https://github.com/QwenLM/AutoIF)\]
  - [paper - ai/Yi)\]\[[Yi-1.5](https://github.com/01-ai/Yi-1.5)\]
  - [paper
  - [paper - LLM](https://github.com/deepseek-ai/DeepSeek-LLM)\]\[[DeepSeek-V2](https://github.com/deepseek-ai/DeepSeek-V2)\]\[[DeepSeek-Coder](https://github.com/deepseek-ai/DeepSeek-Coder)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Copilot/OS-Genesis)\]\[[homepage](https://qiushisun.github.io/OS-Genesis-Home/)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [paper - PaLM)\]
  - [paper - nlp/deita)\]
  - [paper - nlp/deita)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - deepmind/synthid-text)\]
  - [paper - NLP-SG/CoI-Agent)\]\[[AI-Researcher](https://github.com/HKUDS/AI-Researcher)\]\[[Researcher](https://github.com/zhu-minjun/Researcher)\]
  - [paper - PaLM)\]
  - [paper - nlp/ProLong)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper - han-lab/smoothquant)\]\[[ABQ-LLM](https://github.com/bytedance/ABQ-LLM)\]\[[VPTQ](https://github.com/microsoft/VPTQ)\]\[[ppq](https://github.com/OpenPPL/ppq)\]
  - [ggml - ai/ktransformers)\]\[[gpt-fast](https://github.com/pytorch-labs/gpt-fast)\]\[[fastllm](https://github.com/ztxz16/fastllm)\]\[[CTranslate2](https://github.com/OpenNMT/CTranslate2)\]\[[ipex-llm](https://github.com/intel-analytics/ipex-llm)\]\[[rtp-llm](https://github.com/alibaba/rtp-llm)\]\[[KsanaLLM](https://github.com/pcg-mlp/KsanaLLM)\]\[[ppl.nn](https://github.com/OpenPPL/ppl.nn)\]\[[ZhiLight](https://github.com/zhihu/ZhiLight)\]\[[WeChat-TFCC](https://github.com/Tencent/WeChat-TFCC)\]\[[ncnn](https://github.com/Tencent/ncnn)\]\[[llumnix](https://github.com/AlibabaPAI/llumnix)\]\[[dash-infer](https://github.com/modelscope/dash-infer)\]\[[truss](https://github.com/basetenlabs/truss)\]\[[chitu](https://github.com/thu-pacman/chitu)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
  - [paper - NLP/O1-Journey)\]\[[O1 Replication Journey -- Part 2](https://arxiv.org/abs/2411.16489)\]\[[O1 Replication Journey -- Part 3](https://arxiv.org/abs/2501.06458)\]\[[Scaling of Search and Learning](https://arxiv.org/abs/2412.14135)\]\[[Revisiting the Test-Time Scaling of o1-like Models](https://arxiv.org/abs/2502.12215)\]\[[LLaMA-O1](https://github.com/SimpleBerry/LLaMA-O1)\]\[[Marco-o1](https://github.com/AIDC-AI/Marco-o1)\]\[[QwQ-32B](https://qwenlm.github.io/blog/qwq-32b)\]\[[qvq-72b-preview](https://qwenlm.github.io/blog/qvq-72b-preview)\]\[[QwQ-Max-Preview](https://qwenlm.github.io/blog/qwq-max-preview)\]\[[SkyThought](https://github.com/NovaSky-AI/SkyThought)\]\[[On the Overthinking of o1-Like LLMs](https://arxiv.org/abs/2412.21187)\]\[[On the Underthinking of o1-Like LLMs](https://arxiv.org/abs/2501.18585)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]\[[PhiFlow](https://github.com/tum-pbs/PhiFlow)\]
  - [paper
  - [Seed-Coder
  - [blog - ai/Kevin-32B)\]\[[deepwiki](https://cognition.ai/blog/deepwiki)\]
  - [paper
  - [paper - PaLM)\]
  - [paper - recommenders)\]\[[ExFM](https://arxiv.org/abs/2502.17494)\]\[[KuaiFormer](https://arxiv.org/abs/2411.10057)\]\[[Transformers4Rec](https://github.com/NVIDIA-Merlin/Transformers4Rec)\]\[[torchrec](https://github.com/pytorch/torchrec)\]\[[LlamaRec](https://github.com/Yueeeeeeee/LlamaRec)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - YuanGroup/LLaVA-CoT)\]\[[internvl2.0_mpo](https://github.com/OpenGVLab/InternVL/tree/main/internvl_chat/shell/internvl2.0_mpo)\]\[[Insight-V](https://github.com/dongyh20/Insight-V)\]\[[VisVM](https://arxiv.org/abs/2412.03704)\]\[[Mulberry](https://github.com/HJYao00/Mulberry)\]\[[AR-MCTS](https://arxiv.org/abs/2412.14835)\]\[[Virgo](https://arxiv.org/abs/2501.01904)\]\[[Virgo](https://github.com/RUCAIBox/Virgo)\]\[[LlamaV-o1](https://arxiv.org/abs/2501.06186)\]\[[Image-Generation-CoT](https://github.com/ZiyuGuo99/Image-Generation-CoT)\]\[[Awesome-MLLM-Reasoning](https://github.com/WillDreamer/Awesome-MLLM-Reasoning)\]\[[Multimodal Chain-of-Thought Reasoning](https://arxiv.org/abs/2503.12605)\]\[[A Survey on Large Multimodal Reasoning Models](https://arxiv.org/abs/2505.04921)\]
  - [paper - AI/RAGEN)\]\[[VAGEN](https://github.com/RAGEN-AI/VAGEN)\]\[[Agent-R1](https://github.com/0russwest0/Agent-R1)\]\[[OpenManus-RL](https://github.com/OpenManus/OpenManus-RL)\]\[[SWEET-RL](https://arxiv.org/abs/2503.15478)\]\[[APIGen-MT](https://arxiv.org/abs/2504.03601)\]\[[ARTIST](https://arxiv.org/abs/2505.01441)\]\[[SkyRL](https://github.com/NovaSky-AI/SkyRL)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [paper
  - [paper
  - [paper - PaLM)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]
  - [paper - project/bigcodebench)\]\[[LiveCodeBench](https://github.com/LiveCodeBench/LiveCodeBench)\]\[[evalplus](https://github.com/evalplus/evalplus)\]
  - [paper - PaLM)\]
  - [alphafold3 - deepmind/alphafold)\]\[[RoseTTAFold](https://github.com/RosettaCommons/RoseTTAFold)\]\[[RFdiffusion](https://github.com/RosettaCommons/RFdiffusion)\]
  - [openfold - pytorch](https://github.com/lucidrains/alphafold3-pytorch)\]\[[Protenix](https://github.com/bytedance/Protenix)\]\[[AlphaFold3](https://github.com/kyegomez/AlphaFold3)\]\[[Ligo-Biosciences/AlphaFold3](https://github.com/Ligo-Biosciences/AlphaFold3)\]\[[LucaOne](https://github.com/LucaOne/LucaOne)\]\[[esm](https://github.com/evolutionaryscale/esm)\]\[[AlphaPPImd](https://github.com/AspirinCode/AlphaPPImd)\]\[[visual-med-alpaca](https://github.com/cambridgeltl/visual-med-alpaca)\]\[[chai-lab](https://github.com/chaidiscovery/chai-lab)\]\[[evo](https://github.com/evo-design/evo)\]\[[evo2](https://github.com/ArcInstitute/evo2)\]\[[AIRS](https://github.com/divelab/AIRS)\]\[[OpenBioMed](https://github.com/PharMolix/OpenBioMed)\]
  - [paper - Pruner)\]\[[Awesome-Efficient-LLM](https://github.com/horseee/Awesome-Efficient-LLM)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]
  - [PDF-Extract-Kit - tech/colpali)\]\[[localGPT-Vision](https://github.com/PromtEngineer/localGPT-Vision)\]\[[mPLUG-DocOwl](https://github.com/X-PLUG/mPLUG-DocOwl)\]\[[nv-ingest](https://github.com/NVIDIA/nv-ingest)\]
  - [paper
  - [paper - coai/CharacterGLM-6B)\]
  - [paper - o](https://github.com/OpenBMB/MiniCPM-o)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Copilot/OS-Copilot)\]\[[OS-Atlas](https://github.com/OS-Copilot/OS-Atlas)\]\[[OS-Genesis](https://github.com/OS-Copilot/OS-Genesis)\]\[[SeeClick](https://github.com/njucckevin/SeeClick)\]\[[WindowsAgentArena](https://github.com/microsoft/WindowsAgentArena)\]
  - [paper - Agent-Survey)\]\[[LLM-Agent-Paper-Digest](https://github.com/XueyangFeng/LLM-Agent-Paper-Digest)\]\[[awesome-lifelong-llm-agent](https://github.com/qianlima-lab/awesome-lifelong-llm-agent)\]
  - [paper - PLUG/MobileAgent)\]\[[Mobile-Agent-v2](https://arxiv.org/abs/2406.01014)\]\[[LiMAC](https://arxiv.org/abs/2410.17883)\]\[[Mobile-Agent-E](https://arxiv.org/abs/2501.11733)\]\[[Mobile-Agent-V](https://arxiv.org/abs/2502.17110)\]\[[PC-Agent](https://arxiv.org/abs/2502.14282)\]
  - [paper - pilot](https://github.com/Pythagora-io/gpt-pilot)\]\[[Scaling Large-Language-Model-based Multi-Agent Collaboration](https://arxiv.org/abs/2406.07155)\]\[[ProactiveAgent](https://github.com/thunlp/ProactiveAgent)\]\[[FilmAgent](https://github.com/HITsz-TMG/FilmAgent)\]
  - [paper - 2)\]\[[RT-H: Action Hierarchies Using Language](https://arxiv.org/abs/2403.01823)\]\[[RoboMamba](https://arxiv.org/abs/2406.04339)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [paper
  - [paper - oval/storm)\]\[[Co-STORM EMNLP 2024](https://www.arxiv.org/abs/2408.15232)\]\[[WikiChat](https://github.com/stanford-oval/WikiChat)\]\[[kiroku](https://github.com/cnunescoelho/kiroku)\]\[[gpt-researcher](https://github.com/assafelovic/gpt-researcher)\]\[[OmniThink](https://github.com/zjunlp/OmniThink)\]
  - [paper - SQL)\]\[[vanna](https://github.com/vanna-ai/vanna)\]\[[NL2SQL_Handbook](https://github.com/HKUSTDial/NL2SQL_Handbook)\]\[[Spider2](https://github.com/xlang-ai/Spider2)\]\[[WrenAI](https://github.com/Canner/WrenAI)\]
  - [paper
  - [paper - futuredata/ColBERT)\]\[[RAGatouille](https://github.com/AnswerDotAI/RAGatouille)\]\[[rerankers](https://github.com/AnswerDotAI/rerankers)\]\[[A Reproducibility Study of PLAID](https://arxiv.org/abs/2404.14989)\]\[[Jina-ColBERT-v2](https://arxiv.org/abs/2408.16672)\]
  - [paper
  - [paper - II](https://github.com/FreedomIntelligence/HuatuoGPT-II)\]\[[Medical_NLP](https://github.com/FreedomIntelligence/Medical_NLP)\]\[[Zhongjing](https://github.com/SupritYoung/Zhongjing)\]\[[MedicalGPT](https://github.com/shibing624/MedicalGPT)\]\[[huatuogpt-vision](https://github.com/freedomintelligence/huatuogpt-vision)\]\[[Chain-of-Diagnosis](https://github.com/FreedomIntelligence/Chain-of-Diagnosis)\]\[[BianCang](https://github.com/QLU-NLP/BianCang)\]\[[Llama3-OpenBioLLM-70B](https://huggingface.co/aaditya/Llama3-OpenBioLLM-70B)\]\[[CareGPT](https://github.com/WangRongsheng/CareGPT)\]\[[HealthGPT](https://github.com/DCDmllm/HealthGPT)\]
  - [paper - PaLM)\]
  - [recommenders - algorithm)\]\[[Awesome-RSPapers](https://github.com/RUCAIBox/Awesome-RSPapers)\]\[[RecBole](https://github.com/RUCAIBox/RecBole)\]\[[RecSysDatasets](https://github.com/RUCAIBox/RecSysDatasets)\]\[[LLM4Rec-Awesome-Papers](https://github.com/WLiK/LLM4Rec-Awesome-Papers)\]\[[Awesome-LLM-for-RecSys](https://github.com/CHIANGEL/Awesome-LLM-for-RecSys)\]\[[Awesome-LLM4RS-Papers](https://github.com/nancheng58/Awesome-LLM4RS-Papers)\]\[[DA-CL-4Rec](https://github.com/KingGugu/DA-CL-4Rec)\]\[[ReChorus](https://github.com/THUwangcy/ReChorus)\]\[[Transformers4Rec](https://github.com/NVIDIA-Merlin/Transformers4Rec)\]\[[torchrec](https://github.com/pytorch/torchrec)\]
  - [fun-rec - RecommenderSystem](https://github.com/zhongqiangwu960812/AI-RecommenderSystem)\]\[[RecSysPapers](https://github.com/tangxyw/RecSysPapers)\]\[[Algorithm-Practice-in-Industry](https://github.com/Doragd/Algorithm-Practice-in-Industry)\]\[[AlgoNotes](https://github.com/shenweichen/AlgoNotes)\]\[[torch-rechub](https://github.com/datawhalechina/torch-rechub)\]
  - [Awesome-LLM-System-Papers - production-llm](https://github.com/jihoo-kim/awesome-production-llm)\]\[[Awesome-MLSys-Blogger](https://mlsys-learner-resources.github.io/Awesome-MLSys-Blogger)\]\[[Awesome-ML-SYS-Tutorial](https://github.com/zhaochenyang20/Awesome-ML-SYS-Tutorial)\]\[[how-to-learn-deep-learning-framework](https://github.com/BBuf/how-to-learn-deep-learning-framework)\]\[[PKU-DAIR/Starter-Guide](https://github.com/PKU-DAIR/Starter-Guide/tree/main/docs/systems)\]\[[open-infra-index](https://github.com/deepseek-ai/open-infra-index)\]\[[CutlassAcademy](https://github.com/MekkCyber/CutlassAcademy)\]\[[Triton-Puzzles](https://github.com/srush/Triton-Puzzles)\]\[[CUDA-Learn-Notes](https://github.com/xlite-dev/CUDA-Learn-Notes)\]
  - [paper - to-strong)\]\[[weak-to-strong-deception](https://github.com/keven980716/weak-to-strong-deception)\]\[[Evolving Alignment via Asymmetric Self-Play](https://arxiv.org/abs/2411.00062)\]\[[easy-to-hard](https://github.com/Edward-Sun/easy-to-hard)\]\[[Debate Helps Weak-to-Strong Generalization](https://arxiv.org/abs/2501.13124)\]\[[Detecting misbehavior in frontier reasoning models](https://openai.com/index/chain-of-thought-monitoring)\]
  - [paper - 3's Context Ten-Fold Overnight](https://github.com/FlagOpen/FlagEmbedding/tree/master/research/Long_LLM/longllm_qlora)\]\[[From 128K to 4M](https://arxiv.org/abs/2504.06214)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper - project/vllm)\]\[[FastChat](https://github.com/lm-sys/FastChat)\]\[[ollama](https://github.com/jmorganca/ollama)\]
  - [blog - project/sglang)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - ai/flashinfer)\]
  - [paper - serve)\]\[[SARATHI](https://arxiv.org/abs/2308.16369)\]\[[ORCA OSDI 2022](https://www.usenix.org/system/files/osdi22-yu.pdf)\]\[[continuous batching blog](https://www.anyscale.com/blog/continuous-batching-llm-inference)\]
  - [paper - sys/prompt-cache)\]\[[FastServe](https://arxiv.org/abs/2305.05920)\]
  - [paper - ai/ESFT)\]\[[Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts](https://arxiv.org/abs/2408.15664)\]\[[On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models](https://arxiv.org/abs/2501.11873)\]
  - [paper
  - [paper - NLPIR/FlashRAG)\]\[[FlashRAG-Paddle](https://github.com/RUC-NLPIR/FlashRAG-Paddle)\]\[[Auto-RAG](https://github.com/ictnlp/Auto-RAG)\]\[[flexrag](https://github.com/ictnlp/flexrag)\]\[[LevelRAG](https://github.com/ictnlp/LevelRAG)\]
  - [paper - io/memobase)\]
  - [paper - Chunking](https://github.com/IAAR-Shanghai/Meta-Chunking)\]\[[chonkie](https://github.com/chonkie-inc/chonkie)\]\[[chonky](https://github.com/mirth/chonky)\]\[[PageIndex](https://github.com/VectifyAI/PageIndex)\]
  - [paper - augmented-visual-question-answering)\]\[[ViDoRAG](https://github.com/Alibaba-NLP/ViDoRAG)\]\[[Gurubase](https://github.com/Gurubase/gurubase)\]
  - [chatgpt-retrieval-plugin - LLM-RAG-Application](https://github.com/lizhe2004/Awesome-LLM-RAG-Application)\]
  - [paper - embeddings-v2](https://huggingface.co/jinaai/jina-embeddings-v2-base-en)\]\[[jina-reranker-v2](https://huggingface.co/jinaai/jina-reranker-v2-base-multilingual)\]\[[reader-lm-1.5b](https://huggingface.co/jinaai/reader-lm-1.5b)\]\[[ReaderLM-v2](https://huggingface.co/jinaai/ReaderLM-v2)\]\[[pe_rank](https://github.com/liuqi6777/pe_rank)\]\[[Jina CLIP](https://arxiv.org/abs/2405.20204)\]\[[jina-embeddings-v3](https://arxiv.org/abs/2409.10173)\]
  - [paper - MCTS)\]\[[llm-mcts](https://github.com/1989Ryan/llm-mcts)\]\[[LightZero](https://github.com/opendilab/LightZero)\]\[[Agent-R](https://github.com/bytedance/Agent-R)\]\[[atom](https://github.com/qixucen/atom)\]
  - [paper - evolution-pytorch](https://github.com/lucidrains/mind-evolution-pytorch)\]
  - [paper - k1.5)\]\[[demystify-long-cot](https://github.com/eddycmu/demystify-long-cot)\]
  - [paper - lab/stanford_alpaca)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - 1)\]\[[Moto](https://github.com/TencentARC/Moto)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [paper - Scientist)\]\[[AI-Scientist-v2](https://github.com/SakanaAI/AI-Scientist-v2)\]\[[AI-Scientist-ICLR2025-Workshop-Experiment](https://github.com/SakanaAI/AI-Scientist-ICLR2025-Workshop-Experiment)\]\[[Zochi Technical Report](https://www.intology.ai/blog/zochi-tech-report)\]\[[Social_Science](https://github.com/RenqiChen/Social_Science)\]\[[SocialAgent](https://github.com/FudanDISC/SocialAgent)\]\[[game_theory](https://github.com/Wenyueh/game_theory)\]\[[hypothesis-generation](https://github.com/ChicagoHAI/hypothesis-generation)\]\[[Towards an AI co-scientist](https://arxiv.org/abs/2502.18864)\]\[[CodeScientist](https://github.com/allenai/codescientist)\]\[[TinyScientist](https://github.com/ulab-uiuc/tiny-scientist)\]
  - [paper - MCTS](https://github.com/DIRECT-BIT/SRA-MCTS)\]
  - [paper - CR](https://arxiv.org/abs/2501.15134)\]
  - [paper - BJTU/O1-CODER)\]
  - [paper - datasets](https://github.com/virattt/financial-datasets)\]\[[LLMs-in-Finance](https://github.com/hananedupouy/LLMs-in-Finance)\]
  - [paper - Modal Search](https://arxiv.org/abs/2408.14698)\]\[[M3DocRAG](https://arxiv.org/abs/2411.04952)\]\[[Visualized BGE](https://github.com/FlagOpen/FlagEmbedding/tree/master/research/visual_bge)\]\[[OmniSearch](https://github.com/Alibaba-NLP/OmniSearch)\]\[[StreamRAG](https://github.com/video-db/StreamRAG)\]\[[VisRAG](https://github.com/OpenBMB/VisRAG)\]
  - [paper - PaLM)\]
  - [paper - 7B-8k](https://huggingface.co/apple/DCLM-7B-8k)\]\[[data-agora](https://github.com/neulab/data-agora)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - Efficient Model Ladders](https://arxiv.org/abs/2412.04403)\]
  - [OpenAI Blog - auto-interp](https://github.com/EleutherAI/sae-auto-interp)\]\[[multimodal-sae](https://github.com/EvolvingLMMs-Lab/multimodal-sae)\]\[[Language-Model-SAEs](https://github.com/OpenMOSS/Language-Model-SAEs)\]\[[SAE-Reasoning](https://github.com/AIRI-Institute/SAE-Reasoning)\]
  - [paper - inc/Baichuan2)\]\[[BaichuanSEED](https://arxiv.org/abs/2408.15079)\]\[[Baichuan Alignment Technical Report](https://arxiv.org/abs/2410.14940)\]\[[KV Shifting Attention Enhances Language Modeling](https://arxiv.org/abs/2411.19574)\]\[[Baichuan-M1](https://arxiv.org/abs/2502.12671)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]\[[PhiFlow](https://github.com/tum-pbs/PhiFlow)\]
  - [paper
  - [paper
  - [paper - PaLM)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [paper - oval/storm)\]\[[Co-STORM EMNLP 2024](https://www.arxiv.org/abs/2408.15232)\]\[[WikiChat](https://github.com/stanford-oval/WikiChat)\]\[[kiroku](https://github.com/cnunescoelho/kiroku)\]
  - [paper - datasets](https://github.com/virattt/financial-datasets)\]\[[LLMs-in-Finance](https://github.com/hananedupouy/LLMs-in-Finance)\]
  - [paper - lab/stanford_alpaca)\]\[[Alpaca-Lora](https://github.com/tloen/alpaca-lora)\]\[[OpenAlpaca](https://github.com/yxuansu/OpenAlpaca)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [search_with_lepton - oval/storm)\]\[[searxng](https://github.com/searxng/searxng)\]\[[Perplexica](https://github.com/ItzCrazyKns/Perplexica)\]\[[rag-search](https://github.com/thinkany-ai/rag-search)\]\[[sensei](https://github.com/jjleng/sensei)\]\[[azure-search-openai-demo](https://github.com/Azure-Samples/azure-search-openai-demo)\]
  - [paper - Math)\]\[[Qwen2.5-Math-Demo](https://huggingface.co/spaces/Qwen/Qwen2.5-Math-Demo)\]\[[ProcessBench](https://github.com/QwenLM/ProcessBench)\]\[[SuperCorrect-llm](https://github.com/YangLing0818/SuperCorrect-llm)\]
  - [paper - PaLM)\]
  - [paper
  - [paper - granite/granite-code-models)\]\[[granite-3.1-language-models](https://github.com/ibm-granite/granite-3.1-language-models)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper - Transformers](https://github.com/Beomi/BitNet-Transformers)\]\[[BitNet b1.58](https://arxiv.org/abs/2402.17764)\]\[[BitNet a4.8](https://arxiv.org/abs/2411.04965)\]\[[BitNet b1.58 2B4T](https://arxiv.org/abs/2504.12285)\]\[[BitNet v2](https://arxiv.org/abs/2504.18415)\]\[[T-MAC](https://github.com/microsoft/T-MAC)\]\[[BitBLAS](https://github.com/microsoft/BitBLAS)\]\[[BiLLM](https://github.com/Aaronhuang-778/BiLLM)\]\[[decoupleQ](https://github.com/bytedance/decoupleQ)\]
  - [paper - DASLab/gptq)\]\[[AutoGPTQ](https://github.com/PanQiWei/AutoGPTQ)\]\[[QMoE](https://github.com/IST-DASLab/qmoe)\]\[[llmc](https://github.com/ModelTC/llmc)\]
  - [blog - recurrent-drafter)\]\[[A Hitchhiker's Guide to Speculative Decoding](https://pytorch.org/blog/hitchhikers-guide-speculative-decoding)\]
  - [ggml - fast](https://github.com/pytorch-labs/gpt-fast)\]\[[lightllm](https://github.com/ModelTC/lightllm)\]\[[fastllm](https://github.com/ztxz16/fastllm)\]\[[CTranslate2](https://github.com/OpenNMT/CTranslate2)\]\[[ipex-llm](https://github.com/intel-analytics/ipex-llm)\]\[[rtp-llm](https://github.com/alibaba/rtp-llm)\]\[[KsanaLLM](https://github.com/pcg-mlp/KsanaLLM)\]\[[ppl.nn](https://github.com/OpenPPL/ppl.nn)\]\[[ZhiLight](https://github.com/zhihu/ZhiLight)\]\[[WeChat-TFCC](https://github.com/Tencent/WeChat-TFCC)\]
  - [paper
  - [paper
  - [paper - s-k/VARAG)\]\[[docling](https://github.com/DS4SD/docling)\]\[[M3DocRAG](https://arxiv.org/abs/2411.04952)\]\[[Visualized BGE](https://github.com/FlagOpen/FlagEmbedding/tree/master/research/visual_bge)\]\[[OmniSearch](https://github.com/Alibaba-NLP/OmniSearch)\]\[[nv-ingest](https://github.com/NVIDIA/nv-ingest)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [RAG-Retrieval - Shanghai/PGRAG)\]\[[CRUD_RAG](https://github.com/IAAR-Shanghai/CRUD_RAG)\]\[[PlanRAG](https://github.com/myeon9h/PlanRAG)\]\[[DPA-RAG](https://github.com/dongguanting/DPA-RAG)\]\[[FollowRAG](https://github.com/dongguanting/FollowRAG)\]\[[LongRAG](https://github.com/TIGER-AI-Lab/LongRAG)\]\[[structured-rag](https://github.com/weaviate/structured-rag)\]\[[RAGLab](https://github.com/fate-ubw/RAGLab)\]\[[autogluon-rag](https://github.com/autogluon/autogluon-rag)\]\[[VARAG](https://github.com/adithya-s-k/VARAG)\]\[[PAI-RAG](https://github.com/aigc-apps/PAI-RAG)\]\[[RagVL](https://github.com/IDEA-FinAI/RagVL)\]\[[AutoRAG](https://github.com/Marker-Inc-Korea/AutoRAG)\]\[[RetroLLM](https://github.com/sunnynexus/RetroLLM)\]\[[RAG-Instruct](https://github.com/FreedomIntelligence/RAG-Instruct)\]\[[RapidRAG](https://github.com/RapidAI/RapidRAG)\]\[[UltraRAG](https://github.com/OpenBMB/UltraRAG)\]
  - [blog - of-thought-monitoring)\]\[[Agent Q](https://arxiv.org/abs/2408.07199)\]\[[Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters](https://arxiv.org/abs/2408.03314)\]\[[search-and-learn](https://github.com/huggingface/search-and-learn)\]\[[Let's Verify Step by Step](https://arxiv.org/abs/2305.20050)\]\[[Thinking LLMs: General Instruction Following with Thought Generation](https://arxiv.org/abs/2410.10630)\]\[[Awesome-LLM-Strawberry](https://github.com/hijkzzz/Awesome-LLM-Strawberry)\]\[[Awesome-LLM-Reasoning](https://github.com/atfortes/Awesome-LLM-Reasoning)\]\[[Claude's extended thinking](https://www.anthropic.com/research/visible-extended-thinking)\]
  - [paper - guided Tree Search](https://arxiv.org/abs/2411.11694)\]\[[An Empirical Study on Eliciting and Improving R1-like Reasoning Models](https://arxiv.org/abs/2503.04548)\]\[[Towards Large Reasoning Models](https://arxiv.org/abs/2501.09686)\]\[[Reasoning models don't always say what they think](https://www.anthropic.com/research/reasoning-models-dont-say-think)\]\[[Monitoring Reasoning Models](https://arxiv.org/abs/2503.11926)\]\[[Understanding Reasoning LLMs](https://sebastianraschka.com/blog/2025/understanding-reasoning-llms.html)\]\[[The State of LLM Reasoning Models](https://sebastianraschka.com/blog/2025/state-of-llm-reasoning-and-inference-scaling.html)\]
  - [OpenAI Blog - auto-interp](https://github.com/EleutherAI/sae-auto-interp)\]\[[multimodal-sae](https://github.com/EvolvingLMMs-Lab/multimodal-sae)\]\[[Language-Model-SAEs](https://github.com/OpenMOSS/Language-Model-SAEs)\]
  - [paper - inc/Baichuan2)\]\[[BaichuanSEED](https://arxiv.org/abs/2408.15079)\]\[[Baichuan Alignment Technical Report](https://arxiv.org/abs/2410.14940)\]\[[KV Shifting Attention Enhances Language Modeling](https://arxiv.org/abs/2411.19574)\]
  - [paper - instruct)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - AutoML/MobileVLM)\]\[[MobileVLM V2](https://arxiv.org/abs/2402.03766)\]\[[BlueLM-V-3B](https://arxiv.org/abs/2411.10640)\]\[[XiaoMi/mobilevlm](https://github.com/XiaoMi/mobilevlm)\]
  - [open-interpreter
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]
  - [paper - deep-researcher](https://github.com/langchain-ai/ollama-deep-researcher)\]\[[PaSa](https://arxiv.org/abs/2501.10120)\]\[[ScholarCopilot](https://github.com/TIGER-AI-Lab/ScholarCopilot)\]\[[Ai2 ScholarQA](https://github.com/allenai/ai2-scholarqa-lib)\]
  - [paper - PaLM)\]
  - [openfold - pytorch](https://github.com/lucidrains/alphafold3-pytorch)\]\[[Protenix](https://github.com/bytedance/Protenix)\]\[[AlphaFold3](https://github.com/kyegomez/AlphaFold3)\]\[[Ligo-Biosciences/AlphaFold3](https://github.com/Ligo-Biosciences/AlphaFold3)\]\[[LucaOne](https://github.com/LucaOne/LucaOne)\]\[[esm](https://github.com/evolutionaryscale/esm)\]\[[AlphaPPImd](https://github.com/AspirinCode/AlphaPPImd)\]\[[visual-med-alpaca](https://github.com/cambridgeltl/visual-med-alpaca)\]\[[chai-lab](https://github.com/chaidiscovery/chai-lab)\]\[[evo](https://github.com/evo-design/evo)\]
  - [paper
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [PDF-Extract-Kit - tech/colpali)\]\[[localGPT-Vision](https://github.com/PromtEngineer/localGPT-Vision)\]\[[mPLUG-DocOwl](https://github.com/X-PLUG/mPLUG-DocOwl)\]
  - [paper - long.194/)\]\[[llm_reranker](https://github.com/FlagOpen/FlagEmbedding/tree/master/research/llm_reranker)\]\[[FlagEmbedding](https://github.com/FlagOpen/FlagEmbedding)\]
  - [paper - VL-base](https://huggingface.co/BAAI/BGE-VL-base)\]
  - [paper - group/textgrad)\]\[[appl](https://github.com/appl-team/appl)\]\[[okhat/blog](https://github.com/okhat/blog/blob/main/2024.09.impact.md)\]\[[PromptWizard](https://github.com/microsoft/PromptWizard)\]\[[SPO](https://arxiv.org/abs/2502.06855)\]
  - [paper - grpo](https://github.com/open-thought/tiny-grpo)\]\[[simple_GRPO](https://github.com/lsdefine/simple_GRPO)\]\[[GRPO-Zero](https://github.com/policy-gradient/GRPO-Zero)\]\[[grpo-flat](https://github.com/XU-YIJIE/grpo-flat)\]\[[kl-rel-to-ref-in-rl-zh](https://tongyx361.github.io/blogs/posts/kl-rel-to-ref-in-rl-zh)\]
  - [paper - cite](https://github.com/MadryLab/context-cite)\]\[[OmniThink](https://github.com/zjunlp/OmniThink)\]\[[SelfCite](https://arxiv.org/abs/2502.09604)\]
  - [paper
  - [paper
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [paper - Trustworthy-Retrieval-Augmented-Generation)\]\[[TrustRAG](https://github.com/gomate-community/TrustRAG)\]
  - [paper - RAG](https://github.com/microsoft/PIKE-RAG)\]\[[GraphRAG-Local-UI](https://github.com/severian42/GraphRAG-Local-UI)\]\[[nano-graphrag](https://github.com/gusye1234/nano-graphrag)\]\[[fast-graphrag](https://github.com/circlemind-ai/fast-graphrag)\]\[[graph-rag](https://github.com/sarthakrastogi/graph-rag)\]\[[llm-graph-builder](https://github.com/neo4j-labs/llm-graph-builder)\]\[[Triplex](https://huggingface.co/SciPhi/Triplex)\]\[[knowledge_graph_maker](https://github.com/rahulnyk/knowledge_graph_maker)\]\[[itext2kg](https://github.com/AuvaLab/itext2kg)\]\[[KG_RAG](https://github.com/BaranziniLab/KG_RAG)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - OR1](https://github.com/SkyworkAI/Skywork-OR1)\]
  - [paper - nlp/CodeIO)\]\[[simpleRL-reason](https://github.com/hkust-nlp/simpleRL-reason)\]
  - [paper - RL)\]\[[FastCuRL](https://github.com/nick7nlp/FastCuRL)\]\[[SRPO](https://arxiv.org/abs/2504.14286)\]\[[Tina](https://arxiv.org/abs/2504.15777)\]
  - [Open-Reasoner-Zero
  - [Moonlight
  - [paper - AI/MiniMax-01)\]\[[Linear-MoE](https://github.com/OpenSparseLLMs/Linear-MoE)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - hwh/AutoCrawler)\]\[[gpt-crawler](https://github.com/BuilderIO/gpt-crawler)\]\[[webllama](https://github.com/McGill-NLP/webllama)\]\[[gpt-researcher](https://github.com/assafelovic/gpt-researcher)\]\[[skyvern](https://github.com/Skyvern-AI/skyvern)\]\[[Scrapegraph-ai](https://github.com/VinciGit00/Scrapegraph-ai)\]\[[crawl4ai](https://github.com/unclecode/crawl4ai)\]\[[crawlee-python](https://github.com/apify/crawlee-python)\]\[[Agent-E](https://github.com/EmergenceAI/Agent-E)\]\[[CyberScraper-2077](https://github.com/itsOwen/CyberScraper-2077)\]\[[browser-use](https://github.com/browser-use/browser-use)\]\[[nova-act](https://github.com/aws/nova-act)\]\[[ReaderLM-v2](https://huggingface.co/jinaai/ReaderLM-v2)\]
  - [paper - PLUG/MobileAgent)\]\[[Mobile-Agent-v2](https://arxiv.org/abs/2406.01014)\]\[[LiMAC](https://arxiv.org/abs/2410.17883)\]\[[Mobile-Agent-E](https://arxiv.org/abs/2501.11733)\]
  - [paper - pilot](https://github.com/Pythagora-io/gpt-pilot)\]\[[Scaling Large-Language-Model-based Multi-Agent Collaboration](https://arxiv.org/abs/2406.07155)\]\[[ProactiveAgent](https://github.com/thunlp/ProactiveAgent)\]\[[FilmAgent](https://github.com/HITsz-TMG/FilmAgent)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [paper
  - [paper
  - [paper - II](https://github.com/FreedomIntelligence/HuatuoGPT-II)\]\[[Medical_NLP](https://github.com/FreedomIntelligence/Medical_NLP)\]\[[Zhongjing](https://github.com/SupritYoung/Zhongjing)\]\[[MedicalGPT](https://github.com/shibing624/MedicalGPT)\]\[[huatuogpt-vision](https://github.com/freedomintelligence/huatuogpt-vision)\]\[[Chain-of-Diagnosis](https://github.com/FreedomIntelligence/Chain-of-Diagnosis)\]\[[BianCang](https://github.com/QLU-NLP/BianCang)\]\[[Llama3-OpenBioLLM-70B](https://huggingface.co/aaditya/Llama3-OpenBioLLM-70B)\]
  - [paper - PaLM)\]
  - [paper - embeddings-v2](https://huggingface.co/jinaai/jina-embeddings-v2-base-en)\]\[[jina-reranker-v2](https://huggingface.co/jinaai/jina-reranker-v2-base-multilingual)\]\[[reader-lm-1.5b](https://huggingface.co/jinaai/reader-lm-1.5b)\]\[[ReaderLM-v2](https://huggingface.co/jinaai/ReaderLM-v2)\]\[[pe_rank](https://github.com/liuqi6777/pe_rank)\]\[[Jina CLIP](https://arxiv.org/abs/2405.20204)\]\[[jina-embeddings-v3](https://arxiv.org/abs/2409.10173)\]
  - [Open LLM Leaderboard
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - TARS)\]\[[UI-TARS-desktop](https://github.com/bytedance/UI-TARS-desktop)\]\[[midscene](https://github.com/web-infra-dev/midscene)\]\[[browser-use](https://github.com/browser-use/browser-use)\]\[[computer_use_ootb](https://github.com/showlab/computer_use_ootb)\]\[[Agent-S](https://github.com/simular-ai/Agent-S)\]\[[open-operator](https://github.com/All-Hands-AI/open-operator)\]\[[STEVE-R1](https://github.com/FanbinLu/STEVE-R1)\]\[[UI-R1](https://arxiv.org/abs/2503.21620)\]\[[InfiGUI-R1](https://github.com/Reallm-Labs/InfiGUI-R1)\]\[[r1-computer-use](https://github.com/agentsea/r1-computer-use)\]
  - [paper - GR00T)\]\[[IsaacLab](https://github.com/isaac-sim/IsaacLab)\]\[[IsaacGymEnvs](https://github.com/isaac-sim/IsaacGymEnvs)\]\[[OmniIsaacGymEnvs](https://github.com/isaac-sim/OmniIsaacGymEnvs)\]\[[MuJoCo Playground](https://playground.mujoco.org)\]
  - [LeRobot - Embodied-AI/Genesis)\]\[[DORA](https://github.com/dora-rs/dora)\]\[[awesome-ai-agents](https://github.com/e2b-dev/awesome-ai-agents)\]\[[IsaacLab](https://github.com/isaac-sim/IsaacLab)\]\[[IsaacGymEnvs](https://github.com/isaac-sim/IsaacGymEnvs)\]\[[OmniIsaacGymEnvs](https://github.com/isaac-sim/OmniIsaacGymEnvs)\]\[[Isaac-GR00T](https://github.com/NVIDIA/Isaac-GR00T)\]\[[Awesome-Robotics-3D](https://github.com/zubair-irshad/Awesome-Robotics-3D)\]\[[AimRT](https://github.com/AimRT/AimRT)\]\[[agibot_x1_train](https://github.com/AgibotTech/agibot_x1_train)\]\[[Agibot-World](https://github.com/OpenDriveLab/Agibot-World)\]\[[unitree_IL_lerobot](https://github.com/unitreerobotics/unitree_IL_lerobot)\]\[[unitree_rl_gym](https://github.com/unitreerobotics/unitree_rl_gym)\]\[[openpi](https://github.com/Physical-Intelligence/openpi)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]\[[PhiFlow](https://github.com/tum-pbs/PhiFlow)\]
  - [paper
  - [paper
  - [paper - AIFLM-Lab/Fin-R1)\]\[[FinRL-DeepSeek](https://github.com/benstaf/FinRL_DeepSeek)\]\[[DianJin-R1](https://arxiv.org/abs/2504.15716)\]
  - [trt-llm-rag-windows - crawler](https://github.com/BuilderIO/gpt-crawler)\]\[[R2R](https://github.com/SciPhi-AI/R2R)\]\[[rag-notebook-to-microservices](https://github.com/wenqiglantz/rag-notebook-to-microservices)\]\[[MaxKB](https://github.com/1Panel-dev/MaxKB)\]\[[Verba](https://github.com/weaviate/Verba)\]\[[cognita](https://github.com/truefoundry/cognita)\]\[[llmware](https://github.com/llmware-ai/llmware)\]\[[quivr](https://github.com/QuivrHQ/quivr)\]\[[kotaemon](https://github.com/Cinnamon/kotaemon)\]\[[RAGMeUp](https://github.com/AI-Commandos/RAGMeUp)\]\[[pandas-ai](https://github.com/sinaptik-ai/pandas-ai)\]\[[DeepSeek-RAG-Chatbot](https://github.com/SaiAkhil066/DeepSeek-RAG-Chatbot)\]
  - [paper - PaLM)\]
  - [datatrove - studio](https://github.com/HumanSignal/label-studio)\]\[[autolabel](https://github.com/refuel-ai/autolabel)\]\[[synthetic-data-generator](https://github.com/hitsz-ids/synthetic-data-generator)\]\[[NeMo-Curator](https://github.com/NVIDIA/NeMo-Curator)\]\[[distilabel](https://github.com/argilla-io/distilabel)\]\[[easy-dataset](https://github.com/ConardLi/easy-dataset)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper - NLP/VinePPO)\]\[[OpenRFT](https://github.com/ADaM-BJTU/OpenRFT)\]\[[SoRFT](https://arxiv.org/abs/2502.20127)\]\[[MRT](https://cohenqu.github.io/mrt.github.io/)\]\[[Visual-RFT](https://github.com/Liuziyu77/Visual-RFT)\]\[[AdaRFT](https://github.com/uscnlp-lime/verl)\]
  - [paper - Long-Chain-of-Thought-Reasoning)\]\[[Awesome-System2-Reasoning-LLM](https://github.com/zzli2022/Awesome-System2-Reasoning-LLM)\]\[[Stop Overthinking](https://arxiv.org/abs/2503.16419)\]\[[Awesome_Efficient_LRM_Reasoning](https://github.com/XiaoYee/Awesome_Efficient_LRM_Reasoning)\]\[[A Survey on Test-Time Scaling in Large Language Models](https://arxiv.org/abs/2503.24235)\]\[[Awesome-RL-Reasoning-Recipes](https://github.com/TsinghuaC3I/Awesome-RL-Reasoning-Recipes)\]\[[Generative AI Act II](https://arxiv.org/abs/2504.13828)\]\[[100 Days After DeepSeek-R1](https://arxiv.org/abs/2505.00551)\]\[[A Sober Look at Progress in Language Model Reasoning](https://arxiv.org/abs/2504.07086)\]
  - [paper - ai-lab/LookaheadDecoding)\]\[[Consistency_LLM](https://github.com/hao-ai-lab/Consistency_LLM)\]\[[Lookahead](https://github.com/alipay/PainlessInferenceAcceleration)\]
  - [MOSS - RLHF](https://github.com/OpenLMLab/MOSS-RLHF)\]
  - [blog
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - research/circuit_training)\]\[[semikong](https://github.com/aitomatic/semikong)\]\[[Automating GPU Kernel Generation](https://developer.nvidia.com/blog/automating-gpu-kernel-generation-with-deepseek-r1-and-inference-time-scaling)\]
  - [paper - Deep-Research](https://github.com/HKUDS/Auto-Deep-Research)\]
  - [swarm - AI/AgentStack)\]\[[multi-agent-orchestrator](https://github.com/awslabs/multi-agent-orchestrator)\]\[[smolagents](https://github.com/huggingface/smolagents)\]\[[agent-service-toolkit](https://github.com/JoshuaC215/agent-service-toolkit)\]\[[agno](https://github.com/agno-agi/agno)\]\[[ANUS](https://github.com/nikmcfly/ANUS)\]\[[AutoAgent](https://github.com/HKUDS/AutoAgent)\]\[[AgentIQ](https://github.com/NVIDIA/AgentIQ)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [paper - nlp/SWE-agent)\]\[[swe-bench-technical-report](https://www.cognition-labs.com/post/swe-bench-technical-report)\]\[[SWE-smith](https://github.com/SWE-bench/SWE-smith)\]\[[CodeR](https://github.com/NL2Code/CodeR)\]\[[Lingma-SWE-GPT](https://github.com/LingmaTongyi/Lingma-SWE-GPT)\]\[[SWE-Gym](https://github.com/SWE-Gym/SWE-Gym)\]\[[MarsCode Agent](https://arxiv.org/abs/2409.00899)\]\[[SWE-Fixer](https://github.com/InternLM/SWE-Fixer)\]\[[SWE-RL](https://arxiv.org/abs/2502.18449)\]\[[SWE-Lancer](https://github.com/openai/SWELancer-Benchmark)\]\[[multi-swe-bench](https://github.com/multi-swe-bench/multi-swe-bench)\]
  - [paper - AI/SkyThought)\]\[[DeepCoder](https://pretty-radio-b75.notion.site/DeepCoder-A-Fully-Open-Source-14B-Coder-at-O3-mini-Level-1cf81902c14680b3bee5eb349a512a51)\]
  - [paper
  - [paper - FinAI/Fino1)\]\[[PIXIU](https://github.com/The-FinAI/PIXIU)\]\[[FLAG-Trader](https://arxiv.org/abs/2502.11433)\]\[[FinAudio](https://arxiv.org/abs/2503.20990)\]
  - [gpt-investor - quant](https://github.com/goldmansachs/gs-quant)\]\[[stockbot-on-groq](https://github.com/bklieger-groq/stockbot-on-groq)\]\[[Real-Time-Stock-Market-Prediction-using-Ensemble-DL-and-Rainbow-DQN](https://github.com/THINK989/Real-Time-Stock-Market-Prediction-using-Ensemble-DL-and-Rainbow-DQN)\]\[[openbb-agents](https://github.com/OpenBB-finance/openbb-agents)\]\[[ai-hedge-fund](https://github.com/virattt/ai-hedge-fund)\]\[[ai-financial-agent](https://github.com/virattt/ai-financial-agent)\]\[[Finance](https://github.com/shashankvemuri/Finance)\]
  - [paper
  - [paper - PaLM)\]
  - [paper - 460M-1T)\]\[[MobiLlama](https://github.com/mbzuai-oryx/MobiLlama)\]\[[Steel-LLM](https://github.com/zhanshijinwat/Steel-LLM)\]\[[minimind](https://github.com/jingyaogong/minimind)\]\[[tiny-llm-zh](https://github.com/wdndev/tiny-llm-zh)\]\[[SkyLadder](https://github.com/sail-sg/SkyLadder)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - ai/OSWorld)\]\[[aguvis](https://github.com/xlang-ai/aguvis)\]\[[Large Action Models](https://arxiv.org/abs/2412.10047)\]
  - [paper - Copilot/OS-Copilot)\]\[[OS-Atlas](https://github.com/OS-Copilot/OS-Atlas)\]\[[OS-Genesis](https://github.com/OS-Copilot/OS-Genesis)\]\[[WindowsAgentArena](https://github.com/microsoft/WindowsAgentArena)\]
  - [datatrove - studio](https://github.com/HumanSignal/label-studio)\]\[[autolabel](https://github.com/refuel-ai/autolabel)\]\[[synthetic-data-generator](https://github.com/hitsz-ids/synthetic-data-generator)\]
  - [RedPajama-Data - minigrid-datasets](https://github.com/dunno-lab/xland-minigrid-datasets)\]\[[OmniCorpus](https://github.com/OpenGVLab/OmniCorpus)\]\[[dclm](https://github.com/mlfoundations/dclm)\]\[[Infinity-Instruct](https://github.com/FlagOpen/Infinity-Instruct)\]\[[MNBVC](https://github.com/esbatmop/MNBVC)\]\[[LMSYS-Chat-1M](https://arxiv.org/abs/2309.11998)\]\[[kangas](https://github.com/comet-ml/kangas)\]\[[openwebtext](https://github.com/jcpeterson/openwebtext)\]\[[open-thoughts](https://github.com/open-thoughts/open-thoughts)\]\[[Bespoke-Stratos-17k](https://huggingface.co/datasets/HuggingFaceH4/Bespoke-Stratos-17k)\]\[[dolphin-r1](https://huggingface.co/datasets/cognitivecomputations/dolphin-r1)\]\[[reasoning-gym](https://github.com/open-thought/reasoning-gym)\]
  - [paper - Role-Play-Papers)\]\[[RPBench-Auto](https://boson.ai/rpbench-blog/)\]\[[Hermes 3 Technical Report](https://arxiv.org/abs/2408.11857)\]\[[From Persona to Personalization: A Survey on Role-Playing Language Agents](https://arxiv.org/abs/2404.18231)\]\[[MMRole](https://github.com/YanqiDai/MMRole)\]\[[OpenCharacter](https://arxiv.org/abs/2501.15427)\]
  - [blog - Second-Half/)\]\[[LLMAgentPapers](https://github.com/zjunlp/LLMAgentPapers)\]\[[LLM-Agents-Papers](https://github.com/AGI-Edgerunners/LLM-Agents-Papers)\]\[[awesome-language-agents](https://github.com/ysymyth/awesome-language-agents)\]\[[Awesome-Papers-Autonomous-Agent](https://github.com/lafmdp/Awesome-Papers-Autonomous-Agent)\]\[[GUI-Agents-Paper-List](https://github.com/OSU-NLP-Group/GUI-Agents-Paper-List)\]\[[ai-agent-white-paper](https://arthurchiao.art/blog/ai-agent-white-paper-zh)\]\[[Building effective agents](https://www.anthropic.com/research/building-effective-agents)\]\[[EMNLP 2024 Tutorial: Language Agents](https://language-agent-tutorial.github.io/)\]
  - [paper - Lab](https://github.com/THUDM/Android-Lab)\]
  - [paper - NLP/PC-Agent)\]\[[PPTAgent](https://github.com/icip-cas/PPTAgent)\]
  - [paper - Agent-Survey](https://github.com/OS-Agent-Survey/OS-Agent-Survey)\]\[[ACU](https://github.com/francedot/acu)\]
  - [paper - 2-a-large-scale-foundation-world-model)\]\[[genie2-pytorch](https://github.com/lucidrains/genie2-pytorch)\]\[[GameNGen](https://arxiv.org/abs/2408.14837)\]\[[GameGen-X](https://github.com/GameGen-X/GameGen-X)\]\[[GameFactory](https://github.com/KwaiVGI/GameFactory)\]\[[Unbounded](https://arxiv.org/abs/2410.18975)\]\[[open-oasis](https://github.com/etched-ai/open-oasis)\]\[[DIAMOND](https://diamond-wm.github.io)\]\[[WHAM](https://huggingface.co/microsoft/wham)\]\[[dreamerv3](https://www.nature.com/articles/s41586-025-08744-2)\]\[[AssistanceZero](https://github.com/cassidylaidlaw/minecraft-building-assistance-game)\]\[[MineWorld](https://github.com/microsoft/MineWorld)\]\[[Multiverse](https://github.com/EnigmaLabsAI/multiverse)\]
  - [LeRobot - Embodied-AI/Genesis)\]\[[DORA](https://github.com/dora-rs/dora)\]\[[awesome-ai-agents](https://github.com/e2b-dev/awesome-ai-agents)\]\[[IsaacLab](https://github.com/isaac-sim/IsaacLab)\]\[[IsaacGymEnvs](https://github.com/isaac-sim/IsaacGymEnvs)\]\[[OmniIsaacGymEnvs](https://github.com/isaac-sim/OmniIsaacGymEnvs)\]\[[Awesome-Robotics-3D](https://github.com/zubair-irshad/Awesome-Robotics-3D)\]\[[AimRT](https://github.com/AimRT/AimRT)\]\[[agibot_x1_train](https://github.com/AgibotTech/agibot_x1_train)\]\[[unitree_IL_lerobot](https://github.com/unitreerobotics/unitree_IL_lerobot)\]\[[unitree_rl_gym](https://github.com/unitreerobotics/unitree_rl_gym)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [paper - Coder)\]\[[CodeArena](https://arxiv.org/abs/2412.05210)\]\[[CodeElo](https://arxiv.org/abs/2501.01257)\]
  - [paper - nlp/SWE-agent)\]\[[swe-bench-technical-report](https://www.cognition-labs.com/post/swe-bench-technical-report)\]\[[CodeR](https://github.com/NL2Code/CodeR)\]\[[Lingma-SWE-GPT](https://github.com/LingmaTongyi/Lingma-SWE-GPT)\]\[[SWE-Gym](https://github.com/SWE-Gym/SWE-Gym)\]
  - [paper - Touchstone](https://github.com/IDEA-FinAI/Golden-Touchstone)\]\[[financebench](https://github.com/patronus-ai/financebench)\]\[[OmniEval](https://github.com/RUC-NLPIR/OmniEval)\]\[[FLAME](https://github.com/FLAME-ruc/FLAME)\]\[[FinEval](https://github.com/SUFE-AIFLM-Lab/FinEval)\]\[[CFBenchmark](https://github.com/TongjiFinLab/CFBenchmark)\]\[[MME-Finance](https://github.com/HiThink-Research/MME-Finance)\]
  - [paper
  - [paper - o1)\]\[[MedVLM-R1](https://arxiv.org/abs/2502.19634)\]\[[m1](https://github.com/UCSC-VLAA/m1)\]\[[MedReason](https://github.com/UCSC-VLAA/MedReason)\]\[[X-Reasoner](https://arxiv.org/abs/2505.03981)\]
  - [paper - PaLM)\]
  - [paper
  - [paper - paper-li_mu.pdf)\]\[[ps-lite](https://github.com/dmlc/ps-lite)\]\[[Zero Bubble Pipeline Parallelism](https://arxiv.org/abs/2401.10241)\]\[[DualPipe](https://github.com/deepseek-ai/DualPipe)\]
  - [paper - to-strong)\]\[[weak-to-strong-deception](https://github.com/keven980716/weak-to-strong-deception)\]\[[Evolving Alignment via Asymmetric Self-Play](https://arxiv.org/abs/2411.00062)\]\[[easy-to-hard](https://github.com/Edward-Sun/easy-to-hard)\]
  - [evaluate - guidebook](https://github.com/huggingface/evaluation-guidebook)\]\[[EvalScope](https://github.com/modelscope/evalscope)\]\[[llmperf](https://github.com/ray-project/llmperf)\]\[[OpenEvals](https://github.com/langchain-ai/openevals)\]\[[Awesome-LLM-Eval](https://github.com/onejune2018/Awesome-LLM-Eval)\]\[[LLM-eval-survey](https://github.com/MLGroupJLU/LLM-eval-survey)\]\[[llm_benchmarks](https://github.com/leobeeson/llm_benchmarks)\]\[[Awesome-LLMs-Evaluation-Papers](https://github.com/tjunlp-lab/Awesome-LLMs-Evaluation-Papers)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [blog - llm-inference-backends)\]
  - [paper - Factory](https://github.com/Zefan-Cai/KVCache-Factory)\]\[[InfiniGen](https://github.com/snu-comparch/InfiniGen)\]\[[kvpress](https://github.com/NVIDIA/kvpress)\]
  - [TensorRT-LLM - inference-server/server)\]\[[Dynamo](https://github.com/ai-dynamo/dynamo)\]\[[GenerativeAIExamples](https://github.com/NVIDIA/GenerativeAIExamples)\]\[[TensorRT-Model-Optimizer](https://github.com/NVIDIA/TensorRT-Model-Optimizer)\]\[[TensorRT](https://github.com/NVIDIA/TensorRT)\]\[[kvpress](https://github.com/NVIDIA/kvpress)\]\[[OpenVINO](https://github.com/openvinotoolkit/openvino)\]
  - [paper - ai/DeepSeek-V3)\]\[[DeepSeek-R1](https://github.com/deepseek-ai/DeepSeek-R1)\]\[[DeepEP](https://github.com/deepseek-ai/DeepEP)\]
  - [Megatron-LM - DeepSpeed](https://github.com/microsoft/Megatron-DeepSpeed)\]\[[Megatron-DeepSpeed](https://github.com/bigscience-workshop/Megatron-DeepSpeed)\]\[[Pai-Megatron-Patch](https://github.com/alibaba/Pai-Megatron-Patch)\]
  - [paper
  - [paper - nlpir/flashrag)\]\[[FlashRAG-Paddle](https://github.com/RUC-NLPIR/FlashRAG-Paddle)\]\[[Auto-RAG](https://github.com/ictnlp/Auto-RAG)\]\[[flexrag](https://github.com/ictnlp/flexrag)\]
  - [paper - Augmented Generation with Graphs](https://arxiv.org/abs/2501.00309)\]\[[code](https://github.com/Graph-RAG/GraphRAG)\]\[[Awesome-GraphRAG](https://github.com/DEEP-PolyU/Awesome-GraphRAG)\]
  - [paper - Augmented Generation: A Survey](https://arxiv.org/abs/2405.07437)\]\[[ragas](https://github.com/explodinggradients/ragas)\]\[[RAGChecker](https://github.com/amazon-science/RAGChecker)\]\[[rageval](https://github.com/gomate-community/rageval)\]\[[CORAL](https://github.com/Ariya12138/CORAL)\]\[[WebWalker](https://github.com/Alibaba-NLP/WebWalker)\]
  - [blog - Augmented Generation for Large Language Models](https://arxiv.org/abs/2409.13385)\]\[[ContextRAG](https://arxiv.org/abs/2502.14759)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [haystack - Chatchat](https://github.com/chatchat-space/Langchain-Chatchat)\]\[[ragflow](https://github.com/infiniflow/ragflow)\]\[[infinity](https://github.com/infiniflow/infinity)\]
  - [RAG-Retrieval - Shanghai/PGRAG)\]\[[CRUD_RAG](https://github.com/IAAR-Shanghai/CRUD_RAG)\]\[[PlanRAG](https://github.com/myeon9h/PlanRAG)\]\[[DPA-RAG](https://github.com/dongguanting/DPA-RAG)\]\[[FollowRAG](https://github.com/dongguanting/FollowRAG)\]\[[LongRAG](https://github.com/TIGER-AI-Lab/LongRAG)\]\[[structured-rag](https://github.com/weaviate/structured-rag)\]\[[RAGLab](https://github.com/fate-ubw/RAGLab)\]\[[autogluon-rag](https://github.com/autogluon/autogluon-rag)\]\[[VARAG](https://github.com/adithya-s-k/VARAG)\]\[[PAI-RAG](https://github.com/aigc-apps/PAI-RAG)\]\[[Meta-Chunking](https://github.com/IAAR-Shanghai/Meta-Chunking)\]\[[chonkie](https://github.com/chonkie-ai/chonkie)\]\[[RagVL](https://github.com/IDEA-FinAI/RagVL)\]\[[AutoRAG](https://github.com/Marker-Inc-Korea/AutoRAG)\]\[[RetroLLM](https://github.com/sunnynexus/RetroLLM)\]
  - [paper - mistral-7b-instruct)\]\[[llm2vec](https://github.com/McGill-NLP/llm2vec)\]\[[When Text Embedding Meets Large Language Model: A Comprehensive Survey](https://arxiv.org/abs/2412.09165)\]
  - [paper - org/llm-reasoners)\]\[[LLM Reasoners COLM 2024](https://arxiv.org/abs/2404.05221)\]\[[AgentGen KDD 2025](https://arxiv.org/abs/2408.00764)\]
  - [paper - RL/PRIME)\]
  - [paper - MCTS)\]\[[llm-mcts](https://github.com/1989Ryan/llm-mcts)\]\[[LightZero](https://github.com/opendilab/LightZero)\]
  - [paper - 2-research](https://github.com/open-thought/system-2-research)\]\[[Test-time Computing: from System-1 Thinking to System-2 Thinking](https://arxiv.org/abs/2501.02497)\]\[[Towards System 2 Reasoning in LLMs](https://arxiv.org/abs/2501.04682)\]\[[Awesome-System2-Reasoning-LLM](https://github.com/zzli2022/Awesome-System2-Reasoning-LLM)\]
  - [paper - 1M Technical Report](https://arxiv.org/abs/2501.15383)\]\[[QwQ](https://github.com/QwenLM/QwQ)\]
  - [paper - 4](https://huggingface.co/collections/microsoft/phi-4-677e9380e514feb5577a40e4)\]\[[SmolLM](https://huggingface.co/blog/smollm)\]\[[SmolLM2](https://arxiv.org/abs/2502.02737)\]\[[Computational Bottlenecks of Training Small-scale Large Language Models](https://arxiv.org/abs/2410.19456)\]\[[SLMs-Survey](https://github.com/FairyFali/SLMs-Survey)\]\[[MiniLLM](https://arxiv.org/abs/2306.08543)\]
  - [paper - GSAI/LLaDA)\]\[[Diffusion-LM](https://github.com/XiangLi1999/Diffusion-LM)\]\[[BD3-LM](https://github.com/kuleshov-group/bd3lms)\]\[[mdlm](https://github.com/kuleshov-group/mdlm)\]\[[Dream](https://github.com/HKUNLP/Dream)\]\[[d1](https://github.com/dllm-reasoning/d1)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - wpy/SeqXGPT)\]\[[llm-detect-ai](https://github.com/yanqiangmiffy/llm-detect-ai)\]\[[detect-gpt](https://github.com/eric-mitchell/detect-gpt)\]\[[fast-detect-gpt](https://github.com/baoguangsheng/fast-detect-gpt)\]\[[ImBD](https://github.com/Jiaqi-Chen-00/ImBD)\]\[[MAGE](https://github.com/yafuly/MAGE)\]
  - [paper - research/circuit_training)\]\[[semikong](https://github.com/aitomatic/semikong)\]\[[Automating GPU Kernel Generation](https://developer.nvidia.com/blog/automating-gpu-kernel-generation-with-deepseek-r1-and-inference-time-scaling)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [paper
  - [paper - Scientist)\]\[[Social_Science](https://github.com/RenqiChen/Social_Science)\]\[[SocialAgent](https://github.com/FudanDISC/SocialAgent)\]\[[game_theory](https://github.com/Wenyueh/game_theory)\]\[[hypothesis-generation](https://github.com/ChicagoHAI/hypothesis-generation)\]\[[Towards an AI co-scientist](https://arxiv.org/abs/2502.18864)\]
  - [paper
  - [paper - NLPIR/LLM4IR-Survey)\]\[[YuLan-IR](https://github.com/RUC-GSAI/YuLan-IR)\]\[[A Survey of Conversational Search](https://arxiv.org/abs/2410.15576)\]\[[A Survey of Model Architectures in Information Retrieval](https://arxiv.org/abs/2502.14822)\]\[[A Survey of Query Optimization in Large Language Models](https://arxiv.org/abs/2412.17558)\]
  - [paper - rewarding-reasoning-LLM)\]
  - [paper - PaLM)\]
  - [paper - hwh/AutoCrawler)\]\[[gpt-crawler](https://github.com/BuilderIO/gpt-crawler)\]\[[webllama](https://github.com/McGill-NLP/webllama)\]\[[gpt-researcher](https://github.com/assafelovic/gpt-researcher)\]\[[skyvern](https://github.com/Skyvern-AI/skyvern)\]\[[Scrapegraph-ai](https://github.com/VinciGit00/Scrapegraph-ai)\]\[[crawl4ai](https://github.com/unclecode/crawl4ai)\]\[[crawlee-python](https://github.com/apify/crawlee-python)\]\[[Agent-E](https://github.com/EmergenceAI/Agent-E)\]\[[CyberScraper-2077](https://github.com/itsOwen/CyberScraper-2077)\]\[[browser-use](https://github.com/browser-use/browser-use)\]\[[ReaderLM-v2](https://huggingface.co/jinaai/ReaderLM-v2)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - group/textgrad)\]\[[appl](https://github.com/appl-team/appl)\]\[[okhat/blog](https://github.com/okhat/blog/blob/main/2024.09.impact.md)\]\[[PromptWizard](https://github.com/microsoft/PromptWizard)\]\[[SPO](https://arxiv.org/abs/2502.06855)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - RAG](https://github.com/microsoft/PIKE-RAG)\]\[[GraphRAG-Local-UI](https://github.com/severian42/GraphRAG-Local-UI)\]\[[nano-graphrag](https://github.com/gusye1234/nano-graphrag)\]\[[fast-graphrag](https://github.com/circlemind-ai/fast-graphrag)\]\[[graph-rag](https://github.com/sarthakrastogi/graph-rag)\]\[[llm-graph-builder](https://github.com/neo4j-labs/llm-graph-builder)\]\[[Triplex](https://huggingface.co/SciPhi/Triplex)\]\[[knowledge_graph_maker](https://github.com/rahulnyk/knowledge_graph_maker)\]\[[itext2kg](https://github.com/AuvaLab/itext2kg)\]\[[KG_RAG](https://github.com/BaranziniLab/KG_RAG)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [blog - Time Compute Optimally can be More Effective than Scaling Model Parameters](https://arxiv.org/abs/2408.03314)\]\[[search-and-learn](https://github.com/huggingface/search-and-learn)\]\[[Let's Verify Step by Step](https://arxiv.org/abs/2305.20050)\]\[[Thinking LLMs: General Instruction Following with Thought Generation](https://arxiv.org/abs/2410.10630)\]\[[Awesome-LLM-Strawberry](https://github.com/hijkzzz/Awesome-LLM-Strawberry)\]\[[Awesome-LLM-Reasoning](https://github.com/atfortes/Awesome-LLM-Reasoning)\]\[[Claude's extended thinking](https://www.anthropic.com/research/visible-extended-thinking)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper - DASLab/gptq)\]\[[AutoGPTQ](https://github.com/PanQiWei/AutoGPTQ)\]\[[QMoE](https://github.com/IST-DASLab/qmoe)\]\[[llmc](https://github.com/ModelTC/llmc)\]
  - [paper - Factory](https://github.com/Zefan-Cai/KVCache-Factory)\]\[[InfiniGen](https://github.com/snu-comparch/InfiniGen)\]
  - [TensorRT-LLM - inference-server/server)\]\[[GenerativeAIExamples](https://github.com/NVIDIA/GenerativeAIExamples)\]\[[TensorRT-Model-Optimizer](https://github.com/NVIDIA/TensorRT-Model-Optimizer)\]\[[TensorRT](https://github.com/NVIDIA/TensorRT)\]\[[TransformerEngine](https://github.com/NVIDIA/TransformerEngine)\]\[[OpenVINO](https://github.com/openvinotoolkit/openvino)\]
  - [paper
  - [paper - Augmented Generation with Graphs](https://arxiv.org/abs/2501.00309)\]
  - [paper - Augmented Generation: A Survey](https://arxiv.org/abs/2405.07437)\]\[[ragas](https://github.com/explodinggradients/ragas)\]\[[RAGChecker](https://github.com/amazon-science/RAGChecker)\]\[[rageval](https://github.com/gomate-community/rageval)\]\[[CORAL](https://github.com/Ariya12138/CORAL)\]
  - [blog - Augmented Generation for Large Language Models](https://arxiv.org/abs/2409.13385)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [haystack - Chatchat](https://github.com/chatchat-space/Langchain-Chatchat)\]\[[ragflow](https://github.com/infiniflow/ragflow)\]
  - [paper - mistral-7b-instruct)\]\[[llm2vec](https://github.com/McGill-NLP/llm2vec)\]\[[When Text Embedding Meets Large Language Model: A Comprehensive Survey](https://arxiv.org/abs/2412.09165)\]
  - [paper - 2-research](https://github.com/open-thought/system-2-research)\]\[[Test-time Computing: from System-1 Thinking to System-2 Thinking](https://arxiv.org/abs/2501.02497)\]\[[Towards System 2 Reasoning in LLMs](https://arxiv.org/abs/2501.04682)\]
  - [unsloth - ai/oumi)\]\[[VeOmni](https://github.com/ByteDance-Seed/VeOmni)\]
  - [paper
  - [paper - Parameter-Efficient-Fine-Tuning-for-Foundation-Models)\]
  - [paper - db/StreamRAG)\]\[[VideoRAG](https://arxiv.org/abs/2501.05874)\]\[[Ask in Any Modality](https://arxiv.org/abs/2502.08826)\]
  - [paper - o1)\]\[[WebThinker](https://github.com/RUC-NLPIR/WebThinker)\]\[[CoRAG](https://arxiv.org/abs/2501.14342)\]\[[DeepRAG](https://arxiv.org/abs/2502.01142)\]\[[StructRAG](https://github.com/icip-cas/StructRAG)\]\[[ReAG](https://github.com/superagent-ai/reag)\]\[[Search-R1](https://github.com/PeterGriffinJin/Search-R1)\]\[[r1-reasoning-rag](https://github.com/deansaco/r1-reasoning-rag)\]\[[R1-Searcher](https://github.com/RUCAIBox/R1-Searcher)\]\[[MCTS-RAG](https://arxiv.org/abs/2503.20757)\]\[[ReaRAG](https://arxiv.org/abs/2503.21729)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [RAG-Retrieval - Shanghai/PGRAG)\]\[[CRUD_RAG](https://github.com/IAAR-Shanghai/CRUD_RAG)\]\[[PlanRAG](https://github.com/myeon9h/PlanRAG)\]\[[DPA-RAG](https://github.com/dongguanting/DPA-RAG)\]\[[FollowRAG](https://github.com/dongguanting/FollowRAG)\]\[[LongRAG](https://github.com/TIGER-AI-Lab/LongRAG)\]\[[structured-rag](https://github.com/weaviate/structured-rag)\]\[[RAGLab](https://github.com/fate-ubw/RAGLab)\]\[[autogluon-rag](https://github.com/autogluon/autogluon-rag)\]\[[VARAG](https://github.com/adithya-s-k/VARAG)\]\[[PAI-RAG](https://github.com/aigc-apps/PAI-RAG)\]\[[RagVL](https://github.com/IDEA-FinAI/RagVL)\]\[[AutoRAG](https://github.com/Marker-Inc-Korea/AutoRAG)\]\[[RetroLLM](https://github.com/sunnynexus/RetroLLM)\]\[[RAG-Instruct](https://github.com/FreedomIntelligence/RAG-Instruct)\]\[[RapidRAG](https://github.com/RapidAI/RapidRAG)\]\[[UltraRAG](https://github.com/OpenBMB/UltraRAG)\]\[[MMOA-RAG](https://github.com/chenyiqun/MMOA-RAG)\]\[[EasyRAG](https://github.com/BUAADreamer/EasyRAG)\]\[[HiRAG](https://github.com/hhy-huang/HiRAG)\]
  - [paper - NLP/llm2vec)\]\[[VLM2Vec](https://github.com/TIGER-AI-Lab/VLM2Vec)\]
  - [paper - Embed/blob/main/modeling_nvmmembed.py)\]\[[magiclens](https://github.com/google-deepmind/magiclens)\]\[[E5-V](https://github.com/kongds/E5-V)\]\[[Visualized BGE](https://github.com/FlagOpen/FlagEmbedding/tree/master/research/visual_bge)\]\[[VLM2Vec](https://github.com/TIGER-AI-Lab/VLM2Vec)\]\[[GME-Qwen2-VL](https://arxiv.org/abs/2412.16855)\]\[[mmE5](https://github.com/haon-chen/mmE5)\]\[[LLaVE](https://github.com/DeepLearnXMU/LLaVE)\]\[[perception_models](https://github.com/facebookresearch/perception_models)\]
  - [JamAIBase - Retrieval](https://github.com/NovaSearch-Team/RAG-Retrieval)\]\[[model2vec](https://github.com/MinishLab/model2vec)\]
  - [paper - baichuan-mllm/bc-omni)\]\[[Baichuan-Omni-1.5 Technical Report](https://arxiv.org/abs/2501.15368)\]\[[Baichuan-Omni-1.5](https://github.com/baichuan-inc/Baichuan-Omni-1.5)\]
  - [blog - models](https://github.com/meta-llama/llama-models)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper
  - [paper - mini)\]\[[Adam](https://arxiv.org/abs/1412.6980)\]\[[AdamW](https://arxiv.org/abs/1711.05101)\]
  - [paper - Learning-Enhanced-LLMs-A-Survey)\]
  - [paper - sia.github.io)\]\[[VC-PPO](https://arxiv.org/abs/2503.01491)\]\[[Pre-PPO](https://arxiv.org/abs/2503.22230)\]\[[VAPO](https://arxiv.org/abs/2504.05118)\]\[[EMPO](https://arxiv.org/abs/2504.05812)\]
  - [paper - grpo](https://github.com/open-thought/tiny-grpo)\]\[[simple_GRPO](https://github.com/lsdefine/simple_GRPO)\]\[[GRPO-Zero](https://github.com/policy-gradient/GRPO-Zero)\]\[[grpo-flat](https://github.com/XU-YIJIE/grpo-flat)\]\[[kl-rel-to-ref-in-rl-zh](https://tongyx361.github.io/blogs/posts/kl-rel-to-ref-in-rl-zh)\]
  - [paper - 460M-1T)\]\[[MobiLlama](https://github.com/mbzuai-oryx/MobiLlama)\]\[[Steel-LLM](https://github.com/zhanshijinwat/Steel-LLM)\]\[[minimind](https://github.com/jingyaogong/minimind)\]\[[tiny-llm-zh](https://github.com/wdndev/tiny-llm-zh)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - Role-Play-Papers)\]\[[RPBench-Auto](https://boson.ai/rpbench-blog/)\]\[[Hermes 3 Technical Report](https://arxiv.org/abs/2408.11857)\]\[[From Persona to Personalization: A Survey on Role-Playing Language Agents](https://arxiv.org/abs/2404.18231)\]
  - [blog - Agents-Papers](https://github.com/AGI-Edgerunners/LLM-Agents-Papers)\]\[[awesome-language-agents](https://github.com/ysymyth/awesome-language-agents)\]\[[Awesome-Papers-Autonomous-Agent](https://github.com/lafmdp/Awesome-Papers-Autonomous-Agent)\]\[[GUI-Agents-Paper-List](https://github.com/OSU-NLP-Group/GUI-Agents-Paper-List)\]\[[ai-agent-white-paper](https://arthurchiao.art/blog/ai-agent-white-paper-zh)\]\[[Building effective agents](https://www.anthropic.com/research/building-effective-agents)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [paper - Curieous/Curie)\]\[[Paper2Code](https://github.com/going-doer/Paper2Code)\]
  - [paper - table-survey](https://github.com/godaai/llm-table-survey)\]\[[table-transformer](https://github.com/microsoft/table-transformer)\]\[[Awesome-Tabular-LLMs](https://github.com/SpursGoZmy/Awesome-Tabular-LLMs)\]\[[Awesome-LLM-Tabular](https://github.com/johnnyhwu/Awesome-LLM-Tabular)\]\[[Table-LLaVA](https://github.com/SpursGoZmy/Table-LLaVA)\]\[[tablegpt-agent](https://github.com/tablegpt/tablegpt-agent)\]\[[TableLLM](https://github.com/RUCKBReasoning/TableLLM)\]\[[OmniSQL](https://github.com/RUCKBReasoning/OmniSQL)\]\[[ChartVLM](https://github.com/UniModal4Reasoning/ChartVLM)\]\[[OmniCaptioner](https://github.com/Alpha-Innovator/OmniCaptioner)\]
  - [paper
  - [paper - Touchstone](https://github.com/IDEA-FinAI/Golden-Touchstone)\]\[[financebench](https://github.com/patronus-ai/financebench)\]\[[OmniEval](https://github.com/RUC-NLPIR/OmniEval)\]\[[FLAME](https://github.com/FLAME-ruc/FLAME)\]
  - [paper - RL/PRIME)\]
  - [paper - PaLM)\]
  - [paper - recommenders)\]\[[Transformers4Rec](https://github.com/NVIDIA-Merlin/Transformers4Rec)\]\[[torchrec](https://github.com/pytorch/torchrec)\]
  - [recommenders - algorithm)\]\[[Awesome-RSPapers](https://github.com/RUCAIBox/Awesome-RSPapers)\]\[[RecBole](https://github.com/RUCAIBox/RecBole)\]\[[RecSysDatasets](https://github.com/RUCAIBox/RecSysDatasets)\]\[[LLM4Rec-Awesome-Papers](https://github.com/WLiK/LLM4Rec-Awesome-Papers)\]\[[Awesome-LLM-for-RecSys](https://github.com/CHIANGEL/Awesome-LLM-for-RecSys)\]\[[Awesome-LLM4RS-Papers](https://github.com/nancheng58/Awesome-LLM4RS-Papers)\]\[[ReChorus](https://github.com/THUwangcy/ReChorus)\]\[[Transformers4Rec](https://github.com/NVIDIA-Merlin/Transformers4Rec)\]\[[torchrec](https://github.com/pytorch/torchrec)\]
  - [paper - Aligner)\]\[[NeMo-Curator](https://github.com/NVIDIA/NeMo-Curator)\]\[[Nemotron-4 340B Technical Report](https://d1qx31qr3h6wln.cloudfront.net/publications/Nemotron_4_340B_8T.pdf)\]\[[Mistral NeMo](https://mistral.ai/news/mistral-nemo/)\]\[[SparseLLM](https://github.com/BaiTheBest/SparseLLM)\]\[[MaskLLM](https://github.com/NVlabs/MaskLLM)\]\[[HelpSteer2-Preference](https://arxiv.org/abs/2410.01257)\]
  - [paper - attention-pytorch](https://github.com/lucidrains/ring-attention-pytorch)\]\[[ring-flash-attention](https://github.com/zhuzilin/ring-flash-attention)\]\[[local-attention](https://github.com/lucidrains/local-attention)\]\[[tree_attention](https://github.com/Zyphra/tree_attention)\]
  - [RedPajama-Data - minigrid-datasets](https://github.com/dunno-lab/xland-minigrid-datasets)\]\[[OmniCorpus](https://github.com/OpenGVLab/OmniCorpus)\]\[[dclm](https://github.com/mlfoundations/dclm)\]\[[Infinity-Instruct](https://github.com/FlagOpen/Infinity-Instruct)\]\[[MNBVC](https://github.com/esbatmop/MNBVC)\]\[[LMSYS-Chat-1M](https://arxiv.org/abs/2309.11998)\]\[[kangas](https://github.com/comet-ml/kangas)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [chatgpt-on-wechat - on-wechat](https://github.com/hanfangyuan4396/dify-on-wechat)\]\[[LLM-As-Chatbot](https://github.com/deep-diver/LLM-As-Chatbot)\]\[[NextChat](https://github.com/ChatGPTNextWeb/NextChat)\]\[[chatbox](https://github.com/Bin-Huang/chatbox)\]\[[cherry-studio](https://github.com/CherryHQ/cherry-studio)\]\[[ChatWise](https://chatwise.app)\]\[[khoj](https://github.com/khoj-ai/khoj)\]\[[HuixiangDou](https://github.com/InternLM/HuixiangDou)\]\[[Streamer-Sales](https://github.com/PeterH0323/Streamer-Sales)\]\[[Tianji](https://github.com/SocialAI-tianji/Tianji)\]\[[metahuman-stream](https://github.com/lipku/metahuman-stream)\]\[[aiavatarkit](https://github.com/uezo/aiavatarkit)\]\[[ai-getting-started](https://github.com/a16z-infra/ai-getting-started)\]\[[chatnio](https://github.com/zmh-program/chatnio)\]\[[VideoChat](https://github.com/Henry-23/VideoChat)\]\[[livetalking](https://github.com/lipku/livetalking)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]\[[PhiFlow](https://github.com/tum-pbs/PhiFlow)\]
  - [paper
  - [paper
  - [paper - eval](https://github.com/openai/human-eval)\]\[[CriticGPT](https://openai.com/index/finding-gpt4s-mistakes-with-gpt-4/)\]\[[On scalable oversight with weak LLMs judging strong LLMs](https://arxiv.org/abs/2407.04622)\]\[[OpenAI Codex CLI](https://github.com/openai/codex)\]
  - [paper - ai/OSWorld)\]\[[AgentGym](https://github.com/WooooDyy/AgentGym)\]\[[Agent-as-a-Judge](https://arxiv.org/abs/2410.10934)\]\[[intellagent](https://github.com/plurai-ai/intellagent)\]\[[Survey on Evaluation of LLM-based Agents](https://arxiv.org/abs/2503.16416)\]\[[AgentRewardBench](https://github.com/McGill-NLP/agent-reward-bench)\]
  - [paper - 2-a-large-scale-foundation-world-model)\]\[[genie2-pytorch](https://github.com/lucidrains/genie2-pytorch)\]\[[GameNGen](https://arxiv.org/abs/2408.14837)\]\[[GameGen-X](https://github.com/GameGen-X/GameGen-X)\]\[[GameFactory](https://github.com/KwaiVGI/GameFactory)\]\[[Unbounded](https://arxiv.org/abs/2410.18975)\]\[[open-oasis](https://github.com/etched-ai/open-oasis)\]\[[DIAMOND](https://diamond-wm.github.io)\]\[[WHAM](https://huggingface.co/microsoft/wham)\]\[[dreamerv3](https://www.nature.com/articles/s41586-025-08744-2)\]\[[AssistanceZero](https://github.com/cassidylaidlaw/minecraft-building-assistance-game)\]\[[MineWorld](https://github.com/microsoft/MineWorld)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - ai/letta)\]\[[Zep](https://arxiv.org/abs/2501.13956)\]\[[zep-python](https://github.com/getzep/zep-python)\]\[[graphiti](https://github.com/getzep/graphiti)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]\[[PhiFlow](https://github.com/tum-pbs/PhiFlow)\]
  - [paper
  - [OpenDeepResearcher - DeepResearch](https://github.com/jina-ai/node-DeepResearch)\]\[[open-deep-research](https://github.com/nickscamara/open-deep-research)\]\[[open-deep-research blog](https://huggingface.co/blog/open-deep-research)\]\[[open_deep_research](https://github.com/huggingface/smolagents/tree/main/examples/open_deep_research)\]\[[deep-research](https://github.com/dzhng/deep-research)\]\[[Auto-Deep-Research](https://github.com/HKUDS/Auto-Deep-Research)\]\[[deep-searcher](https://github.com/zilliztech/deep-searcher)\]\[[local-deep-research](https://github.com/LearningCircuit/local-deep-research)\]\[[local-deep-researcher](https://github.com/langchain-ai/local-deep-researcher)\]\[[open_deep_research](https://github.com/langchain-ai/open_deep_research)\]\[[Agentic-Reasoning](https://github.com/theworldofagents/Agentic-Reasoning)\]
  - [paper - ai/DeepSeek-Coder-V2)\]\[[DeepSeek-V2.5](https://huggingface.co/deepseek-ai/DeepSeek-V2.5)\]\[[Ling-Coder-lite](https://arxiv.org/abs/2503.17793)\]
  - [paper
  - [paper - agi/OpenDeepSearch)\]\[[deep-searcher](https://github.com/zilliztech/deep-searcher)\]
  - [paper - PaLM)\]
  - [blog - agents-python/mcp/)\]\[[fastmcp](https://github.com/jlowin/fastmcp)\]\[[MCP.so](https://mcp.so)\]\[[mcpagents](https://mcpagents.dev)\]\[[ModelScope MCP](https://www.modelscope.cn/mcp)\]\[[mcp-agent](https://github.com/lastmile-ai/mcp-agent)\]\[[awesome-mcp-servers](https://github.com/punkpeye/awesome-mcp-servers)\]\[[Awesome-MCP-ZH](https://github.com/yzfly/Awesome-MCP-ZH)\]\[[mcp](https://github.com/awslabs/mcp)\]\[[A2A](https://github.com/google/A2A)\]\[[adk-python](https://github.com/google/adk-python)\]
  - [paper - LM)\]\[[Megatron-2](https://arxiv.org/abs/2104.04473)\]\[[megatron sequence parallelism](https://arxiv.org/abs/2205.05198)\]\[[Scaling Language Model Training to a Trillion Parameters Using Megatron](https://developer.nvidia.com/blog/scaling-language-model-training-to-a-trillion-parameters-using-megatron)\]\[[picotron](https://github.com/huggingface/picotron)\]\[[nanotron](https://github.com/huggingface/nanotron)\]\[[DeepEP](https://github.com/deepseek-ai/DeepEP)\]
  - [paper - ai/MergeKit)\]\[[DistillKit](https://github.com/arcee-ai/DistillKit)\]\[[A Survey on Collaborative Strategies in the Era of Large Language Models](https://arxiv.org/abs/2407.06089)\]\[[FuseAI](https://github.com/fanqiwan/FuseAI)\]\[[MergeLM](https://github.com/yule-BUAA/MergeLM)\]\[[Long-to-Short-via-Model-Merging](https://github.com/hahahawu/Long-to-Short-via-Model-Merging)\]
  - [paper - Coder-lite](https://arxiv.org/abs/2503.17793)\]\[[DLRover](https://github.com/intelligent-machine-learning/dlrover)\]
  - [The Effect of Prompt Tokens on Instruction Tuning
  - [paper
  - [paper - RL/ReCall)\]\[[Synergizing RAG and Reasoning](https://arxiv.org/abs/2504.15909)\]\[[Agentic-RAG-R1](https://github.com/jiangxinke/Agentic-RAG-R1)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - nlp/simpleRL-reason)\]\[[CodeIO](https://github.com/hkust-nlp/CodeIO)\]\[[B-STaR](https://arxiv.org/abs/2412.17256)\]
  - [paper - R1)\]\[[SEED-Bench-R1](https://github.com/TencentARC/SEED-Bench-R1)\]\[[TinyLLaVA-Video-R1](https://github.com/ZhangXJ199/TinyLLaVA-Video-R1)\]
  - [paper - table-survey](https://github.com/godaai/llm-table-survey)\]\[[table-transformer](https://github.com/microsoft/table-transformer)\]\[[Awesome-Tabular-LLMs](https://github.com/SpursGoZmy/Awesome-Tabular-LLMs)\]\[[Awesome-LLM-Tabular](https://github.com/johnnyhwu/Awesome-LLM-Tabular)\]\[[Table-LLaVA](https://github.com/SpursGoZmy/Table-LLaVA)\]\[[tablegpt-agent](https://github.com/tablegpt/tablegpt-agent)\]\[[TableLLM](https://github.com/RUCKBReasoning/TableLLM)\]\[[OmniSQL](https://github.com/RUCKBReasoning/OmniSQL)\]\[[ChartVLM](https://github.com/UniModal4Reasoning/ChartVLM)\]\[[OmniCaptioner](https://github.com/Alpha-Innovator/OmniCaptioner)\]
  - [paper
  - [paper - Searcher)\]\[[Search-o1](https://github.com/sunnynexus/Search-o1)\]\[[WebThinker](https://github.com/RUC-NLPIR/WebThinker)\]\[[ReSearch](https://github.com/Agent-RL/ReCall)\]\[[Auto-Deep-Research](https://github.com/HKUDS/Auto-Deep-Research)\]\[[deep-searcher](https://github.com/zilliztech/deep-searcher)\]
  - [paper - R1)\]\[[DeepRetrieval](https://github.com/pat-jj/DeepRetrieval)\]\[[DeepResearcher](https://github.com/GAIR-NLP/DeepResearcher)\]\[[ZeroSearch](https://arxiv.org/abs/2505.04588)\]
  - [paper - harvard/TxAgent)\]\[[MedAgent-Pro](https://arxiv.org/abs/2503.18968)\]
  - [paper - PaLM)\]
  - [paper
  - [paper - NLP/ToRL)\]\[[ReCall](https://github.com/Agent-RL/ReCall)\]\[[ReTool](https://arxiv.org/abs/2504.11536)\]\[[ToolRL](https://arxiv.org/abs/2504.13958)\]\[[OTC](https://arxiv.org/abs/2504.14870)\]\[[Improving Multi-Turn Tool Use with RL](https://www.bespokelabs.ai/blog/improving-multi-turn-tool-use-with-reinforcement-learning)\]
  - [Awesome-LLM-System-Papers - production-llm](https://github.com/jihoo-kim/awesome-production-llm)\]\[[Awesome-MLSys-Blogger](https://mlsys-learner-resources.github.io/Awesome-MLSys-Blogger)\]\[[Awesome-ML-SYS-Tutorial](https://github.com/zhaochenyang20/Awesome-ML-SYS-Tutorial)\]\[[how-to-learn-deep-learning-framework](https://github.com/BBuf/how-to-learn-deep-learning-framework)\]\[[PKU-DAIR/Starter-Guide](https://github.com/PKU-DAIR/Starter-Guide/tree/main/docs/systems)\]\[[open-infra-index](https://github.com/deepseek-ai/open-infra-index)\]\[[CutlassAcademy](https://github.com/MekkCyber/CutlassAcademy)\]\[[Triton-Puzzles](https://github.com/srush/Triton-Puzzles)\]\[[CUDA-Learn-Notes](https://github.com/xlite-dev/CUDA-Learn-Notes)\]
  - [blog - pytorch-fully-sharded-data-parallel-api/)\]\[[pytorch-fsdp](https://github.com/huggingface/blog/blob/main/zh/pytorch-fsdp.md)\]
  - [paper - ulysses)\]\[[unofficial code](https://github.com/feifeibear/long-context-attention/blob/main/yunchang/ulysses/attn_layer.py)\]\[[MagiAttention](https://github.com/SandAI-org/MagiAttention)\]
  - [paper
  - [paper - guidebook](https://github.com/huggingface/evaluation-guidebook)\]
  - [blog - leaderboard](https://github.com/vectara/hallucination-leaderboard)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper - Transformers](https://github.com/Beomi/BitNet-Transformers)\]\[[BitNet b1.58](https://arxiv.org/abs/2402.17764)\]\[[BitNet a4.8](https://arxiv.org/abs/2411.04965)\]\[[BitNet b1.58 2B4T](https://arxiv.org/abs/2504.12285)\]\[[T-MAC](https://github.com/microsoft/T-MAC)\]\[[BitBLAS](https://github.com/microsoft/BitBLAS)\]\[[BiLLM](https://github.com/Aaronhuang-778/BiLLM)\]\[[decoupleQ](https://github.com/bytedance/decoupleQ)\]
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper - RL/PRIME)\]\[[TTRL](https://arxiv.org/abs/2504.16084)\]\[[Free Process Rewards without Process Labels](https://arxiv.org/abs/2412.01981)\]\[[OREAL](https://github.com/InternLM/OREAL)\]\[[VisualPRM](https://arxiv.org/abs/2503.10291)\]\[[Crossing the Reward Bridge](https://arxiv.org/abs/2503.23829)\]\[[GenPRM](https://arxiv.org/abs/2504.00891)\]
  - [paper - ai/DeepSeek-R1)\]\[[Open-R1](https://github.com/huggingface/open-r1)\]\[[TinyZero](https://github.com/Jiayi-Pan/TinyZero)\]\[[simpleRL-reason](https://github.com/hkust-nlp/simpleRL-reason)\]\[[Logic-RL](https://github.com/Unakar/Logic-RL)\]\[[DeepScaleR](https://github.com/agentica-project/rllm)\]\[[oat-zero](https://oatllm.notion.site/oat-zero)\]\[[understand-r1-zero](https://github.com/sail-sg/understand-r1-zero)\]\[[X-R1](https://github.com/dhcode-cpp/X-R1)\]\[[nano-aha-moment](https://github.com/McGill-NLP/nano-aha-moment)\]\[[Light-R1](https://github.com/Qihoo360/Light-R1)\]\[[ragen](https://github.com/ZihanWang314/ragen)\]\[[R1-V](https://github.com/Deep-Agent/R1-V)\]\[[VLM-R1](https://github.com/om-ai-lab/VLM-R1)\]\[[open-r1-multimodal](https://github.com/EvolvingLMMs-Lab/open-r1-multimodal)\]\[[R1-Onevision](https://github.com/Fancy-MLLM/R1-Onevision)\]\[[Visual-RFT](https://github.com/Liuziyu77/Visual-RFT)\]\[[VisualThinker-R1-Zero](https://github.com/turningpoint-ai/VisualThinker-R1-Zero)\]\[[the-illustrated-deepseek-r1](https://newsletter.languagemodels.co/p/the-illustrated-deepseek-r1)\]\[[Understanding Reasoning LLMs](https://sebastianraschka.com/blog/2025/understanding-reasoning-llms.html)\]
  - [paper
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - aloha)\]\[[Hardware Code](https://github.com/MarkFzp/mobile-aloha)\]\[[Learning Code](https://github.com/MarkFzp/act-plus-plus)\]\[[UMI](https://github.com/real-stanford/universal_manipulation_interface)\]\[[humanplus](https://github.com/MarkFzp/humanplus)\]\[[TeleVision](https://github.com/OpenTeleVision/TeleVision)\]\[[Surgical Robot Transformer](https://surgical-robot-transformer.github.io/)\]\[[lifelike-agility-and-play](https://github.com/Tencent-RoboticsX/lifelike-agility-and-play)\]\[[ReKep](https://rekep-robot.github.io/)\]\[[Open_Duck_Mini](https://github.com/apirrone/Open_Duck_Mini)\]\[[Learning Visual Parkour from Generated Images](https://lucidsim.github.io/)\]\[[ASAP](https://github.com/LeCAR-Lab/ASAP)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]
  - [paper
  - [paper
  - [paper
  - [paper - PaLM)\]
  - [paper - piexl/JailbreakZoo)\]\[[jailbreak_llms](https://github.com/verazuo/jailbreak_llms)\]\[[llm-attacks](https://github.com/llm-attacks/llm-attacks)\]\[[Awesome-Jailbreak-on-LLMs](https://github.com/yueliu1999/Awesome-Jailbreak-on-LLMs)\]\[[Constitutional Classifiers](https://arxiv.org/abs/2501.18837)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [Seed-Thinking-v1.5 - 4-reasoning Technical Report](https://arxiv.org/abs/2504.21318)\]\[[Llama-Nemotron: Efficient Reasoning Models](https://arxiv.org/abs/2505.00949)\]
  - [paper - Time Compute Optimally can be More Effective than Scaling Model Parameters](https://arxiv.org/abs/2408.03314)\]\[[probabilistic-inference-scaling](https://github.com/probabilistic-inference-scaling/probabilistic-inference-scaling)\]\[[LIMO](https://arxiv.org/abs/2502.03387)\]\[[OpenThinker-32B](https://www.open-thoughts.ai/blog/scale)\]\[[L1](https://github.com/cmu-l3/l1)\]\[[Z1](https://github.com/efficientscaling/Z1)\]\[[Reasoning Models Can Be Effective Without Thinking](https://arxiv.org/abs/2504.09858)\]
  - [paper - Reasoner-Zero/Open-Reasoner-Zero)\]\[[One-Shot-RLVR](https://github.com/ypwang61/One-Shot-RLVR)\]\[[Absolute-Zero-Reasoner](https://github.com/LeapLabTHU/Absolute-Zero-Reasoner)\]
  - [paper - ai-lab/VLM-R1)\]\[[R1-V](https://github.com/Deep-Agent/R1-V)\]
  - [paper - VL)\]\[[Moonlight](https://github.com/MoonshotAI/Moonlight)\]\[[APOLLO](https://github.com/zhuhanqing/APOLLO)\]
  - [paper - ultra](https://github.com/pangu-tech/pangu-ultra)\]\[[Pangu Ultra MoE](https://arxiv.org/abs/2505.04519)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [paper - ai/letta)\]\[[Zep](https://arxiv.org/abs/2501.13956)\]\[[zep-python](https://github.com/getzep/zep-python)\]\[[graphiti](https://github.com/getzep/graphiti)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]\[[PhiFlow](https://github.com/tum-pbs/PhiFlow)\]
  - [paper
  - [paper
  - [paper - PaLM)\]
  - [paper - LM)\]\[[picotron](https://github.com/huggingface/picotron)\]\[[Megatron-2](https://arxiv.org/abs/2104.04473)\]\[[megatron sequence parallelism](https://arxiv.org/abs/2205.05198)\]\[[Scaling Language Model Training to a Trillion Parameters Using Megatron](https://developer.nvidia.com/blog/scaling-language-model-training-to-a-trillion-parameters-using-megatron)\]\[[DeepEP](https://github.com/deepseek-ai/DeepEP)\]
  - [paper - hallucination-detection](https://github.com/patronus-ai/Lynx-hallucination-detection)\]
  - [DeepSpeed-MII - FastGen](https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-fastgen)\]\[[ONNX Runtime](https://github.com/microsoft/onnxruntime)\]\[[onnx](https://github.com/onnx/onnx)\]\[[Nanoflow](https://github.com/efeslab/Nanoflow)\]
  - [DeepSpeed - us/research/blog/zero-deepspeed-new-system-optimizations-enable-training-models-with-over-100-billion-parameters/)\]
  - [paper
  - [Advanced RAG Techniques: an Illustrated Overview - RAG-Agent](https://github.com/NirDiamant/Controllable-RAG-Agent)\]\[[GenAI_Agents](https://github.com/NirDiamant/GenAI_Agents)\]\[[bRAG-langchain](https://github.com/bRAGAI/bRAG-langchain)\]\[[GenAI-Showcase](https://github.com/mongodb-developer/GenAI-Showcase)\]
  - [paper
  - [OpenDevin - code-rover](https://github.com/nus-apr/auto-code-rover)\]\[[developer](https://github.com/smol-ai/developer)\]\[[aider](https://github.com/paul-gauthier/aider)\]\[[claude-engineer](https://github.com/Doriandarko/claude-engineer)\]\[[SuperCoder](https://github.com/TransformerOptimus/SuperCoder)\]\[[AIDE](https://github.com/WecoAI/aideml)\]\[[vulnhuntr](https://github.com/protectai/vulnhuntr)\]\[[devin.cursorrules](https://github.com/grapeot/devin.cursorrules)\]
  - [paper
  - [paper - PaLM)\]
  - [paper - deepmind/synthid-text)\]\[[watermark-anything](https://github.com/facebookresearch/watermark-anything)\]
  - [chatgpt-on-wechat - on-wechat](https://github.com/hanfangyuan4396/dify-on-wechat)\]\[[LLM-As-Chatbot](https://github.com/deep-diver/LLM-As-Chatbot)\]\[[NextChat](https://github.com/ChatGPTNextWeb/NextChat)\]\[[chatbox](https://github.com/Bin-Huang/chatbox)\]\[[cherry-studio](https://github.com/CherryHQ/cherry-studio)\]\[[ChatWise](https://chatwise.app)\]\[[khoj](https://github.com/khoj-ai/khoj)\]\[[HuixiangDou](https://github.com/InternLM/HuixiangDou)\]\[[Streamer-Sales](https://github.com/PeterH0323/Streamer-Sales)\]\[[Tianji](https://github.com/SocialAI-tianji/Tianji)\]\[[metahuman-stream](https://github.com/lipku/metahuman-stream)\]\[[aiavatarkit](https://github.com/uezo/aiavatarkit)\]\[[ai-getting-started](https://github.com/a16z-infra/ai-getting-started)\]\[[chatnio](https://github.com/zmh-program/chatnio)\]\[[VideoChat](https://github.com/Henry-23/VideoChat)\]\[[livetalking](https://github.com/lipku/livetalking)\]
  - [paper - Weather)\]\[[arxiv](https://arxiv.org/abs/2211.02556)\]
  - [paper - deepmind/graphcast)\]\[[OpenCastKit](https://github.com/HFAiLab/OpenCastKit)\]\[[GenCast](https://deepmind.google/discover/blog/gencast-predicts-weather-and-the-risks-of-extreme-conditions-with-sota-accuracy/)\]\[[PhiFlow](https://github.com/tum-pbs/PhiFlow)\]
- 1. Word2Vec
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - nmt)\]
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper
  - [paper - nmt)\]
  - [paper
- 2. Seq2Seq
  - [paper
  - [paper
  - [paper - groundhog/GroundHog)\]
  - [paper
  - [paper
  - [fairseq - seq2seq](https://github.com/IBM/pytorch-seq2seq)\]
  - [paper
  - [paper
  - [paper - groundhog/GroundHog)\]
  - [paper
  - [paper
  - [fairseq - seq2seq](https://github.com/IBM/pytorch-seq2seq)\]

Programming Languages

Python 100 Jupyter Notebook 13 C++ 4 TypeScript 3 HTML 2 SCSS 1 Shell 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

awesome-ai-papers

NLP

3. Pretraining

1. Word2Vec

2. Seq2Seq