Projects in Awesome Lists by TIGER-AI-Lab
A curated list of projects in awesome lists by TIGER-AI-Lab .
https://github.com/tiger-ai-lab/theoremexplainagent
Official Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025]
llm-agents manim manim-animations manim-video rag
Last synced: 13 Jun 2025
https://github.com/TIGER-AI-Lab/TheoremExplainAgent
Official Repo for "TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding"
llm-agents manim manim-animations manim-video rag
Last synced: 01 Apr 2025
https://github.com/tiger-ai-lab/anyv2v
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)
deep-learning generative-ai image-editing image-to-video-generation pytorch video-editing video-synthesis
Last synced: 13 Jun 2025
https://github.com/TIGER-AI-Lab/OpenResearcher
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis
Last synced: 05 Mar 2026
https://github.com/TIGER-AI-Lab/MAmmoTH
Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]
Last synced: 18 Jul 2025
https://github.com/tiger-ai-lab/clawbench
Open-source benchmark for browser AI agents on 153 everyday online tasks across 144 live websites. 5-layer recording + DOM-match + LLM judge. Top score 33.3%.
agent-evaluation agentic-ai ai-agent-benchmark ai-agents benchmark browser-agent browser-automation browser-use chrome-agent chrome-extension computer-use dataset evaluation everyday-tasks llm llm-evaluation online-tasks real-world-benchmark web-agent web-agents
Last synced: 31 May 2026
https://github.com/tiger-ai-lab/program-of-thoughts
Data and Code for Program of Thoughts (TMLR 2023)
Last synced: 22 Aug 2025
https://github.com/tiger-ai-lab/mammoth
Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)
Last synced: 13 Jun 2025
https://github.com/TIGER-AI-Lab/Program-of-Thoughts
Data and Code for Program of Thoughts (TMLR 2023)
Last synced: 31 Mar 2025
https://github.com/tiger-ai-lab/vlm2vec
This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR25]
benchmark contrastive-learning embedding image-retrieval mmeb multimodal rag representation-learning video-retrieval visual-document-retrieval vlm
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/mmlu-pro
The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]
Last synced: 08 Oct 2025
https://github.com/tiger-ai-lab/consisti2v
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)
diffusion-models image-to-video-generation video-generation video-synthesis
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/mantis
Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR2024]
fuyu language llava-llama3 lmm mantis mllm multi-image-understanding multimodal video vision vlm
Last synced: 13 Jun 2025
https://tiger-ai-lab.github.io/Mantis/
Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR2024]
fuyu language llava-llama3 lmm mantis mllm multi-image-understanding multimodal video vision vlm
Last synced: 12 Apr 2025
https://tiger-ai-lab.github.io/ConsistI2V/
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)
diffusion-models image-to-video-generation video-generation video-synthesis
Last synced: 28 Mar 2025
https://github.com/tiger-ai-lab/verl-tool
A version of verl to support tool use
Last synced: 24 Dec 2025
https://github.com/TIGER-AI-Lab/MMLU-Pro
The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]
Last synced: 23 Apr 2025
https://github.com/tiger-ai-lab/imagenhub
A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)
deep-learning diffusion-models generative-ai generative-art image-editing image-generation pytorch stable-diffusion
Last synced: 12 Apr 2025
https://github.com/tiger-ai-lab/longrag
Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/critiquefinetuning
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"
Last synced: 13 Jun 2025
https://github.com/TIGER-AI-Lab/ImagenHub
A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)
deep-learning diffusion-models generative-ai generative-art image-editing image-generation pytorch stable-diffusion
Last synced: 18 Jul 2025
https://github.com/tiger-ai-lab/mammoth2
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/general-reasoner
General Reasoner: Advancing LLM Reasoning Across All Domains
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/pixel-reasoner
Pixel-Level Reasoning Model trained with RL
Last synced: 13 Jun 2025
https://tiger-ai-lab.github.io/MAmmoTH2/
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
Last synced: 11 Apr 2025
https://github.com/tiger-ai-lab/scholarcopilot
ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/editreward
EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing [ICLR 2026]
Last synced: 14 Feb 2026
https://github.com/tiger-ai-lab/omniedit
Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]
diffusionmodel imageediting imagegeneration
Last synced: 29 Jan 2026
https://github.com/tiger-ai-lab/vl-rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/longiclbench
Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/acecoder
The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis"
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/videoscore
official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/vamba
Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers"
Last synced: 22 Jun 2025
https://github.com/tiger-ai-lab/uniir
Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers"
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/structlm
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/mega-bench
This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR2025]
benchmark evaluation multimodal
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/kb-binder
"Few-shot In-context Learning for Knowledge Base Question Answering" [ACL2023]
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/videogenhub
A one-stop library to standardize the inference and evaluation of all the conditional video generation models.
deep-learning diffusion-models generative-ai pytorch video-generation
Last synced: 17 Oct 2025
https://github.com/tiger-ai-lab/tablecot
The code and data for paper "Large Language Models are few(1)-shot Table Reasoners" [EACL2023]
Last synced: 30 Jul 2025
https://github.com/TIGER-AI-Lab/TableCoT
The code and data for paper "Large Language Models are few(1)-shot Table Reasoners" [EACL2023]
Last synced: 08 Jul 2025
https://github.com/tiger-ai-lab/quickvideo
Quick Long Video Understanding
llm multimodal multimodal-learning video
Last synced: 13 Jun 2025
https://github.com/TIGER-AI-Lab/ImagenWorld
Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks [ICLR 2026]
Last synced: 16 Jun 2026
https://github.com/tiger-ai-lab/theoremqa
The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)
Last synced: 06 Sep 2025
https://tiger-ai-lab.github.io/ImagenWorld/
Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks [ICLR 2026]
Last synced: 16 Jun 2026
https://github.com/tiger-ai-lab/tigerscore
"TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks" [TMLR 2024]
evaluation language-model llm metrics
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/viescore
Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024 main)
computer-vision gpt4vision image-editing image-generation visual-question-answering
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/visualwebinstruct
The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search"
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/imagenworld
Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks
Last synced: 13 Feb 2026
https://github.com/tiger-ai-lab/pixelworld
The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]
Last synced: 30 Jan 2026
https://github.com/tiger-ai-lab/browseragent
An agent that can interact with browser to complete tasks
Last synced: 25 Oct 2025
https://github.com/tiger-ai-lab/vista
The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/abc
ABC: Achieving Better Control of Multimodal Embeddings using VLMs
information-retrieval multimodal
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/videoeval-pro
More reliable Video Understanding Evaluation
evaluation multimodal understanding video
Last synced: 13 Jun 2025
https://github.com/tiger-ai-lab/viscoder
The official code of "VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation"
Last synced: 18 Jul 2025
https://github.com/tiger-ai-lab/genai-bench
Code and Data for "GenAI Arena: An Open Evaluation Platform for Generative Models"
diffusion evaluation generative-ai
Last synced: 28 Oct 2025
https://github.com/tiger-ai-lab/disprotedit
Official Repo for "DisProtEdit: Exploring Disentangled Representations for Multi-Attribute Protein Editing" [ICMLW 2025]
Last synced: 03 Feb 2026
https://github.com/tiger-ai-lab/one-shot-cft
The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem”
Last synced: 13 Jun 2025