Projects in Awesome Lists by TIGER-AI-Lab

https://github.com/tiger-ai-lab/theoremexplainagent

Official Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025]

llm-agents manim manim-animations manim-video rag

Last synced: 13 Jun 2025

https://github.com/TIGER-AI-Lab/TheoremExplainAgent

Official Repo for "TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding"

llm-agents manim manim-animations manim-video rag

Last synced: 01 Apr 2025

https://github.com/tiger-ai-lab/anyv2v

Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)

deep-learning generative-ai image-editing image-to-video-generation pytorch video-editing video-synthesis

Last synced: 13 Jun 2025

https://github.com/TIGER-AI-Lab/OpenResearcher

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

deep-research llm retrieval

Last synced: 05 Mar 2026

https://github.com/TIGER-AI-Lab/MAmmoTH

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]

llm math reasoning

Last synced: 18 Jul 2025

https://github.com/tiger-ai-lab/clawbench

Open-source benchmark for browser AI agents on 153 everyday online tasks across 144 live websites. 5-layer recording + DOM-match + LLM judge. Top score 33.3%.

agent-evaluation agentic-ai ai-agent-benchmark ai-agents benchmark browser-agent browser-automation browser-use chrome-agent chrome-extension computer-use dataset evaluation everyday-tasks llm llm-evaluation online-tasks real-world-benchmark web-agent web-agents

Last synced: 31 May 2026

https://github.com/tiger-ai-lab/program-of-thoughts

Data and Code for Program of Thoughts (TMLR 2023)

llm program prompting

Last synced: 22 Aug 2025

https://github.com/tiger-ai-lab/mammoth

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)

Last synced: 13 Jun 2025

https://github.com/TIGER-AI-Lab/Program-of-Thoughts

Data and Code for Program of Thoughts (TMLR 2023)

llm program prompting

Last synced: 31 Mar 2025

https://github.com/tiger-ai-lab/vlm2vec

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR25]

benchmark contrastive-learning embedding image-retrieval mmeb multimodal rag representation-learning video-retrieval visual-document-retrieval vlm

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/mmlu-pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]

evaluation llm

Last synced: 08 Oct 2025

https://github.com/tiger-ai-lab/consisti2v

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)

diffusion-models image-to-video-generation video-generation video-synthesis

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/mantis

Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR2024]

fuyu language llava-llama3 lmm mantis mllm multi-image-understanding multimodal video vision vlm

Last synced: 13 Jun 2025

https://tiger-ai-lab.github.io/Mantis/

Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR2024]

fuyu language llava-llama3 lmm mantis mllm multi-image-understanding multimodal video vision vlm

Last synced: 12 Apr 2025

https://tiger-ai-lab.github.io/ConsistI2V/

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)

diffusion-models image-to-video-generation video-generation video-synthesis

Last synced: 28 Mar 2025

https://github.com/tiger-ai-lab/verl-tool

A version of verl to support tool use

learning llm reinforcement

Last synced: 24 Dec 2025

https://github.com/TIGER-AI-Lab/MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]

evaluation llm

Last synced: 23 Apr 2025

https://github.com/tiger-ai-lab/imagenhub

A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)

deep-learning diffusion-models generative-ai generative-art image-editing image-generation pytorch stable-diffusion

Last synced: 12 Apr 2025

https://github.com/tiger-ai-lab/longrag

Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".

llm rag

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/critiquefinetuning

Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"

fine-tuning languagemodel

Last synced: 13 Jun 2025

https://github.com/TIGER-AI-Lab/ImagenHub

A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)

deep-learning diffusion-models generative-ai generative-art image-editing image-generation pytorch stable-diffusion

Last synced: 18 Jul 2025

https://github.com/tiger-ai-lab/mammoth2

Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]

language math reasoning

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/general-reasoner

General Reasoner: Advancing LLM Reasoning Across All Domains

llm reasoning rl

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/pixel-reasoner

Pixel-Level Reasoning Model trained with RL

llm multimodal reasoning

Last synced: 13 Jun 2025

https://tiger-ai-lab.github.io/MAmmoTH2/

Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]

language math reasoning

Last synced: 11 Apr 2025

https://github.com/tiger-ai-lab/scholarcopilot

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

copilot lm rag

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/editreward

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing [ICLR 2026]

diffusion editing evaluation

Last synced: 14 Feb 2026

https://github.com/tiger-ai-lab/omniedit

Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]

diffusionmodel imageediting imagegeneration

Last synced: 29 Jan 2026

https://github.com/tiger-ai-lab/vl-rethinker

The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"

llm multimodal reasoning vlm

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/longiclbench

Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]

large-language-models

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/acecoder

The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis"

code codellm llm

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/videoscore

official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]

language machine-learning

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/vamba

Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers"

llm video vlm

Last synced: 22 Jun 2025

https://github.com/tiger-ai-lab/uniir

Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers"

language-model retrieval

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/structlm

Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/mega-bench

This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR2025]

benchmark evaluation multimodal

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/kb-binder

"Few-shot In-context Learning for Knowledge Base Question Answering" [ACL2023]

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/videogenhub

A one-stop library to standardize the inference and evaluation of all the conditional video generation models.

deep-learning diffusion-models generative-ai pytorch video-generation

Last synced: 17 Oct 2025

https://github.com/tiger-ai-lab/tablecot

The code and data for paper "Large Language Models are few(1)-shot Table Reasoners" [EACL2023]

Last synced: 30 Jul 2025

https://github.com/TIGER-AI-Lab/TableCoT

The code and data for paper "Large Language Models are few(1)-shot Table Reasoners" [EACL2023]

Last synced: 08 Jul 2025

https://github.com/tiger-ai-lab/quickvideo

Quick Long Video Understanding

llm multimodal multimodal-learning video

Last synced: 13 Jun 2025

https://github.com/TIGER-AI-Lab/ImagenWorld

Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks [ICLR 2026]

genai generation image

Last synced: 16 Jun 2026

https://github.com/tiger-ai-lab/theoremqa

The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)

lm math theorem

Last synced: 06 Sep 2025

https://tiger-ai-lab.github.io/ImagenWorld/

Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks [ICLR 2026]

genai generation image

Last synced: 16 Jun 2026

https://github.com/tiger-ai-lab/tigerscore

"TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks" [TMLR 2024]

evaluation language-model llm metrics

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/viescore

Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024 main)

computer-vision gpt4vision image-editing image-generation visual-question-answering

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/visualwebinstruct

The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search"

llm vlm

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/imagenworld

Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks

genai generation image

Last synced: 13 Feb 2026

https://github.com/tiger-ai-lab/pixelworld

The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]

llm vlm

Last synced: 30 Jan 2026

https://github.com/tiger-ai-lab/browseragent

An agent that can interact with browser to complete tasks

agent browser llm

Last synced: 25 Oct 2025

https://github.com/tiger-ai-lab/vista

The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]

lm multimodal vlm

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/abc

ABC: Achieving Better Control of Multimodal Embeddings using VLMs

information-retrieval multimodal

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/videoeval-pro

More reliable Video Understanding Evaluation

evaluation multimodal understanding video

Last synced: 13 Jun 2025

https://github.com/tiger-ai-lab/viscoder

The official code of "VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation"

Last synced: 18 Jul 2025

https://github.com/tiger-ai-lab/genai-bench

Code and Data for "GenAI Arena: An Open Evaluation Platform for Generative Models"

diffusion evaluation generative-ai

Last synced: 28 Oct 2025

https://github.com/tiger-ai-lab/disprotedit

Official Repo for "DisProtEdit: Exploring Disentangled Representations for Multi-Attribute Protein Editing" [ICMLW 2025]

protein protein-editing

Last synced: 03 Feb 2026