Projects in Awesome Lists by EvolvingLMMs-Lab
A curated list of projects in awesome lists by EvolvingLMMs-Lab .
https://github.com/evolvinglmms-lab/lmms-eval
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
agi audio-evaluation benchmark evaluation large-language-models llm-evaluation multimodal multimodal-evaluation video-understanding vision-language-model vlm
Last synced: 28 Feb 2026
https://github.com/evolvinglmms-lab/otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
artificial-inteligence chatgpt deep-learning embodied-ai foundation-models gpt-4 instruction-tuning large-scale-models machine-learning multi-modality visual-language-learning
Last synced: 13 Dec 2025
https://github.com/EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
Last synced: 05 Sep 2025
https://github.com/EvolvingLMMs-Lab/open-r1-multimodal
A fork to add multimodal model training to open-r1
Last synced: 10 Apr 2025
https://github.com/evolvinglmms-lab/open-r1-multimodal
A fork to add multimodal model training to open-r1
Last synced: 11 Apr 2025
https://github.com/EvolvingLMMs-Lab/RelateAnything
Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.
Last synced: 02 May 2025
https://github.com/evolvinglmms-lab/relateanything
Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.
Last synced: 05 Apr 2025
https://github.com/evolvinglmms-lab/llava-onevision-1.5
Fully Open Framework for Democratized Multimodal Training
llava llm mllm qwen3 vision-language-model
Last synced: 26 Dec 2025
https://github.com/evolvinglmms-lab/longva
Long Context Transfer from Language to Vision
Last synced: 12 Apr 2025
https://github.com/EvolvingLMMs-Lab/LongVA
Long Context Transfer from Language to Vision
Last synced: 07 May 2025
https://github.com/evolvinglmms-lab/egolife
[CVPR 2025] EgoLife: Towards Egocentric Life Assistant
egocentric-vision omnimodal rag
Last synced: 05 Apr 2025
https://github.com/EvolvingLMMs-Lab/EgoLife
[CVPR 2025] EgoLife: Towards Egocentric Life Assistant
egocentric-vision omnimodal rag
Last synced: 01 Apr 2025
https://github.com/evolvinglmms-lab/lmms-lab-writer
Agentic LaTeX Writer - Local-first editor for AI-assisted academic writing
academic-writing ai editor latex writing
Last synced: 31 May 2026
https://github.com/evolvinglmms-lab/multimodal-sae
Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.
Last synced: 06 Apr 2025
https://github.com/evolvinglmms-lab/neo
NEO Series: Native Vision-Language Models from First Principles
Last synced: 27 Oct 2025
https://github.com/evolvinglmms-lab/mgpo
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning
Last synced: 06 Mar 2026
https://github.com/evolvinglmms-lab/sae
A framework that allows you to apply Sparse AutoEncoder on any models
Last synced: 20 Jul 2025
https://github.com/evolvinglmms-lab/engram
Privacy-first AI memory layer - Signal for AI Memory. E2EE, local-first, works with Claude, Cursor, and any MCP-compatible AI.
ai claude cursor e2ee encryption llm local-first mcp memory privacy
Last synced: 19 Feb 2026
https://github.com/evolvinglmms-lab/homebrew-tap
Homebrew tap for LMMs-Lab applications
Last synced: 19 Feb 2026