Projects in Awesome Lists by EvolvingLMMs-Lab

https://github.com/evolvinglmms-lab/lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

agi audio-evaluation benchmark evaluation large-language-models llm-evaluation multimodal multimodal-evaluation video-understanding vision-language-model vlm

Last synced: 28 Feb 2026

https://github.com/evolvinglmms-lab/otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

artificial-inteligence chatgpt deep-learning embodied-ai foundation-models gpt-4 instruction-tuning large-scale-models machine-learning multi-modality visual-language-learning

Last synced: 13 Dec 2025

https://github.com/EvolvingLMMs-Lab/lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Last synced: 05 Sep 2025

https://github.com/EvolvingLMMs-Lab/open-r1-multimodal

A fork to add multimodal model training to open-r1

Last synced: 10 Apr 2025

https://github.com/evolvinglmms-lab/open-r1-multimodal

A fork to add multimodal model training to open-r1

Last synced: 11 Apr 2025

https://github.com/EvolvingLMMs-Lab/RelateAnything

Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.

Last synced: 02 May 2025

https://github.com/evolvinglmms-lab/relateanything

Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.

Last synced: 05 Apr 2025

https://github.com/evolvinglmms-lab/llava-onevision-1.5

Fully Open Framework for Democratized Multimodal Training

llava llm mllm qwen3 vision-language-model

Last synced: 26 Dec 2025

https://github.com/evolvinglmms-lab/longva

Long Context Transfer from Language to Vision

Last synced: 12 Apr 2025

https://github.com/EvolvingLMMs-Lab/LongVA

Long Context Transfer from Language to Vision

Last synced: 07 May 2025

https://github.com/evolvinglmms-lab/egolife

[CVPR 2025] EgoLife: Towards Egocentric Life Assistant

egocentric-vision omnimodal rag

Last synced: 05 Apr 2025

https://github.com/EvolvingLMMs-Lab/EgoLife

[CVPR 2025] EgoLife: Towards Egocentric Life Assistant

egocentric-vision omnimodal rag

Last synced: 01 Apr 2025

https://github.com/evolvinglmms-lab/lmms-lab-writer

Agentic LaTeX Writer - Local-first editor for AI-assisted academic writing

academic-writing ai editor latex writing

Last synced: 31 May 2026

https://github.com/evolvinglmms-lab/multimodal-sae

Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.

Last synced: 06 Apr 2025

https://github.com/evolvinglmms-lab/neo

NEO Series: Native Vision-Language Models from First Principles

Last synced: 27 Oct 2025

https://github.com/evolvinglmms-lab/aero-1

Last synced: 06 Jul 2025

https://github.com/evolvinglmms-lab/videommmu

Last synced: 07 Sep 2025

https://github.com/evolvinglmms-lab/mgpo

High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning

Last synced: 06 Mar 2026

https://github.com/evolvinglmms-lab/sae

A framework that allows you to apply Sparse AutoEncoder on any models

Last synced: 20 Jul 2025

https://github.com/evolvinglmms-lab/engram

Privacy-first AI memory layer - Signal for AI Memory. E2EE, local-first, works with Claude, Cursor, and any MCP-compatible AI.

ai claude cursor e2ee encryption llm local-first mcp memory privacy

Last synced: 19 Feb 2026

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome