https://github.com/Yuan-ManX/ai-game-devtools
Here we will keep track of the latest AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥
https://github.com/Yuan-ManX/ai-game-devtools
List: ai-game-devtools
ai-platform ai-toolkit aigc artificial-intelligence awesome-list deep-learning game-ai game-development game-engine mechine-learing unity
Last synced: about 1 year ago
JSON representation
Here we will keep track of the latest AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥
- Host: GitHub
- URL: https://github.com/Yuan-ManX/ai-game-devtools
- Owner: Yuan-ManX
- License: mit
- Created: 2023-03-21T03:01:17.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2025-04-07T06:29:37.000Z (about 1 year ago)
- Last Synced: 2025-05-07T02:02:01.747Z (about 1 year ago)
- Topics: ai-platform, ai-toolkit, aigc, artificial-intelligence, awesome-list, deep-learning, game-ai, game-development, game-engine, mechine-learing, unity
- Homepage: https://yuan-manx.github.io/ai-game-devtools/
- Size: 1.78 MB
- Stars: 825
- Watchers: 29
- Forks: 80
- Open Issues: 5
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- Awesome-Gamedev - link
- awesome_ai_agents - Ai-Game-Devtools - Here we will keep track of the latest AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, An… (Building / Tools)
- ultimate-awesome - ai-game-devtools - Here we will keep track of the latest AI Game Development Tools, including LLM, World Model, Agent, Code, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥. (Programming Language Lists / Python Lists)
- awesome-opensource-unity - AI Game DevTools (AI-GDT) - the latest AI Game Development Tools, including LLM, World Model, Agent, Code, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics (Open Source Packages / AI Tools)
- awesome-openclaw-skills - Yuan-ManX/ai-game-devtools
README
# AI Game DevTools (AI-GDT) 🎮
Here we will keep track of the latest AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥
## Table of Contents
* [Tool (AI LLM)](#tool)
* [Game (Agent)](#game)
* [Code](#code)
* [Writer](#writer)
* [Image](#image)
* [Texture](#texture)
* [Shader](#shader)
* [3D Model](#model)
* [Avatar](#avatar)
* [Animation](#animation)
* [Visual](#visual)
* [Video](#video)
* [Audio](#audio)
* [Music](#music)
* [Singing Voice](#voice)
* [Speech](#speech)
* [Analytics](#analytics)
## Project List
### Tool (AI LLM)
| Source | Description | Paper | Game Engine | Type |
| :------------------------------------------------------------------------------------------ | :--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | :-----------: | :-----------: | :-------: |
| [AgentGPT](https://github.com/reworkd/AgentGPT) | 🤖 Assemble, configure, and deploy autonomous AI Agents in your browser. | | | Tool |
| [AICommand](https://github.com/keijiro/AICommand) | ChatGPT integration with Unity Editor. | | Unity | Tool |
| [AIOS](https://github.com/agiresearch/AIOS) | LLM Agent Operating System. | | | Tool |
| [AI Scientist](https://github.com/SakanaAI/AI-Scientist) | The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery. |[arXiv](https://arxiv.org/abs/2408.06292) | | Tool |
| [Assistant CLI](https://github.com/diciaup/assistant-cli) | A comfortable CLI tool to use ChatGPT service🔥 | | | Tool |
| [Auto-GPT](https://github.com/Significant-Gravitas/Auto-GPT) | An experimental open-source attempt to make GPT-4 fully autonomous. | | | Tool |
| [BabyAGI](https://github.com/yoheinakajima/babyagi) | This Python script is an example of an AI-powered task management system. | | | Tool |
| [👶🤖🖥️ BabyAGI UI](https://github.com/miurla/babyagi-ui) | BabyAGI UI is designed to make it easier to run and develop with babyagi in a web app, like a ChatGPT. | | | Tool |
| [baichuan-7B](https://github.com/baichuan-inc/baichuan-7B) | A large-scale 7B pretraining language model developed by Baichuan. | | | Tool |
| [Baichuan-13B](https://github.com/baichuan-inc/Baichuan-13B) | A 13B large language model developed by Baichuan Intelligent Technology. | | | Tool |
| [Baichuan 2](https://github.com/baichuan-inc/Baichuan2) | A series of large language models developed by Baichuan Intelligent Technology. | | | Tool |
| [Bisheng](https://github.com/dataelement/bisheng) | Bisheng is an open LLM devops platform for next generation AI applications. | | | Tool |
| [Character-LLM](https://github.com/choosewhatulike/trainable-agents) | A Trainable Agent for Role-Playing. |[arXiv](https://arxiv.org/abs/2310.10158) | | Tool |
| [ChatDev](https://github.com/OpenBMB/ChatDev) | Communicative Agents for Software Development. |[arXiv](https://arxiv.org/abs/2307.07924) | | Tool |
| [ChatGPT-API-unity](https://github.com/mochi-neko/ChatGPT-API-unity) | Binds ChatGPT chat completion API to pure C# on Unity. | | Unity | Tool |
| [ChatGPTForUnity](https://github.com/sunsvip/ChatGPTForUnity) | ChatGPT for unity. | | Unity | Tool |
| [ChatRWKV](https://github.com/BlinkDL/ChatRWKV) | ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. | | | Tool |
| [ChatYuan](https://github.com/clue-ai/ChatYuan) | Large Language Model for Dialogue in Chinese and English. | | | Tool |
| [Chinese-LLaMA-Alpaca-3](https://github.com/ymcui/Chinese-LLaMA-Alpaca-3) | (Chinese Llama-3 LLMs) developed from Meta Llama 3. | | | Tool |
| [Chrome-GPT](https://github.com/richardyc/Chrome-GPT) | An AutoGPT agent that controls Chrome on your desktop. | | | Tool |
| [CogVLM](https://www.modelscope.cn/models/ZhipuAI/CogVLM/summary) | CogVLM, a powerful open-source visual language foundation model. |[arXiv](https://arxiv.org/abs/2311.03079) | | Tool |
| [CoreNet](https://github.com/apple/corenet) | A library for training deep neural networks. | | | Tool |
| [Cosmos](https://github.com/NVIDIA/Cosmos) | Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. | | | LLM |
| [DBRX](https://github.com/databricks/dbrx) | DBRX is a large language model trained by Databricks. | | | Tool |
| [DCLM](https://github.com/mlfoundations/dclm) | DataComp for Language Models. |[arXiv](https://arxiv.org/abs/2406.11794) | | Tool |
| [DeepSeek-R1](https://github.com/deepseek-ai/DeepSeek-R1) | DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrated remarkable performance on reasoning. | | | LLM |
| [DeepSeek-V3](https://github.com/deepseek-ai/DeepSeek-V3) | DeepSeek-V3 is a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. |[arXiv](https://arxiv.org/abs/2412.19437) | | LLM |
| [DemoGPT](https://github.com/melih-unsal/DemoGPT) | Auto Gen-AI App Generator with the Power of Llama 2 | | | Tool |
| [Design2Code](https://github.com/NoviScl/Design2Code) | Automating Front-End Engineering | | | Tool |
| [Devika](https://github.com/stitionai/devika) | Devika is an Agentic AI Software Engineer. | | | Tool |
| [Devon](https://github.com/entropy-research/Devon) | An open-source pair programmer. | | | Tool |
| [Dora](https://www.dora.run/ai) | Generating powerful websites, one prompt at a time. | | | Tool |
| [Flowise](https://github.com/FlowiseAI/Flowise) | Drag & drop UI to build your customized LLM flow using LangchainJS. | | | Tool |
| [Gemini](https://deepmind.google/technologies/gemini) | Gemini is built from the ground up for multimodality — reasoning seamlessly across text, images, video, audio, and code. | | | Tool |
| [Gemma](https://github.com/google/gemma_pytorch) | Gemma is a family of lightweight, state-of-the art open models built from research and technology used to create Google Gemini models. | | | Tool |
| [gemma.cpp](https://github.com/google/gemma.cpp) | lightweight, standalone C++ inference engine for Google's Gemma models. | | | Tool |
| [GLM-4](https://github.com/THUDM/GLM-4) | GLM-4-9B is the open-source version of the latest generation of pre-trained models in the GLM-4 series launched by Zhipu AI. | | | Tool |
| [GPT4All](https://github.com/nomic-ai/gpt4all) | A chatbot trained on a massive collection of clean assistant data including code, stories and dialogue. | | | Tool |
| [GPT-4o](https://openai.com/index/hello-gpt-4o/) | GPT-4o (“o” for “omni”) is a step towards much more natural human-computer interaction—it accepts as input any combination of text, audio, image, and video and generates any combination of text, audio, and image outputs. | | | Tool |
| [GPTScript](https://github.com/gptscript-ai/gptscript) | Develop LLM Apps in Natural Language. | | | Tool |
| [Grok-1](https://x.ai/blog/grok-os) | The weights and architecture of our 314 billion parameter Mixture-of-Experts model, Grok-1. | | | Tool |
| [HuggingChat](https://huggingface.co/chat/) | Making the community's best AI chat models available to everyone. | | | Tool |
| [Hugging Face API Unity Integration](https://github.com/huggingface/unity-api) | This Unity package provides an easy-to-use integration for the Hugging Face Inference API, allowing developers to access and use Hugging Face AI models within their Unity projects. | | Unity | Tool |
| [ImageBind](https://github.com/facebookresearch/ImageBind) | ImageBind One Embedding Space to Bind Them All. |[arXiv](https://arxiv.org/abs/2305.05665) | | Tool |
| [Index-1.9B](https://github.com/bilibili/Index-1.9B) | A SOTA lightweight multilingual LLM. | | | Tool |
| [InteractML-Unity](https://github.com/Interactml/iml-unity) | InteractML, an Interactive Machine Learning Visual Scripting framework for Unity3D. | | Unity | Tool |
| [InteractML-Unreal Engine](https://github.com/Interactml/iml-ue4) | Bringing Machine Learning to Unreal Engine. | | Unreal Engine | Tool |
| [InternLM](https://github.com/InternLM/InternLM) | InternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system. |[arXiv](https://arxiv.org/abs/2403.17297) | | Tool |
| [InternLM-XComposer](https://github.com/InternLM/InternLM-XComposer) | InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension. |[arXiv](https://arxiv.org/abs/2404.06512) | | Tool |
| [Jan](https://github.com/janhq/jan) | Bring AI to your Desktop. | | | Tool |
| [Janus](https://github.com/deepseek-ai/Janus) | Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation. |[arXiv](https://arxiv.org/abs/2410.13848) | | LLM |
| [Lamini](https://github.com/lamini-ai/lamini) | Lamini allows any engineering team to outperform general purpose LLMs through RLHF and fine- tuning on their own data. | | | Tool |
| [LaMini-LM](https://github.com/mbzuai-nlp/LaMini-LM) | LaMini-LM is a collection of small-sized, efficient language models distilled from ChatGPT and trained on a large-scale dataset of 2.58M instructions. | | | Tool |
| [LangChain](https://github.com/hwchase17/langchain) | LangChain is a framework for developing applications powered by language models. | | | Tool |
| [LangFlow](https://github.com/logspace-ai/langflow) | ⛓️ LangFlow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows. | | | Tool |
| [LaVague](https://github.com/lavague-ai/LaVague) | Automate automation with Large Action Model framework. | | | Tool |
| [Lemur](https://github.com/OpenLemur/Lemur) | Open Foundation Models for Language Agents. | | | Tool |
| [Lepton AI](https://github.com/leptonai/leptonai) | A Pythonic framework to simplify AI service building. | | | Tool |
| [Lit-LLaMA](https://github.com/Lightning-AI/lit-llama) | Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. | | | Tool |
| [llama2-webui](https://github.com/liltom-eth/llama2-webui) | Run Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). | | | Tool |
| [Llama 3](https://github.com/meta-llama/llama3) | The official Meta Llama 3 GitHub site. | | | Tool |
| [Llama 3.1](https://github.com/meta-llama/llama-models) | Llama is an accessible, open large language model (LLM) designed for developers, researchers, and businesses to build, experiment, and responsibly scale their generative AI ideas. | | | Tool |
| [LLaSM](https://github.com/LinkSoul-AI/LLaSM) | Large Language and Speech Model. | | | Tool |
| [LLM Answer Engine](https://github.com/developersdigest/llm-answer-engine) | Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Mixtral, Langchain, OpenAI, Brave & Serper. | | | Tool |
| [llm.c](https://github.com/karpathy/llm.c) | LLM training in simple, raw C/CUDA. | | | Tool |
| [LLMUnity](https://github.com/undreamai/LLMUnity) | Create characters in Unity with LLMs! | | Unity | Tool |
| [LLocalSearch](https://github.com/nilsherzig/LLocalSearch) | LLocalSearch is a completely locally running search engine using LLM Agents. | | | Tool |
| [LogicGamesSolver](https://github.com/fabridigua/LogicGamesSolver) | A Python tool to solve logic games with AI, Deep Learning and Computer Vision. | | | Tool |
| [LongWriter](https://github.com/THUDM/LongWriter) | LongWriter: Unleashing 10,000+ Word Generation From Long Context LLMs. |[arXiv](https://arxiv.org/abs/2408.07055) | | Tool |
| [Large World Model (LWM)](https://github.com/LargeWorldModel/LWM) | Large World Model (LWM) is a general-purpose large-context multimodal autoregressive model. |[arXiv](https://arxiv.org/abs/2402.08268) | | Tool |
| [Lumina-T2X](https://github.com/Alpha-VLLM/Lumina-T2X) | Lumina-T2X is a unified framework for Text to Any Modality Generation. |[arXiv](https://arxiv.org/abs/2405.05945) | | Tool |
| [MetaGPT](https://github.com/geekan/MetaGPT) | The Multi-Agent Framework | | | Tool |
| [MiniCPM-2B](https://github.com/OpenBMB/MiniCPM) | An end-side LLM outperforms Llama2-13B. | | | Tool |
| [MiniGPT-4](https://github.com/Vision-CAIR/MiniGPT-4) | Enhancing Vision-language Understanding with Advanced Large Language Models. |[arXiv](https://arxiv.org/abs/2304.10592) | | Tool |
| [MiniGPT-5](https://github.com/eric-ai-lab/MiniGPT-5) | Interleaved Vision-and-Language Generation via Generative Vokens. |[arXiv](https://arxiv.org/abs/2310.02239) | | Tool |
| [MiniMax-01](https://github.com/MiniMax-AI/MiniMax-01) | MiniMax-01: Scaling Foundation Models with Lightning Attention. |[arXiv](https://arxiv.org/abs/2501.08313) | | LLM |
| [Mixtral 8x7B](https://mistral.ai/news/mixtral-of-experts/) | A high quality Sparse Mixture-of-Experts. |[arXiv](https://arxiv.org/abs/2401.04088) | | Tool |
| [Mistral 7B](https://mistral.ai/news/announcing-mistral-7b/) | The best 7B model to date, Apache 2.0. | | | Tool |
| [Mistral Large](https://mistral.ai/news/mistral-large/) | Mistral Large is a new cutting-edge text generation model. It reaches top-tier reasoning capabilities. | | | Tool |
| [MLC LLM](https://github.com/mlc-ai/mlc-llm) | Enable everyone to develop, optimize and deploy AI models natively on everyone's devices. | | | Tool |
| [MobiLlama](https://github.com/mbzuai-oryx/MobiLlama) | Towards Accurate and Lightweight Fully Transparent GPT. |[arXiv](https://arxiv.org/abs/2402.16840) | | Tool |
| [MoE-LLaVA](https://github.com/PKU-YuanGroup/MoE-LLaVA) | Mixture of Experts for Large Vision-Language Models. |[arXiv](https://arxiv.org/abs/2401.15947) | | Tool |
| [Moshi](https://www.moshi.chat/?queue_id=talktomoshi) | Moshi is an experimental conversational AI. | | | Tool |
| [Moshi](https://github.com/kyutai-labs/moshi) | Moshi: a speech-text foundation model for real time dialogue. | | | Tool |
| [MOSS](https://github.com/OpenLMLab/MOSS) | An open-source tool-augmented conversational language model from Fudan University. | | | Tool |
| [mPLUG-Owl🦉](https://github.com/X-PLUG/mPLUG-Owl) | Modularization Empowers Large Language Models with Multimodality. |[arXiv](https://arxiv.org/abs/2304.14178) | | Tool |
| [Nemotron-4](https://arxiv.org/abs/2402.16819) | A 15-billion-parameter large multilingual language model trained on 8 trillion text tokens. |[arXiv](https://arxiv.org/abs/2402.16819) | | Tool |
| [NExT-GPT](https://github.com/NExT-GPT/NExT-GPT) | Any-to-Any Multimodal Large Language Model. | | | Tool |
| [OLMo](https://github.com/allenai/OLMo) | Open Language Model |[arXiv](https://arxiv.org/abs/2402.00838) | | Tool |
| [OmniLMM](https://github.com/OpenBMB/OmniLMM) | Large multi-modal models for strong performance and efficient deployment. | | | Tool |
| [OneLLM](https://github.com/csuhan/OneLLM) | One Framework to Align All Modalities with Language. |[arXiv](https://arxiv.org/abs/2312.03700) | | Tool |
| [Open-Assistant](https://github.com/LAION-AI/Open-Assistant) | OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so. | | | Tool |
| [Open Deep Research](https://github.com/dzhng/deep-research) | An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. | | | LLM |
| [OpenDevin](https://github.com/OpenDevin/OpenDevin) | An autonomous AI software engineer. | | | Tool |
| [Orion-14B](https://github.com/OrionStarAI/Orion) | Orion-14B is a family of models includes a 14B foundation LLM, and a series of models. |[arXiv](https://arxiv.org/abs/2401.12246) | | Tool |
| [Panda](https://github.com/dandelionsllm/pandallm) | Overseas Chinese open source large language model, based on Llama-7B, -13B, -33B, -65B for continuous pre-training in the Chinese field. | | | Tool |
| [Perplexica](https://github.com/ItzCrazyKns/Perplexica) | An AI-powered search engine. | | | Tool |
| [Pi](https://heypi.com/talk) | AI chatbot designed for personal assistance and emotional support. | | | Tool |
| [Qwen1.5](https://github.com/QwenLM/Qwen1.5) | Qwen1.5 is the improved version of Qwen. | | | Tool |
| [Qwen2](https://github.com/QwenLM/Qwen2) | Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud. | | | LLM |
| [Qwen2.5-Coder](https://github.com/QwenLM/Qwen2.5-Coder) | Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud. |[arXiv](https://arxiv.org/abs/2409.12186) | | LLM |
| [Qwen-7B](https://github.com/QwenLM/Qwen-7B) | The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud. | | | LLM |
| [RepoAgent](https://github.com/OpenBMB/RepoAgent) | RepoAgent is an Open-Source project driven by Large Language Models(LLMs) that aims to provide an intelligent way to document projects. |[arXiv](https://arxiv.org/abs/2402.16667) | | Tool |
| [s1](https://github.com/simplescaling/s1) | s1: Simple test-time scaling. |[arXiv](https://arxiv.org/abs/2501.19393) | | LLM |
| [Sanity AI Engine](https://github.com/tosos/SanityEngine) | Sanity AI Engine for the Unity Game Development Tool. | | Unity | Tool |
| [SearchGPT](https://github.com/tobiasbueschel/search-gpt) | 🌳 Connecting ChatGPT with the Internet | | | Tool |
| [ShareGPT4V](https://sharegpt4v.github.io/) | Improving Large Multi-Modal Models with Better Captions. | | | Tool |
| [SkyThought](https://github.com/NovaSky-AI/SkyThought) | Sky-T1: Train your own O1 preview model within $450. | | | LLM |
| [Skywork](https://github.com/SkyworkAI/Skywork) | Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. | | | Tool |
| [StableLM](https://github.com/Stability-AI/StableLM) | Stability AI Language Models. |[arXiv](https://arxiv.org/abs/2402.17834) | | Tool |
| [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca) | An Instruction-following LLaMA Model. | | | LLM |
| [Text generation web UI](https://github.com/oobabooga/text-generation-webui) | A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, OPT, and GALACTICA. | | | Tool |
| [TinyChatEngine](https://github.com/mit-han-lab/TinyChatEngine) | On-Device LLM Inference Library. | | | Tool |
| [ToolBench](https://github.com/openbmb/toolbench) | An open platform for training, serving, and evaluating large language model for tool learning. | | | Tool |
| [Unity ChatGPT](https://github.com/dilmerv/UnityChatGPT) | Unity ChatGPT Experiments. | | Unity | Tool |
| [Unity OpenAI-API Integration](https://github.com/himanshuskyrockets/Unity_OpenAI) | Integrate openai GPT-3 language model and ChatGPT API into a Unity project. | | Unity | Tool |
| [Unreal Engine 5 Llama LoRA](https://github.com/bublint/ue5-llama-lora) | A proof-of-concept project that showcases the potential for using small, locally trainable LLMs to create next-generation documentation tools. | | Unreal Engine | Tool |
| [UnrealGPT](https://github.com/TREE-Ind/UnrealGPT) | A collection of Unreal Engine 5 Editor Utility widgets powered by GPT3/4. | | Unreal Engine | Tool |
| [Video-LLaVA](https://github.com/PKU-YuanGroup/Video-LLaVA) | Learning United Visual Representation by Alignment Before Projection. |[arXiv](https://arxiv.org/abs/2311.10122) | | Tool |
| [WebGPT](https://github.com/0hq/WebGPT) | Run GPT model on the browser with WebGPU. | | | Tool |
| [Web3-GPT](https://github.com/Markeljan/Web3GPT) | Deploy smart contracts with AI | | | Tool |
| [WordGPT](https://github.com/filippofinke/WordGPT) | 🤖 Bring the power of ChatGPT to Microsoft Word | | | Tool |
| [XAgent](https://github.com/OpenBMB/XAgent) | An Autonomous LLM Agent for Complex Task Solving. | | | Tool |
| [Yi](https://github.com/01-ai/Yi) | A series of large language models trained from scratch by developers. | | | Tool |
| [01 Project](https://github.com/OpenInterpreter/01) | The open-source language model computer. | | | Tool |
## Game (Agent)
| Source | Description | Paper | Game Engine | Type |
| :------------------------------------------------------------------------------------------ | :--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | :-----------: | :-----------: | :-------: |
| [AgentBench](https://github.com/thudm/agentbench) | A Comprehensive Benchmark to Evaluate LLMs as Agents. |[arXiv](https://arxiv.org/abs/2308.03688) | | Agent |
| [Agent Group Chat](https://github.com/MikeGu721/AgentGroup) | An Interactive Group Chat Simulacra For Better Eliciting Collective Emergent Behavior. |[arXiv](https://arxiv.org/abs/2403.13433) | | Agent |
| [Agent K](https://github.com/mikekelly/AgentK) | An autoagentic AGI that is self-evolving and modular. | | | Agent |
| [Agent Laboratory](https://github.com/SamuelSchmidgall/AgentLaboratory) | Agent Laboratory: Using LLM Agents as Research Assistants. |[arXiv](https://arxiv.org/abs/2501.04227) | | Agent |
| [AgentScope](https://github.com/modelscope/agentscope) | Start building LLM-empowered multi-agent applications in an easier way. |[arXiv](https://arxiv.org/abs/2402.14034) | | Agent |
| [AgentSims](https://github.com/py499372727/AgentSims/) | An Open-Source Sandbox for Large Language Model Evaluation. | | | Agent |
| [AI Town](https://github.com/a16z-infra/ai-town) | AI Town is a virtual town where AI characters live, chat and socialize. | | | Agent |
| [anime.gf](https://github.com/cyanff/anime.gf) | Local & Open Source Alternative to CharacterAI. | | | Game |
| [Astrocade](https://www.astrocade.com/) | Create games with AI | | | Game |
| [Atomic Agents](https://github.com/KennyVaneetvelde/atomic_agents) | The Atomic Agents framework is designed to be modular, extensible, and easy to use. | | | Agent |
| [AutoAgents](https://github.com/Link-AGI/AutoAgents) | A Framework for Automatic Agent Generation. | | | Agent |
| [AutoGen](https://github.com/microsoft/autogen) | Enable Next-Gen Large Language Model Applications. |[arXiv](https://arxiv.org/abs/2308.08155) | | Agent |
| [behaviac](https://github.com/Tencent/behaviac) | Behaviac is a framework of the game AI development. | | | Framework |
| [Biomes](https://github.com/ill-inc/biomes-game) | Biomes is an open source sandbox MMORPG built for the web using web technologies such as Next.js, Typescript, React and WebAssembly. | | | Game |
| [Buffer of Thoughts](https://github.com/YangLing0818/buffer-of-thought-llm) | Thought-Augmented Reasoning with Large Language Models. |[arXiv](https://arxiv.org/abs/2406.04271) | | Agent |
| [Byzer-Agent](https://github.com/allwefantasy/byzer-agent) | Easy, fast, and distributed agent framework for everyone. | | | Agent |
| [Cat Town](https://github.com/ykhli/cat-town) | A C(h)atGPT-powered simulation with cats. | | | Agent |
| [Cat Town](https://github.com/ykhli/cat-town) | A C(h)atGPT-powered simulation with cats. | | | Agent |
| [CharacterGLM](https://github.com/thu-coai/CharacterGLM-6B) | Customizing Chinese Conversational AI Characters with Large Language Models. |[arXiv](https://arxiv.org/abs/2311.16832) | | Agent |
| [ChatDev](https://github.com/OpenBMB/ChatDev) | Communicative Agents for Software Development. |[arXiv](https://arxiv.org/abs/2405.04219) | | Agent |
| [CogAgent](https://modelscope.cn/models/ZhipuAI/cogagent-chat/summary) | CogAgent is an open-source visual language model improved based on CogVLM. |[arXiv](https://arxiv.org/abs/2312.08914) | | Agent |
| [Cradle](https://github.com/BAAI-Agents/Cradle) | Towards General Computer Control. | | | Agent |
| [crewAI](https://github.com/joaomdmoura/crewAI) | Framework for orchestrating role-playing, autonomous AI agents. | | | Agent |
| [Dify](https://github.com/langgenius/dify) | Dify is an open-source LLM app building platform. | | | Agent |
| [Digital Life Project](https://digital-life-project.com/) | Autonomous 3D Characters with Social Intelligence. |[arXiv](https://arxiv.org/abs/2312.04547) | | Agent |
| [everything-ai](https://github.com/AstraBert/everything-ai) | Your fully proficient, AI-powered and local chatbot assistant🤖. | | | Agent |
| [fabric](https://github.com/danielmiessler/fabric) | fabric is an open-source framework for augmenting humans using AI. | | | Agent |
| [FastGPT](https://github.com/labring/FastGPT) | FastGPT is a knowledge-based platform built on the LLM. | | | Agent |
| [fastRAG](https://github.com/IntelLabs/fastRAG) | Efficient Retrieval Augmentation and Generation Framework. | | | Agent |
| [GameAISDK](https://github.com/Tencent/GameAISDK) | Image-based game AI automation framework. | | | Framework |
| [GameNGen](https://gamengen.github.io/) | Diffusion Models Are Real-Time Game Engines. |[arXiv](https://arxiv.org/abs/2408.14837) | | Game |
| [GameGen-O](https://github.com/GameGen-O/GameGen-O) | GameGen-O: Open-world Video Game Generation. | | | Game |
| [GenAgent](https://github.com/xxyQwQ/GenAgent) | GenAgent: Build Collaborative AI Systems with Automated Workflow Generation - Case Studies on ComfyUI. |[arXiv](https://arxiv.org/abs/2409.01392) | | Agent |
| [Generative Agents](https://github.com/joonspk-research/generative_agents) | Interactive Simulacra of Human Behavior. |[arXiv](https://arxiv.org/abs/2304.03442) | | Agent |
| [Genesis](https://github.com/Genesis-Embodied-AI/Genesis) | Genesis: A Generative and Universal Physics Engine for Robotics and Beyond. | | | Game |
| [Genie](https://sites.google.com/view/genie-2024/home) | Generative Interactive Environments. | | | Game |
| [gigax](https://github.com/GigaxGames/gigax) | Runtime, LLM-powered NPCs. | | | Game |
| [HippoRAG](https://github.com/OSU-NLP-Group/HippoRAG) | Neurobiologically Inspired Long-Term Memory for Large Language Models. |[arXiv](https://arxiv.org/abs/2405.14831) | | Agent |
| [Interactive LLM Powered NPCs](https://github.com/AkshitIreddy/Interactive-LLM-Powered-NPCs) | Interactive LLM Powered NPCs, is an open-source project that completely transforms your interaction with non-player characters (NPCs) in any game! | | | Game |
| [IoA](https://github.com/OpenBMB/IoA) | An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity. | | | Agent |
| [KwaiAgents](https://github.com/KwaiKEG/KwaiAgents) | A generalized information-seeking agent system with Large Language Models (LLMs). |[arXiv](https://arxiv.org/abs/2312.04889) | | Agent |
| [LangChain](https://github.com/langchain-ai/langchain) | Get your LLM application from prototype to production. | | | Agent |
| [Langflow](https://github.com/logspace-ai/langflow) | Langflow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows. | | | Agent |
| [LangGraph Studio](https://github.com/langchain-ai/langgraph-studio) | LangGraph Studio offers a new way to develop LLM applications by providing a specialized agent IDE that enables visualization, interaction, and debugging of complex agentic applications. | | | Agent |
| [LARP](https://github.com/MiAO-AI-Lab/LARP) | Language-Agent Role Play for open-world games. |[arXiv](https://arxiv.org/abs/2312.17653) | | Agent |
| [LLama Agentic System](https://github.com/meta-llama/llama-agentic-system) | Agentic components of the Llama Stack APIs. | | | Agent |
| [LlamaIndex](https://github.com/run-llama/llama_index) | LlamaIndex is a data framework for your LLM application. | | | Agent |
| [MindSearch](https://github.com/InternLM/MindSearch) | 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT). | | | Agent |
| [Mixture of Agents (MoA)](https://github.com/togethercomputer/MoA) | Mixture-of-Agents Enhances Large Language Model Capabilities. |[arXiv](https://arxiv.org/abs/2406.04692) | | Agent |
| [MMRole](https://github.com/YanqiDai/MMRole) | MMRole: A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents. |[arXiv](https://arxiv.org/abs/2408.04203v1) | | Agent |
| [Moonlander.ai](https://www.moonlander.ai/) | Start building 3D games without any coding using generative AI. | | | Framework |
| [MuG Diffusion](https://github.com/Keytoyze/Mug-Diffusion) | MuG Diffusion is a charting AI for rhythm games based on Stable Diffusion (one of the most powerful AIGC models) with a large modification to incorporate audio waves. | | | Game |
| [Oasis](https://github.com/etched-ai/open-oasis) | Oasis is an interactive world model developed by Decart and Etched. Based on diffusion transformers, Oasis takes in user keyboard input and generates gameplay in an autoregressive manner. | | | Game |
| [OmAgent](https://github.com/om-ai-lab/OmAgent) | A multimodal agent framework for solving complex tasks. | | | Agent |
| [OpenAgents](https://github.com/xlang-ai/OpenAgents) | An Open Platform for Language Agents in the Wild. | | | Agent |
| [Opus](https://opus.ai/) | An AI app that turns text into a video game. | | | Game |
| [Pipecat](https://github.com/pipecat-ai/pipecat) | Open Source framework for voice and multimodal conversational AI. | | | Agent |
| [Qwen-Agent](https://github.com/QwenLM/Qwen-Agent) | Qwen-Agent is a framework for developing LLM applications based on the instruction following, tool usage, planning, and memory capabilities of Qwen. | | | Agent |
| [Ragas](https://github.com/explodinggradients/ragas) | Ragas is a framework that helps you evaluate your Retrieval Augmented Generation (RAG) pipelines. | | | Agent |
| [RPBench-Auto](https://github.com/boson-ai/RPBench-Auto) | An automated pipeline for evaluating LLMs for role-playing. | | | Game |
| [SIMA](https://deepmind.google/discover/blog/sima-generalist-ai-agent-for-3d-virtual-environments/) | A generalist AI agent for 3D virtual environments. | | | Agent |
| [StoryGames.ai](https://storygames.buildbox.com/) | AI for Dreamers Make Games. | | | Game |
| [SWE-agent](https://github.com/princeton-nlp/SWE-agent) | Agent Computer Interfaces Enable Software Engineering Language Models. |[arXiv](https://arxiv.org/abs/2405.15793) | | Agent |
| [TaskGen](https://github.com/simbianai/taskgen) | A Task-based agentic framework building on StrictJSON outputs by LLM agents. | | | Agent |
| [TEN Agent](https://github.com/TEN-framework/TEN-Agent) | TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities. | | | Agent |
| [Translation Agent](https://github.com/andrewyng/translation-agent) | Agentic translation using reflection workflow. | | | Agent |
| [Twitter](https://github.com/wordware-ai/twitter) | Twitter Personality is a web application that analyzes your Twitter handle to create a personalized personality profile using Wordware AI Agent. | | | Agent |
| [Unbounded](https://generative-infinite-game.github.io/) | Unbounded: A Generative Infinite Game of Character Life Simulation. |[arXiv](https://arxiv.org/abs/2410.18975) | | Game |
| [Video2Game](https://github.com/video2game/video2game) | Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video. |[arXiv](https://arxiv.org/abs/2404.09833) | | Game |
| [V-IRL](https://virl-platform.github.io/) | Grounding Virtual Intelligence in Real Life. |[arXiv](https://arxiv.org/abs/2402.03310) | | Agent |
| [WebDesignAgent](https://github.com/DAMO-NLP-SG/WebDesignAgent) | An agent used for webdesign. | | | Agent |
| [XAgent](https://github.com/OpenBMB/XAgent) | An Autonomous LLM Agent for Complex Task Solving. | | | Agent |
## Code
| Source | Description | Paper | Game Engine | Type |
| :------------------------------------------------------------------------------------------ | :--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | :-----------: | :-----------: | :-------: |
| [AI Code Translator](https://github.com/mckaywrigley/ai-code-translator) | Use AI to translate code from one language to another. | | | Code |
| [aiXcoder-7B](https://github.com/aixcoder-plugin/aiXcoder-7B) | aiXcoder-7B Code Large Language Model. | | | Code |
| [bloop](https://github.com/BloopAI/bloop) | bloop is a fast code search engine written in Rust. | | | Code |
| [Chapyter](https://github.com/chapyter/chapyter) | ChatGPT Code Interpreter in Jupyter Notebooks. | | | Code |
| [CodeGeeX](https://github.com/THUDM/CodeGeeX) | An Open Multilingual Code Generation Model. |[arXiv](https://arxiv.org/abs/2303.17568) | | Code |
| [CodeGeeX2](https://github.com/THUDM/CodeGeeX2) | A More Powerful Multilingual Code Generation Model. | | | Code |
| [CodeGeeX4](https://github.com/THUDM/CodeGeeX4) | CodeGeeX4: Open Multilingual Code Generation Model. | | | Code |
| [CodeGen](https://github.com/salesforce/CodeGen) | CodeGen is an open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex. |[arXiv](https://arxiv.org/abs/2203.13474) | | Code |
| [CodeGen2](https://github.com/salesforce/CodeGen2) | CodeGen2 models for program synthesis. |[arXiv](https://arxiv.org/abs/2305.02309) | | Code |
| [Code Llama](https://github.com/facebookresearch/codellama) | Code Llama is a large language models for code based on Llama 2. | | | Code |
| [CodeTF](https://github.com/salesforce/codetf) | One-stop Transformer Library for State-of-the-art Code LLM. | | | Code |
| [CodeT5](https://github.com/salesforce/codet5) | Open Code LLMs for Code Understanding and Generation. | | | Code |
| [Cursor](https://www.cursor.so/) | Write, edit, and chat about your code with GPT-4 in a new type of editor. | | | Code |
| [DeepSeek Coder](https://github.com/deepseek-ai/DeepSeek-Coder) | DeepSeek Coder: Let the Code Write Itself. |[arXiv](https://arxiv.org/abs/2401.14196) | | Code |
| [OpenAI Codex](https://openai.com/blog/openai-codex) | OpenAI Codex is a descendant of GPT-3. | | | Code |
| [PandasAI](https://github.com/gventuri/pandas-ai) | Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational. | | | Code |
| [RobloxScripterAI](https://www.haddock.ai/search?platform=Roblox) | RobloxScripterAI is an AI-powered code generation tool for Roblox. | | Roblox | Code |
| [Scikit-LLM](https://github.com/iryna-kondr/scikit-llm) | Seamlessly integrate powerful language models like ChatGPT into scikit-learn for enhanced text analysis tasks. | | | Code |
| [SoTaNa](https://github.com/DeepSoftwareAnalytics/SoTaNa) | The Open-Source Software Development Assistant. |[arXiv](https://arxiv.org/abs/2308.13416) | | Code |
| [Stable Code 3B](https://bit.ly/3O4oGWW) | Coding on the Edge. | | | Code |
| [StarCoder](https://github.com/bigcode-project/starcoder) | 💫 StarCoder is a language model (LM) trained on source code and natural language text. |[arXiv](https://arxiv.org/abs/2305.06161) | | Code |
| [StarCoder 2](https://github.com/bigcode-project/starcoder2) | StarCoder2 is a family of code generation models (3B, 7B, and 15B), trained on 600+ programming languages from The Stack v2 and some natural language text such as Wikipedia, Arxiv, and GitHub issues. |[arXiv](https://arxiv.org/abs/2402.19173) | | Code |
| [UnityGen AI](https://github.com/himanshuskyrockets/UnityGen-AI) | UnityGen AI is an AI-powered code generation plugin for Unity. | | Unity | Code |
| [Void](https://github.com/voideditor/void) | Void is an open source Cursor alternative. Write code with the best AI tools, retain full control over your data, and access powerful AI features. | | | Code |
## Writer
| Source | Description | Paper | Game Engine | Type |
| :------------------------------------------------------------------------------------------ | :--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | :-----------: | :-----------: | :-------: |
| [AI-Writer](https://github.com/BlinkDL/AI-Writer) | AI writes novels, generates fantasy and romance web articles, etc. Chinese pre-trained generative model. | | | Writer |
| [Notebook.ai](https://github.com/indentlabs/notebook) | Notebook.ai is a set of tools for writers, game designers, and roleplayers to create magnificent universes – and everything within them. | | | Writer |
| [Novel](https://github.com/steven-tey/novel) | Notion-style WYSIWYG editor with AI-powered autocompletions. | | | Writer |
| [NovelAI](https://novelai.net/) | Driven by AI, painlessly construct unique stories, thrilling tales, seductive romances, or just fool around. | | | Writer |
## Image
| Source | Description | Paper | Game Engine | Type |
| :------------------------------------------------------------------------------------------ | :--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | :-----------: | :-----------: | :-------: |
| [AnyDoor](https://ali-vilab.github.io/AnyDoor-Page/) | Zero-shot Object-level Image Customization. |[arXiv](https://arxiv.org/abs/2307.09481) | | Image |
| [AnyText](https://github.com/tyxsspa/AnyText) | Multilingual Visual Text Generation And Editing. |[arXiv](https://arxiv.org/abs/2311.03054) | | Image |
| [AutoStudio](https://github.com/donahowe/AutoStudio) | Crafting Consistent Subjects in Multi-turn Interactive Image Generation. |[arXiv](https://arxiv.org/abs/2406.01388) | | Image |
| [Blender-ControlNet](https://github.com/coolzilj/Blender-ControlNet) | Using ControlNet right in Blender. | | Blender | Image |
| [BriVL](https://github.com/BAAI-WuDao/BriVL) | Bridging Vision and Language Model. |[arXiv](https://arxiv.org/abs/2103.06561) | | Image |
| [CatVTON](https://github.com/Zheng-Chong/CatVTON) | CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models. |[arXiv](https://arxiv.org/abs/2407.15886) | | Image |
| [CLIPasso](https://github.com/yael-vinker/CLIPasso) | A method for converting an image of an object to a sketch, allowing for varying levels of abstraction. |[arXiv](https://arxiv.org/abs/2202.05822) | | Image |
| [ClipDrop](https://clipdrop.co/) | Create stunning visuals in seconds. | | | Image |
| [ComfyUI](https://github.com/comfyanonymous/ComfyUI) | A powerful and modular stable diffusion GUI with a graph/nodes interface. | | | Image |
| [ConceptLab](https://github.com/kfirgoldberg/ConceptLab) | Creative Generation using Diffusion Prior Constraints. |[arXiv](https://arxiv.org/abs/2308.02669) | | Image |
| [ControlNet](https://github.com/lllyasviel/ControlNet) | ControlNet is a neural network structure to control diffusion models by adding extra conditions. |[arXiv](https://arxiv.org/abs/2302.05543) | | Image |
| [CSGO](https://github.com/instantX-research/CSGO) | CSGO: Content-Style Composition in Text-to-Image Generation. |[arXiv](https://arxiv.org/abs/2408.16766) | | Image |
| [DALL·E 2](https://openai.com/product/dall-e-2) | DALL·E 2 is an AI system that can create realistic images and art from a description in natural language. | | | Image |
| [Dashtoon Studio](https://www.dashtoon.ai/) | Dashtoon Studio is an AI powered comic creation platform. | | | Comic |
| [DeepAI](https://deepai.org/) | DeepAI offers a suite of tools that use AI to enhance your creativity. | | | Image |
| [DeepFloyd IF](https://github.com/deep-floyd/IF) | IF by DeepFloyd Lab at StabilityAI. | | | Image |
| [Depth Anything V2](https://github.com/DepthAnything/Depth-Anything-V2) | Depth Anything V2 |[arXiv](https://arxiv.org/abs/2406.09414) | | Image |
| [Depth map library and poser](https://github.com/jexom/sd-webui-depth-lib) | Depth map library for use with the Control Net extension for Automatic1111/stable-diffusion-webui. | | | Image |
| [Diffuse to Choose](https://diffuse2choose.github.io/) | Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All. |[arXiv](https://arxiv.org/abs/2401.13795) | | Image |
| [Disco Diffusion](https://github.com/alembics/disco-diffusion) | A frankensteinian amalgamation of notebooks, models and techniques for the generation of AI Art and Animations. | | | Image |
| [DragGAN](https://github.com/XingangPan/DragGAN) | Interactive Point-based Manipulation on the Generative Image Manifold. |[arXiv](https://arxiv.org/abs/2305.10973) | | Image |
| [Draw Things](https://drawthings.ai/) | AI- assisted image generation in Your Pocket. | | | Image |
| [DWPose](https://github.com/idea-research/dwpose) | Effective Whole-body Pose Estimation with Two-stages Distillation. |[arXiv](https://arxiv.org/abs/2307.15880) | | Image |
| [EasyPhoto](https://github.com/aigc-apps/sd-webui-EasyPhoto) | Your Smart AI Photo Generator. | | | Image |
| [Flux](https://github.com/black-forest-labs/flux) | This repo contains minimal inference code to run text-to-image and image-to-image with our Flux latent rectified flow transformers. | | | Image |
| [Follow-Your-Click](https://github.com/mayuelala/FollowYourClick) | Open-domain Regional Image Animation via Short Prompts. |[arXiv](https://arxiv.org/abs/2403.08268) | | Image |
| [Fooocus](https://github.com/lllyasviel/Fooocus) | Focus on prompting and generating. | | | Image |
| [GIFfusion](https://github.com/DN6/giffusion) | Create GIFs and Videos using Stable Diffusion. | | | Image |
| [Grounded-Segment-Anything](https://github.com/IDEA-Research/Grounded-Segment-Anything) | Automatically Detect , Segment and Generate Anything with Image, Text, and Audio Inputs. |[arXiv](https://arxiv.org/abs/2401.14159) | | Image |
| [HivisionIDPhotos](https://github.com/Zeyi-Lin/HivisionIDPhotos) | HivisionIDPhotos: a lightweight and efficient AI ID photos tools. | | | Image |
| [Hua](https://github.com/BlinkDL/Hua) | Hua is an AI image editor with Stable Diffusion (and more). | | | Image |
| [Hunyuan-DiT](https://github.com/Tencent/HunyuanDiT) | A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding. |[arXiv](https://arxiv.org/abs/2405.08748) | | Image |
| [IC-Light](https://github.com/lllyasviel/IC-Light) | IC-Light is a project to manipulate the illumination of images. | | | Image |
| [Ideogram](https://ideogram.ai/login) | Helping people become more creative. | | | Image |
| [Imagen](https://imagen.research.google/) | Imagen is an AI system that creates photorealistic images from input text. | | | Image |
| [img2img-turbo](https://github.com/GaParmar/img2img-turbo) | One-Step Image-to-Image with SD-Turbo. | | | Image |
| [Img2Prompt](https://www.img2prompt.io/) | Get prompts from stable diffusion generated images. | | | Image |
| [Infinity](https://github.com/FoundationVision/Infinity) | Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis. |[arXiv](https://arxiv.org/abs/2412.04431) | | Image |
| [InstantID](https://github.com/InstantID/InstantID) | Zero-shot Identity-Preserving Generation in Seconds. |[arXiv](https://arxiv.org/abs/2401.07519) | | Image |
| [InternLM-XComposer2](https://github.com/InternLM/InternLM-XComposer) | InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension. |[arXiv](https://arxiv.org/abs/2401.16420) | | Image |
| [KOALA](https://youngwanlee.github.io/KOALA/) | Self-Attention Matters in Knowledge Distillation of Latent Diffusion Models for Memory-Efficient and Fast Image Synthesis. | | | Image |
| [Kolors](https://github.com/Kwai-Kolors/Kolors) | Kolors: Effective Training of Diffusion Model for Photorealistic Text-to-Image Synthesis. | | | Image |
| [Komiko](https://komiko.app/) | Komiko is an AI-powered storytelling platform that lets you create original characters, comics, and animations with ease. | | | Comic |
| [KREA](https://www.krea.ai/) | Generate images and videos with a delightful AI-powered design tool. | | | Image |
| [LaVi-Bridge](https://github.com/ShihaoZhaoZSH/LaVi-Bridge) | Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation. |[arXiv](https://arxiv.org/abs/2403.07860) | | Image |
| [LayerDiffusion](https://github.com/layerdiffusion/LayerDiffusion) | Transparent Image Layer Diffusion using Latent Transparency. |[arXiv](https://arxiv.org/abs/2305.18676) | | Image |
| [Lexica](https://lexica.art/) | A Stable Diffusion prompts search engine. | | | Image |
| [LlamaGen](https://github.com/FoundationVision/LlamaGen) | Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation. |[arXiv](https://arxiv.org/abs/2406.06525) | | Image |
| [Lumina-Image 2.0](https://github.com/Alpha-VLLM/Lumina-Image-2.0) | Lumina-Image 2.0 : A Unified and Efficient Image Generative Model. | | | Image |
| [Lumina-mGPT](https://github.com/Alpha-VLLM/Lumina-mGPT) | Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining. |[arXiv](https://arxiv.org/abs/2408.02657) | | Image |
| [MakeAnything](https://github.com/showlab/MakeAnything) | MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation. |[arXiv](https://arxiv.org/abs/2502.01572) | | Image |
| [MetaShoot](https://metashoot.vinzi.xyz/) | MetaShoot is a digital twin of a photo studio, developed as a plugin for Unreal Engine that gives any creator the ability to produce highly realistic renders in the easiest and quickest way. | | Unreal Engine | Image |
| [Midjourney](https://www.midjourney.com/) | Midjourney is an independent research lab exploring new mediums of thought and expanding the imaginative powers of the human species. | | | Image |
| [MIGC](https://github.com/limuloo/MIGC) | MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis. |[arXiv](https://arxiv.org/abs/2402.05408) | | Image |
| [MimicBrush](https://github.com/ali-vilab/MimicBrush) | Zero-shot Image Editing with Reference Imitation. |[arXiv](https://arxiv.org/abs/2406.07547) | | Image |
| [OmniGen](https://github.com/VectorSpaceLab/OmniGen) | OmniGen: Unified Image Generation. |[arXiv](https://arxiv.org/abs/2409.11340) | | Image |
| [Omost](https://github.com/lllyasviel/Omost) | Omost is a project to convert LLM's coding capability to image generation (or more accurately, image composing) capability. | | | Image |
| [Openpose Editor](https://github.com/fkunn1326/openpose-editor) | Openpose Editor for AUTOMATIC1111's stable-diffusion-webui. | | | Image |
| [Outfit Anyone](https://humanaigc.github.io/outfit-anyone/) | Ultra-high quality virtual try-on for Any Clothing and Any Person. | | | Image |
| [PaintsUndo](https://github.com/lllyasviel/Paints-UNDO) | PaintsUndo: A Base Model of Drawing Behaviors in Digital Paintings. | | | Image |
| [PhotoMaker](https://photo-maker.github.io/) | Customizing Realistic Human Photos via Stacked ID Embedding. |[arXiv](https://arxiv.org/abs/2312.04461) | | Image |
| [Photoroom](https://www.photoroom.com/backgrounds) | AI Background Generator. | | | Image |
| [Plask](https://plask.ai/) | AI image generation in the cloud. | | | Image |
| [Prompt.Art](https://prompt.art/) | The Generators Hub. | | | Image |
| [PuLID](https://github.com/ToTheBeginning/PuLID) | Pure and Lightning ID Customization via Contrastive Alignment. |[arXiv](https://arxiv.org/abs/2404.16022) | | Image |
| [Rich-Text-to-Image](https://github.com/SongweiGe/rich-text-to-image) | Expressive Text-to-Image Generation with Rich Text. |[arXiv](https://arxiv.org/abs/2304.06720) | | Image |
| [RPG-DiffusionMaster](https://github.com/YangLing0818/RPG-DiffusionMaster) | Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG). | | | Image |
| [SEED-Story](https://github.com/TencentARC/SEED-Story) | SEED-Story: Multimodal Long Story Generation with Large Language Model. |[arXiv](https://arxiv.org/abs/2407.08683) | | Image |
| [Segment Anything](https://segment-anything.com/) | Segment Anything Model (SAM): a new AI model from Meta AI that can "cut out" any object , in any image , with a single click. |[arXiv](https://arxiv.org/abs/2304.02643) | | Image |
| [Segment Anything Model 2 (SAM 2)](https://github.com/facebookresearch/segment-anything-2) | SAM 2: Segment Anything in Images and Videos. |[arXiv](https://arxiv.org/abs/2408.00714) | | Image |
| [sd-webui-controlnet](https://github.com/Mikubill/sd-webui-controlnet) | WebUI extension for ControlNet. | | | Image |
| [SDXL-Lightning](https://huggingface.co/ByteDance/SDXL-Lightning) | Progressive Adversarial Diffusion Distillation. |[arXiv](https://arxiv.org/abs/2402.13929) | | Image |
| [SDXS](https://github.com/IDKiro/sdxs) | Real-Time One-Step Latent Diffusion Models with Image Conditions. | | | Image |
| [Stable.art](https://github.com/isekaidev/stable.art) | Photoshop plugin for Stable Diffusion with Automatic1111 as backend (locally or with Google Colab). | | | Image |
| [Stable Cascade](https://github.com/Stability-AI/StableCascade) | Stable Cascade consists of three models: Stage A, Stage B and Stage C, rep