https://github.com/ace-step/awesome-ace-step

Last synced: 4 months ago
JSON representation
Host: GitHub
URL: https://github.com/ace-step/awesome-ace-step
Owner: ace-step
License: cc0-1.0
Created: 2026-02-27T09:17:31.000Z (4 months ago)
Default Branch: main
Last Pushed: 2026-02-27T12:18:02.000Z (4 months ago)
Last Synced: 2026-02-27T14:55:43.585Z (4 months ago)
Size: 29.3 KB
Stars: 4
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project

README

          # Awesome ACE-Step [![Awesome](https://awesome.re/badge.svg)](https://awesome.re)

> A curated list of projects, tools, models, UIs, and resources for [ACE-Step](https://github.com/ace-step/ACE-Step) — the open-source music generation foundation model by ACE Studio and StepFun.

ACE-Step is a hybrid architecture combining a Language Model planner with a Diffusion Transformer to generate commercial-grade music from text prompts and lyrics. It runs locally on consumer hardware with as little as 4 GB VRAM, generating a full song in under 2 seconds on A100 or under 10 seconds on RTX 3090.

---

## Contents

- [Official Resources](#official-resources)

- [Models](#models)

- [UIs and Studios](#uis-and-studios)

- [ComfyUI](#comfyui)

- [Training and Fine-tuning](#training-and-fine-tuning)

- [Data Annotation](#data-annotation)

- [Integrations and Extensions](#integrations-and-extensions)

- [Open-Source Music Generation Landscape](#open-source-music-generation-landscape)

- [Tutorials and Guides](#tutorials-and-guides)

- [Papers](#papers)

---

## Official Resources

| Resource | Description |

|----------|-------------|

| [GitHub Repository (v1.5)](https://github.com/ace-step/ACE-Step-1.5) | Latest codebase with Gradio UI, REST API, CLI, LoRA training. Mac, AMD, Intel, CUDA. |

| [GitHub Repository (v1.0)](https://github.com/ace-step/ACE-Step) | Original v1.0 codebase. |

| [Project Page (v1.0)](https://ace-step.github.io/) | Architecture overview, demos, and benchmarks. |

| [Project Page (v1.5)](https://ace-step.github.io/ace-step-v1.5.github.io/) | Hybrid LM + DiT architecture, new capabilities. |

| [HuggingFace Space](https://huggingface.co/spaces/ACE-Step/ACE-Step) | Interactive online demo on HuggingFace Zero GPU. |

| [HuggingFace Models](https://huggingface.co/ACE-Step) | All official model weights, LoRAs, and spaces. |

| [Discord](https://discord.gg/PeWDxrkdj7) | Community chat and support. |

## Models

### DiT Models (Diffusion Transformer)

| Model | Steps | Quality | Speed | Features | Link |

|-------|:-----:|---------|-------|----------|------|

| **acestep-v15-turbo** | 8 | Very High | Very Fast | text2music, cover, repaint | [HF](https://huggingface.co/ACE-Step/acestep-v15-turbo) |

| acestep-v15-turbo-continuous | 8 | Very High | Very Fast | Optimized for streaming | [HF](https://huggingface.co/ACE-Step/acestep-v15-turbo-continuous) |

| acestep-v15-sft | 50 | High | Medium | All features | [HF](https://huggingface.co/ACE-Step/acestep-v15-sft) |

| acestep-v15-base | 50 | Medium | Medium | All features, best for fine-tuning | [HF](https://huggingface.co/ACE-Step/acestep-v15-base) |

### Language Models (Planner)

| Model | Base | VRAM | Capability | Link |

|-------|------|------|------------|------|

| acestep-5Hz-lm-0.6B | Qwen3-0.6B | 6-8 GB | Lightweight | [HF](https://huggingface.co/ACE-Step) |

| **acestep-5Hz-lm-1.7B** | Qwen3-1.7B | 8-16 GB | Default, full features | [HF](https://huggingface.co/ACE-Step) |

| acestep-5Hz-lm-4B | Qwen3-4B | 16+ GB | Best quality, audio understanding | [HF](https://huggingface.co/ACE-Step) |

### LoRA Adapters and Quantized Models

| Model | Type | Description | Link |

|-------|------|-------------|------|

| ACE-Step-v1.5-chinese-new-year-LoRA | LoRA | Chinese folk instruments (dizi, erhu), festive style. Trained on 12 songs | [HF](https://huggingface.co/ACE-Step/ACE-Step-v1.5-chinese-new-year-LoRA) |

| Serveurperso/ACE-Step-1.5-GGUF | GGUF | Full quantization suite (Q4-Q8, BF16) for acestep.cpp | [HF](https://huggingface.co/Serveurperso/ACE-Step-1.5-GGUF) |

## UIs and Studios

| Project | Tech Stack | Highlights | Link |

|---------|-----------|------------|------|

| **ace-step-ui** (fspecii) | Node.js + Python | Spotify-inspired, dark/light modes, audio editor, stem extraction, video gen | [GitHub](https://github.com/fspecii/ace-step-ui) |

| **ace-step-studio** (roblaughter) | React + FastAPI | Suno-style studio, create/library/player workflow, OpenAI-compatible LLM for lyrics, cover art gen | [GitHub](https://github.com/roblaughter/ace-step-studio) |

| **Tadpole Studio** | Next.js + FastAPI | AI DJ, Radio, Library, Playlists, LoRA training, HeartMuLa backend, 11 themes | [GitHub](https://github.com/proximasan/tadpole-studio) |

| **Ace-Step-Wrangler** | Python + HTML/JS | DAW-inspired dark UI for musicians. Friendly sliders (Creativity, Strictly follow lyrics) instead of raw model params | [GitHub](https://github.com/tsondo/Ace-Step-Wrangler) |

| ace-step-ui.pinokio | Pinokio | One-click launcher for ace-step-ui (v1.5), auto backend + frontend | [GitHub](https://github.com/cocktailpeanut/ace-step-ui.pinokio) |

| **ACE-Step-1.5-for-windows** (sdbds) | Python + Windows | 936 Suno style tags with search/select; song parameter history; 4-language UI (EN/ZH/JA/KO); LoRA/LoKR training with GPU memory optimization | [GitHub](https://github.com/sdbds/ACE-Step-1.5-for-windows/tree/qinglong) |

## ComfyUI

| Project | Description | Link |

|---------|-------------|------|

| **ComfyUI Native Support** | ACE-Step 1.5 built into ComfyUI core. AIO and split model workflows | [Docs](https://docs.comfy.org/tutorials/audio/ace-step/ace-step-v1-5) |

| **ComfyUI-AceMusic** | 15-node full-featured integration: generation, cover, repaint, extend, edit, LoRA, HeartMuLa compatible | [GitHub](https://github.com/hiroki-abe-58/ComfyUI-AceMusic) |

| **ComfyUI_RH_ACE-Step** | ComfyUI plugin for ACE-Step 1.5 generation | [GitHub](https://github.com/HM-RunningHub/ComfyUI_RH_ACE-Step) |

| **scromfyUI-AceStep** | 30+ specialized nodes: audio KSamplers with shift control, multi-API lyrics gen (Gemini/Groq/OpenAI/Claude), masking & inpainting | [GitHub](https://github.com/scruffynerf/scromfyUI-AceStep) |

| **ComfyUI-FL-AceStep-Training** | LoRA training pipeline in ComfyUI: auto-label, tiled VAE, real-time loss charts | [GitHub](https://github.com/filliptm/ComfyUI-FL-AceStep-Training) |

| Comfyui_SN_AceStepTrainer | LoRA training nodes for ACE-Step 1.5 inside ComfyUI | [GitHub](https://github.com/Starnodes2024/Comfyui_SN_AceStepTrainer) |

## Training and Fine-tuning

| Project | Description | Link |

|---------|-------------|------|

| **Side-Step** | Standalone LoRA/LoKR toolkit for v1.5. Auto-detects variant, 8 GB VRAM training, interactive wizard + CLI | [GitHub](https://github.com/koda-dernet/Side-Step) |

| **ACE-Step-1.5-for-windows** (sdbds) | LoRA and LoKR training with GPU memory offloading optimizations; integrated Gradio UI with style management and 4-language support | [GitHub](https://github.com/sdbds/ACE-Step-1.5-for-windows/tree/qinglong) |

| **ComfyUI-FL-AceStep-Training** | End-to-end LoRA training inside ComfyUI with auto-labeling and live monitoring | [GitHub](https://github.com/filliptm/ComfyUI-FL-AceStep-Training) |

| Ace-Step-1.5-Dataset-Manager | Desktop tool (Qt/C++) for editing LoRA training datasets: per-track caption, lyrics, BPM, key, audio preview | [GitHub](https://github.com/Neyroslav/Ace-Step-1.5-Dataset-Manager) |

## Data Annotation

| Project | Description | Link |

|---------|-------------|------|

| **acestep-captioner** | 11B music captioning model (Qwen2.5 Omni). 1000+ instruments, timbre, structure analysis. Accuracy surpasses Gemini Pro 2.5 | [HF](https://huggingface.co/ACE-Step/acestep-captioner) |

| **acestep-transcriber** | Qwen2.5 Omni-based music transcription. Structure annotation, lyrics transcription, 50+ languages | [HF](https://huggingface.co/ACE-Step/acestep-transcriber) |

## Integrations and Extensions

| Project | Description | Link |

|---------|-------------|------|

| **acestep.cpp** | Portable C++17 / GGML implementation of ACE-Step 1.5. CPU, CUDA, Metal, Vulkan. Stereo 48 kHz WAV output | [GitHub](https://github.com/ServeurpersoCom/acestep.cpp) |

| **ace-step-1.5 Docker** | Docker image with models pre-baked (~15 GB). REST API server, RunPod template, CLI generation tool | [GitHub](https://github.com/ValyrianTech/ace-step-1.5) |

| **Generative Radio** | Fully local AI radio station. Qwen3 generates prompts, ACE-Step 1.5 generates songs. Multi-listener, Apple Silicon optimized | [GitHub](https://github.com/scramblerlab/generative-radio) |

## Open-Source Music Generation Landscape

A comparison of notable open-source music generation projects alongside ACE-Step.

| Project | Architecture | Capability | License | Link |

|---------|-------------|------------|---------|------|

| **ACE-Step** | LM + DiT | Text/lyrics → full song (vocal + BGM), cover, repaint, LoRA. <4 GB VRAM | Apache-2.0 | [GitHub](https://github.com/ace-step/ACE-Step) |

| **YuE** | LLaMA2 autoregressive | Lyrics → full song, multi-genre, multi-lingual, voice cloning, style transfer | Apache-2.0 | [GitHub](https://github.com/multimodal-art-projection/YuE) |

| **AudioCraft / MusicGen** | Autoregressive transformer | Text → music/audio, melody conditioning, style conditioning (JASCO) | MIT | [GitHub](https://github.com/facebookresearch/audiocraft) |

| **Amphion** | Multiple (SVC, TTS, TTA) | Singing voice conversion, text-to-audio, vocoders, research toolkit | MIT | [GitHub](https://github.com/open-mmlab/Amphion) |

| **Riffusion** | Stable Diffusion (spectrograms) | Real-time text → music via spectrogram diffusion | MIT | [GitHub](https://github.com/riffusion/riffusion) |

| **Stable Audio Tools** | DiT + flow matching | Text → variable-length stereo audio (up to 47 s) | MIT | [GitHub](https://github.com/Stability-AI/stable-audio-tools) |

| **DiffRhythm** | Latent diffusion (DiT + VAE) | Lyrics → full-length song (up to 4 min 45 s) in ~10 s | Apache-2.0 | [GitHub](https://github.com/ASLP-lab/DiffRhythm) |

| **HeartMuLa** | LLM-based codec | Song gen, lyric recognition, audio codec, audio-text alignment | Apache-2.0 | [GitHub](https://github.com/HeartMuLa/heartlib) |

## Tutorials and Guides

| Title | Topic | Link |

|-------|-------|------|

| ACE-Step Prompt Guide | Detailed prompting tips: tags, lyrics structure, genre control | [Ambience AI](https://www.ambienceai.com/tutorials/ace-step-music-prompting-guide) |

| Generate AI Music with ACE-Step 1.5 | Installation, generation, LoRA customization | [DigitalOcean](https://www.digitalocean.com/community/tutorials/ace-step-music-ai) |

| ComfyUI ACE-Step 1.5 Guide | Official ComfyUI v1.5 workflow tutorial | [Comfy.org](https://docs.comfy.org/tutorials/audio/ace-step/ace-step-v1-5) |

| AMD ACE-Step 1.5 Local Guide | Running ACE-Step on AMD GPUs | [PromptGalaxy](https://promptgalaxyai.com/blog/amd-ace-step-local-music-ai) |

| Running ACE-Step 1.5 on M2 Mac | Apple Silicon setup, MPS memory workarounds | [BioErrorLog](https://en.bioerrorlog.work/entry/ace-step-15-local-m2-macbook) |

| Install ACE-Step 1.5 with UV | Git + UV package manager setup | [PandaiTech](https://pandaitech.my/alpha/how-to-install-ace-step-15-using-git-and-the-uv-pa-ef9fe2df) |

| ACE-Step 1.5 DeepWiki | Architecture deep-dive, code walkthrough, Gradio UI internals | [DeepWiki](https://deepwiki.com/ace-step/ACE-Step-1.5) |

| ACE Studio | Professional AI music production suite | [acestudio.ai](https://docs.acestudio.ai/) |

## Papers

| Paper | Version | Key Contribution | Link |

|-------|---------|-----------------|------|

| ACE-Step: A Step Towards Music Generation Foundation Model | v1.0 | DCAE + linear transformer, REPA training | [arXiv](https://arxiv.org/abs/2506.00045) |

| ACE-Step 1.5: Pushing the Boundaries of Open-Source Music Generation | v1.5 | Hybrid LM + DiT, intrinsic RL, comprehensive evaluation | [arXiv](https://arxiv.org/abs/2602.00744) |

---

## Contributing

Contributions welcome! Please read the [contributing guidelines](CONTRIBUTING.md) first.

## License

[![CC0](https://licensebuttons.net/p/zero/1.0/88x31.png)](https://creativecommons.org/publicdomain/zero/1.0/)

To the extent possible under law, the authors have waived all copyright and related or neighboring rights to this work.
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ace-step/awesome-ace-step

Awesome Lists containing this project

README