ACE-Step Music Generation Model in ComfyUI | AI Audio Creation
About this model
Generate studio-quality music 15× faster with breakthrough diffusion technology.
Who it's for: creators who want this pipeline in ComfyUI without assembling nodes from scratch. Not for: one-click results with zero tuning — you still choose inputs, prompts, and settings.
Open preloaded workflow on RunComfy
Open preloaded workflow on RunComfy (browser)
Why RunComfy first
- Fewer missing-node surprises — run the graph in a managed environment before you mirror it locally.
- Quick GPU tryout — useful if your local VRAM or install time is the bottleneck.
- Matches the published JSON — the zip follows the same runnable workflow you can open on RunComfy.
When downloading for local ComfyUI makes sense — you want full control over models on disk, batch scripting, or offline runs.
How to use (local ComfyUI)
1. Load inputs (images/video/audio) in the marked loader nodes.
2. Set prompts, resolution, and seeds; start with a short test run.
3. Export from the Save / Write nodes shown in the graph.
Expectations — First run may pull large weights; cloud runs may require a free RunComfy account.
Overview
ACE-Step is a breakthrough open-source foundation model for music generation that bridges the gap between generation speed and musical quality. By integrating diffusion-based generation with Sana's Deep Compression AutoEncoder and a lightweight linear transformer, it synthesizes up to 4 minutes of high-quality music in just 20 seconds—15× faster than LLM-based alternatives. The model excels at maintaining musical coherence while offering advanced control over lyrics, voice cloning, and remixing capabilities.
Important nodes:
TextEncodeAceStepAudioKSamplerEmptyAceStepLatentAudioSaveAudiolyrics_strengthcontrol_after_generatefilename_prefix
Notes
ACE-Step Music Generation Model in ComfyUI | AI Audio Creation — see RunComfy page for the latest node requirements.
Tags
Related Models
Similar AI models you may like
ON-THE-FLY 实时生成!Wan-AI 万相/ Wan2.1 Video Model (multi-specs) - CausVid&Comfy&Kijai - workflow included
【WAN2.1】IMG to VIDEO
ComfyUI Image Workflows
WAN 2.2 Workflow T2V-I2V-T2I (Kijai Wrapper)
Hand Detailer/Segmentation - ADetailer
Hunyuan 🌻 AllInOne
Moody Simple Zimage Turbo/Distilled Workflow