InfiniteTalk: Audio-Portrait to Lip-Synced Video in ComfyUI
About this model
🚀 Create realistic talking avatar videos from a single portrait and voice input — with accurate lip-sync and identity-stable animation.
▶️ Run Directly in Cloud:
/>
💡 Overview
InfiniteTalk is a ComfyUI workflow that generates lip-synced talking videos from a single image and voice input. Powered by the MultiTalk AI model, it produces fluid, identity-stable portrait clips with natural speech motion and prompt-driven customizable animation.
Ideal for content creators, educators, marketers, and anyone who needs realistic talking avatars without filming.
✨ Key Features
Single Image + Audio: Just provide a portrait and a voice clip — the workflow handles the rest.
Accurate Lip-Sync: Natural mouth movements precisely synchronized to the audio input.
Identity Preservation: Facial structure, expression style, and appearance remain consistent throughout.
Prompt-Driven Customization: Fine-tune animation behavior and visual style with text prompts.
🚀 Getting Started
Upload a portrait image — clear, well-lit, forward-facing works best.
Provide an audio clip — speech or narration you want the avatar to speak.
Generate — the workflow produces a lip-synced video with the original audio muxed in.
Click the "Run Directly" link above to bypass local setup and test this workflow immediately in your browser.
Related Models
Similar AI models you may like
CyberRealistic
RealCartoon3D
ON-THE-FLY 实时生成!Wan-AI 万相/ Wan2.1 Video Model (multi-specs) - CausVid&Comfy&Kijai - workflow included
MIHOYO Collection 米家全家桶 (Honkai Impact 3rd | Honkai Star Rail | Genshin Impact | Zenless Zone Zero)
【WAN2.1】IMG to VIDEO
CyberRealistic Classic
t3