Fish Audio S2 TTS: Emotion, Multi-Speaker & Voice Cloning in ComfyUI

⭐ 0.0

⬇ 210 Downloads

👁 7 Views

🖼 1 Images

About this model

🚀 Turn text into expressive, natural speech — multi-speaker dialogues, emotion tags, and voice cloning from short samples.

▶️ Run Directly in Cloud:
/>

💡 Overview

Fish Audio S2 TTS is a ComfyUI workflow for advanced text-to-speech: expressive voices, emotion and style tagging, multi-speaker scenes, and precise voice cloning from reference clips.

Perfect for narration, character dialogue, and emotionally rich audio without a recording studio.

✨ Key Features

Multi-speaker: Split scripts across different voices in one graph.
Emotion & style tags: Whispers, laughs, and more for lifelike delivery.
Voice cloning: Match a speaker from a short sample clip.
Fast inference: Iterate quickly on tone and pacing.

🚀 Getting Started

Enter your script and assign speakers / voices as needed.
Add emotion or style hints where you want extra expressiveness.
Generate and export audio for your project.

Click the "Run Directly" link above to bypass local setup and test this workflow immediately in your browser.

Model Info

Download Model

Type Workflow

Base SD 1.5

Version v1.0

Creator RunComfy

Rating 0.0

Downloads 210

Gallery 1 Images

Fish Audio S2 TTS: Emotion, Multi-Speaker & Voice Cloning in ComfyUI

About this model

💡 Overview

✨ Key Features

🚀 Getting Started

Tags

Related Models

CyberRealistic

RealCartoon3D

ON-THE-FLY 实时生成！Wan-AI 万相/ Wan2.1 Video Model (multi-specs) - CausVid&Comfy&Kijai - workflow included

MIHOYO Collection 米家全家桶 (Honkai Impact 3rd | Honkai Star Rail | Genshin Impact | Zenless Zone Zero)

【WAN2.1】IMG to VIDEO

CyberRealistic Classic

t3

ComfyUI Image Workflows