Wan2.2_S2V_text to "mouth shape" video - Dual sampler version
About this model
You can click on the link below to try it out directly. If the effect is good, you can deploy it locally
style="color:rgb(230, 73, 128)">Fan benefits,register to get 1000 points,daily login 100 points,play 4090!Experience the super power of 48G.
style="background-color:rgb(14, 17, 23);color:rgb(135, 136, 139);font-family:"Source Han Sans", system-ui;font-size:14px">Based on the testing results, S2V lip syncing videos are only suitable for situations where lip syncing is required and both actions and dialogue are needed. It is not recommended to generate non-human videos, and it is best to use vocal music or pure vocals as the audio. If the first 2 seconds of the 5-second audio are pure vocals and the last 3 seconds are background music, it can easily cause interference.
Models that need to be downloaded for local deployment:
1. Wan2.2 T2V high (file name: wan2.2_t2w_high_noise_14B_fp16. safetensors)
/>Place folder: models \ diffusionmodels
2. Wan2.2 S2V (file name: wan2.2_st2v5_14B-bf16. safetensors)
/>Place folder: models \ diffusionmodels
3.wav2vec2_large_english_fp16
/>Place folder: models \ audio_coders
Notes:
WanSoundImageToVideo error, update plugin version.
AudioSeparation error, delete and reinstall.
Related Models
Similar AI models you may like
ON-THE-FLY 实时生成!Wan-AI 万相/ Wan2.1 Video Model (multi-specs) - CausVid&Comfy&Kijai - workflow included
【WAN2.1】IMG to VIDEO
ComfyUI Image Workflows
WAN 2.2 Workflow T2V-I2V-T2I (Kijai Wrapper)
Instagirl WAN 2.2
Hunyuan 🌻 AllInOne
Moody Simple Zimage Turbo/Distilled Workflow