LTX 2.3 basic GGUF 720p workflow
About this model
This is same as default WF in ComfyUI, but it uses GGUF custom node. Basically, you can insert images, audio, and video into any frame, so anything is possible.
T2V, S2V, V2V, I2V First, last, middle frame.
voice clone: You can input a few seconds of audio, and then crop those same few seconds after the process is complete.
reference image: input a starting image and then instruct it to perform a completely different action. (However, the character descriptions remain the same.) Yes, this is what's called a failed I2V. Again, crop the initial image.
extend video: input the images and audio extracted from the video. It will be extended for the remaining length.
GGUF custom node: update your GGUF node and ComfyUI to the latest versions.)
LTX2.3 and other: GGUF: model: encoder:
gemma3 GGUF: the text encoder-related files here: ComfyUI\models\text_encoders
audio vae is here: ComfyUI\models\checkpoints
upscale model is here: ComfyUI\models\latent_upscale_models
Use the distilled model and distilled-embedding, or use the dev model and dev-embedding with distilled-lora.
T2V: set bypass image on
I2V: set bypass image off
You can bypass upscale node for lowres.
Try starting with a lower length (perhaps 9).
Tags
Related Models
Similar AI models you may like
ON-THE-FLY 实时生成!Wan-AI 万相/ Wan2.1 Video Model (multi-specs) - CausVid&Comfy&Kijai - workflow included
【WAN2.1】IMG to VIDEO
ComfyUI Image Workflows
WAN 2.2 Workflow T2V-I2V-T2I (Kijai Wrapper)
Hunyuan 🌻 AllInOne
Moody Simple Zimage Turbo/Distilled Workflow