Folder Image Captioner with Qwen-VL WF
About this model
Folder Image Captioner with Qwen-VL
This ComfyUI workflow allows you to batch caption entire folders of images quickly and efficiently.
It loads images from a selected folder, resizes them if needed, generates high-quality detailed captions using Qwen-VL-Mod (Qwen3-VL-8B-Instruct-Abliterated), and saves both the original image and its corresponding caption file with the exact same filename (e.g., photo.jpg + photo.txt).
Ideal for creating training datasets for LoRAs, character fine-tuning, or any project that requires consistent captions.
Features:
Batch processing directly from folder
Saves image + caption with the same name
High detail and accuracy thanks to Qwen-VL
Maintains the same pose, camera angle, lighting, and location from the original image
Required Custom Nodes:
ComfyUI Custom Nodes
Qwen-VL-Mod (or Qwen3-VL-8B-Instruct-Abliterated)
Resize Image v2
Load Image Dataset from Folder
Save Image and Text Dataset to Folder
Created by: bobgus39 Original profile: Simply select your image folder and run the workflow. The captions will respect the original pose, camera angle, lighting, and background/location of each image, making them perfect for training consistent characters or scenes.
Related Models
Similar AI models you may like
ON-THE-FLY 实时生成!Wan-AI 万相/ Wan2.1 Video Model (multi-specs) - CausVid&Comfy&Kijai - workflow included
【WAN2.1】IMG to VIDEO
ComfyUI Image Workflows
WAN 2.2 Workflow T2V-I2V-T2I (Kijai Wrapper)
[Lah] Mysterious | Qwen update
Hunyuan 🌻 AllInOne
Moody Simple Zimage Turbo/Distilled Workflow