BAGEL Workflow
About this model
Reference Video
π₯― BAGEL Workflow for ComfyUI β All-in-One Image Generation, Editing & Visual Reasoning
This is a complete ComfyUI workflow powered by BAGEL (Blip-Aware Generator Enhanced with Logic), combining text-to-image, image editing (inpainting), and visual question answering (VQA) using BLIP2 and Vicuna. Ideal for advanced AI creators who want generation + reasoning in one streamlined pipeline.
π Key Features:
π· Text-to-Image Generation with language-aware detail
π οΈ Image Editing & Inpainting with precise control
π¬ Visual Question Answering (VQA) via BLIP2 + Vicuna 7B/13B
π Pre-built and optimized ComfyUI workflow β no manual setup needed
π§ VRAM & Hardware Requirements:
β Minimum VRAM: 16GB (BLIP2 + Vicuna are memory-intensive)
π» Recommended: 24GB+ (e.g., RTX 3090/4090 or A6000) for stable performance
β οΈ Not suitable for low-VRAM systems β Vicuna models are large and require significant resources
π§ Optionally runs better with exllama or exllamav2 loaders if using quantized models
Related Models
Similar AI models you may like
γWAN2.1γIMG to VIDEO
ComfyUI Image Workflows
WAN 2.2 Workflow T2V-I2V-T2I (Kijai Wrapper)
Hand Detailer/Segmentation - ADetailer
Hunyuan π» AllInOne
Moody Simple Zimage Turbo/Distilled Workflow