Rebels SenseNova-U1
About this model
SenseNova-U1 is SenseTime's open-source unified multimodal model running natively in ComfyUI through smthemex's custom node pack. One model, five modes, no VAE, no separate text encoder — the entire pipeline collapses into a single GGUF. 8B parameters of MoT (Mixture-of-Transformers) architecture handling text-to-image, image editing, VQA, and interleaved generation from the same checkpoint.
USE 'EDIT' IMG MODE WITH REFERENCE FOR EDITING.
USE 'EDIT' IMG MODE WITH NO REFERENCE FOR TEXT TO IMAGE.
Two versions are available. The 8-step Turbo (distill) is the fast option — runs at 8 steps with cfg 1.0, lower VRAM, quick iteration. The Full Base Model is the quality option — 50 steps with cfg 4.0, more prompt adherence headroom. Do not mix these settings up. Turbo with cfg 4 will blow out, base with cfg 1 will look muddy. Match the settings to the checkpoint you downloaded.
GGUFs come in multiple quant sizes. Q6_K if you have headroom, Q8_0 for max fidelity.
Installation is straightforward. Clone the custom node into your ComfyUI custom_nodes folder:
git clone install requirements using the portable embedded python from your ComfyUI root:
python_embeded\python.exe -m pip install -r ComfyUI\custom_nodes\ComfyUI_SenseNova_U1\requirements.txt
Drop the downloaded GGUF into ComfyUI/models/diffusion_models/ and restart ComfyUI. The SenseNova_SM_Model and SenseNova_SM_Sampler nodes will be available.
Workflow is three nodes: Load Image (optional) → SenseNova_SM_Model → SenseNova_SM_Sampler → Save Image. In the Model node, set diffusion_models to none, pick your GGUF, leave attn_backend on auto. In the Sampler node, set img_mode to the task you want, then dial in steps and cfg per the version you're running.
Settings cheat sheet: Turbo: cfg 1.0, steps 8 Base: cfg 4.0, steps 50 img_cfg 1.0, timestep_shift 3.0, do_sample on
VRAM tip: prefetch_count 1 for low-VRAM cards (less memory, slightly slower). Only set prefetch_count to 0 if you have 32 GB or more VRAM.
Links:
Custom Node — Files (Turbo + Base) — Model by SenseTime ( ComfyUI integration by smthemex, GGUF quantization by smthem on HuggingFace. License: Apache 2.0.
Tags
Related Models
Similar AI models you may like
ON-THE-FLY 实时生成!Wan-AI 万相/ Wan2.1 Video Model (multi-specs) - CausVid&Comfy&Kijai - workflow included
【WAN2.1】IMG to VIDEO
ComfyUI Image Workflows
WAN 2.2 Workflow T2V-I2V-T2I (Kijai Wrapper)
Hand Detailer/Segmentation - ADetailer
Hunyuan 🌻 AllInOne
Moody Simple Zimage Turbo/Distilled Workflow