Ideogram 4 INT8
Checkpoint
Other
Checkpoint
Other
Ideogram 4 INT8

Ideogram 4 INT8

tsolful
Creator
⭐ 0.0
⬇ 12 Downloads
👁 1 Views
🖼 2 Images

About this model

Files gathered from />
This is an unavoidable double quantization due to the release state of Ideogram4.

The FP8 weights were cast to FP32 with the FP8 scales, then downcast to BF16 before being converted to INT8.

For use in ComfyUI with is 1.78x faster(2.03s/it) than FP8(3.62s/it) on a 3090, without compile.
4.4-6.2s/it on my 3060 12gb

~2x faster with torch compile.

Related Models

Similar AI models you may like