Step1X Edit GPT4o Style Image Editing
About this model
Step1X Edit GPT4o Style Image Editing
style="color:rgb(190, 196, 202)">We have released the state-of-the-art image editing model Step1X Edit, whose performance rivals closed-source models such as GPT 4o and Gemini2 Flash. More specifically, we leverage a multimodal LLM to process reference images and user editing instructions. It extracts latent embeddings and integrates them with a diffusion image decoder to obtain the target image. To train the model, we built a data generation pipeline to produce high-quality datasets. For evaluation, we developed GEdit Bench, a novel benchmark rooted in real user instructions. Experimental results on GEdit Bench demonstrate that Step1X Edit significantly outperforms existing open-source baselines and approaches the performance of leading proprietary models, making a major contribution to the field of image editing. For more details, please refer to our technical report.
Tags
Related Models
Similar AI models you may like
CyberRealistic
RealCartoon3D
ON-THE-FLY 实时生成!Wan-AI 万相/ Wan2.1 Video Model (multi-specs) - CausVid&Comfy&Kijai - workflow included
MIHOYO Collection 米家全家桶 (Honkai Impact 3rd | Honkai Star Rail | Genshin Impact | Zenless Zone Zero)
【WAN2.1】IMG to VIDEO
CyberRealistic Classic
t3