We are thrilled to release Qwen-Image, a 20B MMDiT image foundation model that achieves significant advances in complex text rendering and precise ima

Qwen-Image: Crafting with Native Text Rendering

submited by

Style Pass

2025-08-04 16:00:05

We are thrilled to release Qwen-Image, a 20B MMDiT image foundation model that achieves significant advances in complex text rendering and precise image editing. To try the latest model, feel free to visit Qwen Chat and choose “Image Generation”.

We present a comprehensive evaluation of Qwen-Image across multiple public benchmarks, including GenEval, DPG, and OneIG-Bench for general image generation, as well as GEdit, ImgEdit, and GSO for image editing. Qwen-Image achieves state-of-the-art performance on all benchmarks, demonstrating its strong capabilities in both image generation and editing. Furthermore, results on LongText-Bench, ChineseWord, and TextCraft show that it excels in text rendering—particularly in Chinese text generation—outperforming existing state-of-the-art models by a significant margin. This highlights Qwen-Image’s unique position as a leading image generation model that combines broad general capability with exceptional text rendering precision.

One of Qwen-Image’s outstanding capabilities is its ability to achieve high-fidelity text rendering in different scenarios. Let’s take a look at the following Chinese rendering case:

Qwen-Image: Crafting with Native Text Rendering

Leave a Comment

Related Posts

Recent Posts

A n O p e n L e t t e r t o O p e n A I

Modos Paper Monitor | Crowd Supply

A modest proposal for new holidays to manage your digital life

I Asked Four Former Friends Why We Stopped Speaking. Here’s What I Learned

(True) Unidirectional Wifi broadcasting of video data for FPV

Red-teaming a RAG app: What happens?

Diagrammatic algebra: On the road to category theory

Using Git worktrees for development

Search code, repositories, users, issues, pull requests...

Exploring NotebookLM Alternatives

I Found 12 People Who Ditched Their E xpensive Software for AI-built Tools

A Gentle Introduction To Fortran

Search code, repositories, users, issues, pull requests...

Indian Rafale Downed in Largest Air Battle in Decades Due to Intel Failure on PL-15 Range – Reports

A Bit on Bitmaps. We take a look at bitmpas, or bit… | by Tom Herbert | Aug, 2025 | Medium

Why is it worth spending time on type theory?

Modos Developer Kit Now Live on Crowd Supply!

Coming Soon: Women Preferences

Your guide to taking an open model from discovery to a production-ready endpoint on Vertex AI

Perplexity accused of scraping websites that explicitly blocked AI scraping