AuraFlow vs Ideogram V4.0q: Text-to-Image Showdown
Ideogram V4.0q takes the win with more accurate prompt adherence in key tasks.
By RuntimeWire · Published

In our head-to-head comparison, AuraFlow and Ideogram V4.0q Text to Image went toe-to-toe in three distinct text-to-image tasks. While AuraFlow excelled in the 'Rainy transit shelter watch ad' task, accurately capturing the scene's specifics, Ideogram V4.0q dominated the other two tasks. In the 'Melancholy umbrella courier' task, Ideogram V4.0q's output was spot on, with a correct color palette and precise inclusion of prompt-specified elements like the transparent umbrella with gold moths. The 'Cider fair letterpress poster' task was another Ideogram V4.0q stronghold, with accurate text rendering and a centered woodcut-style illustration. Ideogram V4.0q Text to Image is the clear winner here, outdoing AuraFlow in overall prompt adherence and task-specific accuracy.
How they were tested
We ran 3 fresh image tasks, generated on the fly for this matchup so neither model could prepare in advance, and had Llama-4-Maverick-17B-128E-Instruct-FP8 score each one. AuraFlow scored 21.0 to Ideogram V4.0q Text to Image's 25.0.
1. Rainy transit shelter watch ad
Photorealistic commercial scene, 16:9: a brushed titanium diver’s watch with a deep moss-green dial and tiny orange seconds hand resting on the wet acrylic bench of a nearly empty tram shelter at 5:42 a.m., raindrops beading on the crystal and bench, faint fog on the shelter walls, a blurred route map behind it showing colored lines and unreadable station dots, lit by one cool overhead fluorescent strip and a distant amber streetlamp creating mixed color temperature reflections, shot low and close with a 50mm lens look, shallow depth of field, crisp micro-scratches on the metal, premium editorial product photography composition with the watch placed in the lower-right third.


Winner: AuraFlow — Model A's image more closely adheres to the prompt, with a more accurate representation of the specified scene, including the correct placement of the watch, lighting conditions, and background elements. Model B's image, while aesthetically pleasing, has a less accurate representation of the scene, with a different background and less attention to detail.
2. Melancholy umbrella courier
Stylized illustration in scratchboard-and-gouache noir style: a young bicycle courier in a plum raincoat riding uphill through the narrow alley of Saint Veyran, balancing a stack of teal pastry boxes with one hand while holding a transparent umbrella patterned with tiny gold moths, mood wistful and determined, puddles reflecting neon pharmacy light, wind tugging torn festival ribbons between windows, a black cat watching from a bakery sign, dramatic diagonal composition, luminous indigo dusk with selective warm highlights and textured inked shadows, 16:9.


Winner: Ideogram V4.0q Text to Image — Model B's image better adheres to the prompt with a more accurate color palette, correct inclusion of a transparent umbrella with gold moths, and better composition. Model A's image, while visually appealing, deviates from the specified plum raincoat and has a different style that doesn't match the scratchboard-and-gouache noir style as closely.
3. Cider fair letterpress poster
Graphic poster with clean legible text, 16:9, in bold 1970s Swiss-meets-letterpress design: cream paper background, burnt umber and dark olive ink, centered woodcut-style illustration of a striped tent and three pears, strong grid layout, generous margins, sharp high-contrast typography that must read exactly: "BRAMBLE CIDER FAIR" on the top line, "OCT 12" beneath it, and "NORTH QUAY HALL" at the bottom; add small ornamental stars and thin rule lines, warm print-shop lighting, perfectly readable text, no extra words.


Winner: Ideogram V4.0q Text to Image — Model B's output more closely adheres to the prompt, with correct text rendering, a centered woodcut-style illustration, and a strong grid layout. Model A's output has a more playful design but deviates from the specified details, such as the incorrect text and additional elements.
See every prompt and the full side-by-side outputs in the interactive Head-to-Head.