vs

192 evaluations across 6 categories

76%vs73%

Pass Rate

18.1svs11.8s

Avg Latency

Text Rendering › Typography Style › Easybfl/flux-2-pro

Prompt: The word 'CHAPTER ONE' typed on aged paper with a vintage typewriter font, complete with slightly uneven ink

Text Rendering 73%vs67%

▸Typography Style100%vs100%

Easy 100%vs100%

z-image-local/z-image-turboPASS

The word 'CHAPTER ONE' typed on aged paper with a vintage typewriter font, complete with slightly uneven ink

Evaluator

The text “CHAPTER ONE” is rendered in a classic typewriter font, with slightly uneven, textured ink strokes that convincingly simulate the look of ink hitting aged paper. The font style, spacing, and character shapes are consistent with vintage typewriter output. The paper has a yellowed, aged appearance, and the visible typewriter carriage and keys reinforce the medium. The spelling is correct. VERDICT: PASS

z-image-local/z-image-turboPASS

The phrase 'TODAY'S SPECIAL' written in white chalk on a dark green chalkboard with chalk dust

Evaluator

The text “TODAY'S SPECIAL” is rendered in a style that convincingly mimics white chalk on a dark green chalkboard. The lettering shows the characteristic uneven, textured strokes and slight smudges of chalk writing, with visible chalk dust residue around the letters and along the wooden frame. The apostrophe in “TODAY'S” is correctly placed, and the spelling is accurate. The underline beneath “SPECIAL” also matches the chalkboard aesthetic. VERDICT: PASS

z-image-local/z-image-turboPASS

The word 'OPEN' displayed on a red LED sign with glowing pixel-style letters

Evaluator

The text "OPEN" is spelled correctly. The style of the text matches the described medium: it is rendered in glowing, pixel-style letters composed of individual illuminated dots, consistent with a red LED sign. The red glow and pixelated construction convincingly simulate an LED display. VERDICT: PASS

▸Writing accuracy67%vs58%

Easy 100%vs67%

z-image-local/z-image-turboPASS

A single large word 'HELLO' centered on a white background

Evaluator

The word displayed is "HELLO", which is spelled correctly and fully readable. All five letters are present and clearly formed: H, E, L, L, O. There are no misspellings, missing letters, or illegible parts. The text is centered on a white background as intended. VERDICT: PASS

z-image-local/z-image-turboPASS

The word 'ADVENTURE' in big bold letters centered on a plain cream background

Evaluator

The word "ADVENTURE" is spelled correctly and is fully readable. All letters are clearly formed and legible against the plain cream background. There are no misspellings, missing words, or illegible parts. VERDICT: PASS

z-image-local/z-image-turboPASS

The word 'SERENITY' written large and centered on a solid light blue background

Evaluator

The word "SERENITY" is spelled correctly and is fully readable. All letters are clearly visible and properly formed. There are no misspellings, missing words, or illegible parts. VERDICT: PASS

Medium 67%vs67%

z-image-local/z-image-turboPASS

The sentence 'The quick brown fox jumps high' in clean black text on a white background

Evaluator

All words are spelled correctly and fully readable: - "The" - "quick" - "brown" - "fox" - "jumps" - "high" There are no misspellings, missing words, or illegible parts. VERDICT: PASS

z-image-local/z-image-turboPASS

'Every sunset brings a new dawn ahead' written neatly on a pale yellow background

Evaluator

All words in the text are spelled correctly and fully readable. The text reads: "Every sunset brings a new dawn ahead", which matches the intended phrase exactly. There are no misspellings, missing words, or illegible parts. VERDICT: PASS

z-image-local/z-image-turboFAIL

'Stars shine brightest in the dark night' displayed on a clean off-white background

Evaluator

The intended text is: "Stars shine brightest in the dark night". In the image: - "brighhest" is misspelled (should be "brightest"). - The third line shows distorted, unreadable characters: "7 n n th ? -" — this is not legible and does not match "the". - The fourth line "dark night" is correct. There are misspellings and illegible parts. The text is not fully correct or readable. VERDICT: FAIL

Hard 67%vs100%

z-image-local/z-image-turboPASS

A poster with headline 'SUMMER SALE' in large bold text, body text 'Up to 50% off all items this weekend only' in medium size, and fine print 'Terms and conditions apply. See store for details.' at the bottom in small text

Evaluator

The text in the image is: - Headline: "SUMMER SALE" — correctly spelled and fully readable. - Body text: "Up to 50% items this weekend only" — correctly spelled and fully readable. - Fine print: "Terms and conditions apply. See store for details." — correctly spelled and fully readable. All words are spelled correctly and are fully legible. There are no misspellings, missing words, or illegible parts. VERDICT: PASS

z-image-local/z-image-turboFAIL

A flyer with large title 'GRAND OPENING', subtitle 'Join us for food, music, and fun' in medium text, and small disclaimer 'RSVP required. Limited seating available.' at the bottom

Evaluator

The text in the image is: - Main title: "GRAND OPENING" — correctly spelled and fully readable. - Subtitle: "Join us for food, music, and fun" — correctly spelled and fully readable. - Disclaimer: "RSVP required. Limited seeting available." — contains a misspelling: "seeting" should be "seating". The word "seeting" is clearly visible and incorrectly spelled. This is a critical error that affects the accuracy of the text. VERDICT: FAIL

z-image-local/z-image-turboPASS

An event banner with headline 'ANNUAL GALA 2025' at the top in large serif font, description 'An evening of fine dining and live jazz' in medium text, and footnote 'Black tie optional. Doors open at 6 PM.' in small text

Evaluator

All text in the image matches the intended text exactly. - Headline: "ANNUAL GALA 2025" — correctly spelled and fully readable in large serif font. - Description: "An evening of fine dining and live jazz" — correctly spelled and fully readable in medium text. - Footnote: "Black tie optional. Doors open at 6 PM." — correctly spelled and fully readable in small text. No misspellings, missing words, or illegible parts are present. VERDICT: PASS

Extreme 33%vs0%

z-image-local/z-image-turboPASS

A vintage movie poster for 'THE LAST VOYAGE' featuring a tagline 'One ship. One chance. No turning back.' with credits block reading 'Directed by James Harlow. Starring Elena Voss, Marcus Chen, and Sofia Reyes. Music by Thomas Grant. A Paramount Pict

Evaluator

All text in the image matches the intended text exactly. - Title: "THE LAST VOYAGE" — correct. - Tagline: "One ship. One chance. No turning back." — correct. - Credits block: - "Directed by James Harlow." — correct. - "Starring Elena Voss, Marcus Chen, and Sofia Reyes." — correct. - "Music by Thomas Grant." — correct. - "A Paramount Pictures Release." — correct. - "Coming Summer 2025." — correct. No misspellings, missing words, or illegible parts are present. The text is fully readab

z-image-local/z-image-turboFAIL

A magazine cover titled 'NATIONAL EXPLORER' with cover lines 'The Hidden Temples of Cambodia — page 34', 'Interview: Dr. Maria Santos on Climate Action', 'Top 10 Trails You Must Hike This Year', and a barcode area with 'Issue 247 | March 2025 | $6.99

Evaluator

The text on the magazine cover is mostly correct, but there are two critical errors: 1. The phrase “Top 10 Trails You Must Hike This Year” is correctly placed in the upper right, but the intended text “Top 10 Trails You Must Hike This Year” is also present in the lower-left section — however, in that lower-left section, the text reads “Cambodia Ob of This Year”, which is a clear misspelling and truncation. “Ob” is not a word; it should be “Top”. 2. The barcode area at the bottom right reads “I

z-image-local/z-image-turboFAIL

A product label for 'GOLDEN HARVEST ORGANIC HONEY' with subtitle 'Raw & Unfiltered', weight '12 oz (340g)', description 'Pure wildflower honey harvested from sustainable apiaries in the Pacific Northwest. No additives or preservatives.', and small te

Evaluator

The text on the label is mostly correct but contains one error. - "Pure wildflower harvested from" → Correct. - "form sustainable apiaries" → "form" is a misspelling; it should be "from". - "in the Pacific Northwest." → Correct. - "No additives or preservatives." → Correct. - "Best before: 06/2026. Store in cool, dry place. Product of USA." → Correct. The word "form" is misspelled as "form" instead of "from". VERDICT: FAIL

Spatial Reasoning 65%vs90%

▸Attributes Binding89%vs100%

Easy 100%vs100%

z-image-local/z-image-turboPASS

A red ball and a blue cube sitting side by side on a white surface

Evaluator

The image shows a red sphere (ball) on the left and a blue cube on the right, sitting side by side on a white background. All attributes (color, shape, arrangement) are correctly bound to the respective objects. VERDICT: PASS

z-image-local/z-image-turboPASS

A green apple and an orange pumpkin on a wooden cutting board

Evaluator

The image displays a green apple in the foreground and an orange pumpkin in the background, both resting on a light wooden cutting board. The colors and object identities match the prompt perfectly with no attribute swaps. VERDICT: PASS

z-image-local/z-image-turboPASS

A yellow taxi and a black limousine parked next to each other on a street

Evaluator

The image contains two main vehicles. 1. **Yellow Taxi:** In the foreground, there is a yellow sedan with a taxi roof sign. 2. **Black Limousine:** In the background, there is a long, black vehicle with extended windows, characteristic of a limousine. The attributes are correctly bound: the taxi is yellow, and the limousine is black. They are positioned next to each other on a street. VERDICT: PASS

Medium 100%vs100%

z-image-local/z-image-turboPASS

A large red circle, a small blue triangle, and a medium green star arranged on a white canvas

Evaluator

The image contains a large red circle acting as a border, a small blue triangle positioned above a medium green star, all on a white background. All attributes (color, size, shape) are correctly bound to their respective objects as requested. VERDICT: PASS

z-image-local/z-image-turboPASS

A tall thin white candle, a short fat red candle, and a medium spiral yellow candle on a shelf

Evaluator

The image contains three candles on a shelf matching the description perfectly: 1. **Left:** A short, fat, red pillar candle. 2. **Center:** A tall, thin, white taper candle. 3. **Right:** A medium-height, spiral-shaped yellow candle. All attributes (color, size, shape) are correctly bound to the specific objects with no swaps. VERDICT: PASS

z-image-local/z-image-turboPASS

A big brown horse, a small white rabbit, and a medium-sized black cat standing in a meadow

Evaluator

The image contains all the requested elements with the correct attributes bound to the correct objects. - **Horse:** Big, brown, standing. (Correct) - **Rabbit:** Small, white. (Correct) - **Cat:** Black, medium-sized (standard cat size). (Correct) - **Setting:** Meadow with grass and flowers. (Correct) VERDICT: PASS

Hard 67%vs100%

z-image-local/z-image-turboPASS

A pink fire truck, a blue banana, and a green sun in a surreal landscape

Evaluator

The image contains a pink fire truck in the center, a large blue banana in the foreground on the left, and a stylized green sun in the sky. All requested attributes are correctly bound to their respective objects. VERDICT: PASS

z-image-local/z-image-turboFAIL

A purple carrot, a red tree with red leaves, and a white watermelon sliced open

Evaluator

The image contains a purple carrot with green leaves and a red tree-like plant with red leaves, which matches the first two parts of the prompt. However, the third object is a standard red/pink watermelon slice, not the requested "white watermelon." The color attribute for the watermelon is incorrect. VERDICT: FAIL

z-image-local/z-image-turboPASS

An orange elephant, a yellow penguin, and a silver strawberry arranged on a table

Evaluator

The image contains an orange elephant, a yellow and black penguin, and a metallic silver strawberry. All attributes (color, object type) are correctly bound to the respective objects as requested in the prompt. VERDICT: PASS

▸Compositionality78%vs100%

Easy 100%vs100%

z-image-local/z-image-turboPASS

A tabby cat wearing oversized red sunglasses

Evaluator

- **Tabby cat:** Present. The image clearly shows a cat with distinct tabby markings (stripes on the forehead and body). - **Oversized red sunglasses:** Present. The cat is wearing large sunglasses with bright red frames that dominate the face. - **Coherent combination:** Yes. The glasses sit naturally on the cat's face, and the lighting/texture is consistent. VERDICT: PASS

z-image-local/z-image-turboPASS

A golden retriever wearing a tiny cowboy hat

Evaluator

The image contains all the requested elements: * **Golden retriever:** Present (the dog is clearly a Golden Retriever). * **Wearing:** Present (the hat is on the dog's head). * **Tiny cowboy hat:** Present (the hat is a cowboy style and sized appropriately for the dog). The elements are coherently combined into a single, realistic-looking scene. VERDICT: PASS

z-image-local/z-image-turboPASS

A penguin holding a colorful umbrella

Evaluator

1. Penguin: Present (standing on rocky shore). 2. Colorful umbrella: Present (rainbow panels visible). 3. Holding: Present (handle positioned centrally against the chest, visually implying the penguin is supporting it). All described elements are visible and coherently combined into a single scene. The image successfully depicts the whimsical concept requested without major artifacts or missing components. VERDICT: PASS

Medium 100%vs100%

z-image-local/z-image-turboPASS

A medieval knight in full armor sitting at a desk typing on a modern laptop

Evaluator

The image successfully depicts all the requested elements: * **Medieval knight:** Present (the figure is clearly a knight). * **Full armor:** Present (helmet, breastplate, gauntlets, chainmail are visible). * **Sitting at a desk:** Present (the figure is seated at a wooden table). * **Typing on a modern laptop:** Present (the gauntlets are positioned on the keyboard of a sleek, modern laptop). The anachronistic combination is executed coherently with realistic lighting and texture. V

z-image-local/z-image-turboPASS

A Roman gladiator taking a selfie with a smartphone in the Colosseum

Evaluator

- **Roman gladiator:** Present (The man is wearing detailed Roman armor). - **Taking a selfie:** Present (He is holding a phone up to capture his own image). - **Smartphone:** Present (A modern smartphone is clearly visible in his hand). - **Colosseum:** Present (The iconic structure is in the background). All elements are present and combined coherently to fulfill the prompt's unusual concept. VERDICT: PASS

z-image-local/z-image-turboPASS

An astronaut in a spacesuit riding a bicycle through a park

Evaluator

- **Astronaut:** Present. The figure is clearly dressed as an astronaut. - **Spacesuit:** Present. The white suit, helmet, and life-support backpack are visible. - **Riding a bicycle:** Present. The astronaut is seated on a red bicycle, holding the handlebars and pedaling. - **Through a park:** Present. The background features trees, grass, and a paved path typical of a park setting. All described elements are present and combined coherently. VERDICT: PASS

Hard 33%vs100%

z-image-local/z-image-turboFAIL

An octopus wearing a top hat and monocle playing chess on the surface of the moon with Earth visible in the background

Evaluator

The image contains most elements but is missing the monocle. - **Octopus:** Present. - **Top hat:** Present. - **Monocle:** Missing (The octopus has large eyes, but neither is wearing a monocle). - **Playing chess:** Present. - **Surface of the moon:** Present. - **Earth visible:** Present. VERDICT: FAIL

z-image-local/z-image-turboPASS

A dinosaur in a business suit giving a PowerPoint presentation in a modern office while drinking coffee

Evaluator

- **Dinosaur:** Present (Green T-Rex character). - **Business suit:** Present (Black suit jacket, white shirt, tie). - **Giving a PowerPoint presentation:** Present (Gesturing at a large screen with slide-like text). - **Modern office:** Present (Office chairs, desk, laptop, large monitor, windows). - **Drinking coffee:** Present (Holding a coffee cup). All elements are clearly visible and combined coherently into a single scene. VERDICT: PASS

z-image-local/z-image-turboFAIL

A robot dressed as a pirate sailing a paper boat on a lake of lava while juggling three kittens

Evaluator

The image contains a robot pirate in a paper boat on a lava-themed background. However, the prompt specifies "juggling three kittens," while the image clearly shows **four** kittens (two held in the air, two sitting on the boat). Additionally, the action is holding/sitting rather than juggling. * Robot dressed as a pirate: Present * Paper boat: Present * Lake of lava: Present * Juggling three kittens: Fail (There are four kittens, and they are not being juggled). VERDICT: FAIL

▸Counting33%vs56%

Easy 33%vs67%

z-image-local/z-image-turboFAIL

Exactly 3 red apples arranged on a wooden table

Evaluator

I see 4 red apples arranged on a wooden table. The prompt requested exactly 3 apples. Since the count is 4 instead of 3, the request is not met. VERDICT: FAIL

z-image-local/z-image-turboPASS

Exactly 5 yellow rubber ducks sitting in a row on a bathroom shelf

Evaluator

I count exactly 5 yellow rubber ducks arranged in a row on the white shelf. The count and description match the prompt perfectly. VERDICT: PASS

z-image-local/z-image-turboFAIL

Exactly 4 white candles standing on a dark surface

Evaluator

I see 5 white candles standing on a dark surface. The prompt requested exactly 4 candles. Since the actual count (5) does not match the requested count (4), the image fails the specific constraint. VERDICT: FAIL

Medium 67%vs67%

z-image-local/z-image-turboPASS

2 orange cats and 3 black dogs sitting together on a grassy lawn

Evaluator

I see exactly 2 orange cats and 3 black dogs sitting on a grassy lawn. The counts match the requested description perfectly. VERDICT: PASS

z-image-local/z-image-turboFAIL

4 red roses and 2 white daisies in a glass vase on a table

Evaluator

The image contains 7 red roses and 3 white daisies, not the requested 4 red roses and 2 white daisies. While the vase and table are present, the specific flower counts are incorrect. VERDICT: FAIL

z-image-local/z-image-turboPASS

1 blue butterfly and 3 yellow butterflies resting on a garden fence

Evaluator

I see exactly 1 blue butterfly positioned at the top center and 3 yellow butterflies positioned around it (left, bottom, and right). All are resting on a wooden garden fence. The counts match the prompt's request perfectly. VERDICT: PASS

Hard 0%vs33%

z-image-local/z-image-turboFAIL

Exactly 9 goldfish swimming in a large round fishbowl

Evaluator

I count 8 goldfish in the image. 1. Top left 2. Top center 3. Top right 4. Center 5. Far right 6. Middle left 7. Bottom right 8. Bottom left The prompt requested exactly 9 goldfish. Since there are only 8, the count does not match. VERDICT: FAIL

z-image-local/z-image-turboFAIL

A fruit bowl containing exactly 3 bananas, 4 oranges, and 2 apples

Evaluator

The image contains: - **Bananas:** 3 (visible stems and bodies match). - **Apples:** 2 (one top right, one bottom right). - **Oranges:** 5. There is one top-left (behind bananas), one top-right, one center (below top-right), one bottom-left, and a fifth one partially visible behind the top-right orange and the top apple. The prompt requested exactly 4 oranges, but there are 5. VERDICT: FAIL

z-image-local/z-image-turboFAIL

Exactly 11 colorful balloons floating against a clear blue sky

Evaluator

I count 9 balloons in the image: one yellow, one dark blue, one pink, one purple, one white, one green, one light blue, one orange, and one pale blue. The prompt requested exactly 11 balloons. Since the actual count (9) does not match the requested count (11), the image fails the specific constraint. VERDICT: FAIL

▸Negation44%vs89%

Easy 67%vs100%

z-image-local/z-image-turboPASS

A tropical beach scene with no people

Evaluator

The image depicts a serene tropical beach with white sand, palm trees, and turquoise water. I have scanned the entire scene, including the foreground sand, the water, and the distant shoreline near the vegetation. There are no visible human figures, swimmers, or people relaxing on the beach. The scene is completely empty of people. VERDICT: PASS

z-image-local/z-image-turboFAIL

A cat lying on a couch without a collar

Evaluator

The image shows a tabby cat lying on a grey couch. Upon close inspection of the cat's neck, there is a visible light-colored band, which is a collar. Since the prompt explicitly requested the cat be "without a collar," the presence of this accessory violates the negative constraint. VERDICT: FAIL

z-image-local/z-image-turboPASS

A classroom with desks and a chalkboard but no students

Evaluator

The image shows a classroom interior featuring rows of empty wooden desks and chairs facing a large green chalkboard. A teacher's desk is positioned at the front. The room is completely empty of people; there are absolutely no students visible in the scene. The excluded element is successfully absent. VERDICT: PASS

Medium 33%vs100%

z-image-local/z-image-turboFAIL

A sandy beach with palm trees, seashells, and a beach umbrella but no water or ocean visible

Evaluator

The image displays a sandy beach with palm trees, seashells in the foreground, and a beach umbrella. However, contrary to the exclusion criteria, there is clearly a blue ocean with visible waves in the background stretching to the horizon. The excluded element (water/ocean) is present. VERDICT: FAIL

z-image-local/z-image-turboFAIL

An office workspace with a desk, chair, bookshelf, and lamp but no computers or screens

Evaluator

The image shows a home office setup with a desk, chair, bookshelf, and lamp. However, prominently displayed on the desk is a large computer monitor (screen) along with a keyboard and mouse. Since the prompt explicitly excluded "computers or screens," and a screen is clearly present, the image fails the exclusion criteria. VERDICT: FAIL

z-image-local/z-image-turboPASS

A kitchen with pots, pans, a stove, and a cutting board but no food

Evaluator

The image displays a kitchen scene featuring a stovetop with a frying pan, a saucepan, and two larger pots. There are wooden cutting boards visible in the background and on the counter. All cookware either appears empty (the frying pan) or is covered with lids, obscuring any potential contents. There are no visible ingredients, cooked meals, or food items on the counters or cutting boards. VERDICT: PASS

Hard 33%vs67%

z-image-local/z-image-turboFAIL

A pepperoni pizza with tomato sauce, basil, and crust but absolutely no cheese

Evaluator

The image shows a pizza with a crust, tomato sauce, basil leaves, and toppings that resemble pepperoni or tomatoes. However, the pizza is clearly covered in melted white and yellowish cheese, which has a glossy, oily texture typical of melted mozzarella. Since the prompt explicitly requested "absolutely no cheese" and the image contains a significant amount of it, the constraint is violated. VERDICT: FAIL

z-image-local/z-image-turboPASS

A car body with windows, doors, and headlights but no wheels

Evaluator

The image displays the front view of a silver car body shell. It features the windshield frame, hood, front bumper, and headlight housings. Crucially, looking at the bottom of the vehicle, there are absolutely no wheels or tires visible. The car is presented as a bare body shell without any rolling stock attached. VERDICT: PASS

z-image-local/z-image-turboFAIL

A human face with eyes, nose, and mouth but no eyebrows

Evaluator

The image displays a close-up of a human face featuring eyes, a nose, and a mouth. However, distinct, dark eyebrows are clearly visible above the eyes. Since the prompt explicitly requested "no eyebrows," the presence of this element violates the exclusion criteria. VERDICT: FAIL

▸Relative Position83%vs100%

Easy 100%vs100%

z-image-local/z-image-turboPASS

A cat sitting on top of a cardboard box

Evaluator

The image shows a tabby cat positioned directly above a brown cardboard box. The cat's paws are resting on the upper surface of the box, confirming the "sitting on top of" relationship described in the prompt. The spatial arrangement is accurate. VERDICT: PASS

z-image-local/z-image-turboPASS

A red ball underneath a wooden chair

Evaluator

The image shows a shiny red sphere positioned on the floor directly below the seat of a wooden chair, situated between its legs. This perfectly matches the description of a red ball underneath a wooden chair. VERDICT: PASS

z-image-local/z-image-turboPASS

A coffee mug next to an open book on a desk

Evaluator

The image displays a white coffee mug filled with coffee positioned in the foreground. To its right and slightly behind, there is an open book lying flat on a light-colored surface (the desk). Another open book is visible in the background to the left. The spatial arrangement matches the description: the mug is next to the open book, and both are resting on the desk surface. VERDICT: PASS

Medium 100%vs100%

z-image-local/z-image-turboPASS

A bird perched on a branch, the branch extending from a tree next to a park bench

Evaluator

The image shows a bird perched on a branch that extends from a large tree trunk on the right side. In the lower left, slightly out of focus, is a park bench. The spatial arrangement matches the description: the bird is on the branch, the branch comes from the tree, and the tree is situated next to the bench. VERDICT: PASS

z-image-local/z-image-turboPASS

A cat sitting on a pillow, the pillow placed on a wooden chair

Evaluator

The image shows a cat sitting directly on top of a beige pillow. This pillow is resting on the seat of a wooden chair. The spatial hierarchy is: Cat (top) -> Pillow (middle) -> Chair (bottom). All described positional relationships are accurate. VERDICT: PASS

z-image-local/z-image-turboPASS

A candle on top of a stack of books, the books next to a potted plant

Evaluator

The image shows a lit candle sitting directly on top of a stack of three books. To the right of the book stack, there is a potted plant with green leaves. The spatial arrangement perfectly matches the description: the candle is on the books, and the books are next to the plant. VERDICT: PASS

Hard 67%vs100%

z-image-local/z-image-turboFAIL

Five chess pieces on a board: a king in the center, a queen to its left, a bishop to its right, a rook above the king, and a knight below the king

Evaluator

The image shows six chess pieces, not five. The central piece is a King. Behind it (above) is another King (identifiable by the cross), not a Rook. To the right foreground is a Knight, not "below" the king. To the left foreground is a Bishop. The prompt's description of a rook above the king and a knight below the king is incorrect based on the visual evidence. VERDICT: FAIL

z-image-local/z-image-turboPASS

A desk scene: laptop in the center, coffee mug to the left, phone to the right, lamp above, and notebook below

Evaluator

The image shows a laptop centered on a wooden desk. To its left is a white coffee mug. To its right is a smartphone. An open notebook is positioned at the bottom of the frame, below the laptop. A desk lamp is situated in the upper right area, casting light over the scene. All described positional relationships (center, left, right, above/upper area, below) are accurate. VERDICT: PASS

z-image-local/z-image-turboPASS

Five animals in a field: a horse on the far left, a cow to the right of the horse, a sheep in the center, a goat to the right of the sheep, and a dog on the far right

Evaluator

The image displays five animals standing in a grassy field. From left to right, there is a brown horse, a black and white cow, a white sheep in the middle, a white goat, and a black and white dog on the far right. This arrangement perfectly matches the prompt's description of the spatial relationships. VERDICT: PASS

Extreme 67%vs100%

z-image-local/z-image-turboFAIL

A landscape with a red barn in the foreground at large scale, a windmill in the midground at medium scale, and snow-capped mountains in the background at small scale, all with consistent depth perspective

Evaluator

The image shows a large red barn in the foreground and a windmill to its left in the midground. However, the snow-capped mountains in the background are massive and dominate the upper half of the image, contradicting the description "small scale." Additionally, the prompt description does not explicitly use the requested positional prepositions (on, under, left, right) to relate the objects, though the depth arrangement is generally consistent. The scale error regarding the mountains is signif

z-image-local/z-image-turboPASS

A beach scene with a sandcastle in the foreground, a surfer on the waves in the midground, and a cruise ship on the horizon in the background, sizes decreasing with distance

Evaluator

The image perfectly matches the described spatial arrangement. The sandcastle is clearly in the immediate foreground at the bottom. The surfer is positioned in the middle distance on the breaking waves. The cruise ship is located far in the background near the horizon line. The visual sizes decrease appropriately with distance (sandcastle > surfer > ship appearance), adhering to perspective rules. VERDICT: PASS

z-image-local/z-image-turboPASS

A city park with a park bench and pigeons in the foreground, a fountain in the midground, and skyscrapers in the background, each layer progressively smaller

Evaluator

The image perfectly matches the spatial description. The park bench and pigeons are clearly in the immediate foreground. The fountain is situated in the middle distance (midground), behind the bench. The skyscrapers form the skyline in the background. The layering follows standard perspective where foreground objects appear largest and background objects appear smaller due to distance. VERDICT: PASS

▸Scale & Proportions56%vs89%

Easy 67%vs100%

z-image-local/z-image-turboPASS

A person standing next to a yellow school bus, showing realistic relative scale

Evaluator

The image shows a man standing in front of a yellow school bus. The relative scale is realistic: the bus is significantly larger than the person, with the vehicle's height towering over him and the wheel appearing appropriately large (roughly half his height). The proportions are consistent with a real-world scenario. VERDICT: PASS

z-image-local/z-image-turboPASS

A ladybug sitting on a large green leaf, shown at realistic proportions

Evaluator

The image depicts a ladybug resting on a green leaf. The size of the ladybug relative to the leaf appears natural and consistent with real-world macro photography. The leaf is large enough to comfortably support the insect, and the details (like leaf hairs and veins) suggest a realistic scale. The proportions are accurate. VERDICT: PASS

z-image-local/z-image-turboFAIL

A child holding a basketball, both at correct relative sizes

Evaluator

The image shows a child holding what appears to be two basketballs stacked vertically. However, the relative sizes are incorrect. The basketballs are disproportionately large compared to the child's torso and arms. A standard basketball would appear smaller relative to a child's body; here, the balls are nearly as wide as the child's chest, and the hands look too small to grip them naturally. VERDICT: FAIL

Medium 67%vs100%

z-image-local/z-image-turboPASS

A tiny person standing on the rim of a coffee cup, looking down into it

Evaluator

The image shows a miniature figurine of a man in a suit standing on the rim of a white coffee cup. The figurine is significantly smaller than the cup, accurately reflecting the "tiny person" description. The proportions are consistent with a macro photograph of a small object placed on a standard-sized cup. The person is positioned on the edge and appears to be looking down into the coffee. VERDICT: PASS

z-image-local/z-image-turboFAIL

A giant tabby cat walking between city skyscrapers like a kaiju

Evaluator

The image shows a tabby cat in the foreground with skyscrapers in the background. While the cat appears large due to the close-up perspective, the skyscrapers are clearly massive structures towering in the distance. The cat is not depicted as a "giant" or "kaiju" relative to the buildings; it looks like a normal-sized cat on a walkway. The prompt requires the cat to be giant relative to the city, which is not the case here. VERDICT: FAIL

z-image-local/z-image-turboPASS

A miniature house sitting on the palm of someone's hand

Evaluator

The image shows a small model house resting comfortably in the palm of a human hand. The scale is consistent: the house is small enough to be held (miniature), fitting the description perfectly. The proportions of the house relative to the fingers and palm are realistic for a toy or architectural model. VERDICT: PASS

Hard 33%vs67%

z-image-local/z-image-turboFAIL

A group photo of an elephant, a horse, a dog, a cat, and a mouse, all standing in a line with correct real-world proportional sizes

Evaluator

The image fails to meet the prompt's requirements. The prompt explicitly requests a "dog," but there is no dog in the image; instead, there are two cats. Additionally, the prompt asks for singular instances of a horse, cat, and mouse, but the image contains two of each. While the visible animals (elephant, horses, cats, mice) have generally correct relative sizes to one another, the absence of the required dog and the presence of extra animals mean the image does not correctly depict the descr

z-image-local/z-image-turboPASS

A dinner table scene with a plate, fork, knife, wine glass, salt shaker, and peppercorn, all at realistic relative sizes

Evaluator

The image displays a standard table setting where the proportions are accurate. The fork and knife are appropriately sized relative to the dinner plate (roughly similar in length to the plate's diameter). The wine glass is taller than the plate is wide, and the salt and pepper shakers are significantly smaller than the main plate and cutlery, consistent with real-world objects. VERDICT: PASS

z-image-local/z-image-turboFAIL

A farm scene with a barn, tractor, farmer, cow, chicken, and a fence post, all at correct proportional scale

Evaluator

The image shows a farm scene, but the proportions are inconsistent with reality. The barn is disproportionately small compared to the other elements. Specifically, the cow is almost as tall as the barn's lower doors (which should be roughly 10-12 feet high), making the cow appear gigantic or the barn miniature. Similarly, the tractor appears nearly as tall as the barn doors. While the farmer, cow, and tractor have somewhat consistent relative sizes to each other, the barn breaks the scale. VE

Human realism 95%vs95%

▸Faces & Expressions92%vs92%

Easy 100%vs100%

z-image-local/z-image-turboPASS

A close-up portrait of a young woman with a warm natural smile

Evaluator

The face in the image is highly symmetrical with correctly proportioned and natural features. The eyes, eyebrows, nose, and mouth are well-aligned and balanced. The smile is warm, natural, and clearly conveys happiness — the lips curve gently upward, teeth are visible in a relaxed grin, and the cheeks lift slightly, all contributing to an authentic, pleasant expression. There are no noticeable distortions or unnatural artifacts in the facial structure or expression. The lighting and composition

z-image-local/z-image-turboPASS

A portrait of a middle-aged man with a calm, serious expression

Evaluator

The face in the image is symmetric and features are anatomically correct — eyes, nose, mouth, and ears are proportionally aligned and natural. There are no obvious distortions. The expression is calm and serious, with neutral lips, direct gaze, and minimal facial movement, which clearly conveys the intended emotion. The subject appears to be middle-aged, with visible signs of aging such as wrinkles and some graying hair, consistent with the prompt. VERDICT: PASS

z-image-local/z-image-turboPASS

A headshot of a teenager with a relaxed, neutral face

Evaluator

The face in the image is symmetric and features are correctly proportioned — eyes, nose, mouth, and ears are well-aligned and natural. The teenager’s expression is relaxed and neutral, with no strong emotion conveyed, matching the prompt. There are no visible distortions; the skin texture, lighting, and facial structure appear realistic and well-rendered. VERDICT: PASS

Medium 100%vs100%

z-image-local/z-image-turboPASS

A close-up of a person's face showing clear disgust, nose wrinkled and lip curled

Evaluator

The face in the image is largely symmetric, with both eyes, eyebrows, and lips appearing balanced in structure. The features are anatomically plausible — eyebrows are arched and furrowed, the nose is wrinkled, and the lips are curled downward, all of which strongly convey the emotion of disgust as requested. The expression is highly readable and intense, with visible skin texture and fine wrinkles that enhance the realism of the emotion. There are no significant distortions that compromise the f

z-image-local/z-image-turboPASS

A portrait capturing genuine surprise — raised eyebrows, wide eyes, slightly open mouth

Evaluator

The face is largely symmetric with correctly rendered features: both eyes are wide and bulging, eyebrows are raised, and the mouth is slightly open — all aligning with the prompt’s request for “genuine surprise.” The facial structure is natural and undistorted, with no obvious asymmetry or anatomical anomalies. The emotion is clearly readable as surprise, conveyed through the exaggerated eye and mouth expression. There are no visible distortions. VERDICT: PASS

z-image-local/z-image-turboPASS

A face showing quiet melancholy — downcast eyes, slightly drawn mouth, distant gaze

Evaluator

The face is largely symmetric with correctly rendered anatomical features — eyes, nose, and mouth are proportionally accurate and aligned. There are no obvious distortions or unnatural deformities. The expression conveys quiet melancholy: the eyes are downcast, the mouth is slightly drawn (not smiling or frowning, but neutral with a hint of sadness), and the gaze is distant and introspective. Lighting and composition enhance the somber mood without compromising facial integrity. VERDICT: PASS

Hard 100%vs100%

z-image-local/z-image-turboPASS

An elderly woman in her 80s with deep wrinkles and white hair, smiling warmly with kind eyes

Evaluator

The face is largely symmetrical with natural, correctly rendered features: the eyes, nose, and mouth are balanced, and the white, slightly wavy hair is appropriately styled. The deep wrinkles around the eyes and mouth are consistent with an elderly woman in her 80s, and the smile appears warm and genuine, with crinkled eyes conveying kindness. There are no obvious distortions—no misaligned features, unnatural proportions, or AI artifacts. The emotion is clearly readable as warm, gentle, and affe

z-image-local/z-image-turboPASS

A 5-year-old child with round cheeks and large eyes, crying with tears streaming down their face

Evaluator

The face in the image is symmetric and features are anatomically correct for a young child. The child has round cheeks, large eyes (though squinted due to crying), and tears are clearly streaming down both cheeks. The expression is unmistakably one of intense crying — the mouth is wide open, eyebrows are furrowed, and the facial muscles are tensed appropriately for distress. There are no noticeable distortions; the features are natural and well-proportioned. VERDICT: PASS

z-image-local/z-image-turboPASS

An elderly man in his 70s with age spots and thinning gray hair, looking skeptically over reading glasses

Evaluator

The face in the image is largely symmetric with natural, correctly rendered features. The elderly man appears to be in his 70s, with visible age spots, thinning gray hair, and is wearing reading glasses. His expression is skeptical — the brow is slightly furrowed, the lips are pressed in a neutral-to-disapproving line, and his gaze is directed away, conveying doubt or skepticism. There are no significant distortions; the facial structure, proportions, and details (like skin texture, eye shape, a

Extreme 67%vs67%

z-image-local/z-image-turboPASS

A group of three friends at a table: one laughing with head tilted back, one looking shocked with wide eyes, and one rolling their eyes with a smirk

Evaluator

The image shows three individuals seated at a table, each displaying a distinct and clearly readable emotion: - Left: A woman laughing with her head tilted back, eyes closed, and mouth open — this matches the “laughing with head tilted back” prompt. - Center: A woman with wide eyes, open mouth, and hands on cheeks — this clearly conveys “shocked” or “surprised” emotion. - Right: A person with one eye squinted, lips pulled into a smirk, and hands supporting the face — this effectively portrays “

z-image-local/z-image-turboPASS

A family photo with four people: a father beaming with pride, a mother smiling gently, a teenager looking bored, and a child grinning excitedly

Evaluator

All four faces in the image are symmetric with correct, natural features. No distortions are present. - The father (top left) is beaming with pride — his wide smile and bright eyes convey genuine joy. - The mother (top right) is smiling gently — her soft expression and slight upturn of lips match the description. - The teenager (front left) looks bored — his neutral expression, slightly downturned mouth, and lack of engagement clearly convey disinterest. - The child (front right) is grinning ex

z-image-local/z-image-turboFAIL

Three coworkers in a meeting: one looking frustrated with furrowed brows, one nodding with a pleased expression, and one looking confused with a tilted head

Evaluator

The image shows three individuals seated at a table, each with laptops, appearing to be in a meeting. However, their facial expressions do not match the prompt’s requested emotions: - Left person: Frowning, hands on cheeks — this suggests frustration or stress, which aligns with the “frustrated with furrowed brows” prompt. - Middle person: Eyes closed, hands on cheeks, slight smile — this appears to be a mix of relief, exhaustion, or resignation, not “pleased expression” as requested. - Right p

▸Full Body92%vs100%

Easy 100%vs100%

z-image-local/z-image-turboPASS

A woman standing upright in a relaxed pose, arms at her sides, wearing jeans and a white t-shirt

Evaluator

The image shows a full-body view of a woman standing upright with her arms at her sides, wearing a white t-shirt and blue jeans, consistent with the prompt. - Limb count: Correct — two arms, two legs, visible torso and head. - Proportions: Anatomically correct — limbs and torso proportions appear natural and balanced. - Pose: The woman is standing upright in a relaxed pose with arms at her sides — fully matches the prompt. - Action/pose recognizability: Yes — the pose is clearly identifiable as

z-image-local/z-image-turboPASS

A man standing straight facing the camera, hands in pockets, normal proportions

Evaluator

The image shows a full-body shot of a man standing straight and facing the camera, with both hands in his pockets. His posture is upright and symmetrical, and his proportions appear natural and anatomically correct — head, torso, arms, and legs are in proportion with no visible distortions. He has two arms, two legs, and all limbs are fully intact and correctly positioned. The pose is clearly recognizable as “standing straight, hands in pockets,” which matches the prompt exactly. There are no no

z-image-local/z-image-turboPASS

A person standing in a neutral pose in a park, full body visible from head to feet

Evaluator

The image shows a full-body view of a person standing in a park, from head to feet, as requested. The person is in a neutral, upright standing pose with arms hanging loosely at the sides and hands slightly curled. The posture is natural and recognizable as a standard standing pose. Limb count is correct: two arms, two legs, with all major joints (shoulders, elbows, wrists, hips, knees, ankles) appearing anatomically accurate and proportionally aligned. The proportions of the body — head-to-shou

Medium 100%vs100%

z-image-local/z-image-turboPASS

A woman mid-stride running through a park, hair and clothes showing motion

Evaluator

The image shows a woman in mid-stride running through a park. Her hair is flying back, and her clothing (white t-shirt, dark shorts, running shoes) shows motion consistent with running. Anatomical correctness: - Limb count: Correct — two arms, two legs. - Proportions: Natural and proportional — no exaggerated limbs or torso. - Pose: Anatomically correct running pose — one leg forward, the other back, arms bent and swinging, body leaning slightly forward. The motion blur in hair and clothing en

z-image-local/z-image-turboPASS

A man captured mid-jump over a puddle, body in the air with bent knees

Evaluator

The image shows a man captured mid-air, seemingly jumping over a puddle. His body is crouched with bent knees, arms slightly extended forward, and feet off the ground — consistent with the described action. Limb count is correct: two arms, two legs, and a torso. Proportions appear anatomically plausible — the torso is aligned with the legs, the arms are positioned naturally, and the head is proportionally sized. The pose is recognizable as a jump, with the body compacted in a crouch, which is

z-image-local/z-image-turboPASS

A dancer in a dynamic spin, dress flowing outward with one leg extended

Evaluator

The dancer is captured mid-spin, en pointe, with arms gracefully arched above the head and the dress swirling outward — a dynamic, expressive pose that matches the prompt’s description. The dress flows realistically with the motion, and the fabric’s movement is physically plausible. Limb count is correct: two arms, two legs, and the torso are intact. Proportions are anatomically accurate — the dancer’s body is slender but proportionate, with no obvious distortions in limb length or joint place

Hard 67%vs100%

z-image-local/z-image-turboPASS

A person in a yoga tree pose — standing on one leg with the other foot pressed to the inner thigh, arms raised overhead

Evaluator

The image shows a person performing the yoga tree pose (Vrksasana). The pose is recognizable: one leg is standing firmly on the floor, the other foot is pressed against the inner thigh of the standing leg, and the arms are raised overhead with palms pressed together. The limb count is correct — two arms, two legs, two feet — and proportions appear anatomically plausible. The pose is executed with good alignment: the torso is upright, the hips are level, and the standing leg is engaged. There are

z-image-local/z-image-turboPASS

A rock climber clinging to a cliff face with arms and legs extended to different holds

Evaluator

The climber in the image has two arms and two legs — limb count is correct. Proportions appear anatomically plausible: the torso, limbs, and joints are proportioned in a way consistent with a human body in dynamic motion. The pose is recognizable as a rock climbing action — arms extended to grip holds, legs positioned for balance and leverage, body angled against the wall. The climber’s posture is physically plausible for the activity, with no obvious distortions in joint angles or limb placemen

z-image-local/z-image-turboFAIL

A cellist seated with correct posture, legs around the instrument, bow arm extended

Evaluator

The image shows a cellist seated with the cello positioned between their legs, supported by a stand. The bow is held in the right hand and is extended across the strings, consistent with playing posture. The left hand is positioned on the fingerboard, as expected for playing. The cellist’s legs are not wrapped around the instrument — they are seated with the cello resting between them, which is the standard posture for cello playing. The instrument is properly supported by a stand, not held by t

Extreme 100%vs100%

z-image-local/z-image-turboPASS

Two tango dancers in a close embrace, the man dipping the woman backward with intertwined legs

Evaluator

The image shows two tango dancers in a dynamic, close embrace. The woman is arched backward, supported by the man, with her legs extended and intertwined with his — specifically, her right leg is wrapped around his left thigh, and her left leg is extended outward, creating a classic tango dip pose. Both dancers have full, anatomically correct limb counts (two arms, two legs each), and their proportions are realistic and consistent with human anatomy. The pose is clearly recognizable as a tango d

z-image-local/z-image-turboPASS

Two soccer players mid-tackle, one sliding and the other jumping over with the ball at their feet

Evaluator

The image shows two soccer players in a dynamic tackle: one player is sliding on the ground, and the other is airborne, jumping over him while controlling the ball at his feet. This matches the prompt’s description. Anatomical analysis: - Limb count: Both players have all limbs present — arms, legs, hands, feet — no missing or extra limbs. - Proportions: Body proportions are realistic — torsos, limbs, and heads are proportionate to each other and to the players’ athletic build. - Pose: The slid

z-image-local/z-image-turboPASS

Two gymnasts performing a synchronized handstand, side by side with identical body alignment

Evaluator

The image shows two gymnasts performing synchronized handstands side by side. Both have identical body alignment — their legs are straight and parallel, feet pointed, torsos aligned vertically, and hands placed shoulder-width apart on the mat. Limb count is correct: each has two arms, two legs, and the correct number of fingers and toes. Proportions are anatomically accurate — the limbs are proportionate to the torso, and the body is balanced in the handstand position. The pose is clearly recogn

▸Hands100%vs92%

Easy 100%vs100%

z-image-local/z-image-turboPASS

A person standing with both hands visible at their sides, fingers relaxed and naturally spread

Evaluator

Both hands are clearly visible, palms facing forward, fingers relaxed and naturally spread as requested in the prompt. Counting the fingers on each hand: - Left hand: Thumb, index, middle, ring, pinky — 5 fingers. - Right hand: Thumb, index, middle, ring, pinky — 5 fingers. The anatomy is correct: each hand has five digits (thumb + four fingers), and the posture is natural and relaxed, consistent with a person standing with hands at their sides. No anomalies are present — no extra fingers, mi

z-image-local/z-image-turboPASS

A woman holding her hands out in front of her palms-up, all ten fingers clearly visible

Evaluator

Both hands are clearly visible, palms up, with all ten fingers distinctly shown. Each hand displays five fingers: thumb, index, middle, ring, and pinky — no fingers are missing or duplicated. The gesture is anatomically correct: the hands are extended forward, palms open, fingers slightly spread, which is a natural and common pose. There are no anomalies such as extra fingers, fused digits, or unnatural deformities. VERDICT: PASS

z-image-local/z-image-turboPASS

A man with arms crossed, both hands resting on opposite upper arms with fingers visible

Evaluator

Both hands are visible in the image, resting on opposite upper arms as described in the prompt. Each hand clearly shows five fingers: thumb, index, middle, ring, and pinky. The fingers are anatomically positioned and appear natural, with no missing or extra digits. The grip is consistent with a typical crossed-arm posture — hands resting on opposite upper arms, fingers slightly curled and overlapping, which is a common and anatomically plausible gesture. No anomalies are present. VERDICT: PASS

Medium 100%vs100%

z-image-local/z-image-turboPASS

A person holding a white ceramic coffee mug by the handle with their right hand

Evaluator

The visible hand in the image is holding the mug by its handle with the thumb on top and the other fingers curled around the back of the handle. All five fingers are clearly visible: thumb, index, middle, ring, and pinky. The grip is anatomically correct — the thumb and fingers are wrapped around the handle in a natural, functional manner typical of holding a mug. There are no anomalies such as extra fingers, missing fingers, or unnatural joint positions. VERDICT: PASS

z-image-local/z-image-turboPASS

A woman writing in a notebook with a pen gripped correctly in her right hand

Evaluator

The visible hand in the image is holding a pen and writing in a notebook. Counting the fingers: - Thumb: visible - Index finger: wrapped around the pen - Middle finger: supporting the pen - Ring finger: curled under the pen - Little finger: partially visible, curled under All five fingers are clearly present and anatomically positioned. The grip is a standard tripod grip — index finger, middle finger, and thumb holding the pen — which is a common and anatomically correct way to hold a pen for

z-image-local/z-image-turboPASS

A person playing an acoustic guitar with left hand on the fretboard and right hand strumming

Evaluator

Left hand (on fretboard): 5 fingers visible — thumb, index, middle, ring, pinky — all positioned correctly to press strings on the fretboard. Anatomically accurate. Right hand (strumming): 5 fingers visible — thumb, index, middle, ring, pinky — all positioned naturally for strumming. Anatomically accurate. No anomalies detected. Fingers count is exactly 5 per hand. Gestures are consistent with standard guitar playing posture. VERDICT: PASS

Hard 100%vs100%

z-image-local/z-image-turboPASS

A person making a peace sign with their right hand, clearly showing two fingers up and three curled

Evaluator

The image shows a single hand making a peace sign gesture, with the index and middle fingers extended upward and the other three fingers (ring, pinky, and thumb) curled inward. All five fingers are clearly visible and anatomically present — no fingers are missing or duplicated. The gesture is anatomically correct: the index and middle fingers are extended, while the ring finger, pinky, and thumb are curled into the palm. The thumb is positioned naturally against the side of the hand, which is t

z-image-local/z-image-turboPASS

A hand giving a thumbs-up with four fingers curled and thumb extended upward

Evaluator

The image shows a single hand performing a thumbs-up gesture. The thumb is extended upward, and the other four fingers (index, middle, ring, and little) are curled into a fist. All five fingers are clearly visible and anatomically distinct. - Count of fingers per hand: 5 (thumb + four curled fingers). - Anatomical correctness: The gesture is standard and anatomically accurate. The thumb is extended, and the other fingers are curled naturally, consistent with a typical thumbs-up pose. - No anoma

z-image-local/z-image-turboPASS

A person counting to three on their fingers, with index, middle, and ring fingers extended

Evaluator

The image shows a single hand with the index, middle, and ring fingers extended — matching the prompt’s description of “counting to three.” The thumb is not visible, and the pinky finger is also not visible, so only three fingers are extended. The hand appears to be a left hand, palm facing up, with the thumb tucked away or out of frame. Counting the visible fingers: only three are extended (index, middle, ring). The thumb and pinky are not visible, so we cannot confirm they are present or abse

Extreme 100%vs67%

z-image-local/z-image-turboPASS

Two people shaking hands in a professional setting, all fingers of both hands anatomically correct

Evaluator

Each visible hand clearly shows five fingers: thumb, index, middle, ring, and pinky. The grip is a standard, professional handshake — palms facing each other, fingers interlocked, thumbs resting on the outer side. The anatomy is consistent with real human hands: knuckles, finger joints, and nail beds are proportionally accurate and natural. No anomalies such as extra fingers, missing fingers, or unnatural joint positions are present. VERDICT: PASS

z-image-local/z-image-turboPASS

Two people doing a high-five, palms meeting with all fingers extended on both hands

Evaluator

Each visible hand clearly shows five fingers: thumb, index, middle, ring, and pinky. The fingers are fully extended and meet palm-to-palm in a high-five gesture. The anatomy is accurate — no extra or missing fingers, no unnatural deformities, and the positioning is consistent with a natural high-five. The gesture is anatomically correct and matches the prompt. VERDICT: PASS

z-image-local/z-image-turboPASS

One person handing a small gift box to another person, both sets of hands gripping the box correctly

Evaluator

Examining the hands in the image: - The hands of the person giving the gift (top pair, wearing a dark blue sweater) are clearly visible. Each hand shows five fingers: thumb, index, middle, ring, and pinky. No fingers are missing or duplicated. - The hands receiving the gift (bottom pair, wearing a white sleeve) are also visible. Each hand also clearly shows five fingers. - The grip is anatomically correct: the giver’s hands cradle the box with palms slightly cupped and fingers wrapped around th

▸Multi-Subject100%vs100%

Easy 100%vs100%

z-image-local/z-image-turboPASS

Two women standing side by side: one with short blonde hair wearing a red jacket, the other with long black hair wearing a blue dress

Evaluator

Person 1 (left): - Hair: Short, blonde, styled in a bob cut. - Clothing: Wearing a red jacket over a dark-colored top. - Observed traits: Visually distinct, matches description. Person 2 (right): - Hair: Long, straight, black. - Clothing: Wearing a sleeveless blue dress with a belt. - Observed traits: Visually distinct, matches description. All individuals are present, visually distinct, and match their described features. VERDICT: PASS

z-image-local/z-image-turboPASS

Two men at a cafe: one with a beard and glasses wearing a green sweater, the other clean-shaven in a black hoodie

Evaluator

Person 1 (left): - Has a full, reddish-brown beard. - Wears black-rimmed glasses. - Wears a green ribbed sweater. - Has brown, styled hair. - Visible watch on left wrist. Person 2 (right): - Clean-shaven (no facial hair). - Has short, light brown/blond hair. - Wears a black hoodie with drawstrings. - Has light-colored eyes. Both individuals are visually distinct and match the described features exactly. VERDICT: PASS

z-image-local/z-image-turboPASS

Two friends posing: one tall with curly red hair in denim overalls, the other short with straight brown hair in a yellow sundress

Evaluator

Person 1 (tall): - Hair: Curly, red (appears as vibrant copper-red curls, matching description) - Clothing: Denim overalls (blue denim bib overalls over a white t-shirt) - Height: Clearly taller than the other person, as expected Person 2 (short): - Hair: Straight, brown (long, straight brown hair, matching description) - Clothing: Yellow sundress (bright yellow, sleeveless, gathered-waist dress) - Height: Clearly shorter than the other person, as expected Both individuals are visually disti

Medium 100%vs100%

z-image-local/z-image-turboPASS

Four people at a dinner table: a bald man in a suit, a woman with silver hair in a red blouse, a young man with dreadlocks in a denim jacket, and a woman with a hijab in a green dress

Evaluator

- Bald man in a suit: Present. Seen from the side, wearing a dark suit with a white shirt. Bald head is clearly visible. - Woman with silver hair in a red blouse: Present. Seated across from the bald man, has silver-gray wavy hair, wearing a red collared blouse, and gold jewelry. - Young man with dreadlocks in a denim jacket: Present. Seated next to the woman in red, has dark dreadlocks, wearing a blue denim jacket over a white shirt. - Woman with a hijab in a green dress: Present. Seated on the

z-image-local/z-image-turboPASS

Three musicians on stage: a tall woman with short pink hair playing bass, a man with a beard and bandana on drums, and a slim man with glasses playing keyboard

Evaluator

- Tall woman with short pink hair playing bass: Present. She has bright pink short hair, is playing a bass guitar, and is visibly tall in proportion to the others. ✅ - Man with a beard and bandana on drums: Present. He is seated behind a drum kit, has a full beard, and is wearing a black bandana. ✅ - Slim man with glasses playing keyboard: Present. He is on the right, playing a keyboard, has curly brown hair, wears glasses, and appears slim. ✅ All three individuals are visually distinct and mat

z-image-local/z-image-turboPASS

Three hikers on a trail: a man with a backpack and sunhat, a woman with braided hair and hiking poles, and a teenager with a baseball cap and water bottle

Evaluator

Person 1 (Man with backpack and sunhat): - Observed: Seen from the back, wearing a beige sunhat and a large dark gray backpack. He is holding a hiking pole. - Matches description: Yes — backpack and sunhat are present. Person 2 (Woman with braided hair and hiking poles): - Observed: Facing forward, smiling, with long braided hair, wearing a beige sunhat, and holding two hiking poles. - Matches description: Yes — braided hair, hiking poles, and sunhat are present. Person 3 (Teenager with baseba

Truthfulness 78%vs0%

▸Photorealism67%vs0%

Medium 67%vs0%

z-image-local/z-image-turboPASS

A still life on a wooden table featuring a clear glass of water, a polished silver spoon, a folded linen napkin, and a ripe red apple — each material rendered with accurate texture and light behavior

Evaluator

The image accurately renders the requested materials. The glass shows proper refraction and transparency, the spoon exhibits realistic metallic reflections, the apple has a convincing skin texture with water droplets, the napkin displays natural fabric folds, and the wooden table features appropriate grain and lighting. VERDICT: PASS

z-image-local/z-image-turboFAIL

A workspace desk with a brushed aluminum laptop, a leather-bound notebook, a ceramic coffee mug, and a glass paperweight — all materials visually distinct and realistic

Evaluator

The image successfully renders the brushed aluminum laptop, leather notebook, and ceramic mug with accurate textures and lighting. However, the prompt explicitly requested a "glass paperweight," which is completely absent from the scene. Since a key material requested in the prompt is missing, the image fails to meet the criteria. VERDICT: FAIL

z-image-local/z-image-turboPASS

A kitchen counter with a stainless steel pot, a wooden cutting board, a glass bottle of olive oil, and a cotton dish towel — each surface reflecting light correctly

Evaluator

The image accurately renders the distinct visual properties of each material. The stainless steel pot shows sharp, metallic reflections and a brushed texture. The glass bottle exhibits translucency and refraction, with the olive oil visible inside. The wooden cutting board displays natural grain and matte texture, while the cotton towel shows soft fabric folds and diffuse light absorption. The induction cooktop surface correctly reflects the pot and surrounding objects. All materials behave phys

▸Physics & Reflections92%vs0%

Easy 100%vs0%

z-image-local/z-image-turboPASS

A red sphere on a white surface with a single shadow cast to the right, consistent with a light source from the upper left

Evaluator

The shadow is cast to the right and slightly downward, which is consistent with a light source from the upper left. The shadow’s soft edges and gradual fade are physically plausible for a diffused light source. The sphere’s highlight on the upper-left side also aligns with this lighting direction. No distortions or color anomalies are present. VERDICT: PASS

z-image-local/z-image-turboPASS

A coffee mug on a sunny windowsill casting a sharp shadow to the left

Evaluator

The shadow is cast to the left, consistent with sunlight coming from the right (window side). The shadow’s shape matches the mug’s silhouette, including the handle, and shows appropriate softness near the base due to light diffusion. Reflections on the mug’s glossy surface are subtle and realistic. No distortions or color anomalies are present. VERDICT: PASS

z-image-local/z-image-turboPASS

A person standing outdoors at midday with a short shadow directly below them

Evaluator

The shadow is cast directly below the subject, which aligns with the “midday” lighting condition described in the prompt. The shape of the shadow accurately mirrors the person’s posture and clothing folds, showing no distortion or misalignment. The color is a natural dark gray, consistent with asphalt under bright sunlight. No reflections are present, as expected for this scene. All shadow attributes are physically plausible. VERDICT: PASS

Medium 67%vs0%

z-image-local/z-image-turboFAIL

A woman standing in front of a full-length mirror, her reflection accurately showing the back of her outfit and reversed orientation

Evaluator

The reflection is physically implausible. The woman is facing the mirror, so her reflection should show her front. Instead, the reflection shows her back, which contradicts how mirrors work. Additionally, the reflection’s pose and orientation do not match a true mirror image — it appears as if the reflection is a separate person turned around, not a mirrored version of the subject. This violates basic optical principles. VERDICT: FAIL

z-image-local/z-image-turboPASS

A cat sitting on a polished marble floor with its reflection visible below

Evaluator

The reflection is physically plausible. It is correctly oriented (inverted vertically), aligned with the cat’s position, and shows appropriate distortion due to the polished marble surface. The color and brightness match the original subject, and the reflection fades naturally toward the edges, consistent with real-world optics on a glossy floor. No directional or structural inconsistencies are present. VERDICT: PASS

z-image-local/z-image-turboPASS

A vase of flowers on a glass table with a clear reflection visible on the table surface

Evaluator

The reflection on the glass table is physically plausible. It correctly mirrors the vase and stems with appropriate vertical inversion and slight distortion consistent with a reflective surface. The color and brightness of the reflection match the object above, and the direction aligns with the light source. No anomalies in orientation or distortion are present. VERDICT: PASS

Hard 100%vs0%

z-image-local/z-image-turboPASS

A row of trees reflected in a calm lake at sunset, with the reflection slightly darker and gently rippled

Evaluator

The reflections are vertically aligned with the trees, correctly inverted, and exhibit natural ripple distortion consistent with a calm lake surface. The color tone of the reflection is slightly darker than the actual trees, matching the prompt’s description. No directional or orientation errors are present. VERDICT: PASS

z-image-local/z-image-turboPASS

A city skyline reflected in a rain puddle on a street, with mild distortion and color shift

Evaluator

The reflection in the puddle is physically plausible. It correctly mirrors the skyline with appropriate vertical inversion and perspective distortion consistent with a low-angle water surface. The color shift is subtle but realistic, showing slightly muted tones and light scattering typical of reflections on wet asphalt. Direction and orientation align with the scene’s lighting and geometry. VERDICT: PASS

z-image-local/z-image-turboPASS

A mountain range reflected in a still alpine lake, with the reflection inverted and subtly muted in color

Evaluator

The reflection is physically plausible: it is correctly inverted, aligned with the shoreline, and shows appropriate color muting and slight softening consistent with water surface behavior. Shadows on the mountains are consistent with a low-angle light source (likely sunrise/sunset), and their reflections match in direction and intensity. No distortions or misalignments are evident. VERDICT: PASS

Extreme 100%vs0%

z-image-local/z-image-turboPASS

A clear glass sphere sitting on a mirror surface near a pool of water, showing refraction through the sphere, reflection on the mirror, and water reflection simultaneously

Evaluator

The image displays a clear glass sphere on a reflective surface. The reflection is vertically aligned and correctly inverted, matching the sphere’s position. Refraction through the sphere shows a distorted, inverted view of the background, which is physically accurate for a convex lens effect. The contact shadow and water ripple distortion around the base are subtle but plausible. Colors and lighting are consistent with natural daylight. No directional or orientation errors detected. VERDICT: P

z-image-local/z-image-turboPASS

A chrome sculpture in a room with a glass floor and large windows, showing multiple reflections and environmental mapping on every surface

Evaluator

The reflections are physically plausible. The chrome sculpture accurately mirrors the environment, including the window frames and sky, with appropriate distortion based on its curved surface. The glass floor reflects the sculpture and the room’s structure correctly, maintaining proper orientation and perspective. Shadows are minimal but consistent with the bright, diffused lighting from the large windows. VERDICT: PASS

z-image-local/z-image-turboPASS

A crystal wine glass on a polished marble counter next to a window overlooking a lake — showing refraction in the glass, reflection on the marble, and the lake's own reflective surface

Evaluator

The reflections and refractions are physically plausible. The wine glass correctly refracts the background lake, showing an inverted image of the waterline. The marble surface displays a soft, accurate reflection of the glass stem and base. The lighting is consistent, with shadows and highlights aligning with a light source from the window. The lake’s surface also reflects the sky and distant trees naturally. VERDICT: PASS

▸World Knowledge67%vs0%

Easy 100%vs0%

z-image-local/z-image-turboPASS

The Eiffel Tower in Paris on a clear day, architecturally accurate with its iron lattice structure and three levels

Evaluator

The image depicts the Eiffel Tower with high architectural accuracy. The iron lattice structure, the three distinct levels (including the first and second platforms and the top section), and the iconic arch at the base are all correctly rendered. The perspective is realistic, and the clear blue sky matches the prompt’s description of a “clear day.” No significant inaccuracies are present. VERDICT: PASS

z-image-local/z-image-turboPASS

The Taj Mahal in Agra seen from the reflecting pool, with accurate dome, minarets, and symmetry

Evaluator

The image depicts the Taj Mahal with high architectural accuracy. The central dome, four minarets, and symmetrical layout are correctly rendered. The reflection in the pool is precise, and the surrounding landscaping (cypress trees, pathways) matches the real-world site. Minor details like the finial on the dome and the arched entrances are also accurate. No significant factual or structural inaccuracies are present. VERDICT: PASS

z-image-local/z-image-turboPASS

The Statue of Liberty on her island pedestal, with correct torch, crown, tablet, and green patina

Evaluator

The subject is clearly recognizable as the Statue of Liberty. The green patina, crown spikes, tablet, and torch are all present and correctly positioned. The pedestal architecture matches the real-world structure with its stone masonry and arched windows. No significant inaccuracies are visible. VERDICT: PASS

Medium 33%vs0%

z-image-local/z-image-turboFAIL

A traditional Japanese shamisen instrument with accurate three-stringed design, wooden body, and bachi plectrum

Evaluator

The instrument shown is a Ruan (a Chinese lute), not a Shamisen. A Shamisen has a long neck, a square body covered in skin, and typically uses a bachi plectrum. This image shows a round wooden body, frets, and four strings (not three). The subject is not factually accurate to the prompt. VERDICT: FAIL

z-image-local/z-image-turboFAIL

A medieval astrolabe with accurate rete, tympan, and engraved degree markings on brass

Evaluator

The image depicts a brass navigational instrument, but it is not a medieval astrolabe. It lacks the characteristic rete (star map) and tympan (geographic plate) of an astrolabe. Instead, it resembles a mariner’s compass or a simplified planisphere with a star-shaped pointer and degree markings. The design is more consistent with a 17th–19th century nautical instrument than a medieval astronomical device. Thus, it fails to meet the prompt’s requirement for a factually accurate medieval astrolabe.

z-image-local/z-image-turboPASS

A Japanese kintsugi bowl — cracked ceramic repaired with visible gold lacquer lines

Evaluator

The image accurately depicts a Japanese kintsugi bowl. The ceramic texture, the specific style of crack repair using gold lacquer (kintsugi), and the overall form are realistic and factually correct. There are no architectural or factual inaccuracies. VERDICT: PASS

Hard 33%vs0%

z-image-local/z-image-turboPASS

A detailed diagram of the water cycle showing evaporation from the ocean, condensation forming clouds, precipitation as rain, and collection in rivers flowing back to the sea

Evaluator

The image accurately depicts the water cycle with clear labels for evaporation, condensation, precipitation, and collection. The arrows correctly show the flow of water from the ocean to clouds, rain falling, and rivers returning to the sea. The visual elements are simple but factually correct and recognizable. VERDICT: PASS

z-image-local/z-image-turboFAIL

A cross-section showing tectonic plates at a convergent boundary with subduction, a trench, volcanic arc, and layered mantle

Evaluator

The image depicts a surface-level rift or graben with exposed sedimentary layers, resembling a canyon or fault scarp. It does not show the required geological features: there is no subduction zone, oceanic trench, volcanic arc, or mantle layering. The scene is a terrestrial desert landscape with stratified rock, not a tectonic cross-section of a convergent boundary. Therefore, it fails to meet the prompt’s factual and architectural requirements. VERDICT: FAIL

z-image-local/z-image-turboFAIL

An anatomical illustration of the human heart showing four chambers, valves, aorta, pulmonary arteries, and veins with correct placement

Evaluator

The image is a stylized 3D rendering rather than a traditional anatomical illustration. While the major vessels (aorta, pulmonary artery) and chambers are present, the anatomical accuracy is poor. The coronary arteries are depicted as large, superficial red tubes that do not follow realistic anatomical paths. The text labels on the atria are gibberish ("ARTILLZ", "ARTILLZ"), which is a common AI artifact. The valves are not clearly or correctly detailed. The overall structure is recognizable as

Extreme 100%vs0%

z-image-local/z-image-turboPASS

A Morpho menelaus butterfly with accurate iridescent blue dorsal wings, brown ventral wing pattern with eyespots, and correct body morphology

Evaluator

The image depicts a butterfly with the characteristic iridescent blue dorsal wings and brown ventral patterns of a Morpho menelaus. The body morphology, including antennae and wing venation, is anatomically correct. The eyespots on the hindwings are present and accurately placed. The subject is clearly recognizable as the requested species with no factual or structural inaccuracies. VERDICT: PASS

z-image-local/z-image-turboPASS

The Hagia Sophia interior showing the massive central dome, pendentives, semi-domes, and Islamic calligraphy medallions alongside Byzantine mosaics

Evaluator

The image accurately depicts the interior of the Hagia Sophia, showcasing its massive central dome, pendentives, semi-domes, and Islamic calligraphy medallions alongside Byzantine mosaics. The architectural details are factually correct and recognizable. VERDICT: PASS

z-image-local/z-image-turboPASS

A specimen of bismuth crystal showing accurate iridescent oxide layers, geometric hopper crystal formation, and characteristic staircase structure

Evaluator

The image depicts a bismuth crystal with the characteristic hopper structure and iridescent oxide layers. The geometric staircase formation is accurately represented, and the colors are consistent with real-world specimens. The subject is recognizable and factually accurate. VERDICT: PASS

Professional Studio 85%vs85%

▸Camera & Lighting83%vs92%

Easy 100%vs100%

z-image-local/z-image-turboPASS

A wide-angle shot of a modern living room interior showing expansive space with slight barrel distortion at the edges

Evaluator

The image depicts a modern living room with a wide-angle perspective. 1. **Perspective/Wide-angle:** The shot captures a large area of the room, including the sofa, coffee table, and the large window area, consistent with a wide-angle lens. 2. **Barrel Distortion:** There is a subtle hint of barrel distortion visible at the edges (specifically in the way the ceiling lines and the vertical edges of the window frame slightly curve), which matches the prompt's requirement. 3. **Depth of Field:**

z-image-local/z-image-turboPASS

A close-up macro shot of a single dewdrop on a rose petal, shallow depth of field

Evaluator

The image matches the prompt's requirements: 1. **Perspective/Shot Type:** The image is a close-up macro shot of a single dewdrop on a rose petal, as requested. 2. **Depth of Field:** There is a clear shallow depth of field. The dewdrop and the immediate area of the petal are in focus, while the center of the rose and the background are softly blurred. 3. **Lighting:** The lighting is soft and natural, consistent with a macro photography setting. The image accurately reflects the specified

z-image-local/z-image-turboPASS

An aerial bird's-eye view looking straight down at a traffic roundabout with cars and trees

Evaluator

The image shows an aerial bird's-eye view looking straight down at a traffic roundabout. The perspective is a direct top-down (nadir) view, which matches the prompt. The lighting is even and diffuse, consistent with an overhead daylight shot, and the depth of field is deep, keeping both the center of the roundabout and the surrounding streets in focus, which is typical for high-altitude aerial photography. VERDICT: PASS

Medium 33%vs100%

z-image-local/z-image-turboFAIL

A portrait lit with classic Rembrandt lighting — triangle of light on the shadowed cheek, moody atmosphere

Evaluator

The image depicts a portrait of a man with long hair. 1. **Lighting Pattern**: The prompt specifies "classic Rembrandt lighting," which is characterized by a distinct triangle of light on the shadowed cheek. In this image, the light is coming from the side, but there is no visible light triangle on the shadowed side of the face. The shadow on the cheek is quite broad and lacks the specific geometric light pattern required for a true Rembrandt lighting style. 2. **Perspective and Depth of Field

z-image-local/z-image-turboFAIL

A landscape photograph bathed in golden hour light — long warm shadows, orange sky, soft glow

Evaluator

The image depicts a landscape of sand dunes under a sky that is a solid, uniform orange/yellow color. 1. **Lighting Pattern:** The prompt specifies "golden hour light — long warm shadows, orange sky, soft glow." While the sky is orange, the lighting on the dunes is very flat. There are no "long warm shadows" or a "soft glow" emanating from a light source; the dunes appear illuminated by a flat, ambient light rather than a directional sun. There is no visible sun or light source to create the s

z-image-local/z-image-turboPASS

A high-key portrait of a person against a bright white background, minimal shadows, airy and luminous

Evaluator

The image features a close-up portrait of a person against a bright, white, and overexposed background. 1. **Lighting Pattern:** The lighting is high-key, bright, and airy. There are minimal shadows on the face, creating a luminous and soft appearance that matches the prompt's description of "high-key" and "minimal shadows." 2. **Perspective:** The perspective is a close-up portrait, focusing on the face and upper neck, which is consistent with a high-key portrait style. 3. **Depth of Field

Hard 100%vs67%

z-image-local/z-image-turboPASS

A portrait shot at 85mm f/1.4 with creamy bokeh background, backlit with warm rim light separating the subject from a blurred garden

Evaluator

The image matches the prompt's specifications: 1. **Perspective/Lens (85mm):** The image shows a classic portrait composition with a tight framing on the subject, consistent with an 85mm focal length. 2. **Depth of Field (f/1.4 with creamy bokeh):** The background is heavily blurred with soft, circular bokeh, indicating a shallow depth of field characteristic of an f/1.4 aperture. The subject is sharp, while the garden background is completely out of focus. 3. **Lighting (Backlit with warm r

z-image-local/z-image-turboPASS

A street photograph with shallow depth of field as if shot at 50mm f/1.8, subject sharp with soft city lights in the background

Evaluator

The image matches the prompt's requirements: 1. **Depth of Field:** The image exhibits a shallow depth of field characteristic of a wide aperture (like f/1.8). The subject (the woman) is sharp and in focus, while the background elements (city lights, cars, and pedestrians) are heavily blurred with smooth bokeh. 2. **Perspective:** The shot is a medium close-up street photograph, consistent with a 50mm lens perspective, which provides a natural field of view without significant distortion. 3.

z-image-local/z-image-turboPASS

A food photograph shot from above at 35mm f/2.8 with natural window light from the left, slight vignette, warm color temperature

Evaluator

To evaluate the image against the prompt: 1. **Perspective (Shot from above):** The image is shot from a high angle, looking down at the pizza, which matches the "shot from above" requirement. 2. **Depth of Field (35mm f/2.8):** The image shows a shallow depth of field. The center of the pizza is in sharp focus, while the edges of the pizza and the background (the window sill and wooden surface) are softly blurred. This is consistent with an f/2.8 aperture setting. 3. **Lighting (Natural win

Extreme 100%vs100%

z-image-local/z-image-turboPASS

A tilt-shift photograph of a city street making real-life look like a miniature model — selective focus strip, saturated colors

Evaluator

The image successfully implements the tilt-shift effect described in the prompt. 1. **Perspective/Tilt-Shift Effect:** The image uses a very shallow depth of field with a distinct "selective focus strip." The top and bottom of the image are heavily blurred (bokeh), which creates the optical illusion that the scene is a miniature model rather than a full-scale city street. 2. **Depth of Field:** The focus is sharply constrained to the middle section of the street (where the white car and black

z-image-local/z-image-turboPASS

A cinematic frame with blue and orange anamorphic lens flare streaking horizontally across a dark moody scene

Evaluator

The image shows a dark, moody scene dominated by a single, prominent horizontal light streak. 1. **Lighting Pattern:** The prompt requested "blue and orange anamorphic lens flare streaking horizontally". The image features a bright horizontal streak that transitions from a cyan/blue hue on the left to an orange/amber hue on the right. This matches the color and directionality described. 2. **Perspective/Depth of Field:** The image is an abstract representation of light, so traditional perspec

z-image-local/z-image-turboPASS

A portrait with dramatic split lighting — half the face brightly lit, the other half in deep shadow, high contrast chiaroscuro

Evaluator

The image shows a portrait of a man with a clear lighting pattern. 1. **Lighting Pattern:** The prompt specifies "dramatic split lighting — half the face brightly lit, the other half in deep shadow, high contrast chiaroscuro." In the image, the light source is positioned to the side, creating a distinct division. The left side of the subject's face (from the viewer's perspective) is brightly illuminated, while the right side is cast in deep shadow. This creates a high-contrast chiaroscuro effe

▸Color Precision100%vs100%

Easy 100%vs100%

z-image-local/z-image-turboPASS

A bright red sports car parked on a gray asphalt road — the red is vivid and unambiguous

Evaluator

The image shows a sports car that is a bright, vivid, and unambiguous red. The color is consistent across the body of the car and matches the description of "bright red" perfectly. The car is parked on a gray asphalt road, which also matches the prompt. VERDICT: PASS

z-image-local/z-image-turboPASS

A woman wearing a royal blue evening dress against a neutral background

Evaluator

The image shows a woman wearing a sleeveless evening dress. The color of the dress is a deep, vibrant blue, which matches the description of "royal blue." The background is a solid, light gray, which qualifies as a "neutral background." The colors in the image perfectly match the prompt's specifications. VERDICT: PASS

z-image-local/z-image-turboPASS

A field of pure sunflower yellow flowers under a clear blue sky

Evaluator

The image features a large, prominent sunflower in the foreground and several other sunflowers in the background. The petals of the sunflowers are a vibrant, pure sunflower yellow. The background sky is a clear, solid blue. The colors in the image perfectly match the colors specified in the prompt. VERDICT: PASS

Medium 100%vs100%

z-image-local/z-image-turboPASS

A bedroom wall painted in #FF6B35 (burnt orange) with white furniture and natural daylight

Evaluator

The image shows a bedroom with a wall painted in a vibrant, saturated orange color. The hex code #FF6B35 represents a bright, slightly reddish-orange (often described as a vivid burnt orange or coral-orange). The color in the image is a very close match to this description and hex value. The furniture (the sideboard/cabinet) is white, and the lighting appears to be natural daylight coming from the window on the right. VERDICT: PASS

z-image-local/z-image-turboPASS

A forest scene where the foliage is precisely #2D5F2D (dark muted green) with brown tree trunks

Evaluator

The image shows a dense forest scene with a focus on green foliage and brown tree trunks. 1. **Foliage Color:** The leaves are a dark, muted green. The color appears to be a deep, desaturated forest green, which aligns well with the description of `#2D5F2D` (a dark, muted green). 2. **Trunk Color:** The tree trunks are clearly brown, as specified. The colors in the image accurately reflect the specific hex-based color description provided in the prompt. VERDICT: PASS

z-image-local/z-image-turboPASS

A room with walls in #4A90D9 (medium blue) with warm wooden floor and white trim

Evaluator

The image shows a room with walls that are a medium blue color. The hex code #4A90D9 represents a medium-light shade of blue, which matches the visual appearance of the walls in the image. The floor is a warm wooden color, and the trim is white, which also aligns with the prompt's description. VERDICT: PASS

Hard 100%vs100%

z-image-local/z-image-turboPASS

A minimalist room with walls in #F5E6CC (warm cream), a sofa in #2C3E50 (dark navy), and accent pillows in #E74C3C (bright red)

Evaluator

The image matches the color specifications provided in the prompt: 1. **Walls (#F5E6CC - warm cream):** The walls are a light, warm cream/off-white color, consistent with the hex code. 2. **Sofa (#2C3E50 - dark navy):** The sofa is a deep, dark navy blue, matching the specified color. 3. **Accent Pillows (#E74C3C - bright red):** The pillows are a vibrant, bright red, consistent with the hex code. All three primary elements match the requested colors and hex values. VERDICT: PASS

z-image-local/z-image-turboPASS

A product flat-lay on a #1A1A2E (deep dark blue) background, with a #F0C27F (soft gold) watch and #FFFFFF (pure white) earbuds

Evaluator

The image contains the following colors: 1. **Background:** The background is a deep, saturated blue. While it is a dark blue, it appears slightly more vibrant/lighter than the specific deep dark blue `#1A1A2E` (which is a very dark, almost navy/black-leaning blue). However, it is a close match to the description. 2. **Watch:** The watch strap and casing are a soft gold/pale peach color, which aligns well with the `#F0C27F` (soft gold) description. 3. **Earbuds:** The earbuds are a pure white

z-image-local/z-image-turboPASS

A graphic design composition with background #FAFAFA (near-white), main shape #FF4500 (orange-red), and accent lines #333333 (dark charcoal)

Evaluator

The image contains a large, stylized letter "B" with a specific color scheme. 1. **Background:** The background is white. The prompt specified `#FAFAFA` (a very light off-white/near-white). The background in the image appears to be pure white or very close to it, which aligns with the intent of `#FAFAFA`. 2. **Main Shape:** The main shape is the letter "B", which is a vibrant orange-red. The prompt specified `#FF4500`. `#FF4500` is a bright orange-red (Orange Red), which matches the color of

Extreme 100%vs100%

z-image-local/z-image-turboPASS

A smooth gradient on a wall going from #FF0000 (red) on the left to #0000FF (blue) on the right, transitioning evenly through purple

Evaluator

The image shows a smooth gradient transitioning from a bright red on the left to a bright blue on the right. The midpoint of the gradient is a purple/magenta hue, which aligns with the transition from #FF0000 (red) to #0000FF (blue). The colors are consistent with the hex values and the description provided in the prompt. VERDICT: PASS

z-image-local/z-image-turboPASS

A sunset sky gradient on a flat surface from #FF6B35 (orange) at the bottom to #1B1464 (deep indigo) at the top

Evaluator

The image shows a sunset sky with a gradient. 1. **Bottom Color:** The bottom of the image features a bright orange/yellow glow near the horizon. The color is a vibrant orange, which aligns well with the requested `#FF6B35` (a bright orange). 2. **Top Color:** The top of the image is a deep blue. While it is a dark blue, it appears more like a standard sky blue/deep blue rather than the specific deep indigo/navy shade of `#1B1464`. However, the general transition from a bright orange at the bo

z-image-local/z-image-turboPASS

A horizontal banner with a gradient from #00C9FF (cyan) on the left to #92FE9D (mint green) on the right

Evaluator

The image shows a horizontal banner with a smooth gradient. 1. **Left side color:** The color on the far left is a bright cyan/light blue. This aligns well with the hex code **#00C9FF** (a vibrant cyan). 2. **Right side color:** The color on the far right is a light, pale green. This aligns well with the hex code **#92FE9D** (a light mint green). 3. **Gradient transition:** The transition from the cyan on the left to the mint green on the right is smooth and follows the specified direction.

▸Photorealism33%vs0%

Easy 33%vs0%

z-image-local/z-image-turboFAIL

A single red sneaker on a clean white background with soft studio lighting and gentle shadow — product photography style

Evaluator

The image depicts a single red sneaker on a white background. **Analysis:** 1. **Subject Matter:** The sneaker is a solid red color with a suede-like texture. 2. **Lighting and Shadow:** The lighting is soft, but the shadow underneath the shoe is extremely minimal and lacks the natural-looking diffusion or contact shadows expected in a high-quality studio product shot. The shadow looks somewhat "pasted on" or overly simplified. 3. **AI Artifacts:** * **Laces/Eyelets:** The eyelets an

z-image-local/z-image-turboFAIL

A clear glass perfume bottle on a reflective black surface with studio lighting, no background distractions

Evaluator

The image depicts a clear glass perfume bottle on a reflective black surface against a dark, neutral background. **Analysis:** 1. **Subject and Composition:** The bottle is centered and follows the prompt's instructions for a studio-quality shot. 2. **Reflections and Transparency:** While the glass and liquid look relatively realistic, there is a significant AI artifact regarding the internal structure. The "straw" or dip tube inside the bottle is extremely thin and appears to bend or termin

z-image-local/z-image-turboPASS

A ripe green apple on a seamless white background with soft side lighting and subtle shadow — catalog style

Evaluator

The image depicts a single green apple on a seamless white background. **Analysis:** 1. **Subject & Lighting:** The apple is a vibrant green with realistic water droplets on the skin. The lighting is soft and consistent with a studio product shot, creating a gentle shadow at the base. 2. **Realism:** At first glance, the image is highly convincing. The texture of the skin, the translucency of the water droplets, and the way the light interacts with the surface appear very natural. 3. **AI A

Graphical design 54%vs71%

▸Data Visualisation0%vs0%

Easy 0%vs0%

z-image-local/z-image-turboFAIL

A bar chart showing monthly sales for January through June with values 12K, 18K, 15K, 22K, 19K, 25K — labeled axes, color-coded bars, and a title 'Monthly Sales 2025'

Evaluator

The chart is missing the required monthly labels for January through June. Only “June” is labeled on the x-axis, and the other bars are labeled with values (12K, 18K, 15K, 19K) which are not months. The y-axis is labeled with “50” multiple times, which is incorrect — it should show a consistent scale (e.g., 0 to 50K or 0 to 25K). The values on the bars (18K, 15K, 22K, 19K, 25K) do not match the requested sequence (12K, 18K, 15K, 22K, 19K, 25K) — the first bar is labeled 18K instead of 12K. The c

z-image-local/z-image-turboFAIL

A pie chart showing market share: Company A 35%, Company B 25%, Company C 20%, Others 20% — each slice a different color with labels and percentages

Evaluator

The chart is a pie chart as requested, with four distinct slices, each a different color. The percentages and labels are as follows: - Company A: 35% (yellow slice, labeled correctly) - Company B: 25% (green slice, labeled correctly) - Others: 20% (blue slice, labeled correctly) - The remaining 20% (orange slice) is not labeled with a company name, but since the prompt specified “Others 20%”, and the orange slice is the only one not explicitly labeled as Company A, B, or Others, it is implied t

z-image-local/z-image-turboFAIL

A simple line graph showing daily website visitors over one week (Mon-Sun): 1200, 1500, 1100, 1800, 2000, 1700, 1400 — with grid lines, axis labels, and title 'Weekly Traffic'

Evaluator

The chart is a line graph titled “Weekly Traffic,” which matches the requested chart type. However, the axis labels and data values are severely misaligned and incorrect. - The x-axis is labeled “Mon-Sun,” which is correct for the days of the week, but the values (1200, 1100, 2000, 1700, 1400) are placed on the x-axis, which should represent days, not visitor counts. - The y-axis is labeled “Mon-Sun,” which is incorrect — it should represent visitor counts (numerical values), not days. - The nu

▸Layout & Design33%vs78%

Easy 100%vs100%

z-image-local/z-image-turboPASS

A movie night poster with a large popcorn image in the center, the title 'MOVIE NIGHT' in bold at the top, and the tagline 'Every Friday at 8 PM' at the bottom

Evaluator

The image includes: - A large popcorn bucket in the center — ✅ present. - The title "MOVIE NIGHT" in bold at the top — ✅ present and correctly placed. - The tagline "Every Friday at 8 PM" at the bottom — ✅ present and correctly placed. - The visual hierarchy is clear: title at top, popcorn in center, tagline at bottom — ✅ correct. - All text is readable — ✅ yes. However, the tagline text reads “Every Friday at 8 PM” — which matches the prompt — ✅ correct. VERDICT: PASS

z-image-local/z-image-turboPASS

An event poster featuring a guitar illustration, the headline 'LIVE MUSIC FEST' in large text, and 'Tickets available now' as a tagline below

Evaluator

The poster includes: - A guitar illustration (integrated into the lettering, forming the neck and body of the guitar with the letters “LIVE MUSIC FEST”). - The headline “LIVE MUSIC FEST” in large, bold text — correctly placed and visually dominant. - The tagline “Tickets available now” below the main headline — readable and correctly positioned. The visual hierarchy is clear: the headline is largest and most prominent, the tagline is smaller and below, and the guitar illustration is integrated

z-image-local/z-image-turboPASS

A book club poster with an open book illustration, the title 'JOIN THE BOOK CLUB' prominently displayed, and 'Meets every Wednesday' as the tagline

Evaluator

All described layout zones/elements are present: - An open book illustration is clearly visible as the central graphic. - The title “JOIN THE BOOK CLUB” is prominently displayed in large, bold, centered text across the center of the open book. - The tagline “Meets every Wednesday” is correctly placed below the title, in smaller font size, maintaining proper visual hierarchy. All text is readable and correctly positioned according to the prompt. VERDICT: PASS

Medium 0%vs67%

z-image-local/z-image-turboFAIL

A landing page design with a hero header image at the top, a headline and subtitle in the middle, a 'Sign Up' call-to-action button, and a footer with three social media icons

Evaluator

The image displays a landing page layout that includes: - A hero header image at the top (mountain landscape with a circular profile photo overlay). - A headline and subtitle in the middle (though the text is gibberish and unreadable). - A “Sign Up” call-to-action button centered below the text. - A footer at the bottom with three social media icons (envelope, play, magnifying glass). All described layout zones/elements are visually present. The visual hierarchy is correct: hero image → headli

z-image-local/z-image-turboFAIL

A newsletter layout with a banner image header, a two-column body section with text on the left and an image on the right, a highlighted quote block, and a footer with unsubscribe link

Evaluator

The image shows a newsletter layout that attempts to follow the described structure, but fails on multiple critical points: 1. **Banner Image Header**: Present — a header with the title "Nawselter" over a background image. 2. **Two-Column Body Section**: Present — text on the left and an image of a woman on the right. 3. **Highlighted Quote Block**: Present — a beige box with a quote, though the text is gibberish. 4. **Footer with Unsubscribe Link**: Present — a black bar at the bottom with

z-image-local/z-image-turboFAIL

A restaurant menu layout with a logo header, three category sections (appetizers, mains, desserts) each with items and prices, and a footer with address and hours

Evaluator

The image shows a menu layout with a logo header (“MÁNU”), three category sections (Appetizers, mains, Mains, and Desseries), and a footer with text (“Restelade cetralienc” and “$0 - 200”). However, the category names are inconsistent — “mains” is repeated twice (once in lowercase, once in uppercase), and “Desseries” is misspelled. The text in the footer is gibberish and does not resemble a real address or hours. Additionally, many item names are nonsensical or made-up, and prices are either $0

Hard 0%vs67%

z-image-local/z-image-turboFAIL

A magazine double-page spread with a left page containing a large hero photo, and a right page with a two-column article, a pull quote, a sidebar with author bio, and a page number

Evaluator

The image shows a magazine double-page spread. The left page contains a large hero photo of a woman, which matches the prompt. The right page contains a two-column article layout, a pull quote (the text “A Doolr hetere Hoting cocour” is prominently displayed as a headline, which functions as a pull quote), and a sidebar with a photo and author bio (“Ooodr Jarsgehe” with a subtitle). The page numbers (139 and 140) are visible at the bottom of the respective pages. However, the text in the articl

z-image-local/z-image-turboFAIL

A mobile app UI mockup showing a dashboard with a top navigation bar, a search field, a 2x2 grid of feature cards with icons, a list of recent activity items, and a bottom tab bar

Evaluator

The image shows a mobile app UI mockup that includes: - A top navigation bar with status indicators (time, signal, battery) — present. - A search field with placeholder text “Reoure” — present. - A 2x2 grid of feature cards with icons — present (green bookmark, red play, yellow heart, blue chat). - A list of recent activity items — present (with profile pictures and placeholder text). - A bottom tab bar with 5 icons and Chinese labels — present. However, the text in the search field and recent

z-image-local/z-image-turboFAIL

A fashion magazine spread with a full-bleed left-page photo, right page with three-column layout, headline, body copy, a smaller inset image, and styled page numbers

Evaluator

The image shows a fashion magazine spread with a clear left-page full-bleed photo and a right-page three-column layout, including a headline, body copy, a smaller inset image, and styled page numbers (19 and 20). The visual hierarchy is generally correct: the large photo dominates the left page, while the right page uses columnar text blocks with a prominent headline and a smaller inset image. The page numbers are styled and placed appropriately at the bottom corners. However, the text is not re

▸Style Diversity83%vs83%

Easy 100%vs100%

z-image-local/z-image-turboPASS

A golden retriever sitting in a garden, painted in the style of an oil painting with visible brushstrokes and rich texture

Evaluator

The image depicts a golden retriever sitting in a garden setting. The style is clearly that of an oil painting, with visible, textured brushstrokes applied in a manner consistent with traditional oil painting techniques. The fur of the dog and the foliage in the background are rendered with thick, expressive strokes, and the color palette is rich and layered, contributing to a textured, painterly effect. The composition and rendering align well with the requested style. VERDICT: PASS

z-image-local/z-image-turboPASS

A golden retriever sitting in a garden, rendered in Japanese anime style with flat colors and large expressive eyes

Evaluator

The image depicts a golden retriever sitting in a garden, which matches the subject matter of the prompt. The art style is clearly Japanese anime-inspired: the dog has large, expressive, round eyes, simplified facial features, and a soft, stylized rendering. The color palette is flat with minimal shading, consistent with the requested “flat colors” aesthetic. The background foliage is also rendered in a similarly simplified, illustrative style typical of anime. However, there is a notable absen

z-image-local/z-image-turboPASS

A golden retriever sitting in a garden, rendered in pixel art style with a 32x32 grid and limited color palette

Evaluator

The image depicts a golden retriever sitting in a garden, rendered in a pixel art style. The visual characteristics of pixel art are clearly present: the image is composed of distinct, blocky pixels, and the forms are defined by color blocks rather than smooth gradients. The scene is simplified and stylized, consistent with low-resolution pixel art. While the exact 32x32 grid size cannot be verified visually without metadata, the overall aesthetic and blocky construction strongly suggest it adhe

Medium 100%vs100%

z-image-local/z-image-turboPASS

A city street scene in the style of Art Deco — geometric shapes, gold and black palette, ornamental symmetry

Evaluator

The image displays a building facade that strongly embodies the Art Deco style as requested. Key characteristics are clearly present: - **Geometric shapes**: The facade is dominated by sharp lines, stepped forms, and symmetrical patterns — including zigzags, chevrons, and vertical and horizontal bands — all hallmarks of Art Deco. - **Gold and black palette**: The design uses a striking contrast between polished gold-colored metalwork and dark black marble, fulfilling the color requirement preci

z-image-local/z-image-turboPASS

A woman with a parasol in the style of Impressionism — soft brushstrokes, dappled light, pastel palette

Evaluator

The image clearly depicts a woman holding a parasol, set in an outdoor, garden-like environment. The visual style is unmistakably Impressionist: - Soft, visible brushstrokes are evident throughout, especially in the foliage and background. - Dappled light is present, with patches of color suggesting sunlight filtering through leaves. - The palette is predominantly pastel — soft pinks, blues, greens, and yellows — with gentle transitions and no harsh lines. - The focus is on capturing the fleet

z-image-local/z-image-turboPASS

A portrait of a man in Pop Art style — bold outlines, Ben-Day dots, flat vivid primary colors

Evaluator

The image clearly embodies the requested Pop Art style. It features: - Bold, high-contrast outlines defining facial features and hair. - Ben-Day dots (halftone patterns) visible in the skin tones, hair, and background, especially in shaded areas. - Flat, vivid primary colors: the background is a saturated red, the skin is rendered in yellow, and shadows use deep blue/black — classic Pop Art color choices. All key visual characteristics from the prompt are present and unmistakable. VERDICT: PA

Hard 67%vs33%

z-image-local/z-image-turboPASS

A forest rendered in photorealistic watercolor — the detail and lighting of a photo combined with visible watercolor bleeds and paper texture

Evaluator

The image depicts a forest scene rendered in a watercolor style, with visible paper texture and watercolor bleeds — particularly in the soft, diffused edges of foliage and the way colors bleed into one another. The lighting is atmospheric and somewhat photorealistic in its depth and shadowing, especially on the tree trunks and undergrowth, giving a sense of volume and natural illumination. However, the overall aesthetic is painterly and impressionistic rather than photorealistic in the strictest

z-image-local/z-image-turboPASS

A woman in a futuristic city rendered in cyberpunk Art Nouveau — neon colors and tech elements combined with flowing organic lines and floral borders

Evaluator

The image successfully embodies the requested “cyberpunk Art Nouveau” style. Key visual characteristics are clearly present: - **Neon colors**: The scene is saturated with vibrant neon hues — pinks, teals, purples, and golds — especially in the cityscape and the woman’s iridescent outfit. - **Tech elements**: The futuristic city features glowing skyscrapers, circuit-like lines, and a flying saucer, establishing a cyberpunk tech aesthetic. - **Flowing organic lines**: The Art Nouveau influence i

z-image-local/z-image-turboFAIL

A medieval castle depicted in Japanese ukiyo-e woodblock print style combined with photorealistic sky and lighting

Evaluator

The image depicts a Japanese castle (specifically, a traditional Japanese castle like Osaka Castle or a similar structure), which is architecturally accurate for Japan, not a medieval European castle. The prompt requested a “medieval castle” depicted in ukiyo-e style — a mismatch in subject matter. The ukiyo-e style is characterized by flat planes, bold outlines, stylized perspective, and often woodblock print textures — none of which are present here. Instead, the image is rendered in photoreal

Extreme 67%vs100%

z-image-local/z-image-turboPASS

An interior scene rendered in soft brutalism — raw concrete textures combined with rounded pastel furniture and warm diffused light

Evaluator

The image presents an interior scene dominated by raw, unpolished concrete surfaces — including walls, ceiling, and floor — which strongly evokes the aesthetic of brutalism. The furniture, a large curved sectional sofa and a matching armchair, are upholstered in a soft, rounded, pastel pink fabric, aligning with the requested “rounded pastel furniture.” The lighting is warm and diffused, with natural light entering from a window and a soft pendant lamp casting gentle illumination, contributing t

z-image-local/z-image-turboFAIL

A still life painted in anxious minimalism — sparse composition with uncomfortable negative space, muted colors, and subtle visual tension

Evaluator

The requested style — “anxious minimalism” with sparse composition, uncomfortable negative space, muted colors, and subtle visual tension — is not clearly recognizable in this image. The image is a nearly uniform gray field with no discernible objects, composition, or visual tension. While it does feature muted color (gray), it lacks the “sparse composition” and “uncomfortable negative space” because there is no subject or arrangement to create spatial dynamics. The absence of any visual element

z-image-local/z-image-turboPASS

A cityscape rendered in nostalgic futurism — retro-futuristic 1960s space-age aesthetics with a melancholic warm-toned patina

Evaluator

The image successfully evokes the requested “nostalgic futurism — retro-futuristic 1960s space-age aesthetics with a melancholic warm-toned patina.” Key visual characteristics are present: - **Retro-futurism**: The architecture features sleek, stylized skyscrapers with sharp lines and a sense of optimism about the future — reminiscent of 1960s sci-fi visions (e.g., the pointed spire building). The cars are classic 1960s American models with chrome details and rounded, optimistic forms. - **Warm