vs
192 evaluations across 6 categories

Prompt: The word 'CHAPTER ONE' typed on aged paper with a vintage typewriter font, complete with slightly uneven ink
Text Rendering 73%vs67%
▸Typography Style100%vs100%
z-image-local/z-image-turboPASSEvaluator
The text “CHAPTER ONE” is rendered in a classic typewriter font, with slightly uneven, textured ink strokes that convincingly simulate the look of ink hitting aged paper. The font style, spacing, and character shapes are consistent with vintage typewriter output. The paper has a yellowed, aged appearance, and the visible typewriter carriage and keys reinforce the medium. The spelling is correct. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The text “TODAY'S SPECIAL” is rendered in a style that convincingly mimics white chalk on a dark green chalkboard. The lettering shows the characteristic uneven, textured strokes and slight smudges of chalk writing, with visible chalk dust residue around the letters and along the wooden frame. The apostrophe in “TODAY'S” is correctly placed, and the spelling is accurate. The underline beneath “SPECIAL” also matches the chalkboard aesthetic. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The text "OPEN" is spelled correctly. The style of the text matches the described medium: it is rendered in glowing, pixel-style letters composed of individual illuminated dots, consistent with a red LED sign. The red glow and pixelated construction convincingly simulate an LED display. VERDICT: PASS
▸Writing accuracy67%vs58%
z-image-local/z-image-turboPASSEvaluator
The word displayed is "HELLO", which is spelled correctly and fully readable. All five letters are present and clearly formed: H, E, L, L, O. There are no misspellings, missing letters, or illegible parts. The text is centered on a white background as intended. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The word "ADVENTURE" is spelled correctly and is fully readable. All letters are clearly formed and legible against the plain cream background. There are no misspellings, missing words, or illegible parts. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The word "SERENITY" is spelled correctly and is fully readable. All letters are clearly visible and properly formed. There are no misspellings, missing words, or illegible parts. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
All words are spelled correctly and fully readable: - "The" - "quick" - "brown" - "fox" - "jumps" - "high" There are no misspellings, missing words, or illegible parts. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
All words in the text are spelled correctly and fully readable. The text reads: "Every sunset brings a new dawn ahead", which matches the intended phrase exactly. There are no misspellings, missing words, or illegible parts. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The intended text is: "Stars shine brightest in the dark night". In the image: - "brighhest" is misspelled (should be "brightest"). - The third line shows distorted, unreadable characters: "7 n n th ? -" — this is not legible and does not match "the". - The fourth line "dark night" is correct. There are misspellings and illegible parts. The text is not fully correct or readable. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
The text in the image is: - Headline: "SUMMER SALE" — correctly spelled and fully readable. - Body text: "Up to 50% items this weekend only" — correctly spelled and fully readable. - Fine print: "Terms and conditions apply. See store for details." — correctly spelled and fully readable. All words are spelled correctly and are fully legible. There are no misspellings, missing words, or illegible parts. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The text in the image is: - Main title: "GRAND OPENING" — correctly spelled and fully readable. - Subtitle: "Join us for food, music, and fun" — correctly spelled and fully readable. - Disclaimer: "RSVP required. Limited seeting available." — contains a misspelling: "seeting" should be "seating". The word "seeting" is clearly visible and incorrectly spelled. This is a critical error that affects the accuracy of the text. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
All text in the image matches the intended text exactly. - Headline: "ANNUAL GALA 2025" — correctly spelled and fully readable in large serif font. - Description: "An evening of fine dining and live jazz" — correctly spelled and fully readable in medium text. - Footnote: "Black tie optional. Doors open at 6 PM." — correctly spelled and fully readable in small text. No misspellings, missing words, or illegible parts are present. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
All text in the image matches the intended text exactly. - Title: "THE LAST VOYAGE" — correct. - Tagline: "One ship. One chance. No turning back." — correct. - Credits block: - "Directed by James Harlow." — correct. - "Starring Elena Voss, Marcus Chen, and Sofia Reyes." — correct. - "Music by Thomas Grant." — correct. - "A Paramount Pictures Release." — correct. - "Coming Summer 2025." — correct. No misspellings, missing words, or illegible parts are present. The text is fully readab
z-image-local/z-image-turboFAILEvaluator
The text on the magazine cover is mostly correct, but there are two critical errors: 1. The phrase “Top 10 Trails You Must Hike This Year” is correctly placed in the upper right, but the intended text “Top 10 Trails You Must Hike This Year” is also present in the lower-left section — however, in that lower-left section, the text reads “Cambodia Ob of This Year”, which is a clear misspelling and truncation. “Ob” is not a word; it should be “Top”. 2. The barcode area at the bottom right reads “I
z-image-local/z-image-turboFAILEvaluator
The text on the label is mostly correct but contains one error. - "Pure wildflower harvested from" → Correct. - "form sustainable apiaries" → "form" is a misspelling; it should be "from". - "in the Pacific Northwest." → Correct. - "No additives or preservatives." → Correct. - "Best before: 06/2026. Store in cool, dry place. Product of USA." → Correct. The word "form" is misspelled as "form" instead of "from". VERDICT: FAIL
Spatial Reasoning 65%vs90%
▸Attributes Binding89%vs100%
z-image-local/z-image-turboPASSEvaluator
The image shows a red sphere (ball) on the left and a blue cube on the right, sitting side by side on a white background. All attributes (color, shape, arrangement) are correctly bound to the respective objects. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image displays a green apple in the foreground and an orange pumpkin in the background, both resting on a light wooden cutting board. The colors and object identities match the prompt perfectly with no attribute swaps. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image contains two main vehicles. 1. **Yellow Taxi:** In the foreground, there is a yellow sedan with a taxi roof sign. 2. **Black Limousine:** In the background, there is a long, black vehicle with extended windows, characteristic of a limousine. The attributes are correctly bound: the taxi is yellow, and the limousine is black. They are positioned next to each other on a street. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image contains a large red circle acting as a border, a small blue triangle positioned above a medium green star, all on a white background. All attributes (color, size, shape) are correctly bound to their respective objects as requested. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image contains three candles on a shelf matching the description perfectly: 1. **Left:** A short, fat, red pillar candle. 2. **Center:** A tall, thin, white taper candle. 3. **Right:** A medium-height, spiral-shaped yellow candle. All attributes (color, size, shape) are correctly bound to the specific objects with no swaps. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image contains all the requested elements with the correct attributes bound to the correct objects. - **Horse:** Big, brown, standing. (Correct) - **Rabbit:** Small, white. (Correct) - **Cat:** Black, medium-sized (standard cat size). (Correct) - **Setting:** Meadow with grass and flowers. (Correct) VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image contains a pink fire truck in the center, a large blue banana in the foreground on the left, and a stylized green sun in the sky. All requested attributes are correctly bound to their respective objects. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image contains a purple carrot with green leaves and a red tree-like plant with red leaves, which matches the first two parts of the prompt. However, the third object is a standard red/pink watermelon slice, not the requested "white watermelon." The color attribute for the watermelon is incorrect. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
The image contains an orange elephant, a yellow and black penguin, and a metallic silver strawberry. All attributes (color, object type) are correctly bound to the respective objects as requested in the prompt. VERDICT: PASS
▸Compositionality78%vs100%
z-image-local/z-image-turboPASSEvaluator
- **Tabby cat:** Present. The image clearly shows a cat with distinct tabby markings (stripes on the forehead and body). - **Oversized red sunglasses:** Present. The cat is wearing large sunglasses with bright red frames that dominate the face. - **Coherent combination:** Yes. The glasses sit naturally on the cat's face, and the lighting/texture is consistent. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image contains all the requested elements: * **Golden retriever:** Present (the dog is clearly a Golden Retriever). * **Wearing:** Present (the hat is on the dog's head). * **Tiny cowboy hat:** Present (the hat is a cowboy style and sized appropriately for the dog). The elements are coherently combined into a single, realistic-looking scene. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
1. Penguin: Present (standing on rocky shore). 2. Colorful umbrella: Present (rainbow panels visible). 3. Holding: Present (handle positioned centrally against the chest, visually implying the penguin is supporting it). All described elements are visible and coherently combined into a single scene. The image successfully depicts the whimsical concept requested without major artifacts or missing components. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image successfully depicts all the requested elements: * **Medieval knight:** Present (the figure is clearly a knight). * **Full armor:** Present (helmet, breastplate, gauntlets, chainmail are visible). * **Sitting at a desk:** Present (the figure is seated at a wooden table). * **Typing on a modern laptop:** Present (the gauntlets are positioned on the keyboard of a sleek, modern laptop). The anachronistic combination is executed coherently with realistic lighting and texture. V
z-image-local/z-image-turboPASSEvaluator
- **Roman gladiator:** Present (The man is wearing detailed Roman armor). - **Taking a selfie:** Present (He is holding a phone up to capture his own image). - **Smartphone:** Present (A modern smartphone is clearly visible in his hand). - **Colosseum:** Present (The iconic structure is in the background). All elements are present and combined coherently to fulfill the prompt's unusual concept. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
- **Astronaut:** Present. The figure is clearly dressed as an astronaut. - **Spacesuit:** Present. The white suit, helmet, and life-support backpack are visible. - **Riding a bicycle:** Present. The astronaut is seated on a red bicycle, holding the handlebars and pedaling. - **Through a park:** Present. The background features trees, grass, and a paved path typical of a park setting. All described elements are present and combined coherently. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image contains most elements but is missing the monocle. - **Octopus:** Present. - **Top hat:** Present. - **Monocle:** Missing (The octopus has large eyes, but neither is wearing a monocle). - **Playing chess:** Present. - **Surface of the moon:** Present. - **Earth visible:** Present. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
- **Dinosaur:** Present (Green T-Rex character). - **Business suit:** Present (Black suit jacket, white shirt, tie). - **Giving a PowerPoint presentation:** Present (Gesturing at a large screen with slide-like text). - **Modern office:** Present (Office chairs, desk, laptop, large monitor, windows). - **Drinking coffee:** Present (Holding a coffee cup). All elements are clearly visible and combined coherently into a single scene. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image contains a robot pirate in a paper boat on a lava-themed background. However, the prompt specifies "juggling three kittens," while the image clearly shows **four** kittens (two held in the air, two sitting on the boat). Additionally, the action is holding/sitting rather than juggling. * Robot dressed as a pirate: Present * Paper boat: Present * Lake of lava: Present * Juggling three kittens: Fail (There are four kittens, and they are not being juggled). VERDICT: FAIL
▸Counting33%vs56%
z-image-local/z-image-turboFAILEvaluator
I see 4 red apples arranged on a wooden table. The prompt requested exactly 3 apples. Since the count is 4 instead of 3, the request is not met. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
I count exactly 5 yellow rubber ducks arranged in a row on the white shelf. The count and description match the prompt perfectly. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
I see 5 white candles standing on a dark surface. The prompt requested exactly 4 candles. Since the actual count (5) does not match the requested count (4), the image fails the specific constraint. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
I see exactly 2 orange cats and 3 black dogs sitting on a grassy lawn. The counts match the requested description perfectly. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image contains 7 red roses and 3 white daisies, not the requested 4 red roses and 2 white daisies. While the vase and table are present, the specific flower counts are incorrect. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
I see exactly 1 blue butterfly positioned at the top center and 3 yellow butterflies positioned around it (left, bottom, and right). All are resting on a wooden garden fence. The counts match the prompt's request perfectly. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
I count 8 goldfish in the image. 1. Top left 2. Top center 3. Top right 4. Center 5. Far right 6. Middle left 7. Bottom right 8. Bottom left The prompt requested exactly 9 goldfish. Since there are only 8, the count does not match. VERDICT: FAIL
z-image-local/z-image-turboFAILEvaluator
The image contains: - **Bananas:** 3 (visible stems and bodies match). - **Apples:** 2 (one top right, one bottom right). - **Oranges:** 5. There is one top-left (behind bananas), one top-right, one center (below top-right), one bottom-left, and a fifth one partially visible behind the top-right orange and the top apple. The prompt requested exactly 4 oranges, but there are 5. VERDICT: FAIL
z-image-local/z-image-turboFAILEvaluator
I count 9 balloons in the image: one yellow, one dark blue, one pink, one purple, one white, one green, one light blue, one orange, and one pale blue. The prompt requested exactly 11 balloons. Since the actual count (9) does not match the requested count (11), the image fails the specific constraint. VERDICT: FAIL
▸Negation44%vs89%
z-image-local/z-image-turboPASSEvaluator
The image depicts a serene tropical beach with white sand, palm trees, and turquoise water. I have scanned the entire scene, including the foreground sand, the water, and the distant shoreline near the vegetation. There are no visible human figures, swimmers, or people relaxing on the beach. The scene is completely empty of people. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image shows a tabby cat lying on a grey couch. Upon close inspection of the cat's neck, there is a visible light-colored band, which is a collar. Since the prompt explicitly requested the cat be "without a collar," the presence of this accessory violates the negative constraint. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
The image shows a classroom interior featuring rows of empty wooden desks and chairs facing a large green chalkboard. A teacher's desk is positioned at the front. The room is completely empty of people; there are absolutely no students visible in the scene. The excluded element is successfully absent. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image displays a sandy beach with palm trees, seashells in the foreground, and a beach umbrella. However, contrary to the exclusion criteria, there is clearly a blue ocean with visible waves in the background stretching to the horizon. The excluded element (water/ocean) is present. VERDICT: FAIL
z-image-local/z-image-turboFAILEvaluator
The image shows a home office setup with a desk, chair, bookshelf, and lamp. However, prominently displayed on the desk is a large computer monitor (screen) along with a keyboard and mouse. Since the prompt explicitly excluded "computers or screens," and a screen is clearly present, the image fails the exclusion criteria. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
The image displays a kitchen scene featuring a stovetop with a frying pan, a saucepan, and two larger pots. There are wooden cutting boards visible in the background and on the counter. All cookware either appears empty (the frying pan) or is covered with lids, obscuring any potential contents. There are no visible ingredients, cooked meals, or food items on the counters or cutting boards. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image shows a pizza with a crust, tomato sauce, basil leaves, and toppings that resemble pepperoni or tomatoes. However, the pizza is clearly covered in melted white and yellowish cheese, which has a glossy, oily texture typical of melted mozzarella. Since the prompt explicitly requested "absolutely no cheese" and the image contains a significant amount of it, the constraint is violated. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
The image displays the front view of a silver car body shell. It features the windshield frame, hood, front bumper, and headlight housings. Crucially, looking at the bottom of the vehicle, there are absolutely no wheels or tires visible. The car is presented as a bare body shell without any rolling stock attached. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image displays a close-up of a human face featuring eyes, a nose, and a mouth. However, distinct, dark eyebrows are clearly visible above the eyes. Since the prompt explicitly requested "no eyebrows," the presence of this element violates the exclusion criteria. VERDICT: FAIL
▸Relative Position83%vs100%
z-image-local/z-image-turboPASSEvaluator
The image shows a tabby cat positioned directly above a brown cardboard box. The cat's paws are resting on the upper surface of the box, confirming the "sitting on top of" relationship described in the prompt. The spatial arrangement is accurate. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image shows a shiny red sphere positioned on the floor directly below the seat of a wooden chair, situated between its legs. This perfectly matches the description of a red ball underneath a wooden chair. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image displays a white coffee mug filled with coffee positioned in the foreground. To its right and slightly behind, there is an open book lying flat on a light-colored surface (the desk). Another open book is visible in the background to the left. The spatial arrangement matches the description: the mug is next to the open book, and both are resting on the desk surface. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image shows a bird perched on a branch that extends from a large tree trunk on the right side. In the lower left, slightly out of focus, is a park bench. The spatial arrangement matches the description: the bird is on the branch, the branch comes from the tree, and the tree is situated next to the bench. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image shows a cat sitting directly on top of a beige pillow. This pillow is resting on the seat of a wooden chair. The spatial hierarchy is: Cat (top) -> Pillow (middle) -> Chair (bottom). All described positional relationships are accurate. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image shows a lit candle sitting directly on top of a stack of three books. To the right of the book stack, there is a potted plant with green leaves. The spatial arrangement perfectly matches the description: the candle is on the books, and the books are next to the plant. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image shows six chess pieces, not five. The central piece is a King. Behind it (above) is another King (identifiable by the cross), not a Rook. To the right foreground is a Knight, not "below" the king. To the left foreground is a Bishop. The prompt's description of a rook above the king and a knight below the king is incorrect based on the visual evidence. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
The image shows a laptop centered on a wooden desk. To its left is a white coffee mug. To its right is a smartphone. An open notebook is positioned at the bottom of the frame, below the laptop. A desk lamp is situated in the upper right area, casting light over the scene. All described positional relationships (center, left, right, above/upper area, below) are accurate. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image displays five animals standing in a grassy field. From left to right, there is a brown horse, a black and white cow, a white sheep in the middle, a white goat, and a black and white dog on the far right. This arrangement perfectly matches the prompt's description of the spatial relationships. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image shows a large red barn in the foreground and a windmill to its left in the midground. However, the snow-capped mountains in the background are massive and dominate the upper half of the image, contradicting the description "small scale." Additionally, the prompt description does not explicitly use the requested positional prepositions (on, under, left, right) to relate the objects, though the depth arrangement is generally consistent. The scale error regarding the mountains is signif
z-image-local/z-image-turboPASSEvaluator
The image perfectly matches the described spatial arrangement. The sandcastle is clearly in the immediate foreground at the bottom. The surfer is positioned in the middle distance on the breaking waves. The cruise ship is located far in the background near the horizon line. The visual sizes decrease appropriately with distance (sandcastle > surfer > ship appearance), adhering to perspective rules. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image perfectly matches the spatial description. The park bench and pigeons are clearly in the immediate foreground. The fountain is situated in the middle distance (midground), behind the bench. The skyscrapers form the skyline in the background. The layering follows standard perspective where foreground objects appear largest and background objects appear smaller due to distance. VERDICT: PASS
▸Scale & Proportions56%vs89%
z-image-local/z-image-turboPASSEvaluator
The image shows a man standing in front of a yellow school bus. The relative scale is realistic: the bus is significantly larger than the person, with the vehicle's height towering over him and the wheel appearing appropriately large (roughly half his height). The proportions are consistent with a real-world scenario. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image depicts a ladybug resting on a green leaf. The size of the ladybug relative to the leaf appears natural and consistent with real-world macro photography. The leaf is large enough to comfortably support the insect, and the details (like leaf hairs and veins) suggest a realistic scale. The proportions are accurate. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image shows a child holding what appears to be two basketballs stacked vertically. However, the relative sizes are incorrect. The basketballs are disproportionately large compared to the child's torso and arms. A standard basketball would appear smaller relative to a child's body; here, the balls are nearly as wide as the child's chest, and the hands look too small to grip them naturally. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
The image shows a miniature figurine of a man in a suit standing on the rim of a white coffee cup. The figurine is significantly smaller than the cup, accurately reflecting the "tiny person" description. The proportions are consistent with a macro photograph of a small object placed on a standard-sized cup. The person is positioned on the edge and appears to be looking down into the coffee. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image shows a tabby cat in the foreground with skyscrapers in the background. While the cat appears large due to the close-up perspective, the skyscrapers are clearly massive structures towering in the distance. The cat is not depicted as a "giant" or "kaiju" relative to the buildings; it looks like a normal-sized cat on a walkway. The prompt requires the cat to be giant relative to the city, which is not the case here. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
The image shows a small model house resting comfortably in the palm of a human hand. The scale is consistent: the house is small enough to be held (miniature), fitting the description perfectly. The proportions of the house relative to the fingers and palm are realistic for a toy or architectural model. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image fails to meet the prompt's requirements. The prompt explicitly requests a "dog," but there is no dog in the image; instead, there are two cats. Additionally, the prompt asks for singular instances of a horse, cat, and mouse, but the image contains two of each. While the visible animals (elephant, horses, cats, mice) have generally correct relative sizes to one another, the absence of the required dog and the presence of extra animals mean the image does not correctly depict the descr
z-image-local/z-image-turboPASSEvaluator
The image displays a standard table setting where the proportions are accurate. The fork and knife are appropriately sized relative to the dinner plate (roughly similar in length to the plate's diameter). The wine glass is taller than the plate is wide, and the salt and pepper shakers are significantly smaller than the main plate and cutlery, consistent with real-world objects. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image shows a farm scene, but the proportions are inconsistent with reality. The barn is disproportionately small compared to the other elements. Specifically, the cow is almost as tall as the barn's lower doors (which should be roughly 10-12 feet high), making the cow appear gigantic or the barn miniature. Similarly, the tractor appears nearly as tall as the barn doors. While the farmer, cow, and tractor have somewhat consistent relative sizes to each other, the barn breaks the scale. VE
Human realism 95%vs95%
▸Faces & Expressions92%vs92%
z-image-local/z-image-turboPASSEvaluator
The face in the image is highly symmetrical with correctly proportioned and natural features. The eyes, eyebrows, nose, and mouth are well-aligned and balanced. The smile is warm, natural, and clearly conveys happiness — the lips curve gently upward, teeth are visible in a relaxed grin, and the cheeks lift slightly, all contributing to an authentic, pleasant expression. There are no noticeable distortions or unnatural artifacts in the facial structure or expression. The lighting and composition
z-image-local/z-image-turboPASSEvaluator
The face in the image is symmetric and features are anatomically correct — eyes, nose, mouth, and ears are proportionally aligned and natural. There are no obvious distortions. The expression is calm and serious, with neutral lips, direct gaze, and minimal facial movement, which clearly conveys the intended emotion. The subject appears to be middle-aged, with visible signs of aging such as wrinkles and some graying hair, consistent with the prompt. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The face in the image is symmetric and features are correctly proportioned — eyes, nose, mouth, and ears are well-aligned and natural. The teenager’s expression is relaxed and neutral, with no strong emotion conveyed, matching the prompt. There are no visible distortions; the skin texture, lighting, and facial structure appear realistic and well-rendered. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The face in the image is largely symmetric, with both eyes, eyebrows, and lips appearing balanced in structure. The features are anatomically plausible — eyebrows are arched and furrowed, the nose is wrinkled, and the lips are curled downward, all of which strongly convey the emotion of disgust as requested. The expression is highly readable and intense, with visible skin texture and fine wrinkles that enhance the realism of the emotion. There are no significant distortions that compromise the f
z-image-local/z-image-turboPASSEvaluator
The face is largely symmetric with correctly rendered features: both eyes are wide and bulging, eyebrows are raised, and the mouth is slightly open — all aligning with the prompt’s request for “genuine surprise.” The facial structure is natural and undistorted, with no obvious asymmetry or anatomical anomalies. The emotion is clearly readable as surprise, conveyed through the exaggerated eye and mouth expression. There are no visible distortions. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The face is largely symmetric with correctly rendered anatomical features — eyes, nose, and mouth are proportionally accurate and aligned. There are no obvious distortions or unnatural deformities. The expression conveys quiet melancholy: the eyes are downcast, the mouth is slightly drawn (not smiling or frowning, but neutral with a hint of sadness), and the gaze is distant and introspective. Lighting and composition enhance the somber mood without compromising facial integrity. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The face is largely symmetrical with natural, correctly rendered features: the eyes, nose, and mouth are balanced, and the white, slightly wavy hair is appropriately styled. The deep wrinkles around the eyes and mouth are consistent with an elderly woman in her 80s, and the smile appears warm and genuine, with crinkled eyes conveying kindness. There are no obvious distortions—no misaligned features, unnatural proportions, or AI artifacts. The emotion is clearly readable as warm, gentle, and affe
z-image-local/z-image-turboPASSEvaluator
The face in the image is symmetric and features are anatomically correct for a young child. The child has round cheeks, large eyes (though squinted due to crying), and tears are clearly streaming down both cheeks. The expression is unmistakably one of intense crying — the mouth is wide open, eyebrows are furrowed, and the facial muscles are tensed appropriately for distress. There are no noticeable distortions; the features are natural and well-proportioned. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The face in the image is largely symmetric with natural, correctly rendered features. The elderly man appears to be in his 70s, with visible age spots, thinning gray hair, and is wearing reading glasses. His expression is skeptical — the brow is slightly furrowed, the lips are pressed in a neutral-to-disapproving line, and his gaze is directed away, conveying doubt or skepticism. There are no significant distortions; the facial structure, proportions, and details (like skin texture, eye shape, a
z-image-local/z-image-turboPASSEvaluator
The image shows three individuals seated at a table, each displaying a distinct and clearly readable emotion: - Left: A woman laughing with her head tilted back, eyes closed, and mouth open — this matches the “laughing with head tilted back” prompt. - Center: A woman with wide eyes, open mouth, and hands on cheeks — this clearly conveys “shocked” or “surprised” emotion. - Right: A person with one eye squinted, lips pulled into a smirk, and hands supporting the face — this effectively portrays “
z-image-local/z-image-turboPASSEvaluator
All four faces in the image are symmetric with correct, natural features. No distortions are present. - The father (top left) is beaming with pride — his wide smile and bright eyes convey genuine joy. - The mother (top right) is smiling gently — her soft expression and slight upturn of lips match the description. - The teenager (front left) looks bored — his neutral expression, slightly downturned mouth, and lack of engagement clearly convey disinterest. - The child (front right) is grinning ex
z-image-local/z-image-turboFAILEvaluator
The image shows three individuals seated at a table, each with laptops, appearing to be in a meeting. However, their facial expressions do not match the prompt’s requested emotions: - Left person: Frowning, hands on cheeks — this suggests frustration or stress, which aligns with the “frustrated with furrowed brows” prompt. - Middle person: Eyes closed, hands on cheeks, slight smile — this appears to be a mix of relief, exhaustion, or resignation, not “pleased expression” as requested. - Right p
▸Full Body92%vs100%
z-image-local/z-image-turboPASSEvaluator
The image shows a full-body view of a woman standing upright with her arms at her sides, wearing a white t-shirt and blue jeans, consistent with the prompt. - Limb count: Correct — two arms, two legs, visible torso and head. - Proportions: Anatomically correct — limbs and torso proportions appear natural and balanced. - Pose: The woman is standing upright in a relaxed pose with arms at her sides — fully matches the prompt. - Action/pose recognizability: Yes — the pose is clearly identifiable as
z-image-local/z-image-turboPASSEvaluator
The image shows a full-body shot of a man standing straight and facing the camera, with both hands in his pockets. His posture is upright and symmetrical, and his proportions appear natural and anatomically correct — head, torso, arms, and legs are in proportion with no visible distortions. He has two arms, two legs, and all limbs are fully intact and correctly positioned. The pose is clearly recognizable as “standing straight, hands in pockets,” which matches the prompt exactly. There are no no
z-image-local/z-image-turboPASSEvaluator
The image shows a full-body view of a person standing in a park, from head to feet, as requested. The person is in a neutral, upright standing pose with arms hanging loosely at the sides and hands slightly curled. The posture is natural and recognizable as a standard standing pose. Limb count is correct: two arms, two legs, with all major joints (shoulders, elbows, wrists, hips, knees, ankles) appearing anatomically accurate and proportionally aligned. The proportions of the body — head-to-shou
z-image-local/z-image-turboPASSEvaluator
The image shows a woman in mid-stride running through a park. Her hair is flying back, and her clothing (white t-shirt, dark shorts, running shoes) shows motion consistent with running. Anatomical correctness: - Limb count: Correct — two arms, two legs. - Proportions: Natural and proportional — no exaggerated limbs or torso. - Pose: Anatomically correct running pose — one leg forward, the other back, arms bent and swinging, body leaning slightly forward. The motion blur in hair and clothing en
z-image-local/z-image-turboPASSEvaluator
The image shows a man captured mid-air, seemingly jumping over a puddle. His body is crouched with bent knees, arms slightly extended forward, and feet off the ground — consistent with the described action. Limb count is correct: two arms, two legs, and a torso. Proportions appear anatomically plausible — the torso is aligned with the legs, the arms are positioned naturally, and the head is proportionally sized. The pose is recognizable as a jump, with the body compacted in a crouch, which is
z-image-local/z-image-turboPASSEvaluator
The dancer is captured mid-spin, en pointe, with arms gracefully arched above the head and the dress swirling outward — a dynamic, expressive pose that matches the prompt’s description. The dress flows realistically with the motion, and the fabric’s movement is physically plausible. Limb count is correct: two arms, two legs, and the torso are intact. Proportions are anatomically accurate — the dancer’s body is slender but proportionate, with no obvious distortions in limb length or joint place
z-image-local/z-image-turboPASSEvaluator
The image shows a person performing the yoga tree pose (Vrksasana). The pose is recognizable: one leg is standing firmly on the floor, the other foot is pressed against the inner thigh of the standing leg, and the arms are raised overhead with palms pressed together. The limb count is correct — two arms, two legs, two feet — and proportions appear anatomically plausible. The pose is executed with good alignment: the torso is upright, the hips are level, and the standing leg is engaged. There are
z-image-local/z-image-turboPASSEvaluator
The climber in the image has two arms and two legs — limb count is correct. Proportions appear anatomically plausible: the torso, limbs, and joints are proportioned in a way consistent with a human body in dynamic motion. The pose is recognizable as a rock climbing action — arms extended to grip holds, legs positioned for balance and leverage, body angled against the wall. The climber’s posture is physically plausible for the activity, with no obvious distortions in joint angles or limb placemen
z-image-local/z-image-turboFAILEvaluator
The image shows a cellist seated with the cello positioned between their legs, supported by a stand. The bow is held in the right hand and is extended across the strings, consistent with playing posture. The left hand is positioned on the fingerboard, as expected for playing. The cellist’s legs are not wrapped around the instrument — they are seated with the cello resting between them, which is the standard posture for cello playing. The instrument is properly supported by a stand, not held by t
z-image-local/z-image-turboPASSEvaluator
The image shows two tango dancers in a dynamic, close embrace. The woman is arched backward, supported by the man, with her legs extended and intertwined with his — specifically, her right leg is wrapped around his left thigh, and her left leg is extended outward, creating a classic tango dip pose. Both dancers have full, anatomically correct limb counts (two arms, two legs each), and their proportions are realistic and consistent with human anatomy. The pose is clearly recognizable as a tango d
z-image-local/z-image-turboPASSEvaluator
The image shows two soccer players in a dynamic tackle: one player is sliding on the ground, and the other is airborne, jumping over him while controlling the ball at his feet. This matches the prompt’s description. Anatomical analysis: - Limb count: Both players have all limbs present — arms, legs, hands, feet — no missing or extra limbs. - Proportions: Body proportions are realistic — torsos, limbs, and heads are proportionate to each other and to the players’ athletic build. - Pose: The slid
z-image-local/z-image-turboPASSEvaluator
The image shows two gymnasts performing synchronized handstands side by side. Both have identical body alignment — their legs are straight and parallel, feet pointed, torsos aligned vertically, and hands placed shoulder-width apart on the mat. Limb count is correct: each has two arms, two legs, and the correct number of fingers and toes. Proportions are anatomically accurate — the limbs are proportionate to the torso, and the body is balanced in the handstand position. The pose is clearly recogn
▸Hands100%vs92%
z-image-local/z-image-turboPASSEvaluator
Both hands are clearly visible, palms facing forward, fingers relaxed and naturally spread as requested in the prompt. Counting the fingers on each hand: - Left hand: Thumb, index, middle, ring, pinky — 5 fingers. - Right hand: Thumb, index, middle, ring, pinky — 5 fingers. The anatomy is correct: each hand has five digits (thumb + four fingers), and the posture is natural and relaxed, consistent with a person standing with hands at their sides. No anomalies are present — no extra fingers, mi
z-image-local/z-image-turboPASSEvaluator
Both hands are clearly visible, palms up, with all ten fingers distinctly shown. Each hand displays five fingers: thumb, index, middle, ring, and pinky — no fingers are missing or duplicated. The gesture is anatomically correct: the hands are extended forward, palms open, fingers slightly spread, which is a natural and common pose. There are no anomalies such as extra fingers, fused digits, or unnatural deformities. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
Both hands are visible in the image, resting on opposite upper arms as described in the prompt. Each hand clearly shows five fingers: thumb, index, middle, ring, and pinky. The fingers are anatomically positioned and appear natural, with no missing or extra digits. The grip is consistent with a typical crossed-arm posture — hands resting on opposite upper arms, fingers slightly curled and overlapping, which is a common and anatomically plausible gesture. No anomalies are present. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The visible hand in the image is holding the mug by its handle with the thumb on top and the other fingers curled around the back of the handle. All five fingers are clearly visible: thumb, index, middle, ring, and pinky. The grip is anatomically correct — the thumb and fingers are wrapped around the handle in a natural, functional manner typical of holding a mug. There are no anomalies such as extra fingers, missing fingers, or unnatural joint positions. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The visible hand in the image is holding a pen and writing in a notebook. Counting the fingers: - Thumb: visible - Index finger: wrapped around the pen - Middle finger: supporting the pen - Ring finger: curled under the pen - Little finger: partially visible, curled under All five fingers are clearly present and anatomically positioned. The grip is a standard tripod grip — index finger, middle finger, and thumb holding the pen — which is a common and anatomically correct way to hold a pen for
z-image-local/z-image-turboPASSEvaluator
Left hand (on fretboard): 5 fingers visible — thumb, index, middle, ring, pinky — all positioned correctly to press strings on the fretboard. Anatomically accurate. Right hand (strumming): 5 fingers visible — thumb, index, middle, ring, pinky — all positioned naturally for strumming. Anatomically accurate. No anomalies detected. Fingers count is exactly 5 per hand. Gestures are consistent with standard guitar playing posture. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image shows a single hand making a peace sign gesture, with the index and middle fingers extended upward and the other three fingers (ring, pinky, and thumb) curled inward. All five fingers are clearly visible and anatomically present — no fingers are missing or duplicated. The gesture is anatomically correct: the index and middle fingers are extended, while the ring finger, pinky, and thumb are curled into the palm. The thumb is positioned naturally against the side of the hand, which is t
z-image-local/z-image-turboPASSEvaluator
The image shows a single hand performing a thumbs-up gesture. The thumb is extended upward, and the other four fingers (index, middle, ring, and little) are curled into a fist. All five fingers are clearly visible and anatomically distinct. - Count of fingers per hand: 5 (thumb + four curled fingers). - Anatomical correctness: The gesture is standard and anatomically accurate. The thumb is extended, and the other fingers are curled naturally, consistent with a typical thumbs-up pose. - No anoma
z-image-local/z-image-turboPASSEvaluator
The image shows a single hand with the index, middle, and ring fingers extended — matching the prompt’s description of “counting to three.” The thumb is not visible, and the pinky finger is also not visible, so only three fingers are extended. The hand appears to be a left hand, palm facing up, with the thumb tucked away or out of frame. Counting the visible fingers: only three are extended (index, middle, ring). The thumb and pinky are not visible, so we cannot confirm they are present or abse
z-image-local/z-image-turboPASSEvaluator
Each visible hand clearly shows five fingers: thumb, index, middle, ring, and pinky. The grip is a standard, professional handshake — palms facing each other, fingers interlocked, thumbs resting on the outer side. The anatomy is consistent with real human hands: knuckles, finger joints, and nail beds are proportionally accurate and natural. No anomalies such as extra fingers, missing fingers, or unnatural joint positions are present. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
Each visible hand clearly shows five fingers: thumb, index, middle, ring, and pinky. The fingers are fully extended and meet palm-to-palm in a high-five gesture. The anatomy is accurate — no extra or missing fingers, no unnatural deformities, and the positioning is consistent with a natural high-five. The gesture is anatomically correct and matches the prompt. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
Examining the hands in the image: - The hands of the person giving the gift (top pair, wearing a dark blue sweater) are clearly visible. Each hand shows five fingers: thumb, index, middle, ring, and pinky. No fingers are missing or duplicated. - The hands receiving the gift (bottom pair, wearing a white sleeve) are also visible. Each hand also clearly shows five fingers. - The grip is anatomically correct: the giver’s hands cradle the box with palms slightly cupped and fingers wrapped around th
▸Multi-Subject100%vs100%
z-image-local/z-image-turboPASSEvaluator
Person 1 (left): - Hair: Short, blonde, styled in a bob cut. - Clothing: Wearing a red jacket over a dark-colored top. - Observed traits: Visually distinct, matches description. Person 2 (right): - Hair: Long, straight, black. - Clothing: Wearing a sleeveless blue dress with a belt. - Observed traits: Visually distinct, matches description. All individuals are present, visually distinct, and match their described features. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
Person 1 (left): - Has a full, reddish-brown beard. - Wears black-rimmed glasses. - Wears a green ribbed sweater. - Has brown, styled hair. - Visible watch on left wrist. Person 2 (right): - Clean-shaven (no facial hair). - Has short, light brown/blond hair. - Wears a black hoodie with drawstrings. - Has light-colored eyes. Both individuals are visually distinct and match the described features exactly. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
Person 1 (tall): - Hair: Curly, red (appears as vibrant copper-red curls, matching description) - Clothing: Denim overalls (blue denim bib overalls over a white t-shirt) - Height: Clearly taller than the other person, as expected Person 2 (short): - Hair: Straight, brown (long, straight brown hair, matching description) - Clothing: Yellow sundress (bright yellow, sleeveless, gathered-waist dress) - Height: Clearly shorter than the other person, as expected Both individuals are visually disti
z-image-local/z-image-turboPASSEvaluator
- Bald man in a suit: Present. Seen from the side, wearing a dark suit with a white shirt. Bald head is clearly visible. - Woman with silver hair in a red blouse: Present. Seated across from the bald man, has silver-gray wavy hair, wearing a red collared blouse, and gold jewelry. - Young man with dreadlocks in a denim jacket: Present. Seated next to the woman in red, has dark dreadlocks, wearing a blue denim jacket over a white shirt. - Woman with a hijab in a green dress: Present. Seated on the
z-image-local/z-image-turboPASSEvaluator
- Tall woman with short pink hair playing bass: Present. She has bright pink short hair, is playing a bass guitar, and is visibly tall in proportion to the others. ✅ - Man with a beard and bandana on drums: Present. He is seated behind a drum kit, has a full beard, and is wearing a black bandana. ✅ - Slim man with glasses playing keyboard: Present. He is on the right, playing a keyboard, has curly brown hair, wears glasses, and appears slim. ✅ All three individuals are visually distinct and mat
z-image-local/z-image-turboPASSEvaluator
Person 1 (Man with backpack and sunhat): - Observed: Seen from the back, wearing a beige sunhat and a large dark gray backpack. He is holding a hiking pole. - Matches description: Yes — backpack and sunhat are present. Person 2 (Woman with braided hair and hiking poles): - Observed: Facing forward, smiling, with long braided hair, wearing a beige sunhat, and holding two hiking poles. - Matches description: Yes — braided hair, hiking poles, and sunhat are present. Person 3 (Teenager with baseba
Truthfulness 78%vs0%
▸Photorealism67%vs0%
z-image-local/z-image-turboPASSEvaluator
The image accurately renders the requested materials. The glass shows proper refraction and transparency, the spoon exhibits realistic metallic reflections, the apple has a convincing skin texture with water droplets, the napkin displays natural fabric folds, and the wooden table features appropriate grain and lighting. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image successfully renders the brushed aluminum laptop, leather notebook, and ceramic mug with accurate textures and lighting. However, the prompt explicitly requested a "glass paperweight," which is completely absent from the scene. Since a key material requested in the prompt is missing, the image fails to meet the criteria. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
The image accurately renders the distinct visual properties of each material. The stainless steel pot shows sharp, metallic reflections and a brushed texture. The glass bottle exhibits translucency and refraction, with the olive oil visible inside. The wooden cutting board displays natural grain and matte texture, while the cotton towel shows soft fabric folds and diffuse light absorption. The induction cooktop surface correctly reflects the pot and surrounding objects. All materials behave phys
▸Physics & Reflections92%vs0%
z-image-local/z-image-turboPASSEvaluator
The shadow is cast to the right and slightly downward, which is consistent with a light source from the upper left. The shadow’s soft edges and gradual fade are physically plausible for a diffused light source. The sphere’s highlight on the upper-left side also aligns with this lighting direction. No distortions or color anomalies are present. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The shadow is cast to the left, consistent with sunlight coming from the right (window side). The shadow’s shape matches the mug’s silhouette, including the handle, and shows appropriate softness near the base due to light diffusion. Reflections on the mug’s glossy surface are subtle and realistic. No distortions or color anomalies are present. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The shadow is cast directly below the subject, which aligns with the “midday” lighting condition described in the prompt. The shape of the shadow accurately mirrors the person’s posture and clothing folds, showing no distortion or misalignment. The color is a natural dark gray, consistent with asphalt under bright sunlight. No reflections are present, as expected for this scene. All shadow attributes are physically plausible. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The reflection is physically implausible. The woman is facing the mirror, so her reflection should show her front. Instead, the reflection shows her back, which contradicts how mirrors work. Additionally, the reflection’s pose and orientation do not match a true mirror image — it appears as if the reflection is a separate person turned around, not a mirrored version of the subject. This violates basic optical principles. VERDICT: FAIL
z-image-local/z-image-turboPASSEvaluator
The reflection is physically plausible. It is correctly oriented (inverted vertically), aligned with the cat’s position, and shows appropriate distortion due to the polished marble surface. The color and brightness match the original subject, and the reflection fades naturally toward the edges, consistent with real-world optics on a glossy floor. No directional or structural inconsistencies are present. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The reflection on the glass table is physically plausible. It correctly mirrors the vase and stems with appropriate vertical inversion and slight distortion consistent with a reflective surface. The color and brightness of the reflection match the object above, and the direction aligns with the light source. No anomalies in orientation or distortion are present. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The reflections are vertically aligned with the trees, correctly inverted, and exhibit natural ripple distortion consistent with a calm lake surface. The color tone of the reflection is slightly darker than the actual trees, matching the prompt’s description. No directional or orientation errors are present. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The reflection in the puddle is physically plausible. It correctly mirrors the skyline with appropriate vertical inversion and perspective distortion consistent with a low-angle water surface. The color shift is subtle but realistic, showing slightly muted tones and light scattering typical of reflections on wet asphalt. Direction and orientation align with the scene’s lighting and geometry. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The reflection is physically plausible: it is correctly inverted, aligned with the shoreline, and shows appropriate color muting and slight softening consistent with water surface behavior. Shadows on the mountains are consistent with a low-angle light source (likely sunrise/sunset), and their reflections match in direction and intensity. No distortions or misalignments are evident. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image displays a clear glass sphere on a reflective surface. The reflection is vertically aligned and correctly inverted, matching the sphere’s position. Refraction through the sphere shows a distorted, inverted view of the background, which is physically accurate for a convex lens effect. The contact shadow and water ripple distortion around the base are subtle but plausible. Colors and lighting are consistent with natural daylight. No directional or orientation errors detected. VERDICT: P
z-image-local/z-image-turboPASSEvaluator
The reflections are physically plausible. The chrome sculpture accurately mirrors the environment, including the window frames and sky, with appropriate distortion based on its curved surface. The glass floor reflects the sculpture and the room’s structure correctly, maintaining proper orientation and perspective. Shadows are minimal but consistent with the bright, diffused lighting from the large windows. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The reflections and refractions are physically plausible. The wine glass correctly refracts the background lake, showing an inverted image of the waterline. The marble surface displays a soft, accurate reflection of the glass stem and base. The lighting is consistent, with shadows and highlights aligning with a light source from the window. The lake’s surface also reflects the sky and distant trees naturally. VERDICT: PASS
▸World Knowledge67%vs0%
z-image-local/z-image-turboPASSEvaluator
The image depicts the Eiffel Tower with high architectural accuracy. The iron lattice structure, the three distinct levels (including the first and second platforms and the top section), and the iconic arch at the base are all correctly rendered. The perspective is realistic, and the clear blue sky matches the prompt’s description of a “clear day.” No significant inaccuracies are present. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image depicts the Taj Mahal with high architectural accuracy. The central dome, four minarets, and symmetrical layout are correctly rendered. The reflection in the pool is precise, and the surrounding landscaping (cypress trees, pathways) matches the real-world site. Minor details like the finial on the dome and the arched entrances are also accurate. No significant factual or structural inaccuracies are present. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The subject is clearly recognizable as the Statue of Liberty. The green patina, crown spikes, tablet, and torch are all present and correctly positioned. The pedestal architecture matches the real-world structure with its stone masonry and arched windows. No significant inaccuracies are visible. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The instrument shown is a Ruan (a Chinese lute), not a Shamisen. A Shamisen has a long neck, a square body covered in skin, and typically uses a bachi plectrum. This image shows a round wooden body, frets, and four strings (not three). The subject is not factually accurate to the prompt. VERDICT: FAIL
z-image-local/z-image-turboFAILEvaluator
The image depicts a brass navigational instrument, but it is not a medieval astrolabe. It lacks the characteristic rete (star map) and tympan (geographic plate) of an astrolabe. Instead, it resembles a mariner’s compass or a simplified planisphere with a star-shaped pointer and degree markings. The design is more consistent with a 17th–19th century nautical instrument than a medieval astronomical device. Thus, it fails to meet the prompt’s requirement for a factually accurate medieval astrolabe.
z-image-local/z-image-turboPASSEvaluator
The image accurately depicts a Japanese kintsugi bowl. The ceramic texture, the specific style of crack repair using gold lacquer (kintsugi), and the overall form are realistic and factually correct. There are no architectural or factual inaccuracies. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image accurately depicts the water cycle with clear labels for evaporation, condensation, precipitation, and collection. The arrows correctly show the flow of water from the ocean to clouds, rain falling, and rivers returning to the sea. The visual elements are simple but factually correct and recognizable. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image depicts a surface-level rift or graben with exposed sedimentary layers, resembling a canyon or fault scarp. It does not show the required geological features: there is no subduction zone, oceanic trench, volcanic arc, or mantle layering. The scene is a terrestrial desert landscape with stratified rock, not a tectonic cross-section of a convergent boundary. Therefore, it fails to meet the prompt’s factual and architectural requirements. VERDICT: FAIL
z-image-local/z-image-turboFAILEvaluator
The image is a stylized 3D rendering rather than a traditional anatomical illustration. While the major vessels (aorta, pulmonary artery) and chambers are present, the anatomical accuracy is poor. The coronary arteries are depicted as large, superficial red tubes that do not follow realistic anatomical paths. The text labels on the atria are gibberish ("ARTILLZ", "ARTILLZ"), which is a common AI artifact. The valves are not clearly or correctly detailed. The overall structure is recognizable as
z-image-local/z-image-turboPASSEvaluator
The image depicts a butterfly with the characteristic iridescent blue dorsal wings and brown ventral patterns of a Morpho menelaus. The body morphology, including antennae and wing venation, is anatomically correct. The eyespots on the hindwings are present and accurately placed. The subject is clearly recognizable as the requested species with no factual or structural inaccuracies. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image accurately depicts the interior of the Hagia Sophia, showcasing its massive central dome, pendentives, semi-domes, and Islamic calligraphy medallions alongside Byzantine mosaics. The architectural details are factually correct and recognizable. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image depicts a bismuth crystal with the characteristic hopper structure and iridescent oxide layers. The geometric staircase formation is accurately represented, and the colors are consistent with real-world specimens. The subject is recognizable and factually accurate. VERDICT: PASS
Professional Studio 85%vs85%
▸Camera & Lighting83%vs92%
z-image-local/z-image-turboPASSEvaluator
The image depicts a modern living room with a wide-angle perspective. 1. **Perspective/Wide-angle:** The shot captures a large area of the room, including the sofa, coffee table, and the large window area, consistent with a wide-angle lens. 2. **Barrel Distortion:** There is a subtle hint of barrel distortion visible at the edges (specifically in the way the ceiling lines and the vertical edges of the window frame slightly curve), which matches the prompt's requirement. 3. **Depth of Field:**
z-image-local/z-image-turboPASSEvaluator
The image matches the prompt's requirements: 1. **Perspective/Shot Type:** The image is a close-up macro shot of a single dewdrop on a rose petal, as requested. 2. **Depth of Field:** There is a clear shallow depth of field. The dewdrop and the immediate area of the petal are in focus, while the center of the rose and the background are softly blurred. 3. **Lighting:** The lighting is soft and natural, consistent with a macro photography setting. The image accurately reflects the specified
z-image-local/z-image-turboPASSEvaluator
The image shows an aerial bird's-eye view looking straight down at a traffic roundabout. The perspective is a direct top-down (nadir) view, which matches the prompt. The lighting is even and diffuse, consistent with an overhead daylight shot, and the depth of field is deep, keeping both the center of the roundabout and the surrounding streets in focus, which is typical for high-altitude aerial photography. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image depicts a portrait of a man with long hair. 1. **Lighting Pattern**: The prompt specifies "classic Rembrandt lighting," which is characterized by a distinct triangle of light on the shadowed cheek. In this image, the light is coming from the side, but there is no visible light triangle on the shadowed side of the face. The shadow on the cheek is quite broad and lacks the specific geometric light pattern required for a true Rembrandt lighting style. 2. **Perspective and Depth of Field
z-image-local/z-image-turboFAILEvaluator
The image depicts a landscape of sand dunes under a sky that is a solid, uniform orange/yellow color. 1. **Lighting Pattern:** The prompt specifies "golden hour light — long warm shadows, orange sky, soft glow." While the sky is orange, the lighting on the dunes is very flat. There are no "long warm shadows" or a "soft glow" emanating from a light source; the dunes appear illuminated by a flat, ambient light rather than a directional sun. There is no visible sun or light source to create the s
z-image-local/z-image-turboPASSEvaluator
The image features a close-up portrait of a person against a bright, white, and overexposed background. 1. **Lighting Pattern:** The lighting is high-key, bright, and airy. There are minimal shadows on the face, creating a luminous and soft appearance that matches the prompt's description of "high-key" and "minimal shadows." 2. **Perspective:** The perspective is a close-up portrait, focusing on the face and upper neck, which is consistent with a high-key portrait style. 3. **Depth of Field
z-image-local/z-image-turboPASSEvaluator
The image matches the prompt's specifications: 1. **Perspective/Lens (85mm):** The image shows a classic portrait composition with a tight framing on the subject, consistent with an 85mm focal length. 2. **Depth of Field (f/1.4 with creamy bokeh):** The background is heavily blurred with soft, circular bokeh, indicating a shallow depth of field characteristic of an f/1.4 aperture. The subject is sharp, while the garden background is completely out of focus. 3. **Lighting (Backlit with warm r
z-image-local/z-image-turboPASSEvaluator
The image matches the prompt's requirements: 1. **Depth of Field:** The image exhibits a shallow depth of field characteristic of a wide aperture (like f/1.8). The subject (the woman) is sharp and in focus, while the background elements (city lights, cars, and pedestrians) are heavily blurred with smooth bokeh. 2. **Perspective:** The shot is a medium close-up street photograph, consistent with a 50mm lens perspective, which provides a natural field of view without significant distortion. 3.
z-image-local/z-image-turboPASSEvaluator
To evaluate the image against the prompt: 1. **Perspective (Shot from above):** The image is shot from a high angle, looking down at the pizza, which matches the "shot from above" requirement. 2. **Depth of Field (35mm f/2.8):** The image shows a shallow depth of field. The center of the pizza is in sharp focus, while the edges of the pizza and the background (the window sill and wooden surface) are softly blurred. This is consistent with an f/2.8 aperture setting. 3. **Lighting (Natural win
z-image-local/z-image-turboPASSEvaluator
The image successfully implements the tilt-shift effect described in the prompt. 1. **Perspective/Tilt-Shift Effect:** The image uses a very shallow depth of field with a distinct "selective focus strip." The top and bottom of the image are heavily blurred (bokeh), which creates the optical illusion that the scene is a miniature model rather than a full-scale city street. 2. **Depth of Field:** The focus is sharply constrained to the middle section of the street (where the white car and black
z-image-local/z-image-turboPASSEvaluator
The image shows a dark, moody scene dominated by a single, prominent horizontal light streak. 1. **Lighting Pattern:** The prompt requested "blue and orange anamorphic lens flare streaking horizontally". The image features a bright horizontal streak that transitions from a cyan/blue hue on the left to an orange/amber hue on the right. This matches the color and directionality described. 2. **Perspective/Depth of Field:** The image is an abstract representation of light, so traditional perspec
z-image-local/z-image-turboPASSEvaluator
The image shows a portrait of a man with a clear lighting pattern. 1. **Lighting Pattern:** The prompt specifies "dramatic split lighting — half the face brightly lit, the other half in deep shadow, high contrast chiaroscuro." In the image, the light source is positioned to the side, creating a distinct division. The left side of the subject's face (from the viewer's perspective) is brightly illuminated, while the right side is cast in deep shadow. This creates a high-contrast chiaroscuro effe
▸Color Precision100%vs100%
z-image-local/z-image-turboPASSEvaluator
The image shows a sports car that is a bright, vivid, and unambiguous red. The color is consistent across the body of the car and matches the description of "bright red" perfectly. The car is parked on a gray asphalt road, which also matches the prompt. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image shows a woman wearing a sleeveless evening dress. The color of the dress is a deep, vibrant blue, which matches the description of "royal blue." The background is a solid, light gray, which qualifies as a "neutral background." The colors in the image perfectly match the prompt's specifications. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image features a large, prominent sunflower in the foreground and several other sunflowers in the background. The petals of the sunflowers are a vibrant, pure sunflower yellow. The background sky is a clear, solid blue. The colors in the image perfectly match the colors specified in the prompt. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image shows a bedroom with a wall painted in a vibrant, saturated orange color. The hex code #FF6B35 represents a bright, slightly reddish-orange (often described as a vivid burnt orange or coral-orange). The color in the image is a very close match to this description and hex value. The furniture (the sideboard/cabinet) is white, and the lighting appears to be natural daylight coming from the window on the right. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image shows a dense forest scene with a focus on green foliage and brown tree trunks. 1. **Foliage Color:** The leaves are a dark, muted green. The color appears to be a deep, desaturated forest green, which aligns well with the description of `#2D5F2D` (a dark, muted green). 2. **Trunk Color:** The tree trunks are clearly brown, as specified. The colors in the image accurately reflect the specific hex-based color description provided in the prompt. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image shows a room with walls that are a medium blue color. The hex code #4A90D9 represents a medium-light shade of blue, which matches the visual appearance of the walls in the image. The floor is a warm wooden color, and the trim is white, which also aligns with the prompt's description. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image matches the color specifications provided in the prompt: 1. **Walls (#F5E6CC - warm cream):** The walls are a light, warm cream/off-white color, consistent with the hex code. 2. **Sofa (#2C3E50 - dark navy):** The sofa is a deep, dark navy blue, matching the specified color. 3. **Accent Pillows (#E74C3C - bright red):** The pillows are a vibrant, bright red, consistent with the hex code. All three primary elements match the requested colors and hex values. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image contains the following colors: 1. **Background:** The background is a deep, saturated blue. While it is a dark blue, it appears slightly more vibrant/lighter than the specific deep dark blue `#1A1A2E` (which is a very dark, almost navy/black-leaning blue). However, it is a close match to the description. 2. **Watch:** The watch strap and casing are a soft gold/pale peach color, which aligns well with the `#F0C27F` (soft gold) description. 3. **Earbuds:** The earbuds are a pure white
z-image-local/z-image-turboPASSEvaluator
The image contains a large, stylized letter "B" with a specific color scheme. 1. **Background:** The background is white. The prompt specified `#FAFAFA` (a very light off-white/near-white). The background in the image appears to be pure white or very close to it, which aligns with the intent of `#FAFAFA`. 2. **Main Shape:** The main shape is the letter "B", which is a vibrant orange-red. The prompt specified `#FF4500`. `#FF4500` is a bright orange-red (Orange Red), which matches the color of
z-image-local/z-image-turboPASSEvaluator
The image shows a smooth gradient transitioning from a bright red on the left to a bright blue on the right. The midpoint of the gradient is a purple/magenta hue, which aligns with the transition from #FF0000 (red) to #0000FF (blue). The colors are consistent with the hex values and the description provided in the prompt. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image shows a sunset sky with a gradient. 1. **Bottom Color:** The bottom of the image features a bright orange/yellow glow near the horizon. The color is a vibrant orange, which aligns well with the requested `#FF6B35` (a bright orange). 2. **Top Color:** The top of the image is a deep blue. While it is a dark blue, it appears more like a standard sky blue/deep blue rather than the specific deep indigo/navy shade of `#1B1464`. However, the general transition from a bright orange at the bo
z-image-local/z-image-turboPASSEvaluator
The image shows a horizontal banner with a smooth gradient. 1. **Left side color:** The color on the far left is a bright cyan/light blue. This aligns well with the hex code **#00C9FF** (a vibrant cyan). 2. **Right side color:** The color on the far right is a light, pale green. This aligns well with the hex code **#92FE9D** (a light mint green). 3. **Gradient transition:** The transition from the cyan on the left to the mint green on the right is smooth and follows the specified direction.
▸Photorealism33%vs0%
z-image-local/z-image-turboFAILEvaluator
The image depicts a single red sneaker on a white background. **Analysis:** 1. **Subject Matter:** The sneaker is a solid red color with a suede-like texture. 2. **Lighting and Shadow:** The lighting is soft, but the shadow underneath the shoe is extremely minimal and lacks the natural-looking diffusion or contact shadows expected in a high-quality studio product shot. The shadow looks somewhat "pasted on" or overly simplified. 3. **AI Artifacts:** * **Laces/Eyelets:** The eyelets an
z-image-local/z-image-turboFAILEvaluator
The image depicts a clear glass perfume bottle on a reflective black surface against a dark, neutral background. **Analysis:** 1. **Subject and Composition:** The bottle is centered and follows the prompt's instructions for a studio-quality shot. 2. **Reflections and Transparency:** While the glass and liquid look relatively realistic, there is a significant AI artifact regarding the internal structure. The "straw" or dip tube inside the bottle is extremely thin and appears to bend or termin
z-image-local/z-image-turboPASSEvaluator
The image depicts a single green apple on a seamless white background. **Analysis:** 1. **Subject & Lighting:** The apple is a vibrant green with realistic water droplets on the skin. The lighting is soft and consistent with a studio product shot, creating a gentle shadow at the base. 2. **Realism:** At first glance, the image is highly convincing. The texture of the skin, the translucency of the water droplets, and the way the light interacts with the surface appear very natural. 3. **AI A
Graphical design 54%vs71%
▸Data Visualisation0%vs0%
z-image-local/z-image-turboFAILEvaluator
The chart is missing the required monthly labels for January through June. Only “June” is labeled on the x-axis, and the other bars are labeled with values (12K, 18K, 15K, 19K) which are not months. The y-axis is labeled with “50” multiple times, which is incorrect — it should show a consistent scale (e.g., 0 to 50K or 0 to 25K). The values on the bars (18K, 15K, 22K, 19K, 25K) do not match the requested sequence (12K, 18K, 15K, 22K, 19K, 25K) — the first bar is labeled 18K instead of 12K. The c
z-image-local/z-image-turboFAILEvaluator
The chart is a pie chart as requested, with four distinct slices, each a different color. The percentages and labels are as follows: - Company A: 35% (yellow slice, labeled correctly) - Company B: 25% (green slice, labeled correctly) - Others: 20% (blue slice, labeled correctly) - The remaining 20% (orange slice) is not labeled with a company name, but since the prompt specified “Others 20%”, and the orange slice is the only one not explicitly labeled as Company A, B, or Others, it is implied t
z-image-local/z-image-turboFAILEvaluator
The chart is a line graph titled “Weekly Traffic,” which matches the requested chart type. However, the axis labels and data values are severely misaligned and incorrect. - The x-axis is labeled “Mon-Sun,” which is correct for the days of the week, but the values (1200, 1100, 2000, 1700, 1400) are placed on the x-axis, which should represent days, not visitor counts. - The y-axis is labeled “Mon-Sun,” which is incorrect — it should represent visitor counts (numerical values), not days. - The nu
▸Layout & Design33%vs78%
z-image-local/z-image-turboPASSEvaluator
The image includes: - A large popcorn bucket in the center — ✅ present. - The title "MOVIE NIGHT" in bold at the top — ✅ present and correctly placed. - The tagline "Every Friday at 8 PM" at the bottom — ✅ present and correctly placed. - The visual hierarchy is clear: title at top, popcorn in center, tagline at bottom — ✅ correct. - All text is readable — ✅ yes. However, the tagline text reads “Every Friday at 8 PM” — which matches the prompt — ✅ correct. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The poster includes: - A guitar illustration (integrated into the lettering, forming the neck and body of the guitar with the letters “LIVE MUSIC FEST”). - The headline “LIVE MUSIC FEST” in large, bold text — correctly placed and visually dominant. - The tagline “Tickets available now” below the main headline — readable and correctly positioned. The visual hierarchy is clear: the headline is largest and most prominent, the tagline is smaller and below, and the guitar illustration is integrated
z-image-local/z-image-turboPASSEvaluator
All described layout zones/elements are present: - An open book illustration is clearly visible as the central graphic. - The title “JOIN THE BOOK CLUB” is prominently displayed in large, bold, centered text across the center of the open book. - The tagline “Meets every Wednesday” is correctly placed below the title, in smaller font size, maintaining proper visual hierarchy. All text is readable and correctly positioned according to the prompt. VERDICT: PASS
z-image-local/z-image-turboFAILEvaluator
The image displays a landing page layout that includes: - A hero header image at the top (mountain landscape with a circular profile photo overlay). - A headline and subtitle in the middle (though the text is gibberish and unreadable). - A “Sign Up” call-to-action button centered below the text. - A footer at the bottom with three social media icons (envelope, play, magnifying glass). All described layout zones/elements are visually present. The visual hierarchy is correct: hero image → headli
z-image-local/z-image-turboFAILEvaluator
The image shows a newsletter layout that attempts to follow the described structure, but fails on multiple critical points: 1. **Banner Image Header**: Present — a header with the title "Nawselter" over a background image. 2. **Two-Column Body Section**: Present — text on the left and an image of a woman on the right. 3. **Highlighted Quote Block**: Present — a beige box with a quote, though the text is gibberish. 4. **Footer with Unsubscribe Link**: Present — a black bar at the bottom with
z-image-local/z-image-turboFAILEvaluator
The image shows a menu layout with a logo header (“MÁNU”), three category sections (Appetizers, mains, Mains, and Desseries), and a footer with text (“Restelade cetralienc” and “$0 - 200”). However, the category names are inconsistent — “mains” is repeated twice (once in lowercase, once in uppercase), and “Desseries” is misspelled. The text in the footer is gibberish and does not resemble a real address or hours. Additionally, many item names are nonsensical or made-up, and prices are either $0
z-image-local/z-image-turboFAILEvaluator
The image shows a magazine double-page spread. The left page contains a large hero photo of a woman, which matches the prompt. The right page contains a two-column article layout, a pull quote (the text “A Doolr hetere Hoting cocour” is prominently displayed as a headline, which functions as a pull quote), and a sidebar with a photo and author bio (“Ooodr Jarsgehe” with a subtitle). The page numbers (139 and 140) are visible at the bottom of the respective pages. However, the text in the articl
z-image-local/z-image-turboFAILEvaluator
The image shows a mobile app UI mockup that includes: - A top navigation bar with status indicators (time, signal, battery) — present. - A search field with placeholder text “Reoure” — present. - A 2x2 grid of feature cards with icons — present (green bookmark, red play, yellow heart, blue chat). - A list of recent activity items — present (with profile pictures and placeholder text). - A bottom tab bar with 5 icons and Chinese labels — present. However, the text in the search field and recent
z-image-local/z-image-turboFAILEvaluator
The image shows a fashion magazine spread with a clear left-page full-bleed photo and a right-page three-column layout, including a headline, body copy, a smaller inset image, and styled page numbers (19 and 20). The visual hierarchy is generally correct: the large photo dominates the left page, while the right page uses columnar text blocks with a prominent headline and a smaller inset image. The page numbers are styled and placed appropriately at the bottom corners. However, the text is not re
▸Style Diversity83%vs83%
z-image-local/z-image-turboPASSEvaluator
The image depicts a golden retriever sitting in a garden setting. The style is clearly that of an oil painting, with visible, textured brushstrokes applied in a manner consistent with traditional oil painting techniques. The fur of the dog and the foliage in the background are rendered with thick, expressive strokes, and the color palette is rich and layered, contributing to a textured, painterly effect. The composition and rendering align well with the requested style. VERDICT: PASS
z-image-local/z-image-turboPASSEvaluator
The image depicts a golden retriever sitting in a garden, which matches the subject matter of the prompt. The art style is clearly Japanese anime-inspired: the dog has large, expressive, round eyes, simplified facial features, and a soft, stylized rendering. The color palette is flat with minimal shading, consistent with the requested “flat colors” aesthetic. The background foliage is also rendered in a similarly simplified, illustrative style typical of anime. However, there is a notable absen
z-image-local/z-image-turboPASSEvaluator
The image depicts a golden retriever sitting in a garden, rendered in a pixel art style. The visual characteristics of pixel art are clearly present: the image is composed of distinct, blocky pixels, and the forms are defined by color blocks rather than smooth gradients. The scene is simplified and stylized, consistent with low-resolution pixel art. While the exact 32x32 grid size cannot be verified visually without metadata, the overall aesthetic and blocky construction strongly suggest it adhe
z-image-local/z-image-turboPASSEvaluator
The image displays a building facade that strongly embodies the Art Deco style as requested. Key characteristics are clearly present: - **Geometric shapes**: The facade is dominated by sharp lines, stepped forms, and symmetrical patterns — including zigzags, chevrons, and vertical and horizontal bands — all hallmarks of Art Deco. - **Gold and black palette**: The design uses a striking contrast between polished gold-colored metalwork and dark black marble, fulfilling the color requirement preci
z-image-local/z-image-turboPASSEvaluator
The image clearly depicts a woman holding a parasol, set in an outdoor, garden-like environment. The visual style is unmistakably Impressionist: - Soft, visible brushstrokes are evident throughout, especially in the foliage and background. - Dappled light is present, with patches of color suggesting sunlight filtering through leaves. - The palette is predominantly pastel — soft pinks, blues, greens, and yellows — with gentle transitions and no harsh lines. - The focus is on capturing the fleet
z-image-local/z-image-turboPASSEvaluator
The image clearly embodies the requested Pop Art style. It features: - Bold, high-contrast outlines defining facial features and hair. - Ben-Day dots (halftone patterns) visible in the skin tones, hair, and background, especially in shaded areas. - Flat, vivid primary colors: the background is a saturated red, the skin is rendered in yellow, and shadows use deep blue/black — classic Pop Art color choices. All key visual characteristics from the prompt are present and unmistakable. VERDICT: PA
z-image-local/z-image-turboPASSEvaluator
The image depicts a forest scene rendered in a watercolor style, with visible paper texture and watercolor bleeds — particularly in the soft, diffused edges of foliage and the way colors bleed into one another. The lighting is atmospheric and somewhat photorealistic in its depth and shadowing, especially on the tree trunks and undergrowth, giving a sense of volume and natural illumination. However, the overall aesthetic is painterly and impressionistic rather than photorealistic in the strictest
z-image-local/z-image-turboPASSEvaluator
The image successfully embodies the requested “cyberpunk Art Nouveau” style. Key visual characteristics are clearly present: - **Neon colors**: The scene is saturated with vibrant neon hues — pinks, teals, purples, and golds — especially in the cityscape and the woman’s iridescent outfit. - **Tech elements**: The futuristic city features glowing skyscrapers, circuit-like lines, and a flying saucer, establishing a cyberpunk tech aesthetic. - **Flowing organic lines**: The Art Nouveau influence i
z-image-local/z-image-turboFAILEvaluator
The image depicts a Japanese castle (specifically, a traditional Japanese castle like Osaka Castle or a similar structure), which is architecturally accurate for Japan, not a medieval European castle. The prompt requested a “medieval castle” depicted in ukiyo-e style — a mismatch in subject matter. The ukiyo-e style is characterized by flat planes, bold outlines, stylized perspective, and often woodblock print textures — none of which are present here. Instead, the image is rendered in photoreal
z-image-local/z-image-turboPASSEvaluator
The image presents an interior scene dominated by raw, unpolished concrete surfaces — including walls, ceiling, and floor — which strongly evokes the aesthetic of brutalism. The furniture, a large curved sectional sofa and a matching armchair, are upholstered in a soft, rounded, pastel pink fabric, aligning with the requested “rounded pastel furniture.” The lighting is warm and diffused, with natural light entering from a window and a soft pendant lamp casting gentle illumination, contributing t
z-image-local/z-image-turboFAILEvaluator
The requested style — “anxious minimalism” with sparse composition, uncomfortable negative space, muted colors, and subtle visual tension — is not clearly recognizable in this image. The image is a nearly uniform gray field with no discernible objects, composition, or visual tension. While it does feature muted color (gray), it lacks the “sparse composition” and “uncomfortable negative space” because there is no subject or arrangement to create spatial dynamics. The absence of any visual element
z-image-local/z-image-turboPASSEvaluator
The image successfully evokes the requested “nostalgic futurism — retro-futuristic 1960s space-age aesthetics with a melancholic warm-toned patina.” Key visual characteristics are present: - **Retro-futurism**: The architecture features sleek, stylized skyscrapers with sharp lines and a sense of optimism about the future — reminiscent of 1960s sci-fi visions (e.g., the pointed spire building). The cars are classic 1960s American models with chrome details and rounded, optimistic forms. - **Warm