ImageBench

ImageBench V1 —

192 evaluations across 6 categories

Benchmark V1 verdicts are produced by VLM judges and can contain mistakes. Treat PASS/FAIL labels as machine-assisted assessments, and inspect the images yourself. Learn more about the methodology.

147
Pass
45
Fail
77%
Pass Rate
9.9s
Avg Latency
9.6s
Min Latency
10.8s
Max Latency
Text Rendering80%Spatial Reasoning74%Human realism86%Truthfulness70%Professional Studio93%Graphical design54%Latency33%
The word 'CHAPTER ONE' typed on aged paper with a vintage typewriter font, complete with slightly uneven ink
Text RenderingTypography StyleEasylocal/boogu-image-turbo

Prompt: The word 'CHAPTER ONE' typed on aged paper with a vintage typewriter font, complete with slightly uneven ink

Text Rendering 80%

Typography StyleClick to expand100%

Typography Style measures whether local/boogu-image-turbo preserves requested lettering styles, font mood, and text texture such as typewriter ink or neon signage.

Easy 100%
The word 'CHAPTER ONE' typed on aged paper with a vintage tyPASS
The word 'CHAPTER ONE' typed on aged paper with a vintage typewriter font, complete with slightly uneven ink

Evaluator (Qwen 3 VL)

Text reads “CHAPTER ONE” correctly. Font mimics vintage typewriter style with uneven, slightly smudged ink and paper texture. Aged paper and typewriter context are convincingly rendered. Minor artifacts (ink specks) enhance realism. All core criteria met. VERDICT: PASS

The phrase 'TODAY'S SPECIAL' written in white chalk on a darPASS
The phrase 'TODAY'S SPECIAL' written in white chalk on a dark green chalkboard with chalk dust

Evaluator (Qwen 3 VL)

Text reads "TODAY'S SPECIAL" correctly. Letters show chalk texture, smudges, and dust on board and ledge, matching chalkboard medium. Chalk pieces and eraser are present. Style convincingly mimics white chalk on dark green surface. VERDICT: PASS

The word 'OPEN' displayed on a red LED sign with glowing pixPASS
The word 'OPEN' displayed on a red LED sign with glowing pixel-style letters

Evaluator (Qwen 3 VL)

The text "OPEN" is rendered in glowing red LED pixels, matching the described medium. Letters are clearly formed with a pixelated grid, and spelling is correct. The sign’s red casing and wiring further reinforce the LED sign aesthetic. All core criteria are met. VERDICT: PASS

Writing accuracyClick to expand75%

Writing accuracy measures whether local/boogu-image-turbo renders the exact requested words, numbers, labels, and short phrases without spelling or layout errors.

Easy 100%
A single large word 'HELLO' centered on a white backgroundPASS
A single large word 'HELLO' centered on a white background

Evaluator (Qwen 3 VL)

The word "HELLO" is spelled correctly and fully readable, despite stylized, textured letterforms. Each letter is distinct and legible against the white background. No misspellings or missing characters are present. The artistic design does not obscure any letters. VERDICT: PASS

The word 'ADVENTURE' in big bold letters centered on a plainPASS
The word 'ADVENTURE' in big bold letters centered on a plain cream background

Evaluator (Qwen 3 VL)

The word "ADVENTURE" is spelled correctly and fully readable, though stylized with cracks and stars. No missing or illegible parts. Background is cream-colored, though not plain due to texture and compass graphic. Text is centered. All letters are intact and discernible. VERDICT: PASS

The word 'SERENITY' written large and centered on a solid liPASS
The word 'SERENITY' written large and centered on a solid light blue background

Evaluator (Qwen 3 VL)

The word "SERENITY" is fully spelled correctly and clearly readable. Each letter is distinct and legible against the solid light blue background. No misspellings, missing letters, or illegible parts are present. The text is centered as intended. VERDICT: PASS

Medium 100%
The sentence 'The quick brown fox jumps high' in clean blackPASS
The sentence 'The quick brown fox jumps high' in clean black text on a white background

Evaluator (Qwen 3 VL)

All words are spelled correctly and fully readable: "The", "quick", "brown", "fox", "jumps", "high". Text is clean black on white background. No misspellings or illegible parts. VERDICT: PASS

'Every sunset brings a new dawn ahead' written neatly on a pPASS
'Every sunset brings a new dawn ahead' written neatly on a pale yellow background

Evaluator (Qwen 3 VL)

The text "Every sunset brings a new dawn ahead" is fully legible and spelled correctly. All words match the intended phrase exactly. No misspellings, missing words, or illegible parts are present. The script is clear and readable against the pale yellow background. VERDICT: PASS

'Stars shine brightest in the dark night' displayed on a clePASS
'Stars shine brightest in the dark night' displayed on a clean off-white background

Evaluator (Qwen 3 VL)

The text "Stars shine brightest in the dark night" is fully legible and spelled correctly. All words match the intended phrase exactly. The background is clean off-white with star motifs, matching the description. No misspellings or missing words detected. VERDICT: PASS

Hard 100%
A poster with headline 'SUMMER SALE' in large bold text, bodPASS
A poster with headline 'SUMMER SALE' in large bold text, body text 'Up to 50% off all items this weekend only' in medium size, and fine print 'Terms and conditions apply. See store for details.' at the bottom in small text

Evaluator (Qwen 3 VL)

All text matches the intended content exactly: "SUMMER SALE", "Up to 50% off all items this weekend only", and "Terms and conditions apply. See store for details." No misspellings, missing words, or illegible parts. Text is fully readable and correctly rendered. VERDICT: PASS

A flyer with large title 'GRAND OPENING', subtitle 'Join us PASS
A flyer with large title 'GRAND OPENING', subtitle 'Join us for food, music, and fun' in medium text, and small disclaimer 'RSVP required. Limited seating available.' at the bottom

Evaluator (Qwen 3 VL)

All text matches the intended message exactly: “GRAND OPENING”, “Join us for food, music, and fun”, and “RSVP required. Limited seating available.”. Every word is spelled correctly and fully readable, with no missing or illegible parts. Font sizes and layout align with the description. VERDICT: PASS

An event banner with headline 'ANNUAL GALA 2025' at the top PASS
An event banner with headline 'ANNUAL GALA 2025' at the top in large serif font, description 'An evening of fine dining and live jazz' in medium text, and footnote 'Black tie optional. Doors open at 6 PM.' in small text

Evaluator (Qwen 3 VL)

All text matches the intended content exactly: "ANNUAL GALA 2025", "An evening of fine dining and live jazz", and "Black tie optional. Doors open at 6 PM." No misspellings, missing words, or illegible parts. Font sizes and layout align with description. Text is fully readable. VERDICT: PASS

Extreme 0%
A vintage movie poster for 'THE LAST VOYAGE' featuring a tagFAIL
A vintage movie poster for 'THE LAST VOYAGE' featuring a tagline 'One ship. One chance. No turning back.' with credits block reading 'Directed by James Harlow. Starring Elena Voss, Marcus Chen, and Sofia Reyes. Music by Thomas Grant. A Paramount Pict

Evaluator (Qwen 3 VL)

The tagline “ONE SHIP. ONE HE CHANCE. CRNANC LO)TUICE. NO TURNING BACK.” contains misspellings: “HE” instead of “A”, and “CRNANC LO)TUICE” is garbled and unreadable. Credits block is mostly correct but “Starring” is misaligned and “Stars Voss” is illegible on the ship. Core text errors prevent full readability. VERDICT: FAIL

A magazine cover titled 'NATIONAL EXPLORER' with cover linesFAIL
A magazine cover titled 'NATIONAL EXPLORER' with cover lines 'The Hidden Temples of Cambodia — page 34', 'Interview: Dr. Maria Santos on Climate Action', 'Top 10 Trails You Must Hike This Year', and a barcode area with 'Issue 247 | March 2025 | $6.99

Evaluator (Qwen 3 VL)

“Top 1ew” is misspelled (should be “Top 10”). “March 2025” in barcode reads “March 2023”. “Climate Action” appears twice incorrectly. “Dr. Maria Santos” is repeated in wrong context. Multiple text elements are misaligned or garbled. Core text is not fully accurate or readable. VERDICT: FAIL

A product label for 'GOLDEN HARVEST ORGANIC HONEY' with subtFAIL
A product label for 'GOLDEN HARVEST ORGANIC HONEY' with subtitle 'Raw & Unfiltered', weight '12 oz (340g)', description 'Pure wildflower honey harvested from sustainable apiaries in the Pacific Northwest. No additives or preservatives.', and small te

Evaluator (Qwen 3 VL)

“Northwest” is misspelled as “Northweest.” All other text matches the intended label exactly, including weight, description, and small print. The misspelling is clear and fully readable. VERDICT: FAIL

Spatial Reasoning 74%

Attributes BindingClick to expand100%

Attributes Binding measures whether local/boogu-image-turbo attaches the right colors, materials, and properties to the correct requested objects.

Easy 100%
A red ball and a blue cube sitting side by side on a white sPASS
A red ball and a blue cube sitting side by side on a white surface

Evaluator (Qwen 3.5 122B)

The image shows two objects on a white surface near a window. On the left is a red spherical object with visible seams and wear — consistent with a ball. On the right is a blue cubic object with chipped paint and edges — consistent with a cube. Colors, shapes, and positions match the prompt: red ball on left, blue cube on right. No attribute swaps or misbindings observed. VERDICT: PASS

A green apple and an orange pumpkin on a wooden cutting boarPASS
A green apple and an orange pumpkin on a wooden cutting board

Evaluator (Qwen 3.5 122B)

The image shows a green apple and an orange pumpkin on a wooden cutting board. The apple is correctly green, round, and appropriately sized relative to the pumpkin. The pumpkin is correctly orange, round, and larger than the apple. Both objects are placed on a rustic wooden cutting board with visible grain and knife marks. No attribute swaps or misidentifications are present. VERDICT: PASS

A yellow taxi and a black limousine parked next to each othePASS
A yellow taxi and a black limousine parked next to each other on a street

Evaluator (Qwen 3.5 122B)

The image shows a yellow taxi and a black limousine parked next to each other on a street. The taxi is correctly yellow and has the typical taxi shape with a roof sign. The limousine is correctly black and has the elongated shape of a limousine. All attributes (color, size, shape) are correctly bound to the right objects with no swaps. VERDICT: PASS

Medium 100%
A large red circle, a small blue triangle, and a medium greePASS
A large red circle, a small blue triangle, and a medium green star arranged on a white canvas

Evaluator (Qwen 3.5 122B)

The image shows a white canvas with three painted shapes: - A large red circle at the top — matches prompt. - A medium-sized blue triangle at bottom left — prompt says “small,” but visually it’s comparable in scale to the star; however, color and shape are correct. - A medium green star at bottom right — matches prompt. All colors and shapes are correctly assigned. Size descriptors (“large,” “small,” “medium”) are subjective but reasonably aligned. No swaps or misattributions. VERDICT: PASS

A tall thin white candle, a short fat red candle, and a mediPASS
A tall thin white candle, a short fat red candle, and a medium spiral yellow candle on a shelf

Evaluator (Qwen 3.5 122B)

The image shows three candles on a wooden shelf: 1. Tall thin white candle – correctly matches description (tall, thin, white). 2. Short fat red candle – correctly matches description (short, fat, red). 3. Medium spiral yellow candle – correctly matches description (medium height, spiral shape, yellow). All attributes are correctly bound to the right objects with no swaps. VERDICT: PASS

A big brown horse, a small white rabbit, and a medium-sized PASS
A big brown horse, a small white rabbit, and a medium-sized black cat standing in a meadow

Evaluator (Qwen 3.5 122B)

The image shows a large brown horse with a black mane and tail, standing in a meadow. A small white rabbit is sitting near the horse’s front legs. A medium-sized black cat stands nearby, facing forward. All attributes match the prompt: horse is big and brown, rabbit is small and white, cat is medium and black. No swaps or misattributions observed. VERDICT: PASS

Hard 100%
A pink fire truck, a blue banana, and a green sun in a surrePASS
A pink fire truck, a blue banana, and a green sun in a surreal landscape

Evaluator (Qwen 3.5 122B)

The image contains a pink fire truck with correct features (ladder, lights, wheels). A large blue banana is present in the foreground. A green sun with a smiling face is visible in the sky. All requested objects are present with their specified colors and attributes correctly bound. No swaps or missing elements detected. VERDICT: PASS

A purple carrot, a red tree with red leaves, and a white watPASS
A purple carrot, a red tree with red leaves, and a white watermelon sliced open

Evaluator (Qwen 3.5 122B)

The image contains: 1. Purple carrot: Correctly colored purple with green leafy top, elongated shape. 2. Red tree: Small potted tree with red leaves and reddish trunk. 3. White watermelon: Sliced open, showing pink/red interior with black seeds, white rind. All attributes are correctly bound to their respective objects with no swaps. VERDICT: PASS

An orange elephant, a yellow penguin, and a silver strawberrPASS
An orange elephant, a yellow penguin, and a silver strawberry arranged on a table

Evaluator (Qwen 3.5 122B)

The image shows three objects on a wooden table: 1. Elephant: Orange-brown ceramic, correct color and shape. 2. Penguin: Yellow and white plush, correct color and shape. 3. Strawberry: Silver metallic with green leaves, correct color and shape. All attributes are correctly bound to the right objects with no swaps. VERDICT: PASS

CompositionalityClick to expand89%

Compositionality measures whether local/boogu-image-turbo combines subjects, actions, outfits, props, and scene details into one coherent requested image.

Easy 100%
A tabby cat wearing oversized red sunglassesPASS
A tabby cat wearing oversized red sunglasses

Evaluator (Qwen 3.5 122B)

The image contains: - Tabby cat: Present (clearly visible with tabby markings) - Oversized red sunglasses: Present (large red frames on cat's face) - Coherent combination: Present (sunglasses fit naturally on cat's face) All described elements are present and coherently combined. The cat is wearing the red sunglasses in a believable manner, and the scene is well-composed with appropriate lighting and background elements. VERDICT: PASS

A golden retriever wearing a tiny cowboy hatPASS
A golden retriever wearing a tiny cowboy hat

Evaluator (Qwen 3.5 122B)

The image contains: - Golden retriever: Present and clearly visible - Cowboy hat: Present on the dog's head - Tiny: The hat appears appropriately sized for the dog All described elements are present and coherently combined. The golden retriever is wearing a cowboy hat that fits its head proportionally, creating a unified and logical composition. VERDICT: PASS

A penguin holding a colorful umbrellaPASS
A penguin holding a colorful umbrella

Evaluator (Qwen 3.5 122B)

The image shows a penguin in the foreground holding a colorful umbrella. The umbrella has multiple bright colors including red, yellow, blue, and orange. The penguin is standing on icy ground with other penguins in the background. All described elements are present and coherently combined - the penguin is clearly holding the umbrella with its flipper/wing, and the umbrella is open above it protecting from rain. VERDICT: PASS

Medium 100%
A medieval knight in full armor sitting at a desk typing on PASS
A medieval knight in full armor sitting at a desk typing on a modern laptop

Evaluator (Qwen 3.5 122B)

The image contains all required elements: 1. Medieval knight in full armor - PRESENT (full plate armor with helmet) 2. Sitting at a desk - PRESENT (wooden table/desk) 3. Typing on a modern laptop - PRESENT (silver laptop with visible screen and keyboard) The concepts are coherently combined with the knight's armored hands positioned on the laptop keyboard, creating a believable anachronistic scene. The medieval setting (stone walls, candles, scrolls) contrasts appropriately with the modern tech

A Roman gladiator taking a selfie with a smartphone in the CPASS
A Roman gladiator taking a selfie with a smartphone in the Colosseum

Evaluator (Qwen 3.5 122B)

The image contains all required elements: 1. Roman gladiator - Present (wearing helmet, armor, red tunic) 2. Selfie action - Present (holding phone up, smiling at camera) 3. Smartphone - Present (modern device showing his reflection) 4. Colosseum - Present (ancient Roman amphitheater in background) All elements are coherently combined in a single scene with proper spatial relationships and lighting consistency. VERDICT: PASS

An astronaut in a spacesuit riding a bicycle through a parkPASS
An astronaut in a spacesuit riding a bicycle through a park

Evaluator (Qwen 3.5 122B)

The image shows: - Astronaut in a spacesuit: Present (white suit with American flag patches, helmet with reflective visor) - Bicycle: Present (blue frame, standard bike design) - Park setting: Present (trees, grass, benches, fountain, people in background) - Coherent combination: Yes (astronaut is actively riding the bike on a park path) All elements are present and logically combined. VERDICT: PASS

Hard 67%
An octopus wearing a top hat and monocle playing chess on thPASS
An octopus wearing a top hat and monocle playing chess on the surface of the moon with Earth visible in the background

Evaluator (Qwen 3.5 122B)

The image contains all described elements: 1. Octopus - Present (central figure) 2. Top hat - Present (black, on octopus head) 3. Monocle - Present (gold-rimmed, on octopus eye) 4. Chess game - Present (board with pieces on lunar surface) 5. Moon surface - Present (cratered terrain) 6. Earth in background - Present (visible in space) All elements are coherently combined in a surreal but visually consistent composition. VERDICT: PASS

A dinosaur in a business suit giving a PowerPoint presentatiPASS
A dinosaur in a business suit giving a PowerPoint presentation in a modern office while drinking coffee

Evaluator (Qwen 3.5 122B)

The image shows: - Dinosaur: Present (T-Rex head/body) - Business suit: Present (dark suit, tie, pocket square) - PowerPoint presentation: Present (on screen with charts/maps) - Modern office: Present (glass walls, conference table, laptops) - Drinking coffee: Present (holding white mug) All elements are coherently combined in a surreal but visually consistent scene. The dinosaur's anatomy blends with human posture and clothing naturally. VERDICT: PASS

A robot dressed as a pirate sailing a paper boat on a lake oFAIL
A robot dressed as a pirate sailing a paper boat on a lake of lava while juggling three kittens

Evaluator (Qwen 3.5 122B)

The image contains: - Robot dressed as a pirate: YES (wearing pirate hat, coat, belt) - Paper boat: YES (white origami-style boat) - Lake of lava: YES (glowing molten surface with volcanoes) - Juggling three kittens: PARTIALLY — there are FIVE kittens floating around, not three, and they’re not being juggled — they’re suspended mid-air in a surreal arrangement Core concept is present but “juggling three kittens” is inaccurate — both number and action mismatch. VERDICT: FAIL

CountingClick to expand44%

Counting measures whether local/boogu-image-turbo creates the exact number of requested objects without adding or missing items.

Easy 67%
Exactly 3 red apples arranged on a wooden tablePASS
Exactly 3 red apples arranged on a wooden table

Evaluator (Qwen 3.5 122B)

The image shows exactly three red apples placed on a rustic wooden table. The apples are arranged in a loose triangular formation near the center-left of the frame. No other apples are visible. The count matches the prompt’s requirement precisely. VERDICT: PASS

Exactly 5 yellow rubber ducks sitting in a row on a bathroomFAIL
Exactly 5 yellow rubber ducks sitting in a row on a bathroom shelf

Evaluator (Qwen 3.5 122B)

The image shows four yellow rubber ducks arranged in a row on a wooden bathroom shelf. The prompt requested exactly five ducks. Since only four are visible, the count does not match the requirement. VERDICT: FAIL

Exactly 4 white candles standing on a dark surfacePASS
Exactly 4 white candles standing on a dark surface

Evaluator (Qwen 3.5 122B)

The image displays four white pillar candles, all lit, standing upright on a dark wooden surface. The count of candles is exactly four, and they are white and standing on a dark surface as requested. No other candle-like objects are present. The core criteria — “Exactly 4 white candles standing on a dark surface” — is fully met. VERDICT: PASS

Medium 33%
2 orange cats and 3 black dogs sitting together on a grassy FAIL
2 orange cats and 3 black dogs sitting together on a grassy lawn

Evaluator (Qwen 3.5 122B)

The image shows 1 orange cat and 2 black dogs sitting on a grassy lawn. The prompt requested 2 orange cats and 3 black dogs. The counts do not match — there is one fewer cat and one fewer dog than specified. Therefore, the image fails to meet the exact object count requirement. VERDICT: FAIL

4 red roses and 2 white daisies in a glass vase on a tableFAIL
4 red roses and 2 white daisies in a glass vase on a table

Evaluator (Qwen 3.5 122B)

The image shows 3 red roses and 3 white daisies in a glass vase on a wooden table. The prompt requested 4 red roses and 2 white daisies. The counts do not match — there is one fewer rose and one extra daisy than specified. Therefore, the core requirement of exact object count is not met. VERDICT: FAIL

1 blue butterfly and 3 yellow butterflies resting on a gardePASS
1 blue butterfly and 3 yellow butterflies resting on a garden fence

Evaluator (Qwen 3.5 122B)

The image shows 1 blue butterfly and 3 yellow butterflies resting on a wooden garden fence, exactly matching the requested count. All butterflies are clearly visible and positioned as described. No extra or missing butterflies are present. The scene accurately reflects the prompt’s specifications. VERDICT: PASS

Hard 33%
Exactly 9 goldfish swimming in a large round fishbowlFAIL
Exactly 9 goldfish swimming in a large round fishbowl

Evaluator (Qwen 3.5 122B)

I count 10 goldfish in the image: one near the top right, two in the upper middle, one on the left middle, one center, one right middle, three along the bottom, and one partially visible behind the glass on the far left. The prompt requested exactly 9 goldfish. Since there are 10, the count does not match. VERDICT: FAIL

A fruit bowl containing exactly 3 bananas, 4 oranges, and 2 PASS
A fruit bowl containing exactly 3 bananas, 4 oranges, and 2 apples

Evaluator (Qwen 3.5 122B)

The image shows a fruit bowl with: - 3 bananas (correct) - 4 oranges (correct) - 2 apples (correct) All counts match the prompt exactly. The composition is accurate and visually coherent. VERDICT: PASS

Exactly 11 colorful balloons floating against a clear blue sFAIL
Exactly 11 colorful balloons floating against a clear blue sky

Evaluator (Qwen 3.5 122B)

The image shows a cluster of colorful balloons against a clear blue sky. Counting them reveals approximately 18–20 balloons, not 11. The prompt requested exactly 11, so the count does not match. While the scene is visually accurate (colorful balloons, blue sky), the numerical requirement is violated. VERDICT: FAIL

NegationClick to expand33%

Negation measures whether local/boogu-image-turbo respects what should be absent from the image, not only what should be present.

Easy 67%
A tropical beach scene with no peoplePASS
A tropical beach scene with no people

Evaluator (Qwen 3.5 122B)

The image depicts a serene tropical beach scene with palm trees, thatched-roof huts, lounge chairs, and sailboats on calm turquoise water. The sand is pristine with gentle waves lapping the shore. There are no visible people anywhere in the frame — not on the beach, under the huts, or in the boats. The exclusion criterion (“no people”) is fully satisfied. VERDICT: PASS

A cat lying on a couch without a collarFAIL
A cat lying on a couch without a collar

Evaluator (Qwen 3.5 122B)

The image shows an orange and white cat lying on a patterned couch. The cat is wearing a dark collar with a small metal ring or tag attached. Since the prompt explicitly excluded “a cat lying on a couch without a collar,” and the cat in the image clearly has a collar, the excluded element is present. VERDICT: FAIL

A classroom with desks and a chalkboard but no studentsPASS
A classroom with desks and a chalkboard but no students

Evaluator (Qwen 3.5 122B)

The image shows an empty classroom with wooden desks arranged in rows and a large chalkboard on the wall with writing on it. There are no students visible in the room. The scene is quiet and unoccupied, matching the requirement of having no students present. VERDICT: PASS

Medium 0%
A sandy beach with palm trees, seashells, and a beach umbrelFAIL
A sandy beach with palm trees, seashells, and a beach umbrella but no water or ocean visible

Evaluator (Qwen 3.5 122B)

The image clearly shows a sandy beach with palm trees, seashells, and a beach umbrella — as requested. However, the prompt explicitly excluded “water or ocean visible,” and in this image, the ocean is prominently visible in the background, stretching across the horizon with waves breaking on shore. This violates the exclusion criterion. VERDICT: FAIL

An office workspace with a desk, chair, bookshelf, and lamp FAIL
An office workspace with a desk, chair, bookshelf, and lamp but no computers or screens

Evaluator (Qwen 3.5 122B)

The image clearly shows a computer monitor (an Apple iMac) on the desk, displaying a document or webpage. The prompt explicitly excluded “computers or screens,” yet a screen is prominently present. This violates the exclusion criterion. VERDICT: FAIL

A kitchen with pots, pans, a stove, and a cutting board but FAIL
A kitchen with pots, pans, a stove, and a cutting board but no food

Evaluator (Qwen 3.5 122B)

The image shows a kitchen with hanging copper and cast iron pots, a gas stove, and a wooden cutting board on an island. However, the cutting board clearly has chopped food items on it — diced potatoes, a small tomato, and a piece of cucumber. Since the prompt explicitly excluded “no food,” and food is visibly present, the exclusion criterion is violated. VERDICT: FAIL

Hard 33%
A pepperoni pizza with tomato sauce, basil, and crust but abFAIL
A pepperoni pizza with tomato sauce, basil, and crust but absolutely no cheese

Evaluator (Qwen 3.5 122B)

The image shows a pepperoni pizza with tomato sauce, basil leaves, and a crust. However, there is clearly melted white cheese (likely mozzarella) distributed across the pizza surface, beneath and around the pepperoni slices. The prompt explicitly excluded cheese, but cheese is visibly present in the image. VERDICT: FAIL

A car body with windows, doors, and headlights but no wheelsPASS
A car body with windows, doors, and headlights but no wheels

Evaluator (Qwen 3.5 122B)

The image shows a vintage car body with windows, doors, and headlights as described. Critically, the car has no wheels — the wheel wells are empty, exposing suspension components or just darkness where tires should be. This matches the exclusion criteria: “no wheels” is accurately represented. The car is clearly in a state of disassembly or restoration, parked on a roadside with tools nearby. No wheels are present anywhere on the vehicle. VERDICT: PASS

A human face with eyes, nose, and mouth but no eyebrowsFAIL
A human face with eyes, nose, and mouth but no eyebrows

Evaluator (Qwen 3.5 122B)

The image shows a human face with eyes, nose, and mouth. However, the excluded element — eyebrows — are clearly present. Both eyebrows are visible above the eyes, with natural hair texture and shape. Since the prompt explicitly required no eyebrows, their presence violates the constraint. VERDICT: FAIL

Relative PositionClick to expand83%

Relative Position measures whether local/boogu-image-turbo places objects in the requested spatial relationship, such as above, below, inside, or next to another object.

Easy 100%
A cat sitting on top of a cardboard boxPASS
A cat sitting on top of a cardboard box

Evaluator (Qwen 3.5 122B)

The image shows a ginger and white cat sitting directly on top of an open cardboard box. The box is positioned on the floor in a living room setting. The spatial relationship described in the prompt — “a cat sitting on top of a cardboard box” — is accurately represented. All positional relationships are correct. VERDICT: PASS

A red ball underneath a wooden chairPASS
A red ball underneath a wooden chair

Evaluator (Qwen 3.5 122B)

The image shows a red ball positioned directly underneath the seat of a wooden chair, resting on the floor between the chair’s legs. The spatial relationship “underneath” is accurately depicted — the ball is below the chair’s seat and enclosed within the leg frame. All described positional relationships are correct. VERDICT: PASS

A coffee mug next to an open book on a deskPASS
A coffee mug next to an open book on a desk

Evaluator (Qwen 3.5 122B)

The image shows a wooden desk with two coffee mugs and an open book. One mug is positioned to the left of the open book, and another mug is behind it, near a pair of glasses. The spatial relationship described in the prompt — "a coffee mug next to an open book on a desk" — is accurately represented. All positional relationships are correct. VERDICT: PASS

Medium 100%
A bird perched on a branch, the branch extending from a treePASS
A bird perched on a branch, the branch extending from a tree next to a park bench

Evaluator (Qwen 3.5 122B)

The image shows a bird perched on a mossy branch that extends from a large tree trunk. The branch curves downward and is positioned above a wooden park bench, which sits on the ground beneath it. The spatial relationships described — bird “on” branch, branch “from” tree, tree “next to” bench — are all accurately represented. The bench is below and slightly to the right of the branch’s end, consistent with “next to.” All positional cues match the prompt. VERDICT: PASS

A cat sitting on a pillow, the pillow placed on a wooden chaPASS
A cat sitting on a pillow, the pillow placed on a wooden chair

Evaluator (Qwen 3.5 122B)

The image shows an orange cat lying curled up on a floral-patterned pillow. The pillow is placed directly on the seat of a rustic wooden chair. The spatial relationships described — “a cat sitting on a pillow, the pillow placed on a wooden chair” — are accurately represented. All positional relationships (on, under) are correct as per the prompt. VERDICT: PASS

A candle on top of a stack of books, the books next to a potPASS
A candle on top of a stack of books, the books next to a potted plant

Evaluator (Qwen 3.5 122B)

The image shows a candle placed on top of a stack of books, which matches the prompt. The stack of books is positioned to the right of a potted plant, also matching the prompt’s spatial description. All specified positional relationships — “on top of” and “next to” — are accurately represented in the image. VERDICT: PASS

Hard 67%
Five chess pieces on a board: a king in the center, a queen FAIL
Five chess pieces on a board: a king in the center, a queen to its left, a bishop to its right, a rook above the king, and a knight below the king

Evaluator (Qwen 3.5 122B)

The image shows five prominent chess pieces arranged on a board. There is a white king in the center, a black queen to its left, and a white pawn (not a bishop) to its right. A black king is behind (above) the white king, and a brown knight is in front (below) the white king. The prompt incorrectly identifies the piece to the right as a bishop and the piece above as a rook. Therefore, not all positional relationships and piece identifications are correct. VERDICT: FAIL

A desk scene: laptop in the center, coffee mug to the left, PASS
A desk scene: laptop in the center, coffee mug to the left, phone to the right, lamp above, and notebook below

Evaluator (Qwen 3.5 122B)

The image shows a desk scene with a laptop centered, a coffee mug to its left, a phone to its right, a lamp positioned above and to the right of the laptop, and an open notebook below the laptop. All specified spatial relationships — left, right, above, below — are accurately represented in the image. The arrangement matches the prompt’s description precisely. VERDICT: PASS

Five animals in a field: a horse on the far left, a cow to tPASS
Five animals in a field: a horse on the far left, a cow to the right of the horse, a sheep in the center, a goat to the right of the sheep, and a dog on the far right

Evaluator (Qwen 3.5 122B)

The image shows five animals arranged left to right: a brown horse grazing on the far left, a black-and-white cow to its right, a white sheep in the center, a brown-and-white goat to the sheep’s right, and a golden dog sitting on the far right. All specified spatial relationships match the prompt exactly — each animal is positioned as described relative to its neighbor. No positional errors are present. VERDICT: PASS

Extreme 67%
A landscape with a red barn in the foreground at large scalePASS
A landscape with a red barn in the foreground at large scale, a windmill in the midground at medium scale, and snow-capped mountains in the background at small scale, all with consistent depth perspective

Evaluator (Qwen 3.5 122B)

The image shows a red barn in the foreground, large and detailed, with a windmill positioned to its right in the midground at a smaller scale. Snow-capped mountains appear in the background, smaller still, creating consistent depth perspective. All spatial relationships described — foreground/midground/background scaling and relative positioning — are accurately represented. VERDICT: PASS

A beach scene with a sandcastle in the foreground, a surfer PASS
A beach scene with a sandcastle in the foreground, a surfer on the waves in the midground, and a cruise ship on the horizon in the background, sizes decreasing with distance

Evaluator (Qwen 3.5 122B)

The image shows a beach scene with a sandcastle in the foreground, a surfer riding a wave in the midground, and a cruise ship on the horizon in the background. The sizes of the objects decrease with distance as specified. All described positional relationships are correct. VERDICT: PASS

A city park with a park bench and pigeons in the foreground,FAIL
A city park with a park bench and pigeons in the foreground, a fountain in the midground, and skyscrapers in the background, each layer progressively smaller

Evaluator (Qwen 3.5 122B)

The image shows a park bench and pigeons in the foreground, a fountain in the midground, and skyscrapers in the background — matching the prompt’s layered spatial description. However, the prompt specifies “each layer progressively smaller,” which is not visually accurate: the fountain appears larger than the distant buildings due to perspective, but the skyscrapers are not scaled down relative to the fountain in a way that reflects true proportional diminution across layers. The spatial relatio

Scale & ProportionsClick to expand89%

Scale & Proportions measures whether local/boogu-image-turbo keeps people, vehicles, buildings, and everyday objects at believable relative sizes.

Easy 100%
A person standing next to a yellow school bus, showing realiPASS
A person standing next to a yellow school bus, showing realistic relative scale

Evaluator (Qwen 3.5 122B)

The person’s height relative to the bus appears realistic — roughly 1/3 to 1/2 the height of the bus’s side panel, which is consistent with real-world proportions. The bus’s features (windows, door, mirrors) are scaled appropriately to human size. No obvious distortions or scale mismatches are visible. VERDICT: PASS

A ladybug sitting on a large green leaf, shown at realistic PASS
A ladybug sitting on a large green leaf, shown at realistic proportions

Evaluator (Qwen 3.5 122B)

The ladybug appears appropriately sized relative to the leaf, consistent with real-world proportions. The leaf’s veins, water droplets, and texture are rendered at a scale that supports the insect’s realistic size. No obvious distortions or scaling errors are present. The composition maintains natural spatial relationships between the bug and its environment. VERDICT: PASS

A child holding a basketball, both at correct relative sizesPASS
A child holding a basketball, both at correct relative sizes

Evaluator (Qwen 3.5 122B)

The child’s hands are holding the basketball, and the ball appears appropriately sized relative to the child’s torso and arms — consistent with a real-world child holding a standard basketball. No anatomical or proportional distortions are evident. The background elements (gym floor, bleachers, hoop) also scale logically. VERDICT: PASS

Medium 100%
A tiny person standing on the rim of a coffee cup, looking dPASS
A tiny person standing on the rim of a coffee cup, looking down into it

Evaluator (Qwen 3.5 122B)

The image shows a miniature figure standing on the rim of a coffee cup, which aligns with the prompt's description of "a tiny person standing on the rim of a coffee cup." The proportions are consistent with the intended size relationship, as the person is appropriately scaled to appear tiny relative to the cup. The scene is coherent and matches the described scenario. VERDICT: PASS

A giant tabby cat walking between city skyscrapers like a kaPASS
A giant tabby cat walking between city skyscrapers like a kaiju

Evaluator (Qwen 3.5 122B)

The image depicts a giant tabby cat towering over city skyscrapers, cars, and pedestrians. The cat’s paws are larger than the vehicles, and its body height exceeds the surrounding buildings, consistent with the “kaiju” description. However, the Empire State Building (visible in background) appears smaller than the cat — which contradicts real-world scale unless the cat is truly colossal. Since the prompt explicitly asks for “giant cat walking between skyscrapers like a kaiju,” the exaggerated pr

A miniature house sitting on the palm of someone's handPASS
A miniature house sitting on the palm of someone's hand

Evaluator (Qwen 3.5 122B)

The image shows a miniature house resting on an open human palm. The house is appropriately sized to fit comfortably within the hand, with details like windows, doors, and moss visible at a scale consistent with being held. The surrounding tools and workshop environment also appear proportionally correct relative to the hand and house. The size relationship between the house and the hand is accurate and consistent with the prompt. VERDICT: PASS

Hard 67%
A group photo of an elephant, a horse, a dog, a cat, and a mPASS
A group photo of an elephant, a horse, a dog, a cat, and a mouse, all standing in a line with correct real-world proportional sizes

Evaluator (Qwen 3.5 122B)

The image shows five animals in a line: elephant, horse, dog, cat, and mouse. Their relative sizes are correctly proportioned according to real-world biology — the elephant is largest, followed by the horse, then dog, cat, and mouse smallest. All are standing on the same ground plane with consistent perspective. No anatomical or scaling errors are visible. VERDICT: PASS

A dinner table scene with a plate, fork, knife, wine glass, PASS
A dinner table scene with a plate, fork, knife, wine glass, salt shaker, and peppercorn, all at realistic relative sizes

Evaluator (Qwen 3.5 122B)

The image displays a dinner table setting with a plate, fork, knife, wine glass, salt shaker, and peppercorns. The relative sizes appear realistic: the plate is appropriately sized for a dinner setting, the cutlery is proportionate to the plate, the wine glass is standard size relative to the other items, and the salt shaker and scattered peppercorns are correctly scaled. No obvious distortions or size inconsistencies are present. VERDICT: PASS

A farm scene with a barn, tractor, farmer, cow, chicken, andFAIL
A farm scene with a barn, tractor, farmer, cow, chicken, and a fence post, all at correct proportional scale

Evaluator (Qwen 3.5 122B)

The image displays a farm scene with a barn, tractor, farmer, cow, chicken, and fence post. The relative sizes appear mostly consistent: the barn is largest, followed by the tractor and cow, then the farmer, with the chicken being smallest. However, the farmer appears slightly too small relative to the cow and tractor, and the chicken seems disproportionately large compared to typical farm chickens. These minor inconsistencies affect overall proportional accuracy. VERDICT: FAIL

Human realism 86%

Faces & ExpressionsClick to expand100%

Faces & Expressions measures whether local/boogu-image-turbo creates realistic faces with the requested age, emotion, expression, and identity cues.

Easy 100%
A close-up portrait of a young woman with a warm natural smiPASS
A close-up portrait of a young woman with a warm natural smile

Evaluator (Qwen 3 VL)

Face is symmetric with natural features. Smile appears warm and genuine, with crinkled eyes and relaxed lips conveying authentic joy. No significant distortions; lighting and texture look realistic. Slight hair strands across face add to naturalism. VERDICT: PASS

A portrait of a middle-aged man with a calm, serious expressPASS
A portrait of a middle-aged man with a calm, serious expression

Evaluator (Qwen 3 VL)

Face is symmetric with natural, correctly proportioned features. Expression is calm and serious, with subtle brow lines and neutral mouth conveying intent. No visible distortions. Lighting and texture are realistic. Meets all criteria for the prompt. VERDICT: PASS

A headshot of a teenager with a relaxed, neutral facePASS
A headshot of a teenager with a relaxed, neutral face

Evaluator (Qwen 3 VL)

The face is symmetric with correctly proportioned features. The teenager’s expression is relaxed and neutral, with no strong emotion. No noticeable distortions or artifacts are present. Lighting and detail are naturalistic, matching a realistic headshot. VERDICT: PASS

Medium 100%
A close-up of a person's face showing clear disgust, nose wrPASS
A close-up of a person's face showing clear disgust, nose wrinkled and lip curled

Evaluator (Qwen 3 VL)

Face is symmetric with natural features. Disgust is clearly conveyed: nose wrinkled, lips curled, brows furrowed. No significant distortions; expression is authentic and readable. Skin texture and lighting are realistic. VERDICT: PASS

A portrait capturing genuine surprise — raised eyebrows, widPASS
A portrait capturing genuine surprise — raised eyebrows, wide eyes, slightly open mouth

Evaluator (Qwen 3 VL)

Face is symmetric with natural features. Eyes are wide, eyebrows raised, mouth slightly open — clearly conveying genuine surprise. No distortions or unnatural artifacts. Lighting and detail are realistic. Emotion is unmistakable and well-executed. VERDICT: PASS

A face showing quiet melancholy — downcast eyes, slightly drPASS
A face showing quiet melancholy — downcast eyes, slightly drawn mouth, distant gaze

Evaluator (Qwen 3 VL)

Face is symmetric with natural, correctly rendered features. Eyes are downcast, mouth slightly drawn, gaze distant — all aligning with the prompt’s melancholy description. No visible distortions. Lighting and texture enhance the emotional realism. VERDICT: PASS

Hard 100%
An elderly woman in her 80s with deep wrinkles and white haiPASS
An elderly woman in her 80s with deep wrinkles and white hair, smiling warmly with kind eyes

Evaluator (Qwen 3 VL)

The face is symmetric with natural, aged features: deep wrinkles, white hair, and a warm, genuine smile. Eyes crinkle naturally, conveying kindness. No significant distortions; lighting and texture are realistic. The emotion is clearly readable as joyful and warm. VERDICT: PASS

A 5-year-old child with round cheeks and large eyes, crying PASS
A 5-year-old child with round cheeks and large eyes, crying with tears streaming down their face

Evaluator (Qwen 3 VL)

The child’s face is symmetric with natural features: round cheeks, large eyes, and tears streaming down. The crying expression is clearly readable, with furrowed brows, an open mouth, and visible tears. No significant distortions are present. The image accurately fulfills the prompt’s requirements. VERDICT: PASS

An elderly man in his 70s with age spots and thinning gray hPASS
An elderly man in his 70s with age spots and thinning gray hair, looking skeptically over reading glasses

Evaluator (Qwen 3 VL)

Face is symmetric with realistic elderly features: age spots, thinning gray hair, and reading glasses. Skeptical expression is clearly readable via furrowed brow and downturned mouth. No significant distortions. Lighting and texture are natural. All prompt elements are accurately rendered. VERDICT: PASS

Extreme 100%
A group of three friends at a table: one laughing with head PASS
A group of three friends at a table: one laughing with head tilted back, one looking shocked with wide eyes, and one rolling their eyes with a smirk

Evaluator (Qwen 3 VL)

Faces are symmetric with correct anatomical features. Emotions are clearly readable: left person laughing (head back), center person shocked (wide eyes, open mouth), right person smirking with rolled eyes. No distortions or unnatural artifacts. All three expressions match the prompt precisely. VERDICT: PASS

A family photo with four people: a father beaming with pridePASS
A family photo with four people: a father beaming with pride, a mother smiling gently, a teenager looking bored, and a child grinning excitedly

Evaluator (Qwen 3 VL)

Faces are symmetric with correct anatomical features. Emotions are clearly readable: father beams, mother smiles gently, teenager looks bored, child grins excitedly. No distortions or unnatural artifacts. All four individuals match the prompt’s emotional descriptions accurately. VERDICT: PASS

Three coworkers in a meeting: one looking frustrated with fuPASS
Three coworkers in a meeting: one looking frustrated with furrowed brows, one nodding with a pleased expression, and one looking confused with a tilted head

Evaluator (Qwen 3 VL)

Faces are symmetric with correct features. Emotions are clearly readable: frustrated (left, furrowed brows), pleased (center, smiling, nodding), confused (right, tilted head, raised brows). No distortions. All three expressions match the prompt accurately. VERDICT: PASS

Full BodyClick to expand100%

Full Body measures whether local/boogu-image-turbo renders full-body people with natural anatomy, clothing, pose, and proportions.

Easy 100%
A woman standing upright in a relaxed pose, arms at her sidePASS
A woman standing upright in a relaxed pose, arms at her sides, wearing jeans and a white t-shirt

Evaluator (Qwen 3 VL)

The woman stands upright with arms at her sides, wearing a white t-shirt and jeans, matching the prompt. Limb count and proportions are anatomically correct. Pose is relaxed and recognizable. No significant distortions observed. VERDICT: PASS

A man standing straight facing the camera, hands in pockets,PASS
A man standing straight facing the camera, hands in pockets, normal proportions

Evaluator (Qwen 3 VL)

The man stands straight, facing the camera, with hands in pockets. Limb count and proportions are anatomically correct. The pose is clearly recognizable as described. No distortions are visible. All elements match the prompt accurately. VERDICT: PASS

A person standing in a neutral pose in a park, full body visPASS
A person standing in a neutral pose in a park, full body visible from head to feet

Evaluator (Qwen 3 VL)

The person’s limbs are correctly counted and proportioned. The neutral standing pose is anatomically accurate and recognizable. No significant distortions are visible. The full body from head to feet is clearly depicted in a park setting as requested. VERDICT: PASS

Medium 100%
A woman mid-stride running through a park, hair and clothes PASS
A woman mid-stride running through a park, hair and clothes showing motion

Evaluator (Qwen 3 VL)

The woman’s limbs are correctly proportioned and anatomically accurate. Her running pose is dynamic and recognizable, with one leg forward, arms bent mid-stride, and hair flowing backward. Motion is convincingly depicted in her clothing and hair. No distortions or anatomical errors are visible. VERDICT: PASS

A man captured mid-jump over a puddle, body in the air with PASS
A man captured mid-jump over a puddle, body in the air with bent knees

Evaluator (Qwen 3 VL)

The man’s limbs are correctly counted and proportioned. His pose—bent knees, mid-air jump over a puddle—is anatomically plausible and matches the prompt. Water splashes realistically. No distortions in body structure or proportions are visible. The action is clearly recognizable as jumping over water. VERDICT: PASS

A dancer in a dynamic spin, dress flowing outward with one lPASS
A dancer in a dynamic spin, dress flowing outward with one leg extended

Evaluator (Qwen 3 VL)

The dancer’s limbs are correctly proportioned and count is accurate. The pose—dynamic spin with one leg extended and dress flowing—is anatomically plausible and recognizable as a ballet turn. No distortions in limbs or body structure are visible. The motion and fabric dynamics are convincingly rendered. VERDICT: PASS

Hard 100%
A person in a yoga tree pose — standing on one leg with the PASS
A person in a yoga tree pose — standing on one leg with the other foot pressed to the inner thigh, arms raised overhead

Evaluator (Qwen 3 VL)

The subject stands on one leg, with the other foot pressed against the inner thigh, arms raised overhead — accurately depicting the tree pose. Limb count and proportions are correct. No anatomical distortions are visible. The pose is clearly recognizable and executed with proper form. VERDICT: PASS

A rock climber clinging to a cliff face with arms and legs ePASS
A rock climber clinging to a cliff face with arms and legs extended to different holds

Evaluator (Qwen 3 VL)

The climber has correct limb count and proportions. The pose is anatomically plausible for rock climbing, with arms and legs extended to holds. Musculature and joint angles appear natural. No obvious distortions. The action is clearly recognizable as dynamic rock climbing. VERDICT: PASS

A cellist seated with correct posture, legs around the instrPASS
A cellist seated with correct posture, legs around the instrument, bow arm extended

Evaluator (Qwen 3 VL)

The cellist has correct limb count and proportions. Her posture is anatomically sound: seated, legs around the cello, bow arm extended. The action is clearly recognizable as playing the cello. No distortions are visible. All elements of the prompt are accurately depicted. VERDICT: PASS

Extreme 100%
Two tango dancers in a close embrace, the man dipping the woPASS
Two tango dancers in a close embrace, the man dipping the woman backward with intertwined legs

Evaluator (Qwen 3 VL)

The dancers’ limbs are correctly counted and proportioned. The woman’s body is arched backward with intertwined legs, matching the tango dip pose. Anatomical structure is accurate, with no visible distortions. The action is clearly recognizable as a classic tango embrace and dip. VERDICT: PASS

Two soccer players mid-tackle, one sliding and the other jumPASS
Two soccer players mid-tackle, one sliding and the other jumping over with the ball at their feet

Evaluator (Qwen 3 VL)

Both players show correct limb count and proportions. The sliding player’s body posture and the jumping player’s mid-air pose are anatomically plausible and match the described action. The ball is near the jumping player’s foot, capturing the tackle moment accurately. No significant distortions are visible. VERDICT: PASS

Two gymnasts performing a synchronized handstand, side by siPASS
Two gymnasts performing a synchronized handstand, side by side with identical body alignment

Evaluator (Qwen 3 VL)

Both gymnasts perform synchronized handstands with identical body alignment. Limb count and proportions are anatomically correct. The pose is clearly recognizable as a handstand on the floor. No distortions are visible. VERDICT: PASS

HandsClick to expand50%

Hands measures whether local/boogu-image-turbo draws anatomically plausible hands, fingers, and gestures.

Easy 33%
A person standing with both hands visible at their sides, fiFAIL
A person standing with both hands visible at their sides, fingers relaxed and naturally spread

Evaluator (Qwen 3.5 122B)

The left hand (viewer’s right) shows 5 fingers, all anatomically plausible. The right hand (viewer’s left) also has 5 fingers, but the thumb appears abnormally short and stubby, and the index finger is slightly malformed or fused at the tip. While finger count is correct, anatomical correctness is compromised. VERDICT: FAIL

A woman holding her hands out in front of her palms-up, all PASS
A woman holding her hands out in front of her palms-up, all ten fingers clearly visible

Evaluator (Qwen 3.5 122B)

The woman’s hands are held palms-up with fingers spread. Left hand: 5 fingers visible (thumb, index, middle, ring, pinky). Right hand: also 5 fingers clearly shown. All digits appear anatomically correct — no extra, missing, or fused fingers. Gesture is natural and matches the prompt. No anomalies detected. VERDICT: PASS

A man with arms crossed, both hands resting on opposite uppeFAIL
A man with arms crossed, both hands resting on opposite upper arms with fingers visible

Evaluator (Qwen 3.5 122B)

The left hand (on the right side of image) shows 4 fingers clearly, but the thumb is missing or fused — only 4 digits visible. The right hand (on left side of image) also appears to have only 4 fingers; the pinky is indistinct or merged. Neither hand has exactly 5 fingers. Grip is anatomically plausible but digit count is incorrect. Anomalies: missing/fused fingers on both hands. VERDICT: FAIL

Medium 33%
A person holding a white ceramic coffee mug by the handle wiPASS
A person holding a white ceramic coffee mug by the handle with their right hand

Evaluator (Qwen 3.5 122B)

The hand holding the mug is anatomically correct with 5 fingers visible: thumb, index, middle, ring, and pinky. The grip on the handle is natural and realistic. No anomalies detected in finger count or positioning. The sweater cuff and skin texture also appear consistent with a real human hand. VERDICT: PASS

A woman writing in a notebook with a pen gripped correctly iFAIL
A woman writing in a notebook with a pen gripped correctly in her right hand

Evaluator (Qwen 3.5 122B)

The right hand holding the pen shows 5 fingers, but the grip is slightly unnatural — the thumb and index finger appear fused or overly close, and the middle finger seems to merge with the ring finger. The left hand resting on the notebook also has 5 fingers, but they are unnaturally elongated and lack clear knuckle definition. Both hands show subtle anatomical distortions typical of AI generation. VERDICT: FAIL

A person playing an acoustic guitar with left hand on the frFAIL
A person playing an acoustic guitar with left hand on the fretboard and right hand strumming

Evaluator (Qwen 3.5 122B)

The right hand (strumming) shows a thumb and three fingers clearly, but the pinky is either missing or fused — only 4 distinct digits visible. The left hand (fretting) has 5 fingers, but the index finger appears to merge with the thumb or fretboard in an unnatural way, and the ring/pinky positioning is awkward. Anatomical correctness fails on both hands due to digit count and gesture anomalies. VERDICT: FAIL

Hard 33%
A person making a peace sign with their right hand, clearly FAIL
A person making a peace sign with their right hand, clearly showing two fingers up and three curled

Evaluator (Qwen 3.5 122B)

The hand making the peace sign has five fingers total, but the anatomy is distorted: the thumb appears fused or malformed, and the curled fingers are unnaturally shaped and positioned. The gesture is recognizable but not anatomically correct — fingers lack proper joint definition and spacing. This violates the requirement for anatomical correctness. VERDICT: FAIL

A hand giving a thumbs-up with four fingers curled and thumbPASS
A hand giving a thumbs-up with four fingers curled and thumb extended upward

Evaluator (Qwen 3.5 122B)

The image shows one hand giving a thumbs-up gesture. The thumb is extended upward, and the other four fingers are curled into a fist — anatomically correct for this gesture. No anomalies in finger count or structure are visible on the single hand shown. Since only one hand is present and it matches the prompt’s description (4 fingers curled, 1 thumb up), the core criteria is met. VERDICT: PASS

A person counting to three on their fingers, with index, midFAIL
A person counting to three on their fingers, with index, middle, and ring fingers extended

Evaluator (Qwen 3.5 122B)

The visible hand shows three extended fingers (index, middle, ring) and two curled fingers (pinky, thumb), totaling five — anatomically correct for a “count to three” gesture. However, the thumb is partially obscured and appears fused or malformed at the base, lacking clear separation from the palm. The wristband also looks unnaturally textured and attached. These anomalies suggest AI generation artifacts. VERDICT: FAIL

Extreme 100%
Two people shaking hands in a professional setting, all fingPASS
Two people shaking hands in a professional setting, all fingers of both hands anatomically correct

Evaluator (Qwen 3.5 122B)

Upon close examination of the handshake: - Left hand (from viewer’s perspective): 5 fingers visible — thumb, index, middle, ring, pinky — all appear anatomically correct and properly positioned in the grip. - Right hand: 5 fingers also visible — thumb wrapped over, four fingers curled underneath — no extra or missing digits; grip is natural and realistic. No anomalies detected. Both hands show exactly 5 fingers with correct anatomy and gesture. VERDICT: PASS

Two people doing a high-five, palms meeting with all fingersPASS
Two people doing a high-five, palms meeting with all fingers extended on both hands

Evaluator (Qwen 3.5 122B)

The image shows two people high-fiving. The man’s hand (left) has 5 fingers clearly visible and extended. The woman’s hand (right) also shows 5 fingers, but the thumb is partially obscured by the man’s hand — however, it’s still anatomically plausible as part of a natural high-five grip. No extra or missing fingers; gesture is correct. Minor occlusion doesn’t violate anatomical correctness. VERDICT: PASS

One person handing a small gift box to another person, both PASS
One person handing a small gift box to another person, both sets of hands gripping the box correctly

Evaluator (Qwen 3.5 122B)

Upon close examination, the hands appear anatomically correct with five fingers per hand. The grip on the gift box is natural and consistent with the action of handing over a present. No visible anomalies such as extra fingers, missing digits, or distorted joints are present. The lighting and focus support clear visibility of hand structure. VERDICT: PASS

Multi-SubjectClick to expand100%

Multi-Subject measures whether local/boogu-image-turbo keeps multiple people distinct while preserving their requested attributes, outfits, and positions.

Easy 100%
Two women standing side by side: one with short blonde hair PASS
Two women standing side by side: one with short blonde hair wearing a red jacket, the other with long black hair wearing a blue dress

Evaluator (Qwen 3 VL)

Two women stand side by side. Left: short blonde hair, red jacket over striped shirt. Right: long black hair, blue dress. Both are visually distinct and match the prompt exactly. Features are accurate and well-defined. VERDICT: PASS

Two men at a cafe: one with a beard and glasses wearing a grPASS
Two men at a cafe: one with a beard and glasses wearing a green sweater, the other clean-shaven in a black hoodie

Evaluator (Qwen 3 VL)

The two men match the description: one has a beard and glasses, wearing a green patterned sweater; the other is clean-shaven, in a black hoodie. Both are visually distinct and clearly identifiable. Background figures are unrelated. All prompt criteria are satisfied. VERDICT: PASS

Two friends posing: one tall with curly red hair in denim ovPASS
Two friends posing: one tall with curly red hair in denim overalls, the other short with straight brown hair in a yellow sundress

Evaluator (Qwen 3 VL)

Tall person: curly red hair, denim overalls over striped shirt. Short person: straight brown hair, yellow floral sundress. Both visually distinct and match descriptions precisely. VERDICT: PASS

Medium 100%
Four people at a dinner table: a bald man in a suit, a womanPASS
Four people at a dinner table: a bald man in a suit, a woman with silver hair in a red blouse, a young man with dreadlocks in a denim jacket, and a woman with a hijab in a green dress

Evaluator (Qwen 3 VL)

Bald man in suit: present, smiling, left side. Silver-haired woman in red blouse: present, center-left. Young man with dreadlocks in denim jacket: present, center-right. Woman with hijab in green dress: present, right side. All visually distinct and match descriptions precisely. VERDICT: PASS

Three musicians on stage: a tall woman with short pink hair PASS
Three musicians on stage: a tall woman with short pink hair playing bass, a man with a beard and bandana on drums, and a slim man with glasses playing keyboard

Evaluator (Qwen 3 VL)

Tall woman: short pink hair, playing bass — matches. Man with beard and bandana: on drums — matches. Slim man with glasses: playing keyboard — matches. All three are visually distinct and correctly positioned. Features align precisely with prompt. VERDICT: PASS

Three hikers on a trail: a man with a backpack and sunhat, aPASS
Three hikers on a trail: a man with a backpack and sunhat, a woman with braided hair and hiking poles, and a teenager with a baseball cap and water bottle

Evaluator (Qwen 3 VL)

Man: wears sunhat and backpack. Woman: has braided hair, holds hiking poles, wears backpack. Teenager: wears baseball cap, holds water bottle, wears backpack. All three are visually distinct and match described features exactly. VERDICT: PASS

Truthfulness 70%

PhotorealismClick to expand100%

Photorealism measures whether local/boogu-image-turbo makes materials, surfaces, and real-world objects look visually plausible and distinct.

Medium 100%
A still life on a wooden table featuring a clear glass of waPASS
A still life on a wooden table featuring a clear glass of water, a polished silver spoon, a folded linen napkin, and a ripe red apple — each material rendered with accurate texture and light behavior

Evaluator (Qwen 3.6 27B)

The image successfully renders the requested materials with high fidelity. The glass shows accurate refraction and transparency, the spoon exhibits sharp metallic reflections, the apple has a realistic waxy sheen, and the linen napkin displays proper fabric texture and soft folds. The wooden table features convincing grain and wear. Lighting is consistent across all objects, creating natural shadows and highlights that enhance the realism of each material’s physical properties. VERDICT: PASS

A workspace desk with a brushed aluminum laptop, a leather-bPASS
A workspace desk with a brushed aluminum laptop, a leather-bound notebook, a ceramic coffee mug, and a glass paperweight — all materials visually distinct and realistic

Evaluator (Qwen 3.6 27B)

The image successfully renders the requested materials with high fidelity. The brushed aluminum laptop shows appropriate metallic sheen and texture. The leather notebook displays realistic grain and thickness. The ceramic mug has a convincing matte, glazed finish. The glass paperweight exhibits correct refraction and reflection of the surroundings. The wooden desk surface shows natural grain and lighting interaction. All materials are visually distinct and realistic. VERDICT: PASS

A kitchen counter with a stainless steel pot, a wooden cuttiPASS
A kitchen counter with a stainless steel pot, a wooden cutting board, a glass bottle of olive oil, and a cotton dish towel — each surface reflecting light correctly

Evaluator (Qwen 3.6 27B)

The image successfully renders the distinct material properties requested. The stainless steel pot and countertop exhibit sharp, accurate reflections. The wooden cutting board shows natural grain and a wet sheen where the oil has been poured. The glass bottle demonstrates correct translucency and refraction of the oil inside. The cotton towel displays appropriate fabric texture and soft folds. All surfaces interact with light realistically. VERDICT: PASS

Physics & ReflectionsClick to expand75%

Physics & Reflections measures whether local/boogu-image-turbo follows basic physical cues such as shadows, reflections, gravity, transparency, and contact points.

Easy 33%
A red sphere on a white surface with a single shadow cast toPASS
A red sphere on a white surface with a single shadow cast to the right, consistent with a light source from the upper left

Evaluator (Qwen 3.6 27B)

The red sphere casts a distinct shadow to the right and slightly downward, which is consistent with a light source positioned in the upper left. The shadow’s shape and softness are physically plausible for a diffuse light source. The sphere’s surface shows appropriate highlights and shading, enhancing its three-dimensional appearance. No distortions or inconsistencies in color or orientation are observed. VERDICT: PASS

A coffee mug on a sunny windowsill casting a sharp shadow toFAIL
A coffee mug on a sunny windowsill casting a sharp shadow to the left

Evaluator (Qwen 3.6 27B)

The light source is clearly coming from the window on the left side of the frame. Consequently, the mug casts a shadow to the right, not the left as specified in the prompt. While the shadow's sharpness and color are physically plausible for the lighting conditions, the directional error makes the image fail the specific criteria of the prompt. VERDICT: FAIL

A person standing outdoors at midday with a short shadow dirFAIL
A person standing outdoors at midday with a short shadow directly below them

Evaluator (Qwen 3.6 27B)

The shadow is cast directly beneath the subject, consistent with midday lighting. The direction and orientation align with the sun’s position overhead. However, the shadow appears slightly distorted—particularly around the feet and hat brim—with unnatural softness and irregular edges that don’t match the sharpness of surrounding elements. Color is appropriately dark gray/black, but the shape lacks physical precision. VERDICT: FAIL

Medium 67%
A woman standing in front of a full-length mirror, her refleFAIL
A woman standing in front of a full-length mirror, her reflection accurately showing the back of her outfit and reversed orientation

Evaluator (Qwen 3.6 27B)

The reflection shows the woman’s front view, which is physically correct for a mirror facing her. However, the prompt specifically requested the reflection to show the “back of her outfit,” which is physically impossible in a standard mirror setup. Since the image contradicts the specific prompt requirement regarding the reflection's content, it fails the evaluation criteria. VERDICT: FAIL

A cat sitting on a polished marble floor with its reflectionPASS
A cat sitting on a polished marble floor with its reflection visible below

Evaluator (Qwen 3.6 27B)

The reflection is physically plausible. It correctly mirrors the cat’s orientation and position, with appropriate distortion due to the floor’s curvature and lighting. The color and brightness match the real cat, slightly dimmed as expected in a reflection. Shadows align with the light source from the left. No obvious artifacts or inconsistencies are present. VERDICT: PASS

A vase of flowers on a glass table with a clear reflection vPASS
A vase of flowers on a glass table with a clear reflection visible on the table surface

Evaluator (Qwen 3.6 27B)

The reflection on the glass table is physically plausible. It correctly mirrors the vase and flowers with appropriate vertical inversion and slight distortion due to the glass thickness and viewing angle. The lighting direction from the window casts consistent shadows and highlights on the flowers, vase, and surrounding furniture. The reflection’s brightness and color match the real objects, and the transparency of the glass allows both the reflection and the table’s wooden legs to be visible si

Hard 100%
A row of trees reflected in a calm lake at sunset, with the PASS
A row of trees reflected in a calm lake at sunset, with the reflection slightly darker and gently rippled

Evaluator (Qwen 3.6 27B)

The reflections are vertically inverted, matching the trees' positions and shapes accurately. The water surface shows gentle ripples that distort the reflection slightly, consistent with a calm lake. The reflected colors are darker than the actual trees, which is physically plausible due to light absorption and scattering in water. The sunset glow is also mirrored appropriately. No directional or orientation errors are present. VERDICT: PASS

A city skyline reflected in a rain puddle on a street, with PASS
A city skyline reflected in a rain puddle on a street, with mild distortion and color shift

Evaluator (Qwen 3.6 27B)

The reflections in the puddle are physically plausible. They exhibit a correct vertical inversion of the skyline and streetlights. The mild distortion and blurring are consistent with the surface tension and ripples of a rain puddle. The color shift towards warmer tones in the reflection compared to the cooler sky is also realistic, mimicking how water surfaces reflect ambient light differently. VERDICT: PASS

A mountain range reflected in a still alpine lake, with the PASS
A mountain range reflected in a still alpine lake, with the reflection inverted and subtly muted in color

Evaluator (Qwen 3.6 27B)

The reflection is physically plausible. It is correctly inverted vertically, matching the mountain range's orientation. The colors are subtly muted and slightly darker than the actual mountains, consistent with light absorption in water. The surface is calm, allowing for a clear mirror image with minimal distortion, which aligns with the "still alpine lake" description. The lighting on the peaks corresponds accurately to their reflections. VERDICT: PASS

Extreme 100%
A clear glass sphere sitting on a mirror surface near a poolPASS
A clear glass sphere sitting on a mirror surface near a pool of water, showing refraction through the sphere, reflection on the mirror, and water reflection simultaneously

Evaluator (Qwen 3.6 27B)

The image displays a glass sphere on a reflective surface near a pool. The reflection of the sphere on the mirror is correctly oriented and positioned directly beneath it. The refraction within the sphere shows an inverted, wide-angle view of the surroundings, which is physically accurate for a glass sphere. The lighting and shadows are consistent with a bright outdoor setting. The water ripples and their interaction with the light appear natural. The colors and distortions align with optical ph

A chrome sculpture in a room with a glass floor and large wiPASS
A chrome sculpture in a room with a glass floor and large windows, showing multiple reflections and environmental mapping on every surface

Evaluator (Qwen 3.6 27B)

The reflections on the chrome sculpture and glass floor are physically plausible, showing accurate environmental mapping with distorted views of the room and windows. The rainbow prismatic effects on the walls and floor are consistent with light refraction through glass or prisms. Shadows align with the light source direction from the windows. The orientation and distortion of reflections match the curved surfaces realistically. VERDICT: PASS

A crystal wine glass on a polished marble counter next to a PASS
A crystal wine glass on a polished marble counter next to a window overlooking a lake — showing refraction in the glass, reflection on the marble, and the lake's own reflective surface

Evaluator (Qwen 3.6 27B)

The reflections and refractions are physically plausible. The wine glass shows realistic distortion of the background through its curved, faceted surface. The marble countertop displays a clear, slightly blurred reflection of the glass stem and base, consistent with a polished surface. The lake outside exhibits a mirror-like reflection of the distant hills, appropriate for calm water. Lighting direction is consistent across all elements, with highlights and shadows aligning logically. VERDICT:

World KnowledgeClick to expand58%

World Knowledge measures whether local/boogu-image-turbo depicts recognizable places, landmarks, flags, and known objects with accurate real-world details.

Easy 100%
The Eiffel Tower in Paris on a clear day, architecturally acPASS
The Eiffel Tower in Paris on a clear day, architecturally accurate with its iron lattice structure and three levels

Evaluator (Qwen 3.6 27B)

The Eiffel Tower is clearly recognizable and depicted on a clear day. The iron lattice structure is detailed, and the three levels are present. However, there are minor architectural inaccuracies: the top spire/antenna is missing, and the base arches appear slightly wider than in reality. Despite these small deviations, the overall representation is factually sound and architecturally coherent for a general depiction. VERDICT: PASS

The Taj Mahal in Agra seen from the reflecting pool, with acPASS
The Taj Mahal in Agra seen from the reflecting pool, with accurate dome, minarets, and symmetry

Evaluator (Qwen 3.6 27B)

The image depicts the Taj Mahal with high fidelity. The central dome, four minarets, and arched entrances are architecturally accurate and symmetrically aligned. The reflection in the pool is clear and correctly mirrors the structure. The surrounding gardens and cypress trees match the real-world layout. No significant distortions or inaccuracies are present. The lighting and perspective enhance realism without compromising factual representation. VERDICT: PASS

The Statue of Liberty on her island pedestal, with correct tPASS
The Statue of Liberty on her island pedestal, with correct torch, crown, tablet, and green patina

Evaluator (Qwen 3.6 27B)

The Statue of Liberty is clearly recognizable and accurately depicted with correct green patina, torch, crown, tablet, and pedestal. Minor background elements (e.g., distant buildings, flag) are plausible. No major architectural or factual inaccuracies detected. VERDICT: PASS

Medium 33%
A traditional Japanese shamisen instrument with accurate thrFAIL
A traditional Japanese shamisen instrument with accurate three-stringed design, wooden body, and bachi plectrum

Evaluator (Qwen 3.6 27B)

The image depicts a shamisen, but it contains significant factual inaccuracies. The bridge is structurally nonsensical, appearing as a floating wooden piece rather than resting on the skin. The tuning pegs are oversized and oddly shaped, lacking the traditional mechanical design. The bachi (plectrum) is incorrectly positioned on the body instead of being held or placed separately. These structural errors make the instrument factually inaccurate. VERDICT: FAIL

A medieval astrolabe with accurate rete, tympan, and engraveFAIL
A medieval astrolabe with accurate rete, tympan, and engraved degree markings on brass

Evaluator (Qwen 3.6 27B)

The image depicts a complex brass instrument that resembles an astrolabe but contains significant factual inaccuracies. The internal mechanism features large, exposed clockwork gears and a central hand, which are not components of a traditional medieval astrolabe (which relies on a flat rete and tympan). The degree markings and symbols appear nonsensical and gibberish-like rather than accurate astronomical data. While visually detailed, it fails the criteria for factual accuracy. VERDICT: FAIL

A Japanese kintsugi bowl — cracked ceramic repaired with visPASS
A Japanese kintsugi bowl — cracked ceramic repaired with visible gold lacquer lines

Evaluator (Qwen 3.6 27B)

The image accurately depicts a Japanese kintsugi bowl with visible gold lacquer repairs on cracked ceramic. The subject is clearly recognizable, and the artistic technique is correctly represented. The surrounding elements (chopsticks, tea utensils) enhance the cultural context without detracting from the main subject. No factual or architectural inaccuracies are present. VERDICT: PASS

Hard 0%
A detailed diagram of the water cycle showing evaporation frFAIL
A detailed diagram of the water cycle showing evaporation from the ocean, condensation forming clouds, precipitation as rain, and collection in rivers flowing back to the sea

Evaluator (Qwen 3.6 27B)

The image depicts the water cycle with recognizable elements like the sun, ocean, clouds, and river. However, it contains significant text errors typical of AI generation, such as "ITHEI" instead of "THE," "FORE. CLOUDS," and "RAINN." These typos undermine the factual accuracy and clarity required for a detailed diagram. While the visual flow is correct, the textual inaccuracies make it fail the criteria for a high-quality, accurate educational diagram. VERDICT: FAIL

A cross-section showing tectonic plates at a convergent bounFAIL
A cross-section showing tectonic plates at a convergent boundary with subduction, a trench, volcanic arc, and layered mantle

Evaluator (Qwen 3.6 27B)

The image depicts the correct geological concept of a subduction zone, showing the oceanic plate diving beneath the continental plate, the resulting trench, and the volcanic arc. However, the text labels are nonsensical gibberish (e.g., "TONICGENT REOT SODIFFNT" instead of "Convergent Boundary"), which is a significant failure in rendering text. Despite the visual accuracy of the geological structures, the illegible and incorrect text makes the image factually flawed as an educational diagram.

An anatomical illustration of the human heart showing four cFAIL
An anatomical illustration of the human heart showing four chambers, valves, aorta, pulmonary arteries, and veins with correct placement

Evaluator (Qwen 3.6 27B)

The image is not factually accurate. The labels are nonsensical gibberish (e.g., "Oorta", "Corta", and illegible Chinese characters), rendering the diagram useless for educational purposes. Furthermore, the anatomy is incorrect: the aorta is depicted as a blue vessel (indicating deoxygenated blood), while the pulmonary artery is red, reversing the standard color coding and physiological flow. The vessel branching and chamber proportions are also distorted. VERDICT: FAIL

Extreme 100%
A Morpho menelaus butterfly with accurate iridescent blue doPASS
A Morpho menelaus butterfly with accurate iridescent blue dorsal wings, brown ventral wing pattern with eyespots, and correct body morphology

Evaluator (Qwen 3.6 27B)

The image depicts a butterfly that closely resembles a Morpho species, featuring the characteristic iridescent blue dorsal wings and brown ventral patterns with eyespots. The body morphology appears correct for a butterfly in this genus. The details such as wing venation, coloration, and overall structure are accurate and recognizable. There are no significant inaccuracies noted. VERDICT: PASS

The Hagia Sophia interior showing the massive central dome, PASS
The Hagia Sophia interior showing the massive central dome, pendentives, semi-domes, and Islamic calligraphy medallions alongside Byzantine mosaics

Evaluator (Qwen 3.6 27B)

The image depicts the interior of the Hagia Sophia with high architectural accuracy. The massive central dome, pendentives, semi-domes, and the arrangement of columns and arches are correctly represented. The Islamic calligraphy medallions (featuring names of Allah, Muhammad, etc.) and the Byzantine mosaics (such as the seraphim on the pendentives) are present and appropriately placed. The lighting and perspective convey the grandeur of the space. No significant inaccuracies are observed. VERDI

A specimen of bismuth crystal showing accurate iridescent oxPASS
A specimen of bismuth crystal showing accurate iridescent oxide layers, geometric hopper crystal formation, and characteristic staircase structure

Evaluator (Qwen 3.6 27B)

The image depicts a bismuth crystal with high fidelity. It accurately portrays the characteristic geometric hopper crystal formation, featuring distinct staircase-like structures and sharp, angular facets. The iridescent oxide layers are rendered with realistic coloration—showing vibrant greens, blues, purples, and golds—that align with real-world bismuth specimens. Minor surface imperfections and natural variations in the oxide film enhance authenticity. The subject is clearly recognizable and

Professional Studio 93%

Camera & LightingClick to expand92%

Camera & Lighting measures whether local/boogu-image-turbo follows specific lens, angle, focus, exposure, and lighting instructions.

Easy 100%
A wide-angle shot of a modern living room interior showing ePASS
A wide-angle shot of a modern living room interior showing expansive space with slight barrel distortion at the edges

Evaluator (Gemma 4 26B)

The image perfectly matches the prompt. It features a wide-angle shot of a modern, expansive living room with high ceilings. Most importantly, it exhibits clear barrel distortion at the edges (visible in the curved lines of the ceiling and floor), which was a specific requirement. The lighting is bright and natural, coming from the large floor-to-ceiling windows, creating an airy and open atmosphere. VERDICT: PASS

A close-up macro shot of a single dewdrop on a rose petal, sPASS
A close-up macro shot of a single dewdrop on a rose petal, shallow depth of field

Evaluator (Gemma 4 26B)

The image features a close-up macro shot of a single prominent dewdrop on a pink rose petal. The perspective is a macro view, and the depth of field is shallow, with the background and the edges of the petal appearing softly blurred. The lighting is bright and natural, highlighting the transparency and reflections within the main droplet. All specified camera and lighting elements are accurately represented. VERDICT: PASS

An aerial bird's-eye view looking straight down at a trafficPASS
An aerial bird's-eye view looking straight down at a traffic roundabout with cars and trees

Evaluator (Gemma 4 26B)

The image perfectly matches the prompt. It features a true aerial bird's-eye view, looking directly down (nadir perspective) at a traffic roundabout. The perspective is flat and perpendicular to the ground, with a deep depth of field that keeps the entire scene—from the central greenery to the surrounding roads and trees—in sharp focus. The lighting is even and natural, consistent with a high-altitude aerial shot. VERDICT: PASS

Medium 100%
A portrait lit with classic Rembrandt lighting — triangle ofPASS
A portrait lit with classic Rembrandt lighting — triangle of light on the shadowed cheek, moody atmosphere

Evaluator (Gemma 4 26B)

The image successfully implements Rembrandt lighting. There is a distinct, characteristic triangle of light on the shadowed cheek (the subject's left cheek), created by a light source positioned at an angle. The lighting creates a high-contrast, moody atmosphere with deep shadows, and the shallow depth of field focuses attention on the facial features, meeting all the specific requirements of the prompt. VERDICT: PASS

A landscape photograph bathed in golden hour light — long waPASS
A landscape photograph bathed in golden hour light — long warm shadows, orange sky, soft glow

Evaluator (Gemma 4 26B)

The image perfectly matches the prompt's requirements. The lighting is a distinct golden hour, characterized by a vibrant orange sky and a warm, soft glow emanating from the sun. There are long, dark shadows cast across the foreground and midground, and the low-angle perspective creates a strong sense of depth. The light flare and warm tones effectively capture the requested atmosphere. VERDICT: PASS

A high-key portrait of a person against a bright white backgPASS
A high-key portrait of a person against a bright white background, minimal shadows, airy and luminous

Evaluator (Gemma 4 26B)

The image perfectly matches the prompt. It features a high-key portrait with a bright, white, overexposed background. The lighting is airy, luminous, and soft, with almost no visible shadows on the subject's face. The perspective is a close-up portrait, and the lighting pattern creates the desired ethereal, high-key aesthetic. VERDICT: PASS

Hard 67%
A portrait shot at 85mm f/1.4 with creamy bokeh background, PASS
A portrait shot at 85mm f/1.4 with creamy bokeh background, backlit with warm rim light separating the subject from a blurred garden

Evaluator (Gemma 4 26B)

The image perfectly matches the prompt. The perspective is a classic portrait shot. The depth of field is shallow, creating a creamy bokeh background with soft, circular light orbs. The lighting features a distinct warm rim light (backlighting) that illuminates the subject's hair and shoulders, effectively separating her from the blurred garden background. VERDICT: PASS

A street photograph with shallow depth of field as if shot aFAIL
A street photograph with shallow depth of field as if shot at 50mm f/1.8, subject sharp with soft city lights in the background

Evaluator (Gemma 4 26B)

The image fails to meet the prompt's core requirement. The prompt specifies a "subject sharp" with a shallow depth of field. However, in this image, the subjects in the foreground are completely out of focus and blurry. Instead of a sharp subject with a blurred background, the entire scene—including the subjects—is heavily blurred, failing the instruction for a sharp subject. VERDICT: FAIL

A food photograph shot from above at 35mm f/2.8 with naturalPASS
A food photograph shot from above at 35mm f/2.8 with natural window light from the left, slight vignette, warm color temperature

Evaluator (Gemma 4 26B)

The image matches all specified criteria. The perspective is a high-angle shot (from above) looking down at the plate. The depth of field is shallow, consistent with an f/2.8 aperture, as the foreground and background (window and glasses) are softly blurred. The lighting is natural light coming from the window on the left, creating soft shadows. The color temperature is warm, and there is a subtle vignette around the edges. VERDICT: PASS

Extreme 100%
A tilt-shift photograph of a city street making real-life loPASS
A tilt-shift photograph of a city street making real-life look like a miniature model — selective focus strip, saturated colors

Evaluator (Gemma 4 26B)

The image perfectly executes the tilt-shift effect. It features a high-angle perspective looking down a city street, creating a miniature model appearance. The depth of field is highly selective, with a sharp focus strip in the middle ground and heavy blur (bokeh) in both the foreground and background. The colors are highly saturated and vibrant, matching the prompt's requirements for a miniature aesthetic. VERDICT: PASS

A cinematic frame with blue and orange anamorphic lens flarePASS
A cinematic frame with blue and orange anamorphic lens flare streaking horizontally across a dark moody scene

Evaluator (Gemma 4 26B)

The image perfectly matches the prompt. It features a dark, moody scene with a shallow depth of field focusing on the woman's face. Most importantly, it displays prominent blue and orange anamorphic lens flares that streak horizontally across the frame, exactly as described. The lighting and cinematic perspective are highly consistent with the requested aesthetic. VERDICT: PASS

A portrait with dramatic split lighting — half the face brigPASS
A portrait with dramatic split lighting — half the face brightly lit, the other half in deep shadow, high contrast chiaroscuro

Evaluator (Gemma 4 26B)

The image perfectly matches the prompt's requirements. It features a close-up portrait with dramatic split lighting. One side of the face is brightly illuminated, while the other side is cast in deep, heavy shadow, creating a high-contrast chiaroscuro effect. The lighting pattern is sharp and distinct, successfully dividing the face into light and dark halves as requested. VERDICT: PASS

Color PrecisionClick to expand100%

Color Precision measures whether local/boogu-image-turbo renders requested colors clearly and consistently across subjects, products, and backgrounds.

Easy 100%
A bright red sports car parked on a gray asphalt road — the PASS
A bright red sports car parked on a gray asphalt road — the red is vivid and unambiguous

Evaluator (Gemma 4 26B)

The image features a sports car in a bright, vivid, and unambiguous red color. The car is parked on a gray asphalt road, which matches the prompt's description perfectly. The colors are highly saturated and clearly defined as requested. VERDICT: PASS

A woman wearing a royal blue evening dress against a neutralPASS
A woman wearing a royal blue evening dress against a neutral background

Evaluator (Gemma 4 26B)

The woman is wearing a vibrant, deep blue evening dress that perfectly matches the description of "royal blue." The background is a solid, muted beige/grey, which qualifies as a "neutral background." All color requirements specified in the prompt are accurately represented in the image. VERDICT: PASS

A field of pure sunflower yellow flowers under a clear blue PASS
A field of pure sunflower yellow flowers under a clear blue sky

Evaluator (Gemma 4 26B)

The image perfectly matches the color descriptions provided in the prompt. The sunflowers are a vibrant, pure sunflower yellow, and the sky above is a clear, bright blue. There are no deviations in color or hue from the requested "pure sunflower yellow" and "clear blue sky." VERDICT: PASS

Medium 100%
A bedroom wall painted in #FF6B35 (burnt orange) with white PASS
A bedroom wall painted in #FF6B35 (burnt orange) with white furniture and natural daylight

Evaluator (Gemma 4 26B)

The wall is a vibrant burnt orange, which aligns with the hex code #FF6B35. The furniture, including the bed frame and nightstand, is white. The room is filled with bright, natural daylight from the window. The colors and elements perfectly match the prompt's specific requirements. VERDICT: PASS

A forest scene where the foliage is precisely #2D5F2D (dark PASS
A forest scene where the foliage is precisely #2D5F2D (dark muted green) with brown tree trunks

Evaluator (Gemma 4 26B)

The image features foliage in various shades of dark, muted green and brown tree trunks. The primary green used in the canopy and the forest floor closely aligns with the requested dark muted green tone (#2D5F2D). The trunks are a consistent medium brown. The color palette accurately follows the specific color requirements provided in the prompt. VERDICT: PASS

A room with walls in #4A90D9 (medium blue) with warm wooden PASS
A room with walls in #4A90D9 (medium blue) with warm wooden floor and white trim

Evaluator (Gemma 4 26B)

The image accurately reflects the prompt's requirements. The walls are a medium blue color consistent with #4A90D9. The floor is a warm wooden texture, and the window frame and baseboards are white trim. All color descriptions and the specific hex-coded color are correctly represented in the scene. VERDICT: PASS

Hard 100%
A minimalist room with walls in #F5E6CC (warm cream), a sofaPASS
A minimalist room with walls in #F5E6CC (warm cream), a sofa in #2C3E50 (dark navy), and accent pillows in #E74C3C (bright red)

Evaluator (Gemma 4 26B)

The image accurately reflects the requested colors. The walls are a warm cream/off-white, consistent with #F5E6CC. The sofa is a deep, dark navy blue, matching #2C3E50. The accent pillows are a vibrant, bright red, consistent with #E74C3C. All color requirements from the prompt are met. VERDICT: PASS

A product flat-lay on a #1A1A2E (deep dark blue) background,PASS
A product flat-lay on a #1A1A2E (deep dark blue) background, with a #F0C27F (soft gold) watch and #FFFFFF (pure white) earbuds

Evaluator (Gemma 4 26B)

The image accurately reflects the requested color palette. The background is a deep, dark navy blue consistent with #1A1A2E. The watch features a soft gold/champagne tone matching #F0C27F. The earbuds are a clean, pure white consistent with #FFFFFF. All three primary color requirements are met precisely. VERDICT: PASS

A graphic design composition with background #FAFAFA (near-wPASS
A graphic design composition with background #FAFAFA (near-white), main shape #FF4500 (orange-red), and accent lines #333333 (dark charcoal)

Evaluator (Gemma 4 26B)

The image successfully follows the color specifications. The background is a near-white color consistent with #FAFAFA. The main organic shape is a vibrant orange-red, matching the #FF4500 requirement. The dark accent lines and the central text are a dark charcoal/black, consistent with the #333333 specification. All three requested colors are clearly present and accurately represented. VERDICT: PASS

Extreme 100%
A smooth gradient on a wall going from #FF0000 (red) on the PASS
A smooth gradient on a wall going from #FF0000 (red) on the left to #0000FF (blue) on the right, transitioning evenly through purple

Evaluator (Gemma 4 26B)

The image shows a smooth gradient on a wall. On the far left, the color is a vibrant red, which matches #FF0000. On the far right, the color is a bright blue, which matches #0000FF. The transition through the middle features a clear purple/magenta hue, fulfilling the requirement of transitioning evenly through purple. The colors and the gradient direction perfectly match the prompt. VERDICT: PASS

A sunset sky gradient on a flat surface from #FF6B35 (orangePASS
A sunset sky gradient on a flat surface from #FF6B35 (orange) at the bottom to #1B1464 (deep indigo) at the top

Evaluator (Gemma 4 26B)

The image displays a gradient that matches the requested colors. The bottom of the sky features a vibrant orange, consistent with #FF6B35. As the gradient moves upward, it transitions through a reddish-purple hue into a very dark, deep indigo at the top, which aligns with #1B1464. The color transition and the specific hex values requested are accurately represented in the image. VERDICT: PASS

A horizontal banner with a gradient from #00C9FF (cyan) on tPASS
A horizontal banner with a gradient from #00C9FF (cyan) on the left to #92FE9D (mint green) on the right

Evaluator (Gemma 4 26B)

The image features a horizontal banner with a gradient. On the left, the color is a bright cyan/azure, which aligns well with #00C9FF. On the right, the color transitions into a light mint green, which matches the description of #92FE9D. The gradient transition is smooth and follows the specified color direction accurately. VERDICT: PASS

PhotorealismClick to expand67%

Photorealism measures whether local/boogu-image-turbo produces polished product and studio imagery with realistic materials, lighting, shadows, and finish.

Easy 67%
A single red sneaker on a clean white background with soft sPASS
A single red sneaker on a clean white background with soft studio lighting and gentle shadow — product photography style

Evaluator (Gemma 4 26B)

The image successfully meets the prompt's requirements. The lighting is soft and professional, creating a natural shadow on the white background. The texture of the red fabric and the white rubber sole appears highly realistic and consistent. There are no obvious AI artifacts, such as warped laces, nonsensical stitching, or distorted geometry. It looks like a high-quality, professional studio product photograph. VERDICT: PASS

A clear glass perfume bottle on a reflective black surface wFAIL
A clear glass perfume bottle on a reflective black surface with studio lighting, no background distractions

Evaluator (Gemma 4 26B)

The image fails to meet the prompt's requirements. The prompt specifies a "reflective black surface," but the image shows a light gray/white background with a bright light source (likely a softbox) visible in the upper left. Furthermore, the "reflective black surface" is actually a light-colored surface reflecting a dark object, which contradicts the prompt. The glass texture and the straw inside the bottle also exhibit unnatural, slightly warped AI-generated distortions. VERDICT: FAIL

A ripe green apple on a seamless white background with soft PASS
A ripe green apple on a seamless white background with soft side lighting and subtle shadow — catalog style

Evaluator (Gemma 4 26B)

The image successfully meets the criteria for a studio-quality product shot. The green apple features realistic skin texture, including subtle lenticels and a natural sheen. The lighting is soft and directional, creating a realistic highlight on the left and a soft, natural shadow on the right. The seamless white background is clean and professional. There are no obvious AI artifacts or distortions; the stem and fruit shape appear highly convincing and photorealistic. VERDICT: PASS

Graphical design 54%

Data VisualisationClick to expand0%

Data Visualisation measures whether local/boogu-image-turbo produces charts with correct labels, proportions, colors, and readable values.

Easy 0%
A bar chart showing monthly sales for January through June wFAIL
A bar chart showing monthly sales for January through June with values 12K, 18K, 15K, 22K, 19K, 25K — labeled axes, color-coded bars, and a title 'Monthly Sales 2025'

Evaluator (Qwen 3 VL)

The chart shows "Monthly Sales 2025" with labeled axes and color-coded bars. Values are 15K (Jan), 15K (Apr), 22K (May), 19K (Jul), 25K (June) — but missing February and March. X-axis labels are incorrect (e.g., "AGR" for Feb, "MA" for Mar, "JUL" for July). The requested months (Jan-Jun) are not correctly labeled. VERDICT: FAIL

A pie chart showing market share: Company A 35%, Company B 2FAIL
A pie chart showing market share: Company A 35%, Company B 25%, Company C 20%, Others 20% — each slice a different color with labels and percentages

Evaluator (Qwen 3 VL)

The pie chart shows incorrect percentages: “Others” is labeled 20% but displays 14%, “Company B” shows 25% but 24%, and “Company A” shows 35% but 4%. Labels are present but misaligned and inconsistent. The chart type is correct, but data accuracy is fatally flawed. VERDICT: FAIL

A simple line graph showing daily website visitors over one FAIL
A simple line graph showing daily website visitors over one week (Mon-Sun): 1200, 1500, 1100, 1800, 2000, 1700, 1400 — with grid lines, axis labels, and title 'Weekly Traffic'

Evaluator (Qwen 3 VL)

The chart shows a line graph titled "Weekly Traffic" with axis labels, grid lines, and data points matching the requested values. However, the x-axis labels are illegible (e.g., "TuuV", "Anil", "Inpk-bndry"), and the y-axis scale is inconsistent (e.g., labels like "90" and "55" appear misaligned with the grid). The chart type is correct, but critical labeling errors render it unusable. VERDICT: FAIL

Layout & DesignClick to expand22%

Layout & Design measures whether local/boogu-image-turbo organizes posters, menus, covers, and other designed compositions with the requested hierarchy and elements.

Easy 67%
A movie night poster with a large popcorn image in the centePASS
A movie night poster with a large popcorn image in the center, the title 'MOVIE NIGHT' in bold at the top, and the tagline 'Every Friday at 8 PM' at the bottom

Evaluator (Qwen 3 VL)

The image features a large popcorn bucket centered, with "MOVIE NIGHT" in bold at the top and "Every Friday at 8 PM" at the bottom. Text is readable and correctly placed. The layout matches the prompt’s core elements and visual hierarchy. All required zones are present and correctly positioned. VERDICT: PASS

An event poster featuring a guitar illustration, the headlinFAIL
An event poster featuring a guitar illustration, the headline 'LIVE MUSIC FEST' in large text, and 'Tickets available now' as a tagline below

Evaluator (Qwen 3 VL)

The image features a guitar illustration and prominent text, but the headline reads "MUSIC FEST" split across lines with "USIC" and "FST" below, not "LIVE MUSIC FEST" as specified. The tagline "Tickets available now" is misrendered as "Tiickets avitb adbel now." Text hierarchy is present but the exact prompt elements are incorrect and unreadable. VERDICT: FAIL

A book club poster with an open book illustration, the titlePASS
A book club poster with an open book illustration, the title 'JOIN THE BOOK CLUB' prominently displayed, and 'Meets every Wednesday' as the tagline

Evaluator (Qwen 3 VL)

The image features an open book illustration centered below the prominent title "JOIN THE BOOK CLUB". The tagline "Meets every Wednesday" is clearly visible in a banner at the bottom. Text is readable and correctly placed with proper visual hierarchy. All required layout elements are present and accurately rendered. VERDICT: PASS

Medium 0%
A landing page design with a hero header image at the top, aFAIL
A landing page design with a hero header image at the top, a headline and subtitle in the middle, a 'Sign Up' call-to-action button, and a footer with three social media icons

Evaluator (Qwen 3 VL)

The image shows a landing page with a hero header image, a headline/subtitle, and a “Sign Up” button — all correctly placed. Social media icons are visible in the footer. However, the headline and subtitle text is garbled and unreadable, violating the requirement for readable text. Visual hierarchy is intact, but illegible text undermines functionality. VERDICT: FAIL

A newsletter layout with a banner image header, a two-columnFAIL
A newsletter layout with a banner image header, a two-column body section with text on the left and an image on the right, a highlighted quote block, and a footer with unsubscribe link

Evaluator (Qwen 3 VL)

The image shows a newsletter layout with a banner header, two-column body (text left, image right), a highlighted quote block, and a footer with an unsubscribe link. However, all text is gibberish and unreadable, violating the requirement for readable, correctly placed text. Layout structure is present but content fails core usability. VERDICT: FAIL

A restaurant menu layout with a logo header, three category FAIL
A restaurant menu layout with a logo header, three category sections (appetizers, mains, desserts) each with items and prices, and a footer with address and hours

Evaluator (Qwen 3 VL)

The menu has a logo header, three category sections (Appetizers, Mains, Desserts) with items and prices, and a footer with address/hours. However, all text is gibberish or nonsensical, making it unreadable and violating the prompt’s requirement for correct, readable content. Layout structure is present but text fails functional purpose. VERDICT: FAIL

Hard 0%
A magazine double-page spread with a left page containing a FAIL
A magazine double-page spread with a left page containing a large hero photo, and a right page with a two-column article, a pull quote, a sidebar with author bio, and a page number

Evaluator (Qwen 3 VL)

The layout matches the prompt: left page has a large hero photo; right page has two columns of text, a pull quote, a sidebar with author bio, and page number. Visual hierarchy is clear. However, all text is gibberish placeholder, not readable. The core requirement of readable text is unmet. VERDICT: FAIL

A mobile app UI mockup showing a dashboard with a top navigaFAIL
A mobile app UI mockup showing a dashboard with a top navigation bar, a search field, a 2x2 grid of feature cards with icons, a list of recent activity items, and a bottom tab bar

Evaluator (Qwen 3 VL)

The image shows a mobile app UI with a top navigation bar, search field, 2x2 grid of feature cards with icons, “RECENTS” list, and bottom tab bar — all present. However, text is garbled and unreadable, violating correct placement and legibility. Visual hierarchy is mostly intact, but illegible text undermines usability. VERDICT: FAIL

A fashion magazine spread with a full-bleed left-page photo,FAIL
A fashion magazine spread with a full-bleed left-page photo, right page with three-column layout, headline, body copy, a smaller inset image, and styled page numbers

Evaluator (Qwen 3 VL)

The left page features a full-bleed photo as required. The right page has a three-column layout with a headline, body copy, and a smaller inset image. Page numbers are present. However, all text is gibberish and unreadable, violating the requirement for readable, correctly placed text. Layout structure is otherwise accurate. VERDICT: FAIL

Style DiversityClick to expand92%

Style Diversity measures whether local/boogu-image-turbo can shift between visual styles like oil painting, pixel art, watercolor, and technical illustration.

Easy 100%
A golden retriever sitting in a garden, painted in the stylePASS
A golden retriever sitting in a garden, painted in the style of an oil painting with visible brushstrokes and rich texture

Evaluator (Qwen 3.5 122B)

The image clearly exhibits an oil painting style with visible, textured brushstrokes throughout. The golden retriever is depicted sitting in a garden setting with flowers and greenery. The rich texture and painterly quality are evident in the fur, foliage, and background elements. The style matches the requested "oil painting with visible brushstrokes and rich texture" description accurately. VERDICT: PASS

A golden retriever sitting in a garden, rendered in JapanesePASS
A golden retriever sitting in a garden, rendered in Japanese anime style with flat colors and large expressive eyes

Evaluator (Qwen 3.5 122B)

The image clearly exhibits the requested Japanese anime style. Key characteristics are present: the golden retriever has large, expressive eyes with prominent highlights, simplified facial features, and clean line art. The colors are vibrant yet relatively flat with minimal shading gradients, consistent with anime aesthetics. The overall composition and character design align well with the prompt’s stylistic requirements. VERDICT: PASS

A golden retriever sitting in a garden, rendered in pixel arPASS
A golden retriever sitting in a garden, rendered in pixel art style with a 32x32 grid and limited color palette

Evaluator (Qwen 3.5 122B)

The image clearly depicts a golden retriever in a garden setting rendered in pixel art style. The 32x32 grid is visibly overlaid on the image, and the color palette is intentionally limited with distinct pixel blocks. Key characteristics of pixel art — blocky edges, low resolution, and grid-based composition — are all present and clearly recognizable. VERDICT: PASS

Medium 100%
A city street scene in the style of Art Deco — geometric shaPASS
A city street scene in the style of Art Deco — geometric shapes, gold and black palette, ornamental symmetry

Evaluator (Qwen 3.5 122B)

The image clearly exhibits the requested Art Deco style. Key characteristics are present: strong geometric shapes in the buildings and street layout, a distinct gold and black color palette, and ornamental symmetry centered around the main building. The typography on the signs and the design of the cars also fit the era. The visual language is consistent with the prompt's requirements. VERDICT: PASS

A woman with a parasol in the style of Impressionism — soft PASS
A woman with a parasol in the style of Impressionism — soft brushstrokes, dappled light, pastel palette

Evaluator (Qwen 3.5 122B)

The image clearly exhibits Impressionist characteristics: soft, visible brushstrokes are evident throughout, especially in the foliage and water reflections. Dappled light filters through trees, creating a play of light and shadow on the woman and ground. The palette is dominated by pastels — pale blues, pinks, greens, and creams — consistent with the style. The overall effect is atmospheric and painterly, matching the requested aesthetic. VERDICT: PASS

A portrait of a man in Pop Art style — bold outlines, Ben-DaPASS
A portrait of a man in Pop Art style — bold outlines, Ben-Day dots, flat vivid primary colors

Evaluator (Qwen 3.5 122B)

The image clearly exhibits Pop Art characteristics: bold black outlines define the figure, vivid primary and secondary colors are used in flat blocks, and Ben-Day dots are prominently featured throughout the hair, face, and background. The stylized, graphic quality matches the requested aesthetic precisely. All key visual elements of the specified art style are present and recognizable. VERDICT: PASS

Hard 67%
A forest rendered in photorealistic watercolor — the detail PASS
A forest rendered in photorealistic watercolor — the detail and lighting of a photo combined with visible watercolor bleeds and paper texture

Evaluator (Qwen 3.5 122B)

The image clearly exhibits watercolor characteristics: visible paper texture, pigment bleeds, splatters, and organic edge diffusion. The lighting mimics photorealism with sunbeams filtering through trees, while the medium remains unmistakably watercolor. Key requested traits — photo-like detail in foliage and light, combined with traditional watercolor artifacts — are all present and well-executed. VERDICT: PASS

A woman in a futuristic city rendered in cyberpunk Art NouvePASS
A woman in a futuristic city rendered in cyberpunk Art Nouveau — neon colors and tech elements combined with flowing organic lines and floral borders

Evaluator (Qwen 3.5 122B)

The image successfully combines cyberpunk and Art Nouveau elements. The character features neon accents, gears, and a futuristic city background (cyberpunk), while the composition is framed by ornate, flowing floral borders and organic lines (Art Nouveau). The color palette utilizes vibrant neons against darker tones. All requested stylistic characteristics are clearly present and well-integrated. VERDICT: PASS

A medieval castle depicted in Japanese ukiyo-e woodblock priFAIL
A medieval castle depicted in Japanese ukiyo-e woodblock print style combined with photorealistic sky and lighting

Evaluator (Qwen 3.5 122B)

The image successfully combines medieval castle architecture with Japanese ukiyo-e woodblock print aesthetics. Key characteristics include the stylized waves in the foreground, flat color planes, visible wood grain texture, and traditional Japanese text seals. However, the sky and lighting are not photorealistic — they retain the illustrative, stylized quality of ukiyo-e, with painted clouds and sunbeams that match the overall artistic style rather than mimicking real-world photography. Thus, th

Extreme 100%
An interior scene rendered in soft brutalism — raw concrete PASS
An interior scene rendered in soft brutalism — raw concrete textures combined with rounded pastel furniture and warm diffused light

Evaluator (Qwen 3.5 122B)

The image clearly exhibits soft brutalism: raw concrete walls, ceiling, and floor are prominent. Rounded pastel furniture (pink sofa, yellow and mint chairs) is present. Warm diffused light streams through sheer curtains, creating a gentle glow. All key visual characteristics of the requested style are accurately represented. VERDICT: PASS

A still life painted in anxious minimalism — sparse compositPASS
A still life painted in anxious minimalism — sparse composition with uncomfortable negative space, muted colors, and subtle visual tension

Evaluator (Qwen 3.5 122B)

The image successfully embodies the requested "anxious minimalism" style. The composition is sparse, featuring a simple table with a bowl, brushes, and cloth against a largely empty wall. The color palette is muted, dominated by grays and browns. The negative space is prominent, particularly in the upper right where faint, sketch-like lines create subtle visual tension and unease. The overall mood is quiet and slightly unsettling, matching the prompt's requirements. VERDICT: PASS

A cityscape rendered in nostalgic futurism — retro-futuristiPASS
A cityscape rendered in nostalgic futurism — retro-futuristic 1960s space-age aesthetics with a melancholic warm-toned patina

Evaluator (Qwen 3.5 122B)

The image successfully embodies the requested "nostalgic futurism" and "retro-futuristic 1960s space-age aesthetics." Key characteristics are clearly present: the architecture features classic mid-century modern elements like domes, spires, and sleek, rounded forms reminiscent of "The Jetsons" or 1960s World's Fair designs. The color palette is a distinct warm, sepia-toned monochrome that creates the requested "melancholic warm-toned patina," mimicking aged photography. The flying vehicles and o