ImageBench V1 —
192 evaluations across 6 categories
Benchmark V1 verdicts are produced by VLM judges and can contain mistakes. Treat PASS/FAIL labels as machine-assisted assessments, and inspect the images yourself. Learn more about the methodology.

Prompt: The word 'CHAPTER ONE' typed on aged paper with a vintage typewriter font, complete with slightly uneven ink
Text Rendering 80%
Typography StyleClick to expand100%
Typography Style measures whether local/boogu-image-turbo preserves requested lettering styles, font mood, and text texture such as typewriter ink or neon signage.
PASSEvaluator (Qwen 3 VL)
Text reads “CHAPTER ONE” correctly. Font mimics vintage typewriter style with uneven, slightly smudged ink and paper texture. Aged paper and typewriter context are convincingly rendered. Minor artifacts (ink specks) enhance realism. All core criteria met. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Text reads "TODAY'S SPECIAL" correctly. Letters show chalk texture, smudges, and dust on board and ledge, matching chalkboard medium. Chalk pieces and eraser are present. Style convincingly mimics white chalk on dark green surface. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The text "OPEN" is rendered in glowing red LED pixels, matching the described medium. Letters are clearly formed with a pixelated grid, and spelling is correct. The sign’s red casing and wiring further reinforce the LED sign aesthetic. All core criteria are met. VERDICT: PASS
Writing accuracyClick to expand75%
Writing accuracy measures whether local/boogu-image-turbo renders the exact requested words, numbers, labels, and short phrases without spelling or layout errors.
PASSEvaluator (Qwen 3 VL)
The word "HELLO" is spelled correctly and fully readable, despite stylized, textured letterforms. Each letter is distinct and legible against the white background. No misspellings or missing characters are present. The artistic design does not obscure any letters. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The word "ADVENTURE" is spelled correctly and fully readable, though stylized with cracks and stars. No missing or illegible parts. Background is cream-colored, though not plain due to texture and compass graphic. Text is centered. All letters are intact and discernible. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The word "SERENITY" is fully spelled correctly and clearly readable. Each letter is distinct and legible against the solid light blue background. No misspellings, missing letters, or illegible parts are present. The text is centered as intended. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
All words are spelled correctly and fully readable: "The", "quick", "brown", "fox", "jumps", "high". Text is clean black on white background. No misspellings or illegible parts. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The text "Every sunset brings a new dawn ahead" is fully legible and spelled correctly. All words match the intended phrase exactly. No misspellings, missing words, or illegible parts are present. The script is clear and readable against the pale yellow background. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The text "Stars shine brightest in the dark night" is fully legible and spelled correctly. All words match the intended phrase exactly. The background is clean off-white with star motifs, matching the description. No misspellings or missing words detected. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
All text matches the intended content exactly: "SUMMER SALE", "Up to 50% off all items this weekend only", and "Terms and conditions apply. See store for details." No misspellings, missing words, or illegible parts. Text is fully readable and correctly rendered. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
All text matches the intended message exactly: “GRAND OPENING”, “Join us for food, music, and fun”, and “RSVP required. Limited seating available.”. Every word is spelled correctly and fully readable, with no missing or illegible parts. Font sizes and layout align with the description. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
All text matches the intended content exactly: "ANNUAL GALA 2025", "An evening of fine dining and live jazz", and "Black tie optional. Doors open at 6 PM." No misspellings, missing words, or illegible parts. Font sizes and layout align with description. Text is fully readable. VERDICT: PASS
FAILEvaluator (Qwen 3 VL)
The tagline “ONE SHIP. ONE HE CHANCE. CRNANC LO)TUICE. NO TURNING BACK.” contains misspellings: “HE” instead of “A”, and “CRNANC LO)TUICE” is garbled and unreadable. Credits block is mostly correct but “Starring” is misaligned and “Stars Voss” is illegible on the ship. Core text errors prevent full readability. VERDICT: FAIL
FAILEvaluator (Qwen 3 VL)
“Top 1ew” is misspelled (should be “Top 10”). “March 2025” in barcode reads “March 2023”. “Climate Action” appears twice incorrectly. “Dr. Maria Santos” is repeated in wrong context. Multiple text elements are misaligned or garbled. Core text is not fully accurate or readable. VERDICT: FAIL
FAILEvaluator (Qwen 3 VL)
“Northwest” is misspelled as “Northweest.” All other text matches the intended label exactly, including weight, description, and small print. The misspelling is clear and fully readable. VERDICT: FAIL
Spatial Reasoning 74%
Attributes BindingClick to expand100%
Attributes Binding measures whether local/boogu-image-turbo attaches the right colors, materials, and properties to the correct requested objects.
PASSEvaluator (Qwen 3.5 122B)
The image shows two objects on a white surface near a window. On the left is a red spherical object with visible seams and wear — consistent with a ball. On the right is a blue cubic object with chipped paint and edges — consistent with a cube. Colors, shapes, and positions match the prompt: red ball on left, blue cube on right. No attribute swaps or misbindings observed. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows a green apple and an orange pumpkin on a wooden cutting board. The apple is correctly green, round, and appropriately sized relative to the pumpkin. The pumpkin is correctly orange, round, and larger than the apple. Both objects are placed on a rustic wooden cutting board with visible grain and knife marks. No attribute swaps or misidentifications are present. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows a yellow taxi and a black limousine parked next to each other on a street. The taxi is correctly yellow and has the typical taxi shape with a roof sign. The limousine is correctly black and has the elongated shape of a limousine. All attributes (color, size, shape) are correctly bound to the right objects with no swaps. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows a white canvas with three painted shapes: - A large red circle at the top — matches prompt. - A medium-sized blue triangle at bottom left — prompt says “small,” but visually it’s comparable in scale to the star; however, color and shape are correct. - A medium green star at bottom right — matches prompt. All colors and shapes are correctly assigned. Size descriptors (“large,” “small,” “medium”) are subjective but reasonably aligned. No swaps or misattributions. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows three candles on a wooden shelf: 1. Tall thin white candle – correctly matches description (tall, thin, white). 2. Short fat red candle – correctly matches description (short, fat, red). 3. Medium spiral yellow candle – correctly matches description (medium height, spiral shape, yellow). All attributes are correctly bound to the right objects with no swaps. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows a large brown horse with a black mane and tail, standing in a meadow. A small white rabbit is sitting near the horse’s front legs. A medium-sized black cat stands nearby, facing forward. All attributes match the prompt: horse is big and brown, rabbit is small and white, cat is medium and black. No swaps or misattributions observed. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image contains a pink fire truck with correct features (ladder, lights, wheels). A large blue banana is present in the foreground. A green sun with a smiling face is visible in the sky. All requested objects are present with their specified colors and attributes correctly bound. No swaps or missing elements detected. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image contains: 1. Purple carrot: Correctly colored purple with green leafy top, elongated shape. 2. Red tree: Small potted tree with red leaves and reddish trunk. 3. White watermelon: Sliced open, showing pink/red interior with black seeds, white rind. All attributes are correctly bound to their respective objects with no swaps. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows three objects on a wooden table: 1. Elephant: Orange-brown ceramic, correct color and shape. 2. Penguin: Yellow and white plush, correct color and shape. 3. Strawberry: Silver metallic with green leaves, correct color and shape. All attributes are correctly bound to the right objects with no swaps. VERDICT: PASS
CompositionalityClick to expand89%
Compositionality measures whether local/boogu-image-turbo combines subjects, actions, outfits, props, and scene details into one coherent requested image.
PASSEvaluator (Qwen 3.5 122B)
The image contains: - Tabby cat: Present (clearly visible with tabby markings) - Oversized red sunglasses: Present (large red frames on cat's face) - Coherent combination: Present (sunglasses fit naturally on cat's face) All described elements are present and coherently combined. The cat is wearing the red sunglasses in a believable manner, and the scene is well-composed with appropriate lighting and background elements. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image contains: - Golden retriever: Present and clearly visible - Cowboy hat: Present on the dog's head - Tiny: The hat appears appropriately sized for the dog All described elements are present and coherently combined. The golden retriever is wearing a cowboy hat that fits its head proportionally, creating a unified and logical composition. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows a penguin in the foreground holding a colorful umbrella. The umbrella has multiple bright colors including red, yellow, blue, and orange. The penguin is standing on icy ground with other penguins in the background. All described elements are present and coherently combined - the penguin is clearly holding the umbrella with its flipper/wing, and the umbrella is open above it protecting from rain. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image contains all required elements: 1. Medieval knight in full armor - PRESENT (full plate armor with helmet) 2. Sitting at a desk - PRESENT (wooden table/desk) 3. Typing on a modern laptop - PRESENT (silver laptop with visible screen and keyboard) The concepts are coherently combined with the knight's armored hands positioned on the laptop keyboard, creating a believable anachronistic scene. The medieval setting (stone walls, candles, scrolls) contrasts appropriately with the modern tech
PASSEvaluator (Qwen 3.5 122B)
The image contains all required elements: 1. Roman gladiator - Present (wearing helmet, armor, red tunic) 2. Selfie action - Present (holding phone up, smiling at camera) 3. Smartphone - Present (modern device showing his reflection) 4. Colosseum - Present (ancient Roman amphitheater in background) All elements are coherently combined in a single scene with proper spatial relationships and lighting consistency. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows: - Astronaut in a spacesuit: Present (white suit with American flag patches, helmet with reflective visor) - Bicycle: Present (blue frame, standard bike design) - Park setting: Present (trees, grass, benches, fountain, people in background) - Coherent combination: Yes (astronaut is actively riding the bike on a park path) All elements are present and logically combined. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image contains all described elements: 1. Octopus - Present (central figure) 2. Top hat - Present (black, on octopus head) 3. Monocle - Present (gold-rimmed, on octopus eye) 4. Chess game - Present (board with pieces on lunar surface) 5. Moon surface - Present (cratered terrain) 6. Earth in background - Present (visible in space) All elements are coherently combined in a surreal but visually consistent composition. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows: - Dinosaur: Present (T-Rex head/body) - Business suit: Present (dark suit, tie, pocket square) - PowerPoint presentation: Present (on screen with charts/maps) - Modern office: Present (glass walls, conference table, laptops) - Drinking coffee: Present (holding white mug) All elements are coherently combined in a surreal but visually consistent scene. The dinosaur's anatomy blends with human posture and clothing naturally. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
The image contains: - Robot dressed as a pirate: YES (wearing pirate hat, coat, belt) - Paper boat: YES (white origami-style boat) - Lake of lava: YES (glowing molten surface with volcanoes) - Juggling three kittens: PARTIALLY — there are FIVE kittens floating around, not three, and they’re not being juggled — they’re suspended mid-air in a surreal arrangement Core concept is present but “juggling three kittens” is inaccurate — both number and action mismatch. VERDICT: FAIL
CountingClick to expand44%
Counting measures whether local/boogu-image-turbo creates the exact number of requested objects without adding or missing items.
PASSEvaluator (Qwen 3.5 122B)
The image shows exactly three red apples placed on a rustic wooden table. The apples are arranged in a loose triangular formation near the center-left of the frame. No other apples are visible. The count matches the prompt’s requirement precisely. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
The image shows four yellow rubber ducks arranged in a row on a wooden bathroom shelf. The prompt requested exactly five ducks. Since only four are visible, the count does not match the requirement. VERDICT: FAIL
PASSEvaluator (Qwen 3.5 122B)
The image displays four white pillar candles, all lit, standing upright on a dark wooden surface. The count of candles is exactly four, and they are white and standing on a dark surface as requested. No other candle-like objects are present. The core criteria — “Exactly 4 white candles standing on a dark surface” — is fully met. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
The image shows 1 orange cat and 2 black dogs sitting on a grassy lawn. The prompt requested 2 orange cats and 3 black dogs. The counts do not match — there is one fewer cat and one fewer dog than specified. Therefore, the image fails to meet the exact object count requirement. VERDICT: FAIL
FAILEvaluator (Qwen 3.5 122B)
The image shows 3 red roses and 3 white daisies in a glass vase on a wooden table. The prompt requested 4 red roses and 2 white daisies. The counts do not match — there is one fewer rose and one extra daisy than specified. Therefore, the core requirement of exact object count is not met. VERDICT: FAIL
PASSEvaluator (Qwen 3.5 122B)
The image shows 1 blue butterfly and 3 yellow butterflies resting on a wooden garden fence, exactly matching the requested count. All butterflies are clearly visible and positioned as described. No extra or missing butterflies are present. The scene accurately reflects the prompt’s specifications. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
I count 10 goldfish in the image: one near the top right, two in the upper middle, one on the left middle, one center, one right middle, three along the bottom, and one partially visible behind the glass on the far left. The prompt requested exactly 9 goldfish. Since there are 10, the count does not match. VERDICT: FAIL
PASSEvaluator (Qwen 3.5 122B)
The image shows a fruit bowl with: - 3 bananas (correct) - 4 oranges (correct) - 2 apples (correct) All counts match the prompt exactly. The composition is accurate and visually coherent. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
The image shows a cluster of colorful balloons against a clear blue sky. Counting them reveals approximately 18–20 balloons, not 11. The prompt requested exactly 11, so the count does not match. While the scene is visually accurate (colorful balloons, blue sky), the numerical requirement is violated. VERDICT: FAIL
NegationClick to expand33%
Negation measures whether local/boogu-image-turbo respects what should be absent from the image, not only what should be present.
PASSEvaluator (Qwen 3.5 122B)
The image depicts a serene tropical beach scene with palm trees, thatched-roof huts, lounge chairs, and sailboats on calm turquoise water. The sand is pristine with gentle waves lapping the shore. There are no visible people anywhere in the frame — not on the beach, under the huts, or in the boats. The exclusion criterion (“no people”) is fully satisfied. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
The image shows an orange and white cat lying on a patterned couch. The cat is wearing a dark collar with a small metal ring or tag attached. Since the prompt explicitly excluded “a cat lying on a couch without a collar,” and the cat in the image clearly has a collar, the excluded element is present. VERDICT: FAIL
PASSEvaluator (Qwen 3.5 122B)
The image shows an empty classroom with wooden desks arranged in rows and a large chalkboard on the wall with writing on it. There are no students visible in the room. The scene is quiet and unoccupied, matching the requirement of having no students present. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
The image clearly shows a sandy beach with palm trees, seashells, and a beach umbrella — as requested. However, the prompt explicitly excluded “water or ocean visible,” and in this image, the ocean is prominently visible in the background, stretching across the horizon with waves breaking on shore. This violates the exclusion criterion. VERDICT: FAIL
FAILEvaluator (Qwen 3.5 122B)
The image clearly shows a computer monitor (an Apple iMac) on the desk, displaying a document or webpage. The prompt explicitly excluded “computers or screens,” yet a screen is prominently present. This violates the exclusion criterion. VERDICT: FAIL
FAILEvaluator (Qwen 3.5 122B)
The image shows a kitchen with hanging copper and cast iron pots, a gas stove, and a wooden cutting board on an island. However, the cutting board clearly has chopped food items on it — diced potatoes, a small tomato, and a piece of cucumber. Since the prompt explicitly excluded “no food,” and food is visibly present, the exclusion criterion is violated. VERDICT: FAIL
FAILEvaluator (Qwen 3.5 122B)
The image shows a pepperoni pizza with tomato sauce, basil leaves, and a crust. However, there is clearly melted white cheese (likely mozzarella) distributed across the pizza surface, beneath and around the pepperoni slices. The prompt explicitly excluded cheese, but cheese is visibly present in the image. VERDICT: FAIL
PASSEvaluator (Qwen 3.5 122B)
The image shows a vintage car body with windows, doors, and headlights as described. Critically, the car has no wheels — the wheel wells are empty, exposing suspension components or just darkness where tires should be. This matches the exclusion criteria: “no wheels” is accurately represented. The car is clearly in a state of disassembly or restoration, parked on a roadside with tools nearby. No wheels are present anywhere on the vehicle. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
The image shows a human face with eyes, nose, and mouth. However, the excluded element — eyebrows — are clearly present. Both eyebrows are visible above the eyes, with natural hair texture and shape. Since the prompt explicitly required no eyebrows, their presence violates the constraint. VERDICT: FAIL
Relative PositionClick to expand83%
Relative Position measures whether local/boogu-image-turbo places objects in the requested spatial relationship, such as above, below, inside, or next to another object.
PASSEvaluator (Qwen 3.5 122B)
The image shows a ginger and white cat sitting directly on top of an open cardboard box. The box is positioned on the floor in a living room setting. The spatial relationship described in the prompt — “a cat sitting on top of a cardboard box” — is accurately represented. All positional relationships are correct. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows a red ball positioned directly underneath the seat of a wooden chair, resting on the floor between the chair’s legs. The spatial relationship “underneath” is accurately depicted — the ball is below the chair’s seat and enclosed within the leg frame. All described positional relationships are correct. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows a wooden desk with two coffee mugs and an open book. One mug is positioned to the left of the open book, and another mug is behind it, near a pair of glasses. The spatial relationship described in the prompt — "a coffee mug next to an open book on a desk" — is accurately represented. All positional relationships are correct. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows a bird perched on a mossy branch that extends from a large tree trunk. The branch curves downward and is positioned above a wooden park bench, which sits on the ground beneath it. The spatial relationships described — bird “on” branch, branch “from” tree, tree “next to” bench — are all accurately represented. The bench is below and slightly to the right of the branch’s end, consistent with “next to.” All positional cues match the prompt. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows an orange cat lying curled up on a floral-patterned pillow. The pillow is placed directly on the seat of a rustic wooden chair. The spatial relationships described — “a cat sitting on a pillow, the pillow placed on a wooden chair” — are accurately represented. All positional relationships (on, under) are correct as per the prompt. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows a candle placed on top of a stack of books, which matches the prompt. The stack of books is positioned to the right of a potted plant, also matching the prompt’s spatial description. All specified positional relationships — “on top of” and “next to” — are accurately represented in the image. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
The image shows five prominent chess pieces arranged on a board. There is a white king in the center, a black queen to its left, and a white pawn (not a bishop) to its right. A black king is behind (above) the white king, and a brown knight is in front (below) the white king. The prompt incorrectly identifies the piece to the right as a bishop and the piece above as a rook. Therefore, not all positional relationships and piece identifications are correct. VERDICT: FAIL
PASSEvaluator (Qwen 3.5 122B)
The image shows a desk scene with a laptop centered, a coffee mug to its left, a phone to its right, a lamp positioned above and to the right of the laptop, and an open notebook below the laptop. All specified spatial relationships — left, right, above, below — are accurately represented in the image. The arrangement matches the prompt’s description precisely. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows five animals arranged left to right: a brown horse grazing on the far left, a black-and-white cow to its right, a white sheep in the center, a brown-and-white goat to the sheep’s right, and a golden dog sitting on the far right. All specified spatial relationships match the prompt exactly — each animal is positioned as described relative to its neighbor. No positional errors are present. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows a red barn in the foreground, large and detailed, with a windmill positioned to its right in the midground at a smaller scale. Snow-capped mountains appear in the background, smaller still, creating consistent depth perspective. All spatial relationships described — foreground/midground/background scaling and relative positioning — are accurately represented. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows a beach scene with a sandcastle in the foreground, a surfer riding a wave in the midground, and a cruise ship on the horizon in the background. The sizes of the objects decrease with distance as specified. All described positional relationships are correct. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
The image shows a park bench and pigeons in the foreground, a fountain in the midground, and skyscrapers in the background — matching the prompt’s layered spatial description. However, the prompt specifies “each layer progressively smaller,” which is not visually accurate: the fountain appears larger than the distant buildings due to perspective, but the skyscrapers are not scaled down relative to the fountain in a way that reflects true proportional diminution across layers. The spatial relatio
Scale & ProportionsClick to expand89%
Scale & Proportions measures whether local/boogu-image-turbo keeps people, vehicles, buildings, and everyday objects at believable relative sizes.
PASSEvaluator (Qwen 3.5 122B)
The person’s height relative to the bus appears realistic — roughly 1/3 to 1/2 the height of the bus’s side panel, which is consistent with real-world proportions. The bus’s features (windows, door, mirrors) are scaled appropriately to human size. No obvious distortions or scale mismatches are visible. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The ladybug appears appropriately sized relative to the leaf, consistent with real-world proportions. The leaf’s veins, water droplets, and texture are rendered at a scale that supports the insect’s realistic size. No obvious distortions or scaling errors are present. The composition maintains natural spatial relationships between the bug and its environment. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The child’s hands are holding the basketball, and the ball appears appropriately sized relative to the child’s torso and arms — consistent with a real-world child holding a standard basketball. No anatomical or proportional distortions are evident. The background elements (gym floor, bleachers, hoop) also scale logically. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows a miniature figure standing on the rim of a coffee cup, which aligns with the prompt's description of "a tiny person standing on the rim of a coffee cup." The proportions are consistent with the intended size relationship, as the person is appropriately scaled to appear tiny relative to the cup. The scene is coherent and matches the described scenario. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image depicts a giant tabby cat towering over city skyscrapers, cars, and pedestrians. The cat’s paws are larger than the vehicles, and its body height exceeds the surrounding buildings, consistent with the “kaiju” description. However, the Empire State Building (visible in background) appears smaller than the cat — which contradicts real-world scale unless the cat is truly colossal. Since the prompt explicitly asks for “giant cat walking between skyscrapers like a kaiju,” the exaggerated pr
PASSEvaluator (Qwen 3.5 122B)
The image shows a miniature house resting on an open human palm. The house is appropriately sized to fit comfortably within the hand, with details like windows, doors, and moss visible at a scale consistent with being held. The surrounding tools and workshop environment also appear proportionally correct relative to the hand and house. The size relationship between the house and the hand is accurate and consistent with the prompt. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows five animals in a line: elephant, horse, dog, cat, and mouse. Their relative sizes are correctly proportioned according to real-world biology — the elephant is largest, followed by the horse, then dog, cat, and mouse smallest. All are standing on the same ground plane with consistent perspective. No anatomical or scaling errors are visible. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image displays a dinner table setting with a plate, fork, knife, wine glass, salt shaker, and peppercorns. The relative sizes appear realistic: the plate is appropriately sized for a dinner setting, the cutlery is proportionate to the plate, the wine glass is standard size relative to the other items, and the salt shaker and scattered peppercorns are correctly scaled. No obvious distortions or size inconsistencies are present. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
The image displays a farm scene with a barn, tractor, farmer, cow, chicken, and fence post. The relative sizes appear mostly consistent: the barn is largest, followed by the tractor and cow, then the farmer, with the chicken being smallest. However, the farmer appears slightly too small relative to the cow and tractor, and the chicken seems disproportionately large compared to typical farm chickens. These minor inconsistencies affect overall proportional accuracy. VERDICT: FAIL
Human realism 86%
Faces & ExpressionsClick to expand100%
Faces & Expressions measures whether local/boogu-image-turbo creates realistic faces with the requested age, emotion, expression, and identity cues.
PASSEvaluator (Qwen 3 VL)
Face is symmetric with natural features. Smile appears warm and genuine, with crinkled eyes and relaxed lips conveying authentic joy. No significant distortions; lighting and texture look realistic. Slight hair strands across face add to naturalism. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Face is symmetric with natural, correctly proportioned features. Expression is calm and serious, with subtle brow lines and neutral mouth conveying intent. No visible distortions. Lighting and texture are realistic. Meets all criteria for the prompt. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The face is symmetric with correctly proportioned features. The teenager’s expression is relaxed and neutral, with no strong emotion. No noticeable distortions or artifacts are present. Lighting and detail are naturalistic, matching a realistic headshot. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Face is symmetric with natural features. Disgust is clearly conveyed: nose wrinkled, lips curled, brows furrowed. No significant distortions; expression is authentic and readable. Skin texture and lighting are realistic. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Face is symmetric with natural features. Eyes are wide, eyebrows raised, mouth slightly open — clearly conveying genuine surprise. No distortions or unnatural artifacts. Lighting and detail are realistic. Emotion is unmistakable and well-executed. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Face is symmetric with natural, correctly rendered features. Eyes are downcast, mouth slightly drawn, gaze distant — all aligning with the prompt’s melancholy description. No visible distortions. Lighting and texture enhance the emotional realism. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The face is symmetric with natural, aged features: deep wrinkles, white hair, and a warm, genuine smile. Eyes crinkle naturally, conveying kindness. No significant distortions; lighting and texture are realistic. The emotion is clearly readable as joyful and warm. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The child’s face is symmetric with natural features: round cheeks, large eyes, and tears streaming down. The crying expression is clearly readable, with furrowed brows, an open mouth, and visible tears. No significant distortions are present. The image accurately fulfills the prompt’s requirements. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Face is symmetric with realistic elderly features: age spots, thinning gray hair, and reading glasses. Skeptical expression is clearly readable via furrowed brow and downturned mouth. No significant distortions. Lighting and texture are natural. All prompt elements are accurately rendered. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Faces are symmetric with correct anatomical features. Emotions are clearly readable: left person laughing (head back), center person shocked (wide eyes, open mouth), right person smirking with rolled eyes. No distortions or unnatural artifacts. All three expressions match the prompt precisely. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Faces are symmetric with correct anatomical features. Emotions are clearly readable: father beams, mother smiles gently, teenager looks bored, child grins excitedly. No distortions or unnatural artifacts. All four individuals match the prompt’s emotional descriptions accurately. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Faces are symmetric with correct features. Emotions are clearly readable: frustrated (left, furrowed brows), pleased (center, smiling, nodding), confused (right, tilted head, raised brows). No distortions. All three expressions match the prompt accurately. VERDICT: PASS
Full BodyClick to expand100%
Full Body measures whether local/boogu-image-turbo renders full-body people with natural anatomy, clothing, pose, and proportions.
PASSEvaluator (Qwen 3 VL)
The woman stands upright with arms at her sides, wearing a white t-shirt and jeans, matching the prompt. Limb count and proportions are anatomically correct. Pose is relaxed and recognizable. No significant distortions observed. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The man stands straight, facing the camera, with hands in pockets. Limb count and proportions are anatomically correct. The pose is clearly recognizable as described. No distortions are visible. All elements match the prompt accurately. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The person’s limbs are correctly counted and proportioned. The neutral standing pose is anatomically accurate and recognizable. No significant distortions are visible. The full body from head to feet is clearly depicted in a park setting as requested. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The woman’s limbs are correctly proportioned and anatomically accurate. Her running pose is dynamic and recognizable, with one leg forward, arms bent mid-stride, and hair flowing backward. Motion is convincingly depicted in her clothing and hair. No distortions or anatomical errors are visible. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The man’s limbs are correctly counted and proportioned. His pose—bent knees, mid-air jump over a puddle—is anatomically plausible and matches the prompt. Water splashes realistically. No distortions in body structure or proportions are visible. The action is clearly recognizable as jumping over water. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The dancer’s limbs are correctly proportioned and count is accurate. The pose—dynamic spin with one leg extended and dress flowing—is anatomically plausible and recognizable as a ballet turn. No distortions in limbs or body structure are visible. The motion and fabric dynamics are convincingly rendered. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The subject stands on one leg, with the other foot pressed against the inner thigh, arms raised overhead — accurately depicting the tree pose. Limb count and proportions are correct. No anatomical distortions are visible. The pose is clearly recognizable and executed with proper form. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The climber has correct limb count and proportions. The pose is anatomically plausible for rock climbing, with arms and legs extended to holds. Musculature and joint angles appear natural. No obvious distortions. The action is clearly recognizable as dynamic rock climbing. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The cellist has correct limb count and proportions. Her posture is anatomically sound: seated, legs around the cello, bow arm extended. The action is clearly recognizable as playing the cello. No distortions are visible. All elements of the prompt are accurately depicted. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The dancers’ limbs are correctly counted and proportioned. The woman’s body is arched backward with intertwined legs, matching the tango dip pose. Anatomical structure is accurate, with no visible distortions. The action is clearly recognizable as a classic tango embrace and dip. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Both players show correct limb count and proportions. The sliding player’s body posture and the jumping player’s mid-air pose are anatomically plausible and match the described action. The ball is near the jumping player’s foot, capturing the tackle moment accurately. No significant distortions are visible. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Both gymnasts perform synchronized handstands with identical body alignment. Limb count and proportions are anatomically correct. The pose is clearly recognizable as a handstand on the floor. No distortions are visible. VERDICT: PASS
HandsClick to expand50%
Hands measures whether local/boogu-image-turbo draws anatomically plausible hands, fingers, and gestures.
FAILEvaluator (Qwen 3.5 122B)
The left hand (viewer’s right) shows 5 fingers, all anatomically plausible. The right hand (viewer’s left) also has 5 fingers, but the thumb appears abnormally short and stubby, and the index finger is slightly malformed or fused at the tip. While finger count is correct, anatomical correctness is compromised. VERDICT: FAIL
PASSEvaluator (Qwen 3.5 122B)
The woman’s hands are held palms-up with fingers spread. Left hand: 5 fingers visible (thumb, index, middle, ring, pinky). Right hand: also 5 fingers clearly shown. All digits appear anatomically correct — no extra, missing, or fused fingers. Gesture is natural and matches the prompt. No anomalies detected. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
The left hand (on the right side of image) shows 4 fingers clearly, but the thumb is missing or fused — only 4 digits visible. The right hand (on left side of image) also appears to have only 4 fingers; the pinky is indistinct or merged. Neither hand has exactly 5 fingers. Grip is anatomically plausible but digit count is incorrect. Anomalies: missing/fused fingers on both hands. VERDICT: FAIL
PASSEvaluator (Qwen 3.5 122B)
The hand holding the mug is anatomically correct with 5 fingers visible: thumb, index, middle, ring, and pinky. The grip on the handle is natural and realistic. No anomalies detected in finger count or positioning. The sweater cuff and skin texture also appear consistent with a real human hand. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
The right hand holding the pen shows 5 fingers, but the grip is slightly unnatural — the thumb and index finger appear fused or overly close, and the middle finger seems to merge with the ring finger. The left hand resting on the notebook also has 5 fingers, but they are unnaturally elongated and lack clear knuckle definition. Both hands show subtle anatomical distortions typical of AI generation. VERDICT: FAIL
FAILEvaluator (Qwen 3.5 122B)
The right hand (strumming) shows a thumb and three fingers clearly, but the pinky is either missing or fused — only 4 distinct digits visible. The left hand (fretting) has 5 fingers, but the index finger appears to merge with the thumb or fretboard in an unnatural way, and the ring/pinky positioning is awkward. Anatomical correctness fails on both hands due to digit count and gesture anomalies. VERDICT: FAIL
FAILEvaluator (Qwen 3.5 122B)
The hand making the peace sign has five fingers total, but the anatomy is distorted: the thumb appears fused or malformed, and the curled fingers are unnaturally shaped and positioned. The gesture is recognizable but not anatomically correct — fingers lack proper joint definition and spacing. This violates the requirement for anatomical correctness. VERDICT: FAIL
PASSEvaluator (Qwen 3.5 122B)
The image shows one hand giving a thumbs-up gesture. The thumb is extended upward, and the other four fingers are curled into a fist — anatomically correct for this gesture. No anomalies in finger count or structure are visible on the single hand shown. Since only one hand is present and it matches the prompt’s description (4 fingers curled, 1 thumb up), the core criteria is met. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
The visible hand shows three extended fingers (index, middle, ring) and two curled fingers (pinky, thumb), totaling five — anatomically correct for a “count to three” gesture. However, the thumb is partially obscured and appears fused or malformed at the base, lacking clear separation from the palm. The wristband also looks unnaturally textured and attached. These anomalies suggest AI generation artifacts. VERDICT: FAIL
PASSEvaluator (Qwen 3.5 122B)
Upon close examination of the handshake: - Left hand (from viewer’s perspective): 5 fingers visible — thumb, index, middle, ring, pinky — all appear anatomically correct and properly positioned in the grip. - Right hand: 5 fingers also visible — thumb wrapped over, four fingers curled underneath — no extra or missing digits; grip is natural and realistic. No anomalies detected. Both hands show exactly 5 fingers with correct anatomy and gesture. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image shows two people high-fiving. The man’s hand (left) has 5 fingers clearly visible and extended. The woman’s hand (right) also shows 5 fingers, but the thumb is partially obscured by the man’s hand — however, it’s still anatomically plausible as part of a natural high-five grip. No extra or missing fingers; gesture is correct. Minor occlusion doesn’t violate anatomical correctness. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
Upon close examination, the hands appear anatomically correct with five fingers per hand. The grip on the gift box is natural and consistent with the action of handing over a present. No visible anomalies such as extra fingers, missing digits, or distorted joints are present. The lighting and focus support clear visibility of hand structure. VERDICT: PASS
Multi-SubjectClick to expand100%
Multi-Subject measures whether local/boogu-image-turbo keeps multiple people distinct while preserving their requested attributes, outfits, and positions.
PASSEvaluator (Qwen 3 VL)
Two women stand side by side. Left: short blonde hair, red jacket over striped shirt. Right: long black hair, blue dress. Both are visually distinct and match the prompt exactly. Features are accurate and well-defined. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
The two men match the description: one has a beard and glasses, wearing a green patterned sweater; the other is clean-shaven, in a black hoodie. Both are visually distinct and clearly identifiable. Background figures are unrelated. All prompt criteria are satisfied. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Tall person: curly red hair, denim overalls over striped shirt. Short person: straight brown hair, yellow floral sundress. Both visually distinct and match descriptions precisely. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Bald man in suit: present, smiling, left side. Silver-haired woman in red blouse: present, center-left. Young man with dreadlocks in denim jacket: present, center-right. Woman with hijab in green dress: present, right side. All visually distinct and match descriptions precisely. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Tall woman: short pink hair, playing bass — matches. Man with beard and bandana: on drums — matches. Slim man with glasses: playing keyboard — matches. All three are visually distinct and correctly positioned. Features align precisely with prompt. VERDICT: PASS
PASSEvaluator (Qwen 3 VL)
Man: wears sunhat and backpack. Woman: has braided hair, holds hiking poles, wears backpack. Teenager: wears baseball cap, holds water bottle, wears backpack. All three are visually distinct and match described features exactly. VERDICT: PASS
Truthfulness 70%
PhotorealismClick to expand100%
Photorealism measures whether local/boogu-image-turbo makes materials, surfaces, and real-world objects look visually plausible and distinct.
PASSEvaluator (Qwen 3.6 27B)
The image successfully renders the requested materials with high fidelity. The glass shows accurate refraction and transparency, the spoon exhibits sharp metallic reflections, the apple has a realistic waxy sheen, and the linen napkin displays proper fabric texture and soft folds. The wooden table features convincing grain and wear. Lighting is consistent across all objects, creating natural shadows and highlights that enhance the realism of each material’s physical properties. VERDICT: PASS
PASSEvaluator (Qwen 3.6 27B)
The image successfully renders the requested materials with high fidelity. The brushed aluminum laptop shows appropriate metallic sheen and texture. The leather notebook displays realistic grain and thickness. The ceramic mug has a convincing matte, glazed finish. The glass paperweight exhibits correct refraction and reflection of the surroundings. The wooden desk surface shows natural grain and lighting interaction. All materials are visually distinct and realistic. VERDICT: PASS
PASSEvaluator (Qwen 3.6 27B)
The image successfully renders the distinct material properties requested. The stainless steel pot and countertop exhibit sharp, accurate reflections. The wooden cutting board shows natural grain and a wet sheen where the oil has been poured. The glass bottle demonstrates correct translucency and refraction of the oil inside. The cotton towel displays appropriate fabric texture and soft folds. All surfaces interact with light realistically. VERDICT: PASS
Physics & ReflectionsClick to expand75%
Physics & Reflections measures whether local/boogu-image-turbo follows basic physical cues such as shadows, reflections, gravity, transparency, and contact points.
PASSEvaluator (Qwen 3.6 27B)
The red sphere casts a distinct shadow to the right and slightly downward, which is consistent with a light source positioned in the upper left. The shadow’s shape and softness are physically plausible for a diffuse light source. The sphere’s surface shows appropriate highlights and shading, enhancing its three-dimensional appearance. No distortions or inconsistencies in color or orientation are observed. VERDICT: PASS
FAILEvaluator (Qwen 3.6 27B)
The light source is clearly coming from the window on the left side of the frame. Consequently, the mug casts a shadow to the right, not the left as specified in the prompt. While the shadow's sharpness and color are physically plausible for the lighting conditions, the directional error makes the image fail the specific criteria of the prompt. VERDICT: FAIL
FAILEvaluator (Qwen 3.6 27B)
The shadow is cast directly beneath the subject, consistent with midday lighting. The direction and orientation align with the sun’s position overhead. However, the shadow appears slightly distorted—particularly around the feet and hat brim—with unnatural softness and irregular edges that don’t match the sharpness of surrounding elements. Color is appropriately dark gray/black, but the shape lacks physical precision. VERDICT: FAIL
FAILEvaluator (Qwen 3.6 27B)
The reflection shows the woman’s front view, which is physically correct for a mirror facing her. However, the prompt specifically requested the reflection to show the “back of her outfit,” which is physically impossible in a standard mirror setup. Since the image contradicts the specific prompt requirement regarding the reflection's content, it fails the evaluation criteria. VERDICT: FAIL
PASSEvaluator (Qwen 3.6 27B)
The reflection is physically plausible. It correctly mirrors the cat’s orientation and position, with appropriate distortion due to the floor’s curvature and lighting. The color and brightness match the real cat, slightly dimmed as expected in a reflection. Shadows align with the light source from the left. No obvious artifacts or inconsistencies are present. VERDICT: PASS
PASSEvaluator (Qwen 3.6 27B)
The reflection on the glass table is physically plausible. It correctly mirrors the vase and flowers with appropriate vertical inversion and slight distortion due to the glass thickness and viewing angle. The lighting direction from the window casts consistent shadows and highlights on the flowers, vase, and surrounding furniture. The reflection’s brightness and color match the real objects, and the transparency of the glass allows both the reflection and the table’s wooden legs to be visible si
PASSEvaluator (Qwen 3.6 27B)
The reflections are vertically inverted, matching the trees' positions and shapes accurately. The water surface shows gentle ripples that distort the reflection slightly, consistent with a calm lake. The reflected colors are darker than the actual trees, which is physically plausible due to light absorption and scattering in water. The sunset glow is also mirrored appropriately. No directional or orientation errors are present. VERDICT: PASS
PASSEvaluator (Qwen 3.6 27B)
The reflections in the puddle are physically plausible. They exhibit a correct vertical inversion of the skyline and streetlights. The mild distortion and blurring are consistent with the surface tension and ripples of a rain puddle. The color shift towards warmer tones in the reflection compared to the cooler sky is also realistic, mimicking how water surfaces reflect ambient light differently. VERDICT: PASS
PASSEvaluator (Qwen 3.6 27B)
The reflection is physically plausible. It is correctly inverted vertically, matching the mountain range's orientation. The colors are subtly muted and slightly darker than the actual mountains, consistent with light absorption in water. The surface is calm, allowing for a clear mirror image with minimal distortion, which aligns with the "still alpine lake" description. The lighting on the peaks corresponds accurately to their reflections. VERDICT: PASS
PASSEvaluator (Qwen 3.6 27B)
The image displays a glass sphere on a reflective surface near a pool. The reflection of the sphere on the mirror is correctly oriented and positioned directly beneath it. The refraction within the sphere shows an inverted, wide-angle view of the surroundings, which is physically accurate for a glass sphere. The lighting and shadows are consistent with a bright outdoor setting. The water ripples and their interaction with the light appear natural. The colors and distortions align with optical ph
PASSEvaluator (Qwen 3.6 27B)
The reflections on the chrome sculpture and glass floor are physically plausible, showing accurate environmental mapping with distorted views of the room and windows. The rainbow prismatic effects on the walls and floor are consistent with light refraction through glass or prisms. Shadows align with the light source direction from the windows. The orientation and distortion of reflections match the curved surfaces realistically. VERDICT: PASS
PASSEvaluator (Qwen 3.6 27B)
The reflections and refractions are physically plausible. The wine glass shows realistic distortion of the background through its curved, faceted surface. The marble countertop displays a clear, slightly blurred reflection of the glass stem and base, consistent with a polished surface. The lake outside exhibits a mirror-like reflection of the distant hills, appropriate for calm water. Lighting direction is consistent across all elements, with highlights and shadows aligning logically. VERDICT:
World KnowledgeClick to expand58%
World Knowledge measures whether local/boogu-image-turbo depicts recognizable places, landmarks, flags, and known objects with accurate real-world details.
PASSEvaluator (Qwen 3.6 27B)
The Eiffel Tower is clearly recognizable and depicted on a clear day. The iron lattice structure is detailed, and the three levels are present. However, there are minor architectural inaccuracies: the top spire/antenna is missing, and the base arches appear slightly wider than in reality. Despite these small deviations, the overall representation is factually sound and architecturally coherent for a general depiction. VERDICT: PASS
PASSEvaluator (Qwen 3.6 27B)
The image depicts the Taj Mahal with high fidelity. The central dome, four minarets, and arched entrances are architecturally accurate and symmetrically aligned. The reflection in the pool is clear and correctly mirrors the structure. The surrounding gardens and cypress trees match the real-world layout. No significant distortions or inaccuracies are present. The lighting and perspective enhance realism without compromising factual representation. VERDICT: PASS
PASSEvaluator (Qwen 3.6 27B)
The Statue of Liberty is clearly recognizable and accurately depicted with correct green patina, torch, crown, tablet, and pedestal. Minor background elements (e.g., distant buildings, flag) are plausible. No major architectural or factual inaccuracies detected. VERDICT: PASS
FAILEvaluator (Qwen 3.6 27B)
The image depicts a shamisen, but it contains significant factual inaccuracies. The bridge is structurally nonsensical, appearing as a floating wooden piece rather than resting on the skin. The tuning pegs are oversized and oddly shaped, lacking the traditional mechanical design. The bachi (plectrum) is incorrectly positioned on the body instead of being held or placed separately. These structural errors make the instrument factually inaccurate. VERDICT: FAIL
FAILEvaluator (Qwen 3.6 27B)
The image depicts a complex brass instrument that resembles an astrolabe but contains significant factual inaccuracies. The internal mechanism features large, exposed clockwork gears and a central hand, which are not components of a traditional medieval astrolabe (which relies on a flat rete and tympan). The degree markings and symbols appear nonsensical and gibberish-like rather than accurate astronomical data. While visually detailed, it fails the criteria for factual accuracy. VERDICT: FAIL
PASSEvaluator (Qwen 3.6 27B)
The image accurately depicts a Japanese kintsugi bowl with visible gold lacquer repairs on cracked ceramic. The subject is clearly recognizable, and the artistic technique is correctly represented. The surrounding elements (chopsticks, tea utensils) enhance the cultural context without detracting from the main subject. No factual or architectural inaccuracies are present. VERDICT: PASS
FAILEvaluator (Qwen 3.6 27B)
The image depicts the water cycle with recognizable elements like the sun, ocean, clouds, and river. However, it contains significant text errors typical of AI generation, such as "ITHEI" instead of "THE," "FORE. CLOUDS," and "RAINN." These typos undermine the factual accuracy and clarity required for a detailed diagram. While the visual flow is correct, the textual inaccuracies make it fail the criteria for a high-quality, accurate educational diagram. VERDICT: FAIL
FAILEvaluator (Qwen 3.6 27B)
The image depicts the correct geological concept of a subduction zone, showing the oceanic plate diving beneath the continental plate, the resulting trench, and the volcanic arc. However, the text labels are nonsensical gibberish (e.g., "TONICGENT REOT SODIFFNT" instead of "Convergent Boundary"), which is a significant failure in rendering text. Despite the visual accuracy of the geological structures, the illegible and incorrect text makes the image factually flawed as an educational diagram.
FAILEvaluator (Qwen 3.6 27B)
The image is not factually accurate. The labels are nonsensical gibberish (e.g., "Oorta", "Corta", and illegible Chinese characters), rendering the diagram useless for educational purposes. Furthermore, the anatomy is incorrect: the aorta is depicted as a blue vessel (indicating deoxygenated blood), while the pulmonary artery is red, reversing the standard color coding and physiological flow. The vessel branching and chamber proportions are also distorted. VERDICT: FAIL
PASSEvaluator (Qwen 3.6 27B)
The image depicts a butterfly that closely resembles a Morpho species, featuring the characteristic iridescent blue dorsal wings and brown ventral patterns with eyespots. The body morphology appears correct for a butterfly in this genus. The details such as wing venation, coloration, and overall structure are accurate and recognizable. There are no significant inaccuracies noted. VERDICT: PASS
PASSEvaluator (Qwen 3.6 27B)
The image depicts the interior of the Hagia Sophia with high architectural accuracy. The massive central dome, pendentives, semi-domes, and the arrangement of columns and arches are correctly represented. The Islamic calligraphy medallions (featuring names of Allah, Muhammad, etc.) and the Byzantine mosaics (such as the seraphim on the pendentives) are present and appropriately placed. The lighting and perspective convey the grandeur of the space. No significant inaccuracies are observed. VERDI
PASSEvaluator (Qwen 3.6 27B)
The image depicts a bismuth crystal with high fidelity. It accurately portrays the characteristic geometric hopper crystal formation, featuring distinct staircase-like structures and sharp, angular facets. The iridescent oxide layers are rendered with realistic coloration—showing vibrant greens, blues, purples, and golds—that align with real-world bismuth specimens. Minor surface imperfections and natural variations in the oxide film enhance authenticity. The subject is clearly recognizable and
Professional Studio 93%
Camera & LightingClick to expand92%
Camera & Lighting measures whether local/boogu-image-turbo follows specific lens, angle, focus, exposure, and lighting instructions.
PASSEvaluator (Gemma 4 26B)
The image perfectly matches the prompt. It features a wide-angle shot of a modern, expansive living room with high ceilings. Most importantly, it exhibits clear barrel distortion at the edges (visible in the curved lines of the ceiling and floor), which was a specific requirement. The lighting is bright and natural, coming from the large floor-to-ceiling windows, creating an airy and open atmosphere. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image features a close-up macro shot of a single prominent dewdrop on a pink rose petal. The perspective is a macro view, and the depth of field is shallow, with the background and the edges of the petal appearing softly blurred. The lighting is bright and natural, highlighting the transparency and reflections within the main droplet. All specified camera and lighting elements are accurately represented. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image perfectly matches the prompt. It features a true aerial bird's-eye view, looking directly down (nadir perspective) at a traffic roundabout. The perspective is flat and perpendicular to the ground, with a deep depth of field that keeps the entire scene—from the central greenery to the surrounding roads and trees—in sharp focus. The lighting is even and natural, consistent with a high-altitude aerial shot. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image successfully implements Rembrandt lighting. There is a distinct, characteristic triangle of light on the shadowed cheek (the subject's left cheek), created by a light source positioned at an angle. The lighting creates a high-contrast, moody atmosphere with deep shadows, and the shallow depth of field focuses attention on the facial features, meeting all the specific requirements of the prompt. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image perfectly matches the prompt's requirements. The lighting is a distinct golden hour, characterized by a vibrant orange sky and a warm, soft glow emanating from the sun. There are long, dark shadows cast across the foreground and midground, and the low-angle perspective creates a strong sense of depth. The light flare and warm tones effectively capture the requested atmosphere. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image perfectly matches the prompt. It features a high-key portrait with a bright, white, overexposed background. The lighting is airy, luminous, and soft, with almost no visible shadows on the subject's face. The perspective is a close-up portrait, and the lighting pattern creates the desired ethereal, high-key aesthetic. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image perfectly matches the prompt. The perspective is a classic portrait shot. The depth of field is shallow, creating a creamy bokeh background with soft, circular light orbs. The lighting features a distinct warm rim light (backlighting) that illuminates the subject's hair and shoulders, effectively separating her from the blurred garden background. VERDICT: PASS
FAILEvaluator (Gemma 4 26B)
The image fails to meet the prompt's core requirement. The prompt specifies a "subject sharp" with a shallow depth of field. However, in this image, the subjects in the foreground are completely out of focus and blurry. Instead of a sharp subject with a blurred background, the entire scene—including the subjects—is heavily blurred, failing the instruction for a sharp subject. VERDICT: FAIL
PASSEvaluator (Gemma 4 26B)
The image matches all specified criteria. The perspective is a high-angle shot (from above) looking down at the plate. The depth of field is shallow, consistent with an f/2.8 aperture, as the foreground and background (window and glasses) are softly blurred. The lighting is natural light coming from the window on the left, creating soft shadows. The color temperature is warm, and there is a subtle vignette around the edges. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image perfectly executes the tilt-shift effect. It features a high-angle perspective looking down a city street, creating a miniature model appearance. The depth of field is highly selective, with a sharp focus strip in the middle ground and heavy blur (bokeh) in both the foreground and background. The colors are highly saturated and vibrant, matching the prompt's requirements for a miniature aesthetic. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image perfectly matches the prompt. It features a dark, moody scene with a shallow depth of field focusing on the woman's face. Most importantly, it displays prominent blue and orange anamorphic lens flares that streak horizontally across the frame, exactly as described. The lighting and cinematic perspective are highly consistent with the requested aesthetic. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image perfectly matches the prompt's requirements. It features a close-up portrait with dramatic split lighting. One side of the face is brightly illuminated, while the other side is cast in deep, heavy shadow, creating a high-contrast chiaroscuro effect. The lighting pattern is sharp and distinct, successfully dividing the face into light and dark halves as requested. VERDICT: PASS
Color PrecisionClick to expand100%
Color Precision measures whether local/boogu-image-turbo renders requested colors clearly and consistently across subjects, products, and backgrounds.
PASSEvaluator (Gemma 4 26B)
The image features a sports car in a bright, vivid, and unambiguous red color. The car is parked on a gray asphalt road, which matches the prompt's description perfectly. The colors are highly saturated and clearly defined as requested. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The woman is wearing a vibrant, deep blue evening dress that perfectly matches the description of "royal blue." The background is a solid, muted beige/grey, which qualifies as a "neutral background." All color requirements specified in the prompt are accurately represented in the image. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image perfectly matches the color descriptions provided in the prompt. The sunflowers are a vibrant, pure sunflower yellow, and the sky above is a clear, bright blue. There are no deviations in color or hue from the requested "pure sunflower yellow" and "clear blue sky." VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The wall is a vibrant burnt orange, which aligns with the hex code #FF6B35. The furniture, including the bed frame and nightstand, is white. The room is filled with bright, natural daylight from the window. The colors and elements perfectly match the prompt's specific requirements. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image features foliage in various shades of dark, muted green and brown tree trunks. The primary green used in the canopy and the forest floor closely aligns with the requested dark muted green tone (#2D5F2D). The trunks are a consistent medium brown. The color palette accurately follows the specific color requirements provided in the prompt. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image accurately reflects the prompt's requirements. The walls are a medium blue color consistent with #4A90D9. The floor is a warm wooden texture, and the window frame and baseboards are white trim. All color descriptions and the specific hex-coded color are correctly represented in the scene. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image accurately reflects the requested colors. The walls are a warm cream/off-white, consistent with #F5E6CC. The sofa is a deep, dark navy blue, matching #2C3E50. The accent pillows are a vibrant, bright red, consistent with #E74C3C. All color requirements from the prompt are met. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image accurately reflects the requested color palette. The background is a deep, dark navy blue consistent with #1A1A2E. The watch features a soft gold/champagne tone matching #F0C27F. The earbuds are a clean, pure white consistent with #FFFFFF. All three primary color requirements are met precisely. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image successfully follows the color specifications. The background is a near-white color consistent with #FAFAFA. The main organic shape is a vibrant orange-red, matching the #FF4500 requirement. The dark accent lines and the central text are a dark charcoal/black, consistent with the #333333 specification. All three requested colors are clearly present and accurately represented. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image shows a smooth gradient on a wall. On the far left, the color is a vibrant red, which matches #FF0000. On the far right, the color is a bright blue, which matches #0000FF. The transition through the middle features a clear purple/magenta hue, fulfilling the requirement of transitioning evenly through purple. The colors and the gradient direction perfectly match the prompt. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image displays a gradient that matches the requested colors. The bottom of the sky features a vibrant orange, consistent with #FF6B35. As the gradient moves upward, it transitions through a reddish-purple hue into a very dark, deep indigo at the top, which aligns with #1B1464. The color transition and the specific hex values requested are accurately represented in the image. VERDICT: PASS
PASSEvaluator (Gemma 4 26B)
The image features a horizontal banner with a gradient. On the left, the color is a bright cyan/azure, which aligns well with #00C9FF. On the right, the color transitions into a light mint green, which matches the description of #92FE9D. The gradient transition is smooth and follows the specified color direction accurately. VERDICT: PASS
PhotorealismClick to expand67%
Photorealism measures whether local/boogu-image-turbo produces polished product and studio imagery with realistic materials, lighting, shadows, and finish.
PASSEvaluator (Gemma 4 26B)
The image successfully meets the prompt's requirements. The lighting is soft and professional, creating a natural shadow on the white background. The texture of the red fabric and the white rubber sole appears highly realistic and consistent. There are no obvious AI artifacts, such as warped laces, nonsensical stitching, or distorted geometry. It looks like a high-quality, professional studio product photograph. VERDICT: PASS
FAILEvaluator (Gemma 4 26B)
The image fails to meet the prompt's requirements. The prompt specifies a "reflective black surface," but the image shows a light gray/white background with a bright light source (likely a softbox) visible in the upper left. Furthermore, the "reflective black surface" is actually a light-colored surface reflecting a dark object, which contradicts the prompt. The glass texture and the straw inside the bottle also exhibit unnatural, slightly warped AI-generated distortions. VERDICT: FAIL
PASSEvaluator (Gemma 4 26B)
The image successfully meets the criteria for a studio-quality product shot. The green apple features realistic skin texture, including subtle lenticels and a natural sheen. The lighting is soft and directional, creating a realistic highlight on the left and a soft, natural shadow on the right. The seamless white background is clean and professional. There are no obvious AI artifacts or distortions; the stem and fruit shape appear highly convincing and photorealistic. VERDICT: PASS
Graphical design 54%
Data VisualisationClick to expand0%
Data Visualisation measures whether local/boogu-image-turbo produces charts with correct labels, proportions, colors, and readable values.
FAILEvaluator (Qwen 3 VL)
The chart shows "Monthly Sales 2025" with labeled axes and color-coded bars. Values are 15K (Jan), 15K (Apr), 22K (May), 19K (Jul), 25K (June) — but missing February and March. X-axis labels are incorrect (e.g., "AGR" for Feb, "MA" for Mar, "JUL" for July). The requested months (Jan-Jun) are not correctly labeled. VERDICT: FAIL
FAILEvaluator (Qwen 3 VL)
The pie chart shows incorrect percentages: “Others” is labeled 20% but displays 14%, “Company B” shows 25% but 24%, and “Company A” shows 35% but 4%. Labels are present but misaligned and inconsistent. The chart type is correct, but data accuracy is fatally flawed. VERDICT: FAIL
FAILEvaluator (Qwen 3 VL)
The chart shows a line graph titled "Weekly Traffic" with axis labels, grid lines, and data points matching the requested values. However, the x-axis labels are illegible (e.g., "TuuV", "Anil", "Inpk-bndry"), and the y-axis scale is inconsistent (e.g., labels like "90" and "55" appear misaligned with the grid). The chart type is correct, but critical labeling errors render it unusable. VERDICT: FAIL
Layout & DesignClick to expand22%
Layout & Design measures whether local/boogu-image-turbo organizes posters, menus, covers, and other designed compositions with the requested hierarchy and elements.
PASSEvaluator (Qwen 3 VL)
The image features a large popcorn bucket centered, with "MOVIE NIGHT" in bold at the top and "Every Friday at 8 PM" at the bottom. Text is readable and correctly placed. The layout matches the prompt’s core elements and visual hierarchy. All required zones are present and correctly positioned. VERDICT: PASS
FAILEvaluator (Qwen 3 VL)
The image features a guitar illustration and prominent text, but the headline reads "MUSIC FEST" split across lines with "USIC" and "FST" below, not "LIVE MUSIC FEST" as specified. The tagline "Tickets available now" is misrendered as "Tiickets avitb adbel now." Text hierarchy is present but the exact prompt elements are incorrect and unreadable. VERDICT: FAIL
PASSEvaluator (Qwen 3 VL)
The image features an open book illustration centered below the prominent title "JOIN THE BOOK CLUB". The tagline "Meets every Wednesday" is clearly visible in a banner at the bottom. Text is readable and correctly placed with proper visual hierarchy. All required layout elements are present and accurately rendered. VERDICT: PASS
FAILEvaluator (Qwen 3 VL)
The image shows a landing page with a hero header image, a headline/subtitle, and a “Sign Up” button — all correctly placed. Social media icons are visible in the footer. However, the headline and subtitle text is garbled and unreadable, violating the requirement for readable text. Visual hierarchy is intact, but illegible text undermines functionality. VERDICT: FAIL
FAILEvaluator (Qwen 3 VL)
The image shows a newsletter layout with a banner header, two-column body (text left, image right), a highlighted quote block, and a footer with an unsubscribe link. However, all text is gibberish and unreadable, violating the requirement for readable, correctly placed text. Layout structure is present but content fails core usability. VERDICT: FAIL
FAILEvaluator (Qwen 3 VL)
The menu has a logo header, three category sections (Appetizers, Mains, Desserts) with items and prices, and a footer with address/hours. However, all text is gibberish or nonsensical, making it unreadable and violating the prompt’s requirement for correct, readable content. Layout structure is present but text fails functional purpose. VERDICT: FAIL
FAILEvaluator (Qwen 3 VL)
The layout matches the prompt: left page has a large hero photo; right page has two columns of text, a pull quote, a sidebar with author bio, and page number. Visual hierarchy is clear. However, all text is gibberish placeholder, not readable. The core requirement of readable text is unmet. VERDICT: FAIL
FAILEvaluator (Qwen 3 VL)
The image shows a mobile app UI with a top navigation bar, search field, 2x2 grid of feature cards with icons, “RECENTS” list, and bottom tab bar — all present. However, text is garbled and unreadable, violating correct placement and legibility. Visual hierarchy is mostly intact, but illegible text undermines usability. VERDICT: FAIL
FAILEvaluator (Qwen 3 VL)
The left page features a full-bleed photo as required. The right page has a three-column layout with a headline, body copy, and a smaller inset image. Page numbers are present. However, all text is gibberish and unreadable, violating the requirement for readable, correctly placed text. Layout structure is otherwise accurate. VERDICT: FAIL
Style DiversityClick to expand92%
Style Diversity measures whether local/boogu-image-turbo can shift between visual styles like oil painting, pixel art, watercolor, and technical illustration.
PASSEvaluator (Qwen 3.5 122B)
The image clearly exhibits an oil painting style with visible, textured brushstrokes throughout. The golden retriever is depicted sitting in a garden setting with flowers and greenery. The rich texture and painterly quality are evident in the fur, foliage, and background elements. The style matches the requested "oil painting with visible brushstrokes and rich texture" description accurately. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image clearly exhibits the requested Japanese anime style. Key characteristics are present: the golden retriever has large, expressive eyes with prominent highlights, simplified facial features, and clean line art. The colors are vibrant yet relatively flat with minimal shading gradients, consistent with anime aesthetics. The overall composition and character design align well with the prompt’s stylistic requirements. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image clearly depicts a golden retriever in a garden setting rendered in pixel art style. The 32x32 grid is visibly overlaid on the image, and the color palette is intentionally limited with distinct pixel blocks. Key characteristics of pixel art — blocky edges, low resolution, and grid-based composition — are all present and clearly recognizable. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image clearly exhibits the requested Art Deco style. Key characteristics are present: strong geometric shapes in the buildings and street layout, a distinct gold and black color palette, and ornamental symmetry centered around the main building. The typography on the signs and the design of the cars also fit the era. The visual language is consistent with the prompt's requirements. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image clearly exhibits Impressionist characteristics: soft, visible brushstrokes are evident throughout, especially in the foliage and water reflections. Dappled light filters through trees, creating a play of light and shadow on the woman and ground. The palette is dominated by pastels — pale blues, pinks, greens, and creams — consistent with the style. The overall effect is atmospheric and painterly, matching the requested aesthetic. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image clearly exhibits Pop Art characteristics: bold black outlines define the figure, vivid primary and secondary colors are used in flat blocks, and Ben-Day dots are prominently featured throughout the hair, face, and background. The stylized, graphic quality matches the requested aesthetic precisely. All key visual elements of the specified art style are present and recognizable. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image clearly exhibits watercolor characteristics: visible paper texture, pigment bleeds, splatters, and organic edge diffusion. The lighting mimics photorealism with sunbeams filtering through trees, while the medium remains unmistakably watercolor. Key requested traits — photo-like detail in foliage and light, combined with traditional watercolor artifacts — are all present and well-executed. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image successfully combines cyberpunk and Art Nouveau elements. The character features neon accents, gears, and a futuristic city background (cyberpunk), while the composition is framed by ornate, flowing floral borders and organic lines (Art Nouveau). The color palette utilizes vibrant neons against darker tones. All requested stylistic characteristics are clearly present and well-integrated. VERDICT: PASS
FAILEvaluator (Qwen 3.5 122B)
The image successfully combines medieval castle architecture with Japanese ukiyo-e woodblock print aesthetics. Key characteristics include the stylized waves in the foreground, flat color planes, visible wood grain texture, and traditional Japanese text seals. However, the sky and lighting are not photorealistic — they retain the illustrative, stylized quality of ukiyo-e, with painted clouds and sunbeams that match the overall artistic style rather than mimicking real-world photography. Thus, th
PASSEvaluator (Qwen 3.5 122B)
The image clearly exhibits soft brutalism: raw concrete walls, ceiling, and floor are prominent. Rounded pastel furniture (pink sofa, yellow and mint chairs) is present. Warm diffused light streams through sheer curtains, creating a gentle glow. All key visual characteristics of the requested style are accurately represented. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image successfully embodies the requested "anxious minimalism" style. The composition is sparse, featuring a simple table with a bowl, brushes, and cloth against a largely empty wall. The color palette is muted, dominated by grays and browns. The negative space is prominent, particularly in the upper right where faint, sketch-like lines create subtle visual tension and unease. The overall mood is quiet and slightly unsettling, matching the prompt's requirements. VERDICT: PASS
PASSEvaluator (Qwen 3.5 122B)
The image successfully embodies the requested "nostalgic futurism" and "retro-futuristic 1960s space-age aesthetics." Key characteristics are clearly present: the architecture features classic mid-century modern elements like domes, spires, and sleek, rounded forms reminiscent of "The Jetsons" or 1960s World's Fair designs. The color palette is a distinct warm, sepia-toned monochrome that creates the requested "melancholic warm-toned patina," mimicking aged photography. The flying vehicles and o