vs

192 evaluations across 6 categories

Benchmark V1 verdicts are produced by VLM judges and can contain mistakes. Treat PASS/FAIL labels as machine-assisted assessments, and inspect the images yourself. Learn more about the methodology.

80%vs72%

Pass Rate

15.5svs12.4s

Avg Latency

Text Rendering › Typography Style › Easyfal/krea/v2-medium-turbo

Prompt: The word 'CHAPTER ONE' typed on aged paper with a vintage typewriter font, complete with slightly uneven ink

Text Rendering 67%vs47%

Typography StyleClick to expand67%vs100%

Typography Style compares whether fal/krea/v2-medium-turbo or fal/bria/fast better preserves requested lettering styles, font mood, and text texture.

Easy 67%vs100%

fal/krea/v2-medium-turboPASS

The word 'CHAPTER ONE' typed on aged paper with a vintage typewriter font, complete with slightly uneven ink

Evaluator (Qwen 3 VL)

Text reads “CHAPTER ONE” correctly. Font mimics vintage typewriter style with blocky, uniform letters. Background shows aged paper with stains and texture. Ink appears slightly uneven and smudged, matching “slightly uneven ink” description. All elements convincingly simulate the requested medium. VERDICT: PASS

fal/krea/v2-medium-turboPASS

The phrase 'TODAY'S SPECIAL' written in white chalk on a dark green chalkboard with chalk dust

Evaluator (Qwen 3 VL)

Text reads "TODAY'S SPECIAL" correctly. Letters show convincing chalk texture and uneven edges on dark green surface, matching chalkboard medium. Chalk dust effect is subtly implied. Spelling and style align with prompt. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

The word 'OPEN' displayed on a red LED sign with glowing pixel-style letters

Evaluator (Qwen 3 VL)

The text "OPEN" is rendered in a glowing red pixel-style LED matrix, matching the described medium. However, the letters are misshapen and distorted, failing to clearly spell "OPEN" — the 'O' and 'N' are particularly illegible. The core requirement of correct spelling is not met. VERDICT: FAIL

Writing accuracyClick to expand67%vs33%

Writing accuracy compares whether fal/krea/v2-medium-turbo or fal/bria/fast more reliably renders the exact requested words, numbers, labels, and short phrases.

Easy 67%vs100%

fal/krea/v2-medium-turboPASS

A single large word 'HELLO' centered on a white background

Evaluator (Qwen 3 VL)

The word "HELLO" is fully spelled correctly and is clearly readable. All letters are intact, properly formed, and centered on the white background. No misspellings, missing letters, or illegible parts are present. The text matches the intended specification exactly. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

The word 'ADVENTURE' in big bold letters centered on a plain cream background

Evaluator (Qwen 3 VL)

The word "ADVENTURE" is spelled correctly but is fragmented into three lines: "ADV", "EN", "TURE". The letters are fully readable, but the intended single word is visually broken. This violates the requirement for the word to be presented as one continuous unit. VERDICT: FAIL

fal/krea/v2-medium-turboPASS

The word 'SERENITY' written large and centered on a solid light blue background

Evaluator (Qwen 3 VL)

The word "SERENITY" is spelled correctly and fully readable, though rendered in a stylized, brush-like font. All letters are present and distinct against the solid light blue background. No misspellings or illegible parts are observed. The text is centered as intended. VERDICT: PASS

Medium 67%vs33%

fal/krea/v2-medium-turboPASS

The sentence 'The quick brown fox jumps high' in clean black text on a white background

Evaluator (Qwen 3 VL)

Text reads: "The quick brown fox jumps high." All words are spelled correctly and fully legible. Black text on white background matches intent. No misspellings or missing words. VERDICT: PASS

fal/krea/v2-medium-turboPASS

'Every sunset brings a new dawn ahead' written neatly on a pale yellow background

Evaluator (Qwen 3 VL)

Text reads: "Every sunset brings a new dawn ahead". All words are spelled correctly and fully readable against the pale yellow background. No misspellings or illegible parts. The phrase matches the intended text exactly. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

'Stars shine brightest in the dark night' displayed on a clean off-white background

Evaluator (Qwen 3 VL)

Text reads: "Stars shine brightest in the the dark night". Contains a duplicated "the" and missing space. "the the" is a clear error. Text is readable but inaccurate. VERDICT: FAIL

Hard 100%vs0%

fal/krea/v2-medium-turboPASS

A poster with headline 'SUMMER SALE' in large bold text, body text 'Up to 50% off all items this weekend only' in medium size, and fine print 'Terms and conditions apply. See store for details.' at the bottom in small text

Evaluator (Qwen 3 VL)

All text matches the intended message exactly. "SUMMER SALE" is prominent. Body text reads "Up to 50% off all items this weekend only". Fine print: "Terms and conditions apply. See store for details." All words are spelled correctly and fully readable against the blue background. No errors detected. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A flyer with large title 'GRAND OPENING', subtitle 'Join us for food, music, and fun' in medium text, and small disclaimer 'RSVP required. Limited seating available.' at the bottom

Evaluator (Qwen 3 VL)

All text matches the intended message exactly. "GRAND OPENING" is correct. Subtitle "Join us for food, music, and fun" is accurate. Disclaimer "RSVP required. Limited seating available." is fully readable and correct. No misspellings or illegible parts detected. VERDICT: PASS

fal/krea/v2-medium-turboPASS

An event banner with headline 'ANNUAL GALA 2025' at the top in large serif font, description 'An evening of fine dining and live jazz' in medium text, and footnote 'Black tie optional. Doors open at 6 PM.' in small text

Evaluator (Qwen 3 VL)

All text matches the intended content exactly: "ANNUAL GALA 2025", "An evening of fine dining and live jazz.", and "Black tie optional. Doors open at 6 PM." No misspellings, missing words, or illegible parts. Font styles and sizes are appropriate. All text is fully readable and accurate. VERDICT: PASS

Extreme 33%vs0%

fal/krea/v2-medium-turboPASS

A vintage movie poster for 'THE LAST VOYAGE' featuring a tagline 'One ship. One chance. No turning back.' with credits block reading 'Directed by James Harlow. Starring Elena Voss, Marcus Chen, and Sofia Reyes. Music by Thomas Grant. A Paramount Pict

Evaluator (Qwen 3 VL)

All text matches the intended content exactly: title, tagline, credits, and release date. No misspellings, missing words, or illegible parts. Font, layout, and vintage styling are consistent with the prompt. Text is fully readable and accurate. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A magazine cover titled 'NATIONAL EXPLORER' with cover lines 'The Hidden Temples of Cambodia — page 34', 'Interview: Dr. Maria Santos on Climate Action', 'Top 10 Trails You Must Hike This Year', and a barcode area with 'Issue 247 | March 2025 | $6.99

Evaluator (Qwen 3 VL)

“Cambida” is misspelled (should be “Cambodia”). All other text matches the intended content exactly. The barcode and issue details are readable. Minor blurring affects legibility slightly but doesn’t obscure critical text. Core misspelling prevents full accuracy. VERDICT: FAIL

fal/krea/v2-medium-turboFAIL

A product label for 'GOLDEN HARVEST ORGANIC HONEY' with subtitle 'Raw & Unfiltered', weight '12 oz (340g)', description 'Pure wildflower honey harvested from sustainable apiaries in the Pacific Northwest. No additives or preservatives.', and small te

Evaluator (Qwen 3 VL)

The word "Unfiltered" is misspelled as "Unflterted" (missing 'i'). All other text matches the intended content exactly. The misspelling is clearly visible and incorrect. VERDICT: FAIL

Spatial Reasoning 86%vs75%

Attributes BindingClick to expand100%vs89%

Attributes Binding compares whether fal/krea/v2-medium-turbo or fal/bria/fast better keeps each requested color, material, or property attached to the correct object.

Easy 100%vs100%

fal/krea/v2-medium-turboPASS

A red ball and a blue cube sitting side by side on a white surface

Evaluator (Qwen 3.5 122B)

The image shows two objects on a white background: 1. Left object: Red sphere (ball) — correct color and shape. 2. Right object: Blue cube — correct color and shape. Both are side by side, as requested. No attribute swaps observed. Shadows are present but not specified in prompt — acceptable. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A green apple and an orange pumpkin on a wooden cutting board

Evaluator (Qwen 3.5 122B)

The image shows a green apple and an orange pumpkin on a wooden cutting board. The apple is round, green, and has a small stem. The pumpkin is round, orange, and has a curved stem. Both objects are correctly colored and shaped as described. No attributes are swapped. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A yellow taxi and a black limousine parked next to each other on a street

Evaluator (Qwen 3.5 122B)

The image shows a yellow taxi on the left and a black limousine on the right, parked side by side on a street. The taxi is correctly colored yellow with a roof light labeled "CXIT." The limousine is correctly colored black and elongated, consistent with its type. All attributes (color, size, shape) are correctly bound to the respective objects with no swaps. VERDICT: PASS

Medium 100%vs67%

fal/krea/v2-medium-turboPASS

A large red circle, a small blue triangle, and a medium green star arranged on a white canvas

Evaluator (Qwen 3.5 122B)

The image contains: - A large red circle (correct color, size, shape) - A small blue triangle (correct color, size, shape) - A medium green star (correct color, size, shape) All attributes are correctly bound to their respective objects with no swaps. The arrangement on a white canvas is also accurate. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A tall thin white candle, a short fat red candle, and a medium spiral yellow candle on a shelf

Evaluator (Qwen 3.5 122B)

The image shows three candles on a shelf: 1. Left: Tall, thin, white candle — matches description. 2. Center: Medium height, spiral-shaped, yellow candle — matches description. 3. Right: Short, fat, red candle — matches description. All attributes (color, size, shape) are correctly bound to each object with no swaps. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A big brown horse, a small white rabbit, and a medium-sized black cat standing in a meadow

Evaluator (Qwen 3.5 122B)

The image shows a big brown horse on the left, a small white rabbit in the center foreground, and a medium-sized black cat sitting (not standing) on the right. All color and size attributes are correctly bound to their respective objects. The cat’s posture is slightly off (“standing” vs. “sitting”), but this is minor. Core object-attribute binding is accurate. VERDICT: PASS

Hard 100%vs100%

fal/krea/v2-medium-turboPASS

A pink fire truck, a blue banana, and a green sun in a surreal landscape

Evaluator (Qwen 3.5 122B)

The image shows a pink fire truck with a ladder and red siren, correctly colored. A large blue banana is present, matching the requested color and shape. The sun in the background is green, as specified. All objects are correctly bound to their attributes: pink fire truck, blue banana, green sun. No swaps or errors observed. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A purple carrot, a red tree with red leaves, and a white watermelon sliced open

Evaluator (Qwen 3.5 122B)

The image shows: - A purple carrot with green leaves — correct color and shape. - A brown tree trunk with red leaves — “red tree” is ambiguous; trunk is brown, not red. Leaves are red — correct. - A sliced watermelon with white flesh and black seeds — correct, though rind is green (not specified in prompt). “Red tree” likely refers to leaves, but trunk isn’t red — minor mismatch. Core attributes mostly correct. VERDICT: PASS

fal/krea/v2-medium-turboPASS

An orange elephant, a yellow penguin, and a silver strawberry arranged on a table

Evaluator (Qwen 3.5 122B)

The image shows three objects: an orange elephant, a yellow penguin, and a silver strawberry. All colors match the prompt exactly. The shapes are stylized but clearly represent the requested animals and fruit. No attributes are swapped — each object retains its correct color and form. Size is relative and not specified in the prompt, so no issue there. VERDICT: PASS

CompositionalityClick to expand78%vs100%

Compositionality compares whether fal/krea/v2-medium-turbo or fal/bria/fast better combines the requested subjects, actions, props, and scene details into one coherent image.

Easy 67%vs100%

fal/krea/v2-medium-turboPASS

A tabby cat wearing oversized red sunglasses

Evaluator (Qwen 3.5 122B)

The image shows a tabby cat with distinct stripes and markings. It is wearing oversized red sunglasses that cover its eyes. The sunglasses are clearly visible and fit the description of being oversized and red. The tabby pattern on the cat's fur is also evident. All described elements are present and coherently combined in the image. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A golden retriever wearing a tiny cowboy hat

Evaluator (Qwen 3.5 122B)

The image contains: - Golden retriever: Present and clearly visible. - Cowboy hat: Present, worn by the dog. - "Tiny" descriptor: The hat is proportionally large on the dog’s head, not tiny. While the core elements (dog + hat) are present and coherently combined, the “tiny” qualifier is inaccurate — the hat appears normal-sized relative to the dog’s head, not miniature. This misalignment with the prompt’s specific descriptor affects fidelity. VERDICT: FAIL

fal/krea/v2-medium-turboPASS

A penguin holding a colorful umbrella

Evaluator (Qwen 3.5 122B)

The image contains all required elements: - A penguin: Present, cartoon-style, black and white with orange beak/feet. - Holding: Present, the penguin’s flipper grips the umbrella handle. - Colorful umbrella: Present, rainbow-colored panels (red, yellow, green, blue, purple). All elements are coherently combined in a single, logical scene. No contradictions or missing components. VERDICT: PASS

Medium 67%vs100%

fal/krea/v2-medium-turboPASS

A medieval knight in full armor sitting at a desk typing on a modern laptop

Evaluator (Qwen 3.5 122B)

The image shows: - Medieval knight in full armor: Present (helmet, chainmail, plate armor, gauntlets) - Sitting at a desk: Present (wooden desk with papers) - Typing on modern laptop: Present (silver laptop being typed on) - Coherent combination: Yes (anachronistic but visually consistent) All elements are present and coherently combined despite the unusual concept. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A Roman gladiator taking a selfie with a smartphone in the Colosseum

Evaluator (Qwen 3.5 122B)

Analysis of elements: 1. Roman gladiator: Present (helmet, armor, red tunic). 2. Selfie: Present (arm extended, camera angle, smiling at lens). 3. Smartphone: NOT VISIBLE (the device is not shown, though implied by the pose). 4. Colosseum: Present (background architecture matches). While the scene is coherent, the specific object "smartphone" is missing from the visual frame. The prompt asks for the smartphone to be present. VERDICT: FAIL

fal/krea/v2-medium-turboPASS

An astronaut in a spacesuit riding a bicycle through a park

Evaluator (Qwen 3.5 122B)

The image shows an astronaut in a full white spacesuit with helmet, riding a black bicycle with a front basket. The setting is a park with trees, grass, and fallen leaves on a dirt path. All prompt elements — astronaut, spacesuit, bicycle, park — are clearly present and coherently combined. No contradictions or missing components. VERDICT: PASS

Hard 100%vs100%

fal/krea/v2-medium-turboPASS

An octopus wearing a top hat and monocle playing chess on the surface of the moon with Earth visible in the background

Evaluator (Qwen 3.5 122B)

The image contains all described elements: - Octopus: Present, blue-green with textured skin. - Top hat: Present, black, worn on the octopus’s head. - Monocle: Present, gold-rimmed, attached to a chain, over one eye. - Chess game: Present, board with black and white pieces, octopus tentacles interacting with pieces. - Moon surface: Present, gray, cratered terrain. - Earth in background: Present, visible in upper right, showing continents and oceans. All elements are coherently combined in a sur

fal/krea/v2-medium-turboPASS

A dinosaur in a business suit giving a PowerPoint presentation in a modern office while drinking coffee

Evaluator (Qwen 3.5 122B)

Let’s examine each element from the prompt: 1. **Dinosaur** – Present: A green cartoon dinosaur is clearly visible. 2. **In a business suit** – Present: The dinosaur wears a dark suit, white shirt, and red tie. 3. **Giving a PowerPoint presentation** – Present: Standing before a screen displaying “PovenFort” with bar charts; gesturing as if presenting. 4. **In a modern office** – Present: Background shows office windows, ceiling lights, desks, laptop, and documents — consistent with a modern of

fal/krea/v2-medium-turboPASS

A robot dressed as a pirate sailing a paper boat on a lake of lava while juggling three kittens

Evaluator (Qwen 3.5 122B)

The image contains all described elements: - Robot dressed as a pirate: Present (wearing pirate hat with skull emblem) - Paper boat: Present (origami-style boat) - Lake of lava: Present (red-orange swirling liquid resembling lava) - Three kittens: Present (three white cats, one in air, two on robot’s hands) - Juggling action: Coherently combined (robot holding cats in juggling pose) All elements are present and coherently integrated. VERDICT: PASS

CountingClick to expand78%vs44%

Counting compares whether fal/krea/v2-medium-turbo or fal/bria/fast better creates the exact number of requested objects without additions or omissions.

Easy 100%vs67%

fal/krea/v2-medium-turboPASS

Exactly 3 red apples arranged on a wooden table

Evaluator (Qwen 3.5 122B)

The image displays three red apples placed on a light wooden surface. The apples are arranged in a slight arc, with the middle one slightly forward. All three are clearly visible and match the description of “red apples.” The background is a wooden table with visible grain. The count exactly matches the prompt’s request for “exactly 3 red apples.” VERDICT: PASS

fal/krea/v2-medium-turboPASS

Exactly 5 yellow rubber ducks sitting in a row on a bathroom shelf

Evaluator (Qwen 3.5 122B)

The image shows five yellow rubber ducks arranged in a row on a reflective bathroom shelf, with white tiled walls in the background. All ducks are identical in color and design, facing forward or slightly angled. The count matches exactly: 5 ducks, all yellow, on a shelf. No extra or missing objects. Lighting and shadows are consistent with natural indoor lighting. VERDICT: PASS

fal/krea/v2-medium-turboPASS

Exactly 4 white candles standing on a dark surface

Evaluator (Qwen 3.5 122B)

The image displays exactly four white cylindrical candles arranged in a row on a dark, textured surface. Each candle has a visible wick and is unlit. The count matches the requested quantity precisely. VERDICT: PASS

Medium 100%vs0%

fal/krea/v2-medium-turboPASS

2 orange cats and 3 black dogs sitting together on a grassy lawn

Evaluator (Qwen 3.5 122B)

The image shows two orange tabby cats in the front row and three black dogs (likely Labrador Retrievers) positioned behind and beside them. All animals are sitting on a grassy lawn. The count of 2 orange cats and 3 black dogs exactly matches the prompt’s request. VERDICT: PASS

fal/krea/v2-medium-turboPASS

4 red roses and 2 white daisies in a glass vase on a table

Evaluator (Qwen 3.5 122B)

The image shows 4 red roses and 2 white daisies arranged in a clear glass vase on a light-colored surface (likely a table). The counts match the prompt exactly: 4 roses, 2 daisies, 1 vase, on a table. All requested elements are present and correctly counted. VERDICT: PASS

fal/krea/v2-medium-turboPASS

1 blue butterfly and 3 yellow butterflies resting on a garden fence

Evaluator (Qwen 3.5 122B)

The image shows 1 blue butterfly and 3 yellow butterflies resting on a wooden garden fence. The count matches the prompt exactly: one blue, three yellow. All are positioned on the fence as described. No extra or missing butterflies. VERDICT: PASS

Hard 33%vs67%

fal/krea/v2-medium-turboFAIL

Exactly 9 goldfish swimming in a large round fishbowl

Evaluator (Qwen 3.5 122B)

The image shows a round fishbowl on a white stand with multiple goldfish inside. Counting the visible fish: there are 10 goldfish clearly visible (including one partially obscured on the left). The prompt requested exactly 9 goldfish. Since the count is 10, not 9, it does not match. VERDICT: FAIL

fal/krea/v2-medium-turboPASS

Exactly 11 colorful balloons floating against a clear blue sky

Evaluator (Qwen 3.5 122B)

I count the balloons in the image: 1. Large red (top center) 2. Green (left of top red) 3. White (right of top red) 4. Yellow (behind white and blue) 5. Light blue (center, below top red) 6. Pink (left side, middle) 7. Large yellow (bottom left) 8. Pink (bottom center-left) 9. Red (bottom center, partially hidden) 10. Large blue (center right) 11. Red (far right) Total: 11 balloons. All are colorful and floating against a clear blue sky. The count exactly matches the request. VERDICT: PASS

Not generated

Provider error during generation (HTTP 500).

fal/krea/v2-medium-turboFAIL

A fruit bowl containing exactly 3 bananas, 4 oranges, and 2 apples

NegationClick to expand89%vs78%

Negation compares whether fal/krea/v2-medium-turbo or fal/bria/fast better respects what should be absent from the image.

Easy 100%vs100%

fal/krea/v2-medium-turboPASS

A tropical beach scene with no people

Evaluator (Qwen 3.5 122B)

The image depicts a tropical beach scene with palm trees, ocean, and sky. There are no people visible anywhere in the image — no figures on the sand, in the water, or under the trees. The scene is entirely devoid of human presence, which aligns with the prompt’s exclusion criterion. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A cat lying on a couch without a collar

Evaluator (Qwen 3.5 122B)

The image shows a light-colored cat with tabby markings lying on a dark textured surface, likely a couch. The cat’s neck is clearly visible and unobstructed — there is no collar present. The prompt explicitly excluded “a collar,” and this element is indeed absent. All other described elements (cat, lying, couch) are consistent with the image. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A classroom with desks and a chalkboard but no students

Evaluator (Qwen 3.5 122B)

The image shows an empty classroom with multiple wooden desks and chairs arranged in rows. A large black chalkboard is mounted on the wall at the front. There are no students visible anywhere in the scene — the room is completely unoccupied. The excluded element (students) is truly absent. VERDICT: PASS

Medium 100%vs67%

fal/krea/v2-medium-turboPASS

A sandy beach with palm trees, seashells, and a beach umbrella but no water or ocean visible

Evaluator (Qwen 3.5 122B)

The image shows a sandy beach scene with two palm trees on the left, a red-and-white striped beach umbrella on the right, and scattered seashells and starfish in the foreground. The background is a clear blue sky. There is no water, ocean, or sea visible anywhere in the image — the horizon line is simply where sand meets sky. The excluded element (water/ocean) is fully absent. VERDICT: PASS

fal/krea/v2-medium-turboPASS

An office workspace with a desk, chair, bookshelf, and lamp but no computers or screens

Evaluator (Qwen 3.5 122B)

The image shows a clean, minimalist office workspace with a white desk, black office chair, tall bookshelf filled with books, and a silver adjustable desk lamp. There is a closed black notebook on the desk, a pen holder, and a small black object (possibly a decorative sphere). No computers, monitors, tablets, or any type of screen are visible anywhere in the scene. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A kitchen with pots, pans, a stove, and a cutting board but no food

Evaluator (Qwen 3.5 122B)

The image shows a clean, modern kitchen scene with a stainless steel pot on the stove, a frying pan on the counter, a cutting board with a knife, and no visible food items. The excluded element — “food” — is indeed absent. All other elements (pots, pans, stove, cutting board) are present as described in the prompt’s inclusion list. VERDICT: PASS

Hard 67%vs67%

fal/krea/v2-medium-turboPASS

A pepperoni pizza with tomato sauce, basil, and crust but absolutely no cheese

Evaluator (Qwen 3.5 122B)

The image shows a pizza with a visible crust, red tomato sauce base, circular pepperoni slices, and green basil strips. There is no melted or solid cheese layer present on the pizza. The surface appears to be primarily sauce and toppings without any dairy-based cheese covering. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A car body with windows, doors, and headlights but no wheels

Evaluator (Qwen 3.5 122B)

The image shows a black car body viewed from the front. It includes windows (windshield and side windows), doors (implied by the body shape), and headlight areas (dark recesses). Crucially, there are no wheels visible — the car appears to be floating or mounted without any wheel components. The excluded element (wheels) is truly absent. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A human face with eyes, nose, and mouth but no eyebrows

Evaluator (Qwen 3.5 122B)

The image shows a painted human face with clearly defined eyes, nose, and mouth. However, above each eye, there are distinct dark brown shapes that represent eyebrows — they are visible as arched forms above the eyelids, rendered in the same impasto style as the rest of the face. Since the prompt explicitly excluded eyebrows, and they are present, this violates the constraint. VERDICT: FAIL

Relative PositionClick to expand92%vs92%

Relative Position compares whether fal/krea/v2-medium-turbo or fal/bria/fast more faithfully places objects in the requested spatial relationship.

Easy 67%vs100%

fal/krea/v2-medium-turboFAIL

A cat sitting on top of a cardboard box

Evaluator (Qwen 3.5 122B)

The image shows a cat inside a cardboard box, not on top of it. The cat is nestled within the box’s interior, with its body contained by the box walls. The prompt specified “on top of,” which is spatially incorrect. Therefore, the positional relationship described in the prompt does not match the image. VERDICT: FAIL

fal/krea/v2-medium-turboPASS

A red ball underneath a wooden chair

Evaluator (Qwen 3.5 122B)

The image shows a wooden chair with a woven seat. Beneath the chair, resting on the floor between the legs, is a red ball. The spatial relationship described — “a red ball underneath a wooden chair” — is accurately depicted. The ball is clearly positioned below the seat and within the footprint of the chair’s legs. All positional descriptors align with the visual content. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A coffee mug next to an open book on a desk

Evaluator (Qwen 3.5 122B)

The image shows a speckled coffee mug positioned to the left of an open book, both resting on a flat surface consistent with a desk. The spatial relationship “mug next to book” is accurately depicted — the mug is adjacent and slightly in front of the book’s left page. No conflicting positional descriptors (e.g., above, under) are present or violated. All described relationships hold true. VERDICT: PASS

Medium 100%vs100%

fal/krea/v2-medium-turboPASS

A bird perched on a branch, the branch extending from a tree next to a park bench

Evaluator (Qwen 3.5 122B)

The image shows a bird perched on a branch that extends diagonally from the right side of the frame. In the blurred background, to the left and below the branch, is a green park bench. The spatial relationships described — bird “on” branch, branch extending from tree (implied by foliage and structure), and bench positioned “next to” the scene (though not directly adjacent in 3D space) — are visually consistent with the prompt’s intent. All key positional descriptors are reasonably accurate. VER

fal/krea/v2-medium-turboPASS

A cat sitting on a pillow, the pillow placed on a wooden chair

Evaluator (Qwen 3.5 122B)

The image shows a cat sitting directly on a cushioned pillow. The pillow is placed on the seat of a wooden chair with a curved backrest. All spatial relationships described in the prompt — “cat on pillow,” “pillow on chair” — are accurately represented. No positional errors are present. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A candle on top of a stack of books, the books next to a potted plant

Evaluator (Qwen 3.5 122B)

The image shows a lit candle placed directly on top of a stack of three books. The books are arranged vertically, with the candle centered on the top book. To the left and slightly behind the stack is a potted plant in a light-colored pot. All specified spatial relationships — “candle on top of books” and “books next to potted plant” — are accurately depicted. VERDICT: PASS

Hard 100%vs67%

fal/krea/v2-medium-turboPASS

Five chess pieces on a board: a king in the center, a queen to its left, a bishop to its right, a rook above the king, and a knight below the king

Evaluator (Qwen 3.5 122B)

The image shows five white chess pieces on a black-and-white checkered board. The king is centered. To its left is the queen. To its right is the bishop. Above the king (in the upper part of the image) is the rook. Below the king (in the lower part) is the knight. All spatial relationships described in the prompt are accurately represented in the image. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A desk scene: laptop in the center, coffee mug to the left, phone to the right, lamp above, and notebook below

Evaluator (Qwen 3.5 122B)

The image shows a desk scene with a laptop centered. A coffee mug is to the left of the laptop, and a phone is to the right — both correct. The lamp is positioned above the laptop, which matches the prompt. The notebook is placed below the laptop, also matching the description. All spatial relationships (left, right, above, below) are accurately represented as specified. VERDICT: PASS

fal/krea/v2-medium-turboPASS

Five animals in a field: a horse on the far left, a cow to the right of the horse, a sheep in the center, a goat to the right of the sheep, and a dog on the far right

Evaluator (Qwen 3.5 122B)

The image shows five animals arranged horizontally in a field. From left to right: a black horse, a brown and white cow, a white sheep, a brown goat, and a tan dog. The spatial relationships described in the prompt — horse on far left, cow to its right, sheep in center, goat to right of sheep, dog on far right — are all accurately represented in the image. VERDICT: PASS

Extreme 100%vs100%

fal/krea/v2-medium-turboPASS

A landscape with a red barn in the foreground at large scale, a windmill in the midground at medium scale, and snow-capped mountains in the background at small scale, all with consistent depth perspective

Evaluator (Qwen 3.5 122B)

The image shows a red barn in the foreground on the right, a windmill in the midground to the left of the barn, and snow-capped mountains in the background. The spatial arrangement matches the description: barn (foreground, large), windmill (midground, medium), mountains (background, small). Depth perspective is consistent. All positional relationships are correctly represented. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A beach scene with a sandcastle in the foreground, a surfer on the waves in the midground, and a cruise ship on the horizon in the background, sizes decreasing with distance

Evaluator (Qwen 3.5 122B)

The image correctly shows a sandcastle in the foreground, a surfer on waves in the midground, and a cruise ship on the horizon in the background. The sizes decrease with distance as specified. All positional relationships (foreground, midground, background) are accurate. No incorrect spatial descriptors like “left,” “right,” “above,” or “below” are misapplied. The composition matches the prompt’s spatial requirements. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A city park with a park bench and pigeons in the foreground, a fountain in the midground, and skyscrapers in the background, each layer progressively smaller

Evaluator (Qwen 3.5 122B)

The image correctly shows a park bench and pigeons in the foreground, a fountain in the midground, and skyscrapers in the background. The spatial layering is accurate: foreground elements are largest and closest, midground (fountain) is centered and slightly farther, and background skyscrapers recede into distance. All described positional relationships align with the prompt. No incorrect on/under/left/right/above/below placements observed. VERDICT: PASS

Scale & ProportionsClick to expand78%vs44%

Scale & Proportions compares whether fal/krea/v2-medium-turbo or fal/bria/fast better keeps requested subjects and objects at believable relative sizes.

Easy 100%vs0%

fal/krea/v2-medium-turboPASS

A person standing next to a yellow school bus, showing realistic relative scale

Evaluator (Qwen 3.5 122B)

The person is positioned in the foreground, appearing smaller than the bus, which is consistent with real-world scale. The bus’s height and length relative to the person are accurate — a typical school bus is about 3.5 meters tall and 10+ meters long, while the boy appears to be around 1.6–1.7 meters. No distortion or unnatural scaling is evident. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A ladybug sitting on a large green leaf, shown at realistic proportions

Evaluator (Qwen 3.5 122B)

The ladybug appears appropriately sized relative to the leaf it is resting on. The leaf’s texture, veins, and curvature are consistent with a real plant, and the insect’s scale matches typical ladybug-to-leaf proportions in nature. Lighting and shadows also support realistic spatial relationships. No gross distortions or size inconsistencies are evident. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A child holding a basketball, both at correct relative sizes

Evaluator (Qwen 3.5 122B)

The child’s hands are holding the basketball, and the ball appears appropriately sized relative to the child’s torso and arms — consistent with a real-world scenario where a young child holds a standard-sized basketball. The proportions of the ball to the child’s body and hands are visually plausible and not distorted. No anatomical or scale inconsistencies are evident. VERDICT: PASS

Medium 100%vs100%

fal/krea/v2-medium-turboPASS

A tiny person standing on the rim of a coffee cup, looking down into it

Evaluator (Qwen 3.5 122B)

The image shows a tiny person standing on the rim of a large coffee cup, looking down into it. The scale relationship is consistent: the person is appropriately small relative to the cup, and the cup’s proportions (rim, handle, depth) are realistic for a coffee mug. The reflection in the coffee also matches the person’s orientation and size. All spatial relationships align with the prompt. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A giant tabby cat walking between city skyscrapers like a kaiju

Evaluator (Qwen 3.5 122B)

The image depicts a giant tabby cat towering over city buildings, consistent with the "kaiju" description. The cat’s paws are larger than nearby cars and buildings, and its head reaches skyward between skyscrapers. Proportions are intentionally exaggerated to match the prompt’s surreal scale. No internal inconsistencies in relative sizing are visible. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A miniature house sitting on the palm of someone's hand

Evaluator (Qwen 3.5 122B)

The image shows a miniature house resting on an open human palm. The house is appropriately scaled to fit comfortably within the hand, with fingers curled slightly around it for support. The proportions between the house and the hand are realistic and consistent with the prompt’s description of a “miniature house sitting on the palm.” No size distortions or inconsistencies are present. VERDICT: PASS

Hard 33%vs33%

fal/krea/v2-medium-turboPASS

A group photo of an elephant, a horse, a dog, a cat, and a mouse, all standing in a line with correct real-world proportional sizes

Evaluator (Qwen 3.5 122B)

The image shows five animals in a line: elephant, horse, dog, cat, and mouse. Their relative sizes are generally correct — the elephant is largest, followed by the horse, then dog, cat, and mouse smallest. However, the horse appears slightly too small relative to the elephant (elephant’s head nearly reaches horse’s shoulder), and the dog/cat are disproportionately large compared to the mouse. While mostly accurate, minor scaling inconsistencies exist. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A dinner table scene with a plate, fork, knife, wine glass, salt shaker, and peppercorn, all at realistic relative sizes

Evaluator (Qwen 3.5 122B)

The image shows a table setting with a plate, fork, knife, wine glass, salt shaker, and a dark textured object (likely a peppercorn or decorative item). However, the “peppercorn” is disproportionately large — nearly the size of the salt shaker and comparable to the knife’s blade width — which is unrealistic. A real peppercorn would be tiny relative to these items. This violates the prompt’s requirement for “realistic relative sizes.” VERDICT: FAIL

fal/krea/v2-medium-turboFAIL

A farm scene with a barn, tractor, farmer, cow, chicken, and a fence post, all at correct proportional scale

Evaluator (Qwen 3.5 122B)

The image shows a farm scene with a barn, red tractor, farmer, cow, chicken, and fence post. However, proportions are inconsistent: the tractor is disproportionately large compared to the barn and farmer; the cow appears too small relative to the tractor; the chicken is oddly sized for its distance; and the fence post looms unrealistically large in the foreground. These scale errors violate the prompt’s requirement for correct proportional relationships. VERDICT: FAIL

Human realism 81%vs79%

Faces & ExpressionsClick to expand92%vs92%

Faces & Expressions compares whether fal/krea/v2-medium-turbo or fal/bria/fast creates more realistic faces with the requested age, emotion, and expression cues.

Easy 100%vs100%

fal/krea/v2-medium-turboPASS

A close-up portrait of a young woman with a warm natural smile

Evaluator (Qwen 3 VL)

The face shows near-symmetry with natural features: freckles, blue eyes, and a warm smile. The smile is clearly joyful and authentic. Slight asymmetry in the eyes and mouth is present but minimal. No major distortions. The emotion is unmistakably happy and natural. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A portrait of a middle-aged man with a calm, serious expression

Evaluator (Qwen 3 VL)

Face is symmetric with natural features: eyes, nose, mouth, and mustache are well-proportioned. Expression conveys calm seriousness with subtle lip tension and focused gaze. No visible distortions or anatomical anomalies. Lighting and texture are realistic. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A headshot of a teenager with a relaxed, neutral face

Evaluator (Qwen 3 VL)

The face is symmetric with natural features: eyes, nose, and mouth are proportionate and centered. The expression is neutral and relaxed, with no strong emotion. Slight blurring on the right eye and minor asymmetry in the mouth are present but not distracting. Overall, the image meets the prompt’s requirements for a teenager with a neutral, relaxed expression. VERDICT: PASS

Medium 67%vs100%

fal/krea/v2-medium-turboFAIL

A close-up of a person's face showing clear disgust, nose wrinkled and lip curled

Evaluator (Qwen 3 VL)

Face is asymmetric with distorted features: left eye appears larger, nose and lips misaligned, and skin texture unnaturally uniform. Disgust is implied via wrinkled nose and curled lip, but anatomical inaccuracies obscure clarity. Emotion is partially readable but compromised by distortions. VERDICT: FAIL

fal/krea/v2-medium-turboPASS

A portrait capturing genuine surprise — raised eyebrows, wide eyes, slightly open mouth

Evaluator (Qwen 3 VL)

Face is symmetric with natural features. Eyes are wide, eyebrows raised, mouth slightly open — clearly conveying surprise. No visible distortions. Lighting and detail are realistic. The expression matches the prompt precisely. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A face showing quiet melancholy — downcast eyes, slightly drawn mouth, distant gaze

Evaluator (Qwen 3 VL)

Face is symmetric with natural, correctly rendered features. Eyes are downcast, lips slightly drawn, and gaze appears distant, conveying quiet melancholy. No significant distortions. Lighting and texture enhance the somber mood. All prompt elements are clearly and accurately depicted. VERDICT: PASS

Hard 100%vs100%

fal/krea/v2-medium-turboPASS

An elderly woman in her 80s with deep wrinkles and white hair, smiling warmly with kind eyes

Evaluator (Qwen 3 VL)

The face is symmetric with natural, aged features: deep wrinkles, white hair, and warm, kind eyes. The smile is clearly readable and conveys warmth. Slight stylization is present but doesn’t distort key elements. All prompt requirements are met. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A 5-year-old child with round cheeks and large eyes, crying with tears streaming down their face

Evaluator (Qwen 3 VL)

The child’s face is symmetric with natural features: round cheeks, large eyes, and tears streaming down. The crying expression is clearly readable, with furrowed brows and a downturned mouth. No significant distortions are present. Features align well with the prompt. VERDICT: PASS

fal/krea/v2-medium-turboPASS

An elderly man in his 70s with age spots and thinning gray hair, looking skeptically over reading glasses

Evaluator (Qwen 3 VL)

Face is largely symmetric with realistic elderly features: age spots, thinning gray hair, and reading glasses. Skeptical expression is clearly readable via furrowed brow and downturned mouth. Minor distortions include slightly uneven earlobe and subtle asymmetry in glasses frame. Overall, the prompt is well-fulfilled. VERDICT: PASS

Extreme 100%vs67%

fal/krea/v2-medium-turboPASS

A group of three friends at a table: one laughing with head tilted back, one looking shocked with wide eyes, and one rolling their eyes with a smirk

Evaluator (Qwen 3 VL)

Faces are symmetric with correct features. Emotions are clearly readable: laughing (head back), shocked (wide eyes), and smirking with rolled eyes. No distortions. All three expressions match the prompt accurately. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A family photo with four people: a father beaming with pride, a mother smiling gently, a teenager looking bored, and a child grinning excitedly

Evaluator (Qwen 3 VL)

Faces are symmetric and feature-accurate for cartoon style. Emotions are clearly readable: father beams, mother smiles, teenager looks bored, child grins excitedly. No distortions. All four characters match the prompt’s emotional descriptions. VERDICT: PASS

fal/krea/v2-medium-turboPASS

Three coworkers in a meeting: one looking frustrated with furrowed brows, one nodding with a pleased expression, and one looking confused with a tilted head

Evaluator (Qwen 3 VL)

Faces are symmetric with clear, stylized features. Left woman shows frustration (furrowed brows, tense mouth). Center man has a pleased expression (smile, relaxed eyes). Right boy shows confusion (tilted head, wide eyes). All emotions are clearly readable and match the prompt. No distortions present. VERDICT: PASS

Full BodyClick to expand92%vs83%

Full Body compares whether fal/krea/v2-medium-turbo or fal/bria/fast renders full-body people with more natural anatomy, clothing, pose, and proportions.

Easy 100%vs100%

fal/krea/v2-medium-turboPASS

A woman standing upright in a relaxed pose, arms at her sides, wearing jeans and a white t-shirt

Evaluator (Qwen 3 VL)

The woman stands upright with arms at her sides, wearing jeans and a white t-shirt as described. Limb count and proportions are anatomically correct. The pose is relaxed and recognizable. No significant distortions are visible. All elements align with the prompt. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A man standing straight facing the camera, hands in pockets, normal proportions

Evaluator (Qwen 3 VL)

The man stands straight, facing forward, with hands in pockets. Limb count and proportions are anatomically correct. Pose is recognizable and matches the prompt. No distortions are visible. All elements align with the requested description. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A person standing in a neutral pose in a park, full body visible from head to feet

Evaluator (Qwen 3 VL)

The person’s limbs are correctly counted and proportioned. Pose is neutral, standing upright with hands clasped in front. Anatomical structure is accurate. The park setting and full-body framing are consistent with the prompt. No significant distortions are visible. VERDICT: PASS

Medium 100%vs100%

fal/krea/v2-medium-turboPASS

A woman mid-stride running through a park, hair and clothes showing motion

Evaluator (Qwen 3 VL)

The woman’s limbs and proportions are anatomically correct. Her pose—mid-stride with arms swinging—clearly depicts running. Motion blur effectively conveys speed in hair and clothing. No distortions in limb count or proportions. The action is unmistakably recognizable as running through a park. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A man captured mid-jump over a puddle, body in the air with bent knees

Evaluator (Qwen 3 VL)

The man’s limbs are correctly counted and proportioned. His bent-knee mid-air pose is anatomically plausible for jumping. The action is clearly recognizable as leaping over a puddle. No significant distortions are visible. The reflection and motion blur support the dynamic pose. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A dancer in a dynamic spin, dress flowing outward with one leg extended

Evaluator (Qwen 3 VL)

The dancer’s limbs are intact and proportionate despite motion blur. The pose—spinning with one leg extended—is recognizable and anatomically plausible. Motion blur obscures fine details but doesn’t distort limb count or proportions. The dress flows outward as described. Core anatomical integrity and action recognition are preserved. VERDICT: PASS

Hard 100%vs67%

fal/krea/v2-medium-turboPASS

A person in a yoga tree pose — standing on one leg with the other foot pressed to the inner thigh, arms raised overhead

Evaluator (Qwen 3 VL)

The figure shows two legs, correct proportions, and the tree pose: one foot on the inner thigh, arms raised. The pose is recognizable despite heavy blur and low detail. No limb count errors or gross anatomical distortions are visible. The core action is accurately depicted. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A rock climber clinging to a cliff face with arms and legs extended to different holds

Evaluator (Qwen 3 VL)

The climber has two arms and two legs, with proportions and joint angles consistent with human anatomy. The pose clearly shows arms and legs extended to different holds on the cliff face, matching the prompt. No significant distortions are visible. The action is recognizable and accurately depicted. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A cellist seated with correct posture, legs around the instrument, bow arm extended

Evaluator (Qwen 3 VL)

The cellist’s posture is anatomically plausible: legs encircle the instrument, bow arm extends forward, and hands position the bow and fingerboard correctly. Limb count and proportions appear accurate. The pose is recognizable as standard cello playing. No significant distortions are visible. VERDICT: PASS

Extreme 67%vs67%

fal/krea/v2-medium-turboPASS

Two soccer players mid-tackle, one sliding and the other jumping over with the ball at their feet

Evaluator (Qwen 3 VL)

Limb counts and proportions are anatomically correct. The sliding player’s body posture and the jumping player’s mid-air pose are recognizable and dynamic. The ball is at the sliding player’s feet, matching the prompt. No significant distortions are visible. The action is clearly depicted as a mid-tackle scenario. VERDICT: PASS

fal/krea/v2-medium-turboPASS

Two gymnasts performing a synchronized handstand, side by side with identical body alignment

Evaluator (Qwen 3 VL)

Both gymnasts show full bodies with correct limb count and proportions. Their synchronized handstands are anatomically plausible, with legs extended and arms supporting weight. Pose is recognizable as a handstand with mirrored symmetry. Slight blending at the midsection suggests minor AI artifact, but core anatomy and action are accurate. VERDICT: PASS

Not generated

Model refused to generate this prompt (HTTP 422).

fal/krea/v2-medium-turboFAIL

Two tango dancers in a close embrace, the man dipping the woman backward with intertwined legs

HandsClick to expand50%vs58%

Hands compares whether fal/krea/v2-medium-turbo or fal/bria/fast draws more anatomically plausible hands, fingers, and gestures.

Easy 33%vs33%

fal/krea/v2-medium-turboFAIL

A person standing with both hands visible at their sides, fingers relaxed and naturally spread

Evaluator (Qwen 3.5 122B)

The left hand (viewer’s left) shows 5 fingers, all anatomically plausible. The right hand (viewer’s right) also shows 5 fingers, but the thumb appears abnormally short and positioned unusually close to the index finger — suggesting a structural anomaly. While finger count is correct, the gesture lacks natural relaxation due to the thumb’s unnatural placement. This violates anatomical correctness. VERDICT: FAIL

fal/krea/v2-medium-turboPASS

A woman holding her hands out in front of her palms-up, all ten fingers clearly visible

Evaluator (Qwen 3.5 122B)

The image shows a woman with both hands extended forward, palms facing up. Each hand clearly displays five fingers — left hand: thumb, index, middle, ring, pinky; right hand: same. All ten fingers are visible and anatomically correct in positioning and proportion. No anomalies detected. Gesture matches prompt exactly. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A man with arms crossed, both hands resting on opposite upper arms with fingers visible

Evaluator (Qwen 3.5 122B)

The left hand (on the right side of image) shows 5 fingers clearly. The right hand (on left side of image) also shows 5 fingers. However, the right hand’s thumb is positioned unnaturally — it appears to emerge from the wrist area rather than the base of the palm, and the finger alignment is slightly distorted. This breaks anatomical correctness. VERDICT: FAIL

Medium 33%vs67%

fal/krea/v2-medium-turboFAIL

A person holding a white ceramic coffee mug by the handle with their right hand

Evaluator (Qwen 3.5 122B)

The image shows a left hand (not right) holding the mug handle. The thumb is on top, index finger curled around front, and three fingers visible behind — totaling 5 fingers. However, the grip is anatomically awkward: the pinky and ring finger appear fused or misaligned, and the middle finger’s position is unnatural. Also, the prompt specifies “right hand,” but this is clearly a left hand. Core criteria failed. VERDICT: FAIL

fal/krea/v2-medium-turboFAIL

A woman writing in a notebook with a pen gripped correctly in her right hand

Evaluator (Qwen 3.5 122B)

The visible right hand gripping the pen shows four fingers clearly (thumb, index, middle, and ring finger), with the pinky finger either hidden or not fully rendered. The grip appears anatomically plausible for writing. However, since only four fingers are distinctly visible on the active hand — and no left hand is shown to verify five fingers — the requirement of “exactly 5 per hand” cannot be confirmed. Anomaly: incomplete finger count on visible hand. VERDICT: FAIL

fal/krea/v2-medium-turboPASS

A person playing an acoustic guitar with left hand on the fretboard and right hand strumming

Evaluator (Qwen 3.5 122B)

The left hand (on the fretboard) shows 5 fingers with natural positioning and grip. The right hand (strumming area) is blurred but appears to show 5 fingers in a plausible strumming position. No obvious anatomical anomalies or extra/missing fingers are visible. Both hands appear correctly formed for the action described. VERDICT: PASS

Hard 67%vs100%

fal/krea/v2-medium-turboPASS

A person making a peace sign with their right hand, clearly showing two fingers up and three curled

Evaluator (Qwen 3.5 122B)

The image shows a single right hand making a peace sign. Two fingers (index and middle) are extended upward, while the thumb, ring, and pinky fingers are curled inward — totaling five fingers. The anatomy appears natural: joints, skin texture, and nail shapes are consistent with a real human hand. No extra or missing fingers; gesture is anatomically correct. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A hand giving a thumbs-up with four fingers curled and thumb extended upward

Evaluator (Qwen 3.5 122B)

The image shows one hand giving a thumbs-up gesture. The thumb is extended upward, and the other four fingers are curled into a fist — anatomically correct for this gesture. No anomalies or extra/missing fingers are visible. Since only one hand is shown and it has exactly 5 fingers (thumb + 4 curled), the core criteria is met. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A person counting to three on their fingers, with index, middle, and ring fingers extended

Evaluator (Qwen 3.5 122B)

The image shows one hand making a gesture. The thumb and pinky are curled inward, while the index, middle, and ring fingers are extended — matching the prompt’s description of “counting to three.” However, the ring finger is not fully extended; it’s bent at the middle joint and pressed against the thumb, which contradicts the instruction that all three fingers (index, middle, ring) should be extended. This is an anatomical inconsistency in the gesture. VERDICT: FAIL

Extreme 67%vs33%

fal/krea/v2-medium-turboPASS

Two people shaking hands in a professional setting, all fingers of both hands anatomically correct

Evaluator (Qwen 3.5 122B)

The handshake shows two hands clasped. The left hand (dark sleeve) has 4 visible fingers and a thumb — anatomically correct. The right hand (gray sleeve) also shows 4 fingers and a thumb gripping the other hand — also anatomically correct. No extra or missing fingers; grip is natural and professional. All visible digits are properly formed and positioned. VERDICT: PASS

fal/krea/v2-medium-turboPASS

Two people doing a high-five, palms meeting with all fingers extended on both hands

Evaluator (Qwen 3.5 122B)

The image shows two silhouetted hands pressed together in a prayer-like or high-five gesture. Each hand clearly displays 5 fingers — thumbs and four fingers — aligned symmetrically. The gesture is anatomically plausible for a high-five or namaste, with no visible distortions or extra/missing digits. Silhouette obscures fine detail but does not contradict the prompt’s requirements. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

One person handing a small gift box to another person, both sets of hands gripping the box correctly

Evaluator (Qwen 3.5 122B)

The image shows two pairs of hands exchanging a green gift box. The left pair (giver) has 5 fingers visible on the top hand and 4 on the bottom — one finger is obscured or missing. The right pair (receiver) has 5 fingers on the top hand, but the bottom hand shows only 3 clearly visible fingers; the other two are either missing or malformed. Additionally, the grip appears unnatural — fingers overlap oddly, and some knuckles/fingertips look distorted or fused. Anatomical correctness fails. VERDIC

Multi-SubjectClick to expand100%vs83%

Multi-Subject compares whether fal/krea/v2-medium-turbo or fal/bria/fast better keeps multiple people distinct with the requested attributes, outfits, and positions.

Easy 100%vs100%

fal/krea/v2-medium-turboPASS

Two women standing side by side: one with short blonde hair wearing a red jacket, the other with long black hair wearing a blue dress

Evaluator (Qwen 3 VL)

Left woman: short blonde hair, wearing red jacket. Right woman: long black hair, wearing blue dress. Both visually distinct and match prompt descriptions precisely. No missing or mismatched elements. VERDICT: PASS

fal/krea/v2-medium-turboPASS

Two men at a cafe: one with a beard and glasses wearing a green sweater, the other clean-shaven in a black hoodie

Evaluator (Qwen 3 VL)

Left man: beard, glasses, green sweater — matches description. Right man: clean-shaven, black hoodie — matches description. Both are visually distinct and clearly identifiable. All prompt features are accurately rendered. VERDICT: PASS

fal/krea/v2-medium-turboPASS

Two friends posing: one tall with curly red hair in denim overalls, the other short with straight brown hair in a yellow sundress

Evaluator (Qwen 3 VL)

Tall person: curly red hair, denim overalls, taller stature. Short person: straight brown hair, yellow sundress. Both visually distinct and match descriptions. All features accurately rendered. VERDICT: PASS

Medium 100%vs67%

fal/krea/v2-medium-turboPASS

Four people at a dinner table: a bald man in a suit, a woman with silver hair in a red blouse, a young man with dreadlocks in a denim jacket, and a woman with a hijab in a green dress

Evaluator (Qwen 3 VL)

Bald man in suit: present, left foreground. Silver-haired woman in red blouse: present, behind him. Young man with dreadlocks in denim jacket: present, center foreground. Woman in green hijab and dress: present, right foreground. All are visually distinct and match descriptions precisely. VERDICT: PASS

fal/krea/v2-medium-turboPASS

Three musicians on stage: a tall woman with short pink hair playing bass, a man with a beard and bandana on drums, and a slim man with glasses playing keyboard

Evaluator (Qwen 3 VL)

Tall woman: pink short hair, playing bass (center). Man on drums: beard, bandana, holding sticks (right). Slim man: glasses, playing keyboard (left). All three are visually distinct and match described features. VERDICT: PASS

fal/krea/v2-medium-turboPASS

Three hikers on a trail: a man with a backpack and sunhat, a woman with braided hair and hiking poles, and a teenager with a baseball cap and water bottle

Evaluator (Qwen 3 VL)

Three hikers are present. Teenager (left): baseball cap, water bottle, backpack. Woman (center): braided hair, hiking poles, backpack. Man (back): sunhat, backpack. All features match descriptions and are visually distinct. VERDICT: PASS

Truthfulness 78%vs63%

PhotorealismClick to expand100%vs100%

Photorealism compares whether fal/krea/v2-medium-turbo or fal/bria/fast makes materials, surfaces, and real-world objects look more visually plausible and distinct.

Medium 100%vs100%

fal/krea/v2-medium-turboPASS

A still life on a wooden table featuring a clear glass of water, a polished silver spoon, a folded linen napkin, and a ripe red apple — each material rendered with accurate texture and light behavior

Evaluator (Qwen 3.6 27B)

The image demonstrates excellent material rendering. The glass shows accurate refraction and transparency, while the spoon exhibits realistic metallic reflections. The apple’s skin texture is detailed with natural subsurface scattering, and the linen napkin displays proper fabric folds and softness. The wooden table surface has convincing grain and wear. Lighting interactions are physically consistent across all objects. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A workspace desk with a brushed aluminum laptop, a leather-bound notebook, a ceramic coffee mug, and a glass paperweight — all materials visually distinct and realistic

Evaluator (Qwen 3.6 27B)

The image successfully renders the requested materials with high fidelity. The brushed aluminum laptop shows appropriate metallic sheen and texture. The leather notebook displays realistic grain, stitching, and thickness. The ceramic mug features a distinct glaze and pattern, while the glass paperweight exhibits correct refraction, transparency, and reflections. All materials are visually distinct and realistic. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A kitchen counter with a stainless steel pot, a wooden cutting board, a glass bottle of olive oil, and a cotton dish towel — each surface reflecting light correctly

Evaluator (Qwen 3.6 27B)

The image successfully renders the distinct material properties requested. The stainless steel pot exhibits sharp, accurate reflections of the surrounding objects. The glass bottle shows appropriate translucency and refraction of the oil within. The wooden cutting board displays natural grain and texture, while the cotton towel has a convincing soft, fibrous appearance. Light interacts realistically with each surface, creating believable highlights and shadows. VERDICT: PASS

Physics & ReflectionsClick to expand92%vs75%

Physics & Reflections compares whether fal/krea/v2-medium-turbo or fal/bria/fast better follows physical cues such as shadows, reflections, gravity, transparency, and contact points.

Easy 67%vs67%

fal/krea/v2-medium-turboPASS

A red sphere on a white surface with a single shadow cast to the right, consistent with a light source from the upper left

Evaluator (Qwen 3.6 27B)

The shadow is cast to the right and slightly downward, which is consistent with a light source from the upper left. The shadow’s soft edges and gradual fade suggest a diffused light source, which is physically plausible. The sphere’s highlight on the upper left further confirms the lighting direction. No distortions or color anomalies are present. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A coffee mug on a sunny windowsill casting a sharp shadow to the left

Evaluator (Qwen 3.6 27B)

The light source is clearly coming from the window on the right, casting a distinct shadow to the left of the mug. The shadow's direction and sharpness are consistent with direct sunlight. The warm color temperature and the soft bokeh in the background further enhance the physical plausibility of the scene. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A person standing outdoors at midday with a short shadow directly below them

Evaluator (Qwen 3.6 27B)

The shadow is long and cast to the side, indicating a low sun angle (morning or late afternoon), not midday. At midday, shadows are short and directly beneath the subject. The shadow’s direction and length contradict the prompt’s “midday with a short shadow directly below them” requirement. VERDICT: FAIL

Medium 100%vs67%

fal/krea/v2-medium-turboPASS

A woman standing in front of a full-length mirror, her reflection accurately showing the back of her outfit and reversed orientation

Evaluator (Qwen 3.6 27B)

The reflection accurately mirrors the woman’s posture, clothing details (including the crisscross back design), and orientation. The mirror’s edge is visible, and the reflection aligns with expected physics—no distortion or color mismatch. Shadows are consistent with ambient lighting. The reversed orientation of the reflection is correct for a mirror. All elements support physical plausibility. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A cat sitting on a polished marble floor with its reflection visible below

Evaluator (Qwen 3.6 27B)

The reflection is correctly oriented vertically below the cat, matching its posture. The distortion is appropriate for a polished floor surface, showing slight blurring and elongation. The color and brightness of the reflection are consistent with the lighting conditions, appearing slightly darker than the actual cat. The direction of the light source seems to align with the shadows cast by the cat. Overall, the shadows and reflections are physically plausible. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A vase of flowers on a glass table with a clear reflection visible on the table surface

Evaluator (Qwen 3.6 27B)

The reflection on the glass table is physically plausible. It correctly mirrors the vase and flowers with appropriate vertical inversion and slight distortion consistent with a reflective surface. The dark water line in the vase is accurately reflected, and the overall orientation matches the object above. Shadows are soft and consistent with ambient lighting. No directional or color inconsistencies are present. VERDICT: PASS

Hard 100%vs100%

fal/krea/v2-medium-turboPASS

A row of trees reflected in a calm lake at sunset, with the reflection slightly darker and gently rippled

Evaluator (Qwen 3.6 27B)

The reflections are vertically inverted, matching the trees' orientation. The color is consistent with the sunset sky, appearing slightly darker as expected in water. The gentle ripples distort the reflection realistically without breaking the image structure. The direction and alignment are physically accurate for a calm lake surface. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A city skyline reflected in a rain puddle on a street, with mild distortion and color shift

Evaluator (Qwen 3.6 27B)

The reflections are vertically inverted, matching the skyline above. The distortion is consistent with a shallow puddle on rough asphalt, showing appropriate blurring and waviness. The color shift is plausible, with the reflection appearing slightly darker and cooler due to the water surface and ambient lighting conditions. The orientation and direction align correctly with the light sources and building positions. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A mountain range reflected in a still alpine lake, with the reflection inverted and subtly muted in color

Evaluator (Qwen 3.6 27B)

The reflection is physically plausible. It is correctly inverted vertically, matching the mountain range's orientation. The color is subtly muted and darker than the actual mountains, consistent with light absorption and scattering in water. The surface shows slight ripples, creating realistic minor distortions rather than a perfect mirror image. The horizon line is straight, and the reflection aligns perfectly with the base of the mountains. VERDICT: PASS

Extreme 100%vs67%

fal/krea/v2-medium-turboPASS

A clear glass sphere sitting on a mirror surface near a pool of water, showing refraction through the sphere, reflection on the mirror, and water reflection simultaneously

Evaluator (Qwen 3.6 27B)

The image displays a glass sphere on a reflective surface with a puddle. The reflection of the sphere is directly beneath it, correctly oriented and distorted by the water's surface tension. The refraction through the sphere inverts the background scene, which is physically accurate for a convex lens. The lighting highlights and shadows are consistent with a single light source. The interaction between the sphere, the mirror-like surface, and the water puddle is rendered with high fidelity to op

fal/krea/v2-medium-turboPASS

A chrome sculpture in a room with a glass floor and large windows, showing multiple reflections and environmental mapping on every surface

Evaluator (Qwen 3.6 27B)

The reflections on the chrome sculpture are physically plausible, showing accurate environmental mapping of the room’s windows and light sources. The glass floor beneath the sculpture displays realistic reflections of the sculpture itself, with proper orientation and distortion. Shadows are consistent with the lighting direction from the windows. The overall interplay of light, reflections, and shadows aligns well with real-world physics. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A crystal wine glass on a polished marble counter next to a window overlooking a lake — showing refraction in the glass, reflection on the marble, and the lake's own reflective surface

Evaluator (Qwen 3.6 27B)

The reflections and shadows are physically plausible. The wine glass casts a sharp reflection on the polished marble, correctly mirroring its shape and position. The light source from the window creates consistent highlights on the glass rim and stem, with corresponding shadows cast toward the interior. The lake’s surface shows realistic specular highlights from the sun, and the overall lighting direction is coherent across all reflective surfaces. VERDICT: PASS

World KnowledgeClick to expand58%vs42%

World Knowledge compares whether fal/krea/v2-medium-turbo or fal/bria/fast depicts recognizable places, landmarks, flags, and known objects with more accurate real-world details.

Easy 67%vs67%

fal/krea/v2-medium-turboPASS

The Eiffel Tower in Paris on a clear day, architecturally accurate with its iron lattice structure and three levels

Evaluator (Qwen 3.6 27B)

The image depicts the Eiffel Tower with high architectural accuracy, correctly showing the iron lattice structure, the four arched legs, and the three distinct levels (two lower platforms and the top observation deck). The perspective and details align with the real-world landmark. The sky is clear with some clouds, fitting the prompt. No significant inaccuracies are present. VERDICT: PASS

fal/krea/v2-medium-turboPASS

The Taj Mahal in Agra seen from the reflecting pool, with accurate dome, minarets, and symmetry

Evaluator (Qwen 3.6 27B)

The image depicts the Taj Mahal with high architectural accuracy, including the central dome, four minarets, and symmetrical layout. The reflection in the pool is clear and well-rendered. Minor details like the chhatris and arched entrances are correctly placed. The symmetry and perspective align with real-world views from the reflecting pool. No significant distortions or inaccuracies are present. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

The Statue of Liberty on her island pedestal, with correct torch, crown, tablet, and green patina

Evaluator (Qwen 3.6 27B)

The Statue of Liberty is recognizable and features the correct green patina, crown, and tablet. However, the torch is factually inaccurate; it is depicted as a solid golden sphere with a spout, resembling a teapot, rather than the actual flame design. Additionally, the pedestal architecture is simplified and lacks the specific structural details of the real monument. VERDICT: FAIL

Medium 33%vs33%

fal/krea/v2-medium-turboFAIL

A traditional Japanese shamisen instrument with accurate three-stringed design, wooden body, and bachi plectrum

Evaluator (Qwen 3.6 27B)

The instrument depicted is not a shamisen. A shamisen has a long neck, a square body covered in skin (usually dog or cat), and three strings. This image shows a different instrument with a short neck, a rounded body covered in snakeskin, and four strings. The plectrum (bachi) is present, but the overall structure is fundamentally incorrect for a shamisen. VERDICT: FAIL

fal/krea/v2-medium-turboFAIL

A medieval astrolabe with accurate rete, tympan, and engraved degree markings on brass

Evaluator (Qwen 3.6 27B)

The image depicts a brass instrument with concentric rings, radial spokes, and engraved markings, consistent with the general appearance of an astrolabe. However, the specific components like the rete and tympan are not clearly distinguishable or accurately rendered; the markings appear somewhat generic and lack the precise, intricate detail expected of a medieval astrolabe. The overall aesthetic is plausible but lacks factual accuracy in its technical details. VERDICT: FAIL

fal/krea/v2-medium-turboPASS

A Japanese kintsugi bowl — cracked ceramic repaired with visible gold lacquer lines

Evaluator (Qwen 3.6 27B)

The image depicts a ceramic bowl with a crackled glaze pattern, meticulously repaired with gold lacquer lines, which is the hallmark of the Japanese art of kintsugi. The subject is clearly recognizable as a kintsugi bowl. The visual details, including the texture of the ceramic, the irregularity of the cracks, and the metallic sheen of the gold repair, are realistic and factually accurate to the craft. There are no architectural or factual inaccuracies. VERDICT: PASS

Hard 33%vs0%

fal/krea/v2-medium-turboPASS

A detailed diagram of the water cycle showing evaporation from the ocean, condensation forming clouds, precipitation as rain, and collection in rivers flowing back to the sea

Evaluator (Qwen 3.6 27B)

The image is a stylized, flat-design illustration of the water cycle. It clearly depicts evaporation (rising vapor), condensation (clouds), precipitation (raindrops), and collection (river flowing to the sea). While not a photorealistic or highly detailed scientific diagram, it accurately represents the core components and flow of the water cycle as requested. The subject is recognizable and factually accurate in its simplified form. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A cross-section showing tectonic plates at a convergent boundary with subduction, a trench, volcanic arc, and layered mantle

Evaluator (Qwen 3.6 27B)

The image clearly depicts a subduction zone with a trench, volcanic arc, and mantle layers, making the subject recognizable. However, it is stylized as a cartoon rather than a realistic scientific diagram. While the core elements are present, the artistic style and simplified representation may not meet the criteria for factual/architectural accuracy expected in a real-world scientific context. VERDICT: FAIL

fal/krea/v2-medium-turboFAIL

An anatomical illustration of the human heart showing four chambers, valves, aorta, pulmonary arteries, and veins with correct placement

Evaluator (Qwen 3.6 27B)

The image is recognizable as a heart illustration but fails on factual accuracy. The major vessels are anatomically incorrect; the aorta is depicted as a straight vertical tube rather than an arch, and the pulmonary arteries/veins are misshapen and misplaced. The internal chambers and valves are distorted and lack the correct structural relationships found in human anatomy. The coronary arteries appear as random branching lines rather than following the epicardial surface correctly. VERDICT: FA

Extreme 100%vs67%

fal/krea/v2-medium-turboPASS

A Morpho menelaus butterfly with accurate iridescent blue dorsal wings, brown ventral wing pattern with eyespots, and correct body morphology

Evaluator (Qwen 3.6 27B)

The image depicts a butterfly with the characteristic iridescent blue dorsal wings and brown ventral patterns of a Morpho species. The wing venation, body morphology, and coloration are highly realistic and consistent with a Morpho menelaus. The details, such as the eyespots and wing edges, are accurately rendered. The subject is clearly recognizable and factually accurate. VERDICT: PASS

fal/krea/v2-medium-turboPASS

The Hagia Sophia interior showing the massive central dome, pendentives, semi-domes, and Islamic calligraphy medallions alongside Byzantine mosaics

Evaluator (Qwen 3.6 27B)

The image depicts the Hagia Sophia interior with recognizable features: the central dome, pendentives, semi-domes, and Islamic calligraphy medallions. However, the Byzantine mosaics are inaccurately rendered as stylized, almost cartoonish figures rather than realistic mosaics. The architectural details are generally correct but lack precision in some areas. Despite these issues, the core subject is recognizable. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A specimen of bismuth crystal showing accurate iridescent oxide layers, geometric hopper crystal formation, and characteristic staircase structure

Evaluator (Qwen 3.6 27B)

The image accurately depicts a bismuth crystal with its characteristic geometric hopper formation and staircase structure. The iridescent oxide layers are vividly rendered, displaying the expected rainbow-like colors due to thin-film interference. The crystalline geometry is precise, with sharp edges and facets typical of bismuth. No significant inaccuracies are observed in the representation of the mineral’s physical properties. VERDICT: PASS

Professional Studio 93%vs93%

Camera & LightingClick to expand92%vs100%

Camera & Lighting compares whether fal/krea/v2-medium-turbo or fal/bria/fast more faithfully follows lens, angle, focus, exposure, and lighting instructions.

Easy 100%vs100%

fal/krea/v2-medium-turboPASS

A wide-angle shot of a modern living room interior showing expansive space with slight barrel distortion at the edges

Evaluator (Gemma 4 26B)

The image features a wide-angle perspective that successfully conveys an expansive, modern living room. The camera angle captures a large portion of the room, including the unique architectural ceiling and multiple seating areas. There is a subtle hint of barrel distortion at the edges, which enhances the wide-angle effect. The lighting is sophisticated, utilizing recessed ceiling lights and a floor lamp to create a warm, layered atmosphere. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A close-up macro shot of a single dewdrop on a rose petal, shallow depth of field

Evaluator (Gemma 4 26B)

The image perfectly matches the prompt. It features a close-up macro shot of a single, prominent dewdrop resting on a pink rose petal. The depth of field is extremely shallow, with the dewdrop and the immediate petal surface in sharp focus while the rest of the rose and the green background are softly blurred. The lighting is soft and natural, highlighting the transparency and refraction within the water droplet. VERDICT: PASS

fal/krea/v2-medium-turboPASS

An aerial bird's-eye view looking straight down at a traffic roundabout with cars and trees

Evaluator (Gemma 4 26B)

The image perfectly matches the prompt. It features a true aerial bird's-eye view looking directly down at a traffic roundabout. The perspective is flat and top-down, with no tilt. The lighting pattern is consistent with a nighttime scene, featuring streetlights and car headlights that cast realistic shadows and light pools on the asphalt. The depth of field is consistent with a high-altitude aerial shot. VERDICT: PASS

Medium 100%vs100%

fal/krea/v2-medium-turboPASS

A portrait lit with classic Rembrandt lighting — triangle of light on the shadowed cheek, moody atmosphere

Evaluator (Gemma 4 26B)

The image successfully demonstrates classic Rembrandt lighting. There is a distinct, small triangle of light visible on the shadowed cheek (the subject's right cheek), which is the defining characteristic of this lighting style. The atmosphere is dark and moody with a shallow depth of field, focusing sharply on the facial features while the background remains dark and indistinct. The lighting pattern and mood perfectly match the prompt's requirements. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A landscape photograph bathed in golden hour light — long warm shadows, orange sky, soft glow

Evaluator (Gemma 4 26B)

The image perfectly captures the requested elements. The sky features a warm orange gradient characteristic of golden hour. The lighting creates long, soft shadows across the rolling hills and grass, and the entire scene is bathed in a warm, golden glow. The perspective provides a wide landscape view with a natural depth of field. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A high-key portrait of a person against a bright white background, minimal shadows, airy and luminous

Evaluator (Gemma 4 26B)

The image perfectly matches the prompt. It features a high-key portrait with a bright, white, seamless background. The lighting is soft and even, creating a luminous and airy feel with minimal shadows on the subject's face and clothing. The depth of field is appropriate for a portrait, keeping the subject in sharp focus against the bright backdrop. VERDICT: PASS

Hard 67%vs100%

fal/krea/v2-medium-turboPASS

A portrait shot at 85mm f/1.4 with creamy bokeh background, backlit with warm rim light separating the subject from a blurred garden

Evaluator (Gemma 4 26B)

The image perfectly matches the prompt. The perspective is a classic portrait shot. The depth of field is shallow, creating a creamy bokeh background with a blurred garden. Most importantly, the lighting features a distinct warm rim light along the subject's hair and shoulder, effectively separating her from the background as requested. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A street photograph with shallow depth of field as if shot at 50mm f/1.8, subject sharp with soft city lights in the background

Evaluator (Gemma 4 26B)

The image successfully matches the prompt's requirements. The subject's face is sharp and in focus, while the background exhibits a shallow depth of field with significant bokeh. The background features soft, out-of-focus city lights (bokeh circles) consistent with a wide aperture like f/1.8. The perspective and lighting patterns align perfectly with the description of a 50mm street photograph. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A food photograph shot from above at 35mm f/2.8 with natural window light from the left, slight vignette, warm color temperature

Evaluator (Gemma 4 26B)

The image fails to meet several key criteria. The perspective is a high-angle shot, but it is not a true "top-down" (flat lay) shot as requested. The lighting appears to come from the top/front rather than clearly from the left. Additionally, the depth of field is quite deep, lacking the characteristic bokeh of an f/2.8 aperture, and there is no noticeable vignette or distinct warm color temperature shift. VERDICT: FAIL

Extreme 100%vs100%

fal/krea/v2-medium-turboPASS

A tilt-shift photograph of a city street making real-life look like a miniature model — selective focus strip, saturated colors

Evaluator (Gemma 4 26B)

The image successfully implements the tilt-shift effect. There is a clear selective focus strip where the red bus and the white car are in sharp focus, while the foreground (pedestrians/street) and the background (buildings/sky) are heavily blurred. This creates the intended miniature model aesthetic. The colors are also noticeably saturated, and the depth of field accurately mimics the requested tilt-shift photography style. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A cinematic frame with blue and orange anamorphic lens flare streaking horizontally across a dark moody scene

Evaluator (Gemma 4 26B)

The image features a dark, moody scene with a strong light source. The light creates a prominent horizontal streak that includes both blue and orange/warm tones, accurately reflecting the "blue and orange anamorphic lens flare" described. The light streaks horizontally across the frame, and the overall atmosphere is cinematic with a shallow depth of field and atmospheric haze. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A portrait with dramatic split lighting — half the face brightly lit, the other half in deep shadow, high contrast chiaroscuro

Evaluator (Gemma 4 26B)

The image perfectly matches the prompt. It features a close-up portrait with dramatic split lighting. One side of the face is brightly lit, while the other side is cast in deep, heavy shadow, creating a high-contrast chiaroscuro effect. The lighting pattern is sharp and follows the vertical division of the face as requested. VERDICT: PASS

Color PrecisionClick to expand100%vs92%

Color Precision compares whether fal/krea/v2-medium-turbo or fal/bria/fast renders requested colors more clearly and consistently across subjects, products, and backgrounds.

Easy 100%vs100%

fal/krea/v2-medium-turboPASS

A bright red sports car parked on a gray asphalt road — the red is vivid and unambiguous

Evaluator (Gemma 4 26B)

The image features a bright, vivid red sports car parked on a gray asphalt road. The red color of the car is intense and unambiguous, exactly as described in the prompt. The road is a consistent gray color. All color requirements are met perfectly. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A woman wearing a royal blue evening dress against a neutral background

Evaluator (Gemma 4 26B)

The woman is wearing a vibrant, deep blue evening dress that perfectly matches the description of "royal blue." The background is a light gray, which qualifies as a neutral background. All color requirements specified in the prompt are accurately represented in the image. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A field of pure sunflower yellow flowers under a clear blue sky

Evaluator (Gemma 4 26B)

The image features a field of bright, sunflower-yellow flowers. The petals are a vibrant, saturated yellow, and the centers are a dark brownish-yellow. Above the flowers, there is a clear, light blue sky with a few wispy white clouds. The colors perfectly match the prompt's description of "pure sunflower yellow flowers" and a "clear blue sky." VERDICT: PASS

Medium 100%vs100%

fal/krea/v2-medium-turboPASS

A bedroom wall painted in #FF6B35 (burnt orange) with white furniture and natural daylight

Evaluator (Gemma 4 26B)

The image features a prominent burnt orange wall that closely aligns with the hex code #FF6B35. The furniture (bed frame and bedding) is white, and the room is filled with bright, natural daylight coming through the window. All color requirements specified in the prompt are accurately represented. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A forest scene where the foliage is precisely #2D5F2D (dark muted green) with brown tree trunks

Evaluator (Gemma 4 26B)

The image features a forest scene with vertical lines representing tree trunks and a background representing foliage. The foliage color is a dark, muted green that closely aligns with the specified hex code #2D5F2D. The tree trunks are a dark brown color, matching the prompt's description. The colors are consistent with the requested values. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A room with walls in #4A90D9 (medium blue) with warm wooden floor and white trim

Evaluator (Gemma 4 26B)

The image features walls in a medium blue color that closely aligns with the requested #4A90D9. The floor is a warm wooden color, and the trim around the door and baseboards is white, matching the prompt's description of "warm wooden floor and white trim." All color requirements are met. VERDICT: PASS

Hard 100%vs67%

fal/krea/v2-medium-turboPASS

A minimalist room with walls in #F5E6CC (warm cream), a sofa in #2C3E50 (dark navy), and accent pillows in #E74C3C (bright red)

Evaluator (Gemma 4 26B)

The image accurately reflects the requested colors. The walls are a warm cream/off-white color, the sofa is a dark navy blue, and the accent pillows are a bright red. All three color-coded elements match the descriptions and the intended hex values provided in the prompt. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A product flat-lay on a #1A1A2E (deep dark blue) background, with a #F0C27F (soft gold) watch and #FFFFFF (pure white) earbuds

Evaluator (Gemma 4 26B)

The image accurately reflects the requested colors. The background is a deep, dark navy blue consistent with #1A1A2E. The watch features a soft gold metallic strap and case, matching the #F0C27F description. The earbuds and their charging case are a clean, pure white (#FFFFFF). All color requirements from the prompt are met. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A graphic design composition with background #FAFAFA (near-white), main shape #FF4500 (orange-red), and accent lines #333333 (dark charcoal)

Evaluator (Gemma 4 26B)

The image accurately reflects the requested colors. The background is a near-white/very light gray, consistent with #FAFAFA. The large, wavy main shape is a vibrant orange-red, matching the description of #FF4500. The thin accent line is a dark charcoal/near-black, consistent with #333333. All three specified colors are present and correctly applied to the designated elements. VERDICT: PASS

Extreme 100%vs100%

fal/krea/v2-medium-turboPASS

A smooth gradient on a wall going from #FF0000 (red) on the left to #0000FF (blue) on the right, transitioning evenly through purple

Evaluator (Gemma 4 26B)

The image shows a smooth gradient transitioning from a bright red on the left to a blue on the right, passing through a purple middle section. The colors align with the requested hex values: the left side is a vibrant red (#FF0000), the right side is a clear blue (#0000FF), and the transition through purple is even and smooth as specified. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A sunset sky gradient on a flat surface from #FF6B35 (orange) at the bottom to #1B1464 (deep indigo) at the top

Evaluator (Gemma 4 26B)

The image displays a vertical gradient. The top is a very dark, deep indigo/navy color, which aligns with #1B1464. The bottom features a vibrant orange-red hue, which aligns with the described #FF6B35 (orange). The transition between the deep indigo and the orange is smooth and follows the requested color scheme accurately. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A horizontal banner with a gradient from #00C9FF (cyan) on the left to #92FE9D (mint green) on the right

Evaluator (Gemma 4 26B)

The image features a horizontal banner with a gradient. On the left side, the color is a bright cyan, which aligns well with the specified #00C9FF. On the right side, the color transitions into a light, pale mint green, which matches the description of #92FE9D. The gradient transition is smooth and accurately follows the color instructions provided in the prompt. VERDICT: PASS

PhotorealismClick to expand67%vs67%

Photorealism compares whether fal/krea/v2-medium-turbo or fal/bria/fast produces more polished product and studio imagery with realistic materials, lighting, shadows, and finish.

Easy 67%vs67%

fal/krea/v2-medium-turboPASS

A single red sneaker on a clean white background with soft studio lighting and gentle shadow — product photography style

Evaluator (Gemma 4 26B)

The image successfully meets the prompt's requirements. The lighting is soft and professional, creating a natural shadow on the clean white background. The textures of the red fabric, the white laces, and the rubber sole are sharp and realistic. There are no obvious AI artifacts or distortions in the geometry of the shoe or the laces. It looks like a high-quality, studio-shot product photograph. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A clear glass perfume bottle on a reflective black surface with studio lighting, no background distractions

Evaluator (Gemma 4 26B)

The image contains significant AI artifacts that prevent it from looking like a real photograph. The geometry of the glass bottle is inconsistent and warped, particularly where the cap meets the sprayer. The internal reflections and refractions are physically impossible and nonsensical, with light paths that do not align with the bottle's structure. The reflection on the surface is also distorted and lacks the structural integrity of a real physical object. VERDICT: FAIL

fal/krea/v2-medium-turboPASS

A ripe green apple on a seamless white background with soft side lighting and subtle shadow — catalog style

Evaluator (Gemma 4 26B)

The image successfully captures a studio-quality product shot. The lighting is soft and natural, creating a realistic subtle shadow on the seamless white background. The texture of the green apple, including the slight color variations and the stem, looks highly realistic and lacks obvious AI artifacts or distortions. It perfectly matches the "catalog style" requested in the prompt. VERDICT: PASS

Graphical design 58%vs54%

Data VisualisationClick to expand0%vs33%

Data Visualisation compares whether fal/krea/v2-medium-turbo or fal/bria/fast produces charts with more correct labels, proportions, colors, and readable values.

Easy 0%vs33%

fal/krea/v2-medium-turboFAIL

A bar chart showing monthly sales for January through June with values 12K, 18K, 15K, 22K, 19K, 25K — labeled axes, color-coded bars, and a title 'Monthly Sales 2025'

Evaluator (Qwen 3 VL)

The chart correctly displays a bar graph titled "Monthly Sales 2025" with labeled axes and color-coded bars for January–June. Values match the prompt: 12K, 18K, 15K, 22K, 19K, 25K. However, the Y-axis labels are misaligned and inconsistent (e.g., "20K" appears twice), making it visually confusing. The core data and chart type are accurate, but axis labeling errors compromise readability. VERDICT: FAIL

fal/krea/v2-medium-turboFAIL

A pie chart showing market share: Company A 35%, Company B 25%, Company C 20%, Others 20% — each slice a different color with labels and percentages

Evaluator (Qwen 3 VL)

The chart is a pie chart with correct colors and labels, but contains critical errors: “Company A” is incorrectly labeled as 25% in one slice and 35% in another, violating the prompt’s 35% requirement. “Company C” is correctly labeled 20%, but “Others” is also 20%. The chart type matches, but data accuracy fails. VERDICT: FAIL

fal/krea/v2-medium-turboFAIL

A simple line graph showing daily website visitors over one week (Mon-Sun): 1200, 1500, 1100, 1800, 2000, 1700, 1400 — with grid lines, axis labels, and title 'Weekly Traffic'

Evaluator (Qwen 3 VL)

The chart shows a line graph titled "Weekly Traffic" but contains critical errors: axis labels are misaligned (e.g., "Tudy" instead of "Monday"), and y-axis values are incorrect (e.g., "5000" repeated instead of proper scaling). Data points are labeled incorrectly (e.g., "110" instead of "1100"). The chart type is correct, but the data and labels are fundamentally wrong and unreadable. VERDICT: FAIL

Layout & DesignClick to expand67%vs33%

Layout & Design compares whether fal/krea/v2-medium-turbo or fal/bria/fast produces the cleaner requested composition with the right hierarchy and required elements.

Easy 100%vs67%

fal/krea/v2-medium-turboPASS

A movie night poster with a large popcorn image in the center, the title 'MOVIE NIGHT' in bold at the top, and the tagline 'Every Friday at 8 PM' at the bottom

Evaluator (Qwen 3 VL)

The poster features a large popcorn image centered, with "MOVIE NIGHT" in bold gold text dominating the top half. The tagline "Every Friday at 8 PM." is visible below, though slightly smaller and angled. Text is readable and correctly placed per the prompt’s layout. Visual hierarchy is clear, with title prioritized over tagline. All required elements are present. VERDICT: PASS

fal/krea/v2-medium-turboPASS

An event poster featuring a guitar illustration, the headline 'LIVE MUSIC FEST' in large text, and 'Tickets available now' as a tagline below

Evaluator (Qwen 3 VL)

The poster includes a stylized guitar illustration, the headline "LIVE MUSIC FEST" in large, distressed text, and the tagline "Tickets available now" below it. Text is readable and correctly placed with proper visual hierarchy: headline dominates, tagline is secondary, and guitar is central. Layout matches prompt requirements. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A book club poster with an open book illustration, the title 'JOIN THE BOOK CLUB' prominently displayed, and 'Meets every Wednesday' as the tagline

Evaluator (Qwen 3 VL)

The image displays "JOIN THE BOOK CLUB" prominently at the top in large, bold text. Below it, "Meets every Wednesday." is clearly visible in smaller font. An open book illustration is centered beneath the text. Background elements include faint bookshelves and coffee cups. All required text is readable and correctly placed with proper visual hierarchy. VERDICT: PASS

Medium 33%vs0%

fal/krea/v2-medium-turboFAIL

A landing page design with a hero header image at the top, a headline and subtitle in the middle, a 'Sign Up' call-to-action button, and a footer with three social media icons

Evaluator (Qwen 3 VL)

The image shows a hero header with mountains, a headline ("Helchodp we siod"), a subtitle ("Sign Up"), and a red "Sign Up" button. Social media icons (Facebook, Instagram, Discord) are at the bottom. However, the headline text is gibberish and unreadable, violating the requirement for readable text. The layout structure is present but text fails core criteria. VERDICT: FAIL

fal/krea/v2-medium-turboPASS

A newsletter layout with a banner image header, a two-column body section with text on the left and an image on the right, a highlighted quote block, and a footer with unsubscribe link

Evaluator (Qwen 3 VL)

The image shows a banner header, two-column layout (text left, image right), a highlighted quote block with quotation marks, and a footer with “Unubsscribe” and “Contact us.” Text is placeholder lines, not readable content. The layout structure matches the prompt, though text readability is absent. Core layout elements are correctly positioned. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A restaurant menu layout with a logo header, three category sections (appetizers, mains, desserts) each with items and prices, and a footer with address and hours

Evaluator (Qwen 3 VL)

The image shows a menu with a logo header, three category sections (Appetizers, Main courses, Desserts), and a footer with address/hours. However, all text is garbled and unreadable, violating the requirement for readable, correctly placed text. The layout structure is present but functionally useless due to illegible content. VERDICT: FAIL

Hard 67%vs33%

fal/krea/v2-medium-turboPASS

A magazine double-page spread with a left page containing a large hero photo, and a right page with a two-column article, a pull quote, a sidebar with author bio, and a page number

Evaluator (Qwen 3 VL)

The left page features a large hero photo as required. The right page has a two-column article layout, a pull quote, and a sidebar with author bio. Text is readable and correctly placed. All described layout zones are present with appropriate visual hierarchy. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A mobile app UI mockup showing a dashboard with a top navigation bar, a search field, a 2x2 grid of feature cards with icons, a list of recent activity items, and a bottom tab bar

Evaluator (Qwen 3 VL)

The image shows a mobile app UI with a top navigation bar (hamburger menu, search, profile), a 2x2 grid of feature cards with icons, a recent activity list with placeholder text, and a bottom tab bar. All layout zones are present with correct visual hierarchy. No readable text is visible, but placeholder text is appropriately placed for activity items. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A fashion magazine spread with a full-bleed left-page photo, right page with three-column layout, headline, body copy, a smaller inset image, and styled page numbers

Evaluator (Qwen 3 VL)

Left page has full-bleed photo as required. Right page shows three-column layout, headline, body copy, and inset image. Page numbers are present. However, all text is gibberish and unreadable, violating the requirement for readable text. Layout structure is correct, but content fails core criteria. VERDICT: FAIL

Style DiversityClick to expand67%vs75%

Style Diversity compares whether fal/krea/v2-medium-turbo or fal/bria/fast better shifts between requested visual styles like oil painting, pixel art, watercolor, and technical illustration.

Easy 67%vs67%

fal/krea/v2-medium-turboPASS

A golden retriever sitting in a garden, painted in the style of an oil painting with visible brushstrokes and rich texture

Evaluator (Qwen 3.5 122B)

The image clearly exhibits an oil painting style with visible, expressive brushstrokes throughout the composition. The texture is rich, particularly in the dog’s fur and the surrounding foliage, where thick impasto-like applications are evident. Color blending and light handling mimic traditional oil techniques. The garden setting and golden retriever subject are rendered with painterly abstraction rather than photorealism, fully aligning with the requested aesthetic. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A golden retriever sitting in a garden, rendered in Japanese anime style with flat colors and large expressive eyes

Evaluator (Qwen 3.5 122B)

The image clearly exhibits the requested Japanese anime style. Key characteristics are present: the golden retriever has large, expressive eyes with distinct highlights, simplified facial features, and flat color shading with minimal gradients. The line work is clean and outlines are visible, consistent with anime aesthetics. The background garden is rendered in a complementary illustrative style that supports the overall anime look. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A golden retriever sitting in a garden, rendered in pixel art style with a 32x32 grid and limited color palette

Evaluator (Qwen 3.5 122B)

The image depicts a golden retriever in a garden setting, matching the subject matter. However, the requested style specifies a "32x32 grid" — meaning the entire image should be composed of exactly 32 pixels wide and 32 pixels tall. The actual image is significantly larger (approximately 64x64 or more), with each “pixel” being a block of multiple real pixels. While it mimics pixel art visually, it does not adhere to the strict 32x32 resolution constraint. Thus, the key technical characteristic i

Medium 100%vs100%

fal/krea/v2-medium-turboPASS

A city street scene in the style of Art Deco — geometric shapes, gold and black palette, ornamental symmetry

Evaluator (Qwen 3.5 122B)

The image strongly exhibits Art Deco characteristics: geometric building forms, stylized human figures, and a dominant gold-and-black palette with red accents. Ornamental borders frame the scene symmetrically. The car and streetlamp are rendered in a simplified, graphic manner consistent with 1920s–30s design. All requested visual traits are clearly present and well-executed. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A woman with a parasol in the style of Impressionism — soft brushstrokes, dappled light, pastel palette

Evaluator (Qwen 3.5 122B)

The image exhibits clear Impressionist characteristics: soft, visible brushstrokes throughout; dappled sunlight filtering through trees; and a pastel-dominated palette with greens, pinks, and whites. The woman’s dress and the background foliage are rendered with loose, painterly technique typical of the style. However, she holds a modern-looking cup with a straw — not a parasol — which contradicts the prompt’s subject. Despite this, the *style* itself is accurately executed. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A portrait of a man in Pop Art style — bold outlines, Ben-Day dots, flat vivid primary colors

Evaluator (Qwen 3.5 122B)

The image clearly exhibits Pop Art characteristics: bold black outlines define the figure, Ben-Day dots are used extensively for shading and texture (visible on skin, hat, and background), and vivid primary/secondary colors dominate (pink background, blue hat, yellow tie). The stylized, graphic quality matches the requested aesthetic precisely. All key visual elements of the specified style are present and well-executed. VERDICT: PASS

Hard 33%vs67%

fal/krea/v2-medium-turboFAIL

A forest rendered in photorealistic watercolor — the detail and lighting of a photo combined with visible watercolor bleeds and paper texture

Evaluator (Qwen 3.5 122B)

The image displays a watercolor-style forest with visible bleeds, washes, and paper texture. However, it lacks the “photorealistic” detail and lighting specified — trees are abstracted, shadows and highlights are not photo-accurate, and forms are loosely defined. While watercolor characteristics are present, the photorealism component is missing. Core criteria not fully met. VERDICT: FAIL

fal/krea/v2-medium-turboPASS

A woman in a futuristic city rendered in cyberpunk Art Nouveau — neon colors and tech elements combined with flowing organic lines and floral borders

Evaluator (Qwen 3.5 122B)

The image successfully combines cyberpunk and Art Nouveau styles. The futuristic city with neon lights and tech elements (headphones, glowing cube) fulfills the cyberpunk aspect. The flowing organic lines, floral borders, and ornate frame clearly represent the Art Nouveau style. The color palette uses vibrant neon colors as requested. All key visual characteristics are present and well-integrated. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A medieval castle depicted in Japanese ukiyo-e woodblock print style combined with photorealistic sky and lighting

Evaluator (Qwen 3.5 122B)

The image displays a medieval castle in a stylized woodblock print aesthetic reminiscent of ukiyo-e, with bold outlines, flat color areas, and textured paper effects. However, the sky and lighting are not photorealistic — they’re rendered in the same illustrative, graphic style as the castle, with stylized clouds and sunbeams. The prompt specifically requested photorealistic sky/lighting combined with ukiyo-e style, which is not achieved here. VERDICT: FAIL

Extreme 67%vs67%

fal/krea/v2-medium-turboPASS

An interior scene rendered in soft brutalism — raw concrete textures combined with rounded pastel furniture and warm diffused light

Evaluator (Qwen 3.5 122B)

The image clearly exhibits soft brutalism with raw concrete walls, ceiling, and structural elements. Rounded pastel furniture — including the lime-green and cream sofa, ottoman, and curved partitions — is prominently featured. Warm diffused light enters through the window, casting gentle shadows and enhancing the serene mood. All key visual characteristics of the requested style are present and well-executed. VERDICT: PASS

fal/krea/v2-medium-turboPASS

A still life painted in anxious minimalism — sparse composition with uncomfortable negative space, muted colors, and subtle visual tension

Evaluator (Qwen 3.5 122B)

The image exhibits a sparse composition with significant negative space, particularly in the upper and lower regions. The color palette is muted, featuring soft yellows, browns, and purples. The brushwork is loose and expressive, creating a sense of visual tension and unease rather than a polished, realistic look. The subject matter (abstract blocks) is minimal. These elements align well with the "anxious minimalism" description. VERDICT: PASS

fal/krea/v2-medium-turboFAIL

A cityscape rendered in nostalgic futurism — retro-futuristic 1960s space-age aesthetics with a melancholic warm-toned patina

Evaluator (Qwen 3.5 122B)

The image displays a cityscape with a warm, sepia-toned patina and visible film grain/scratches, which aligns with the "melancholic warm-toned patina" requirement. However, the architecture is generic modernist skyscrapers, lacking the specific "retro-futuristic 1960s space-age" elements (such as atomic shapes, domes, or distinct mid-century sci-fi motifs). The style is nostalgic but not specifically retro-futuristic. VERDICT: FAIL