ImageBench

vs

192 evaluations across 6 categories

76%vs96%
Pass Rate
4.1svs45.3s
Avg Latency
Text Rendering40%100%Spatial Reasoning75%97%Human realism95%98%Truthfulness67%89%Professional Studio89%93%Graphical design58%100%Latency59%0%
The word 'CHAPTER ONE' typed on aged paper with a vintage typewriter font, complete with slightly uneven ink
Text RenderingTypography StyleEasybfl/flux-2-klein-9b

Prompt: The word 'CHAPTER ONE' typed on aged paper with a vintage typewriter font, complete with slightly uneven ink

Text Rendering 40%vs100%

Typography Style67%vs100%
Easy 67%vs100%
The word 'CHAPTER ONE' typed on aged paper with a vintage tybfl/flux-2-klein-9bFAIL
The word 'CHAPTER ONE' typed on aged paper with a vintage typewriter font, complete with slightly uneven ink

Evaluator

The text “CHAPTER ONE” is rendered in a font that strongly resembles a vintage typewriter style — blocky, slightly uneven, with visible ink smudges and imperfections that mimic the look of old typewriter output. The paper background is aged, with creases, stains, and a yellowed texture, matching the “aged paper” description. The ink appears slightly uneven and worn, consistent with the prompt’s request. However, there is a spelling error: the word “CHAPTER” has an apostrophe inserted between th

The phrase 'TODAY'S SPECIAL' written in white chalk on a darbfl/flux-2-klein-9bPASS
The phrase 'TODAY'S SPECIAL' written in white chalk on a dark green chalkboard with chalk dust

Evaluator

The text “TODAY’S SPECIAL” is rendered in a style that convincingly mimics white chalk on a dark green chalkboard. The letters have a granular, textured appearance consistent with chalk dust, and there are visible specks of chalk dust scattered around and within the letters, especially to the right. The surface shows faint smudges and erasure marks, enhancing the realism of a used chalkboard. The spelling is correct: “TODAY’S SPECIAL” (with proper apostrophe). VERDICT: PASS

The word 'OPEN' displayed on a red LED sign with glowing pixbfl/flux-2-klein-9bPASS
The word 'OPEN' displayed on a red LED sign with glowing pixel-style letters

Evaluator

The text “OPEN” is rendered in a pixel-style composed of individual glowing red LED dots, which convincingly matches the described medium of a red LED sign. The letters are clearly formed with the characteristic blocky, dot-matrix appearance typical of LED displays. The spelling is correct: “O-P-E-N”. VERDICT: PASS

Writing accuracy33%vs100%
Easy 100%vs100%
A single large word 'HELLO' centered on a white backgroundbfl/flux-2-klein-9bPASS
A single large word 'HELLO' centered on a white background

Evaluator

The word displayed is "HELLO", which is spelled correctly and fully readable. All five letters — H, E, L, L, O — are present, clear, and unambiguous. There are no misspellings, missing letters, or illegible parts. VERDICT: PASS

The word 'ADVENTURE' in big bold letters centered on a plainbfl/flux-2-klein-9bPASS
The word 'ADVENTURE' in big bold letters centered on a plain cream background

Evaluator

The word "ADVENTURE" is spelled correctly and is fully readable. All letters are clearly visible and properly formed. There are no misspellings, missing words, or illegible parts. VERDICT: PASS

The word 'SERENITY' written large and centered on a solid libfl/flux-2-klein-9bPASS
The word 'SERENITY' written large and centered on a solid light blue background

Evaluator

The word "SERENITY" is spelled correctly and is fully readable. All letters are clearly visible and properly formed against the solid light blue background. There are no misspellings, missing words, or illegible parts. VERDICT: PASS

Medium 33%vs100%
The sentence 'The quick brown fox jumps high' in clean blackbfl/flux-2-klein-9bPASS
The sentence 'The quick brown fox jumps high' in clean black text on a white background

Evaluator

All words are spelled correctly and fully readable: - "The" - "quick" - "brown" - "fox" - "jumps" - "high" There are no misspellings, missing words, or illegible parts. VERDICT: PASS

'Stars shine brightest in the dark night' displayed on a clebfl/flux-2-klein-9bFAIL
'Stars shine brightest in the dark night' displayed on a clean off-white background

Evaluator

The word "brightest" is misspelled. In the image, it appears as "brigh!st", where the "i" is replaced by an exclamation mark "!" and the "t" is missing. This is a clear error in spelling. Additionally, the word "dark night" is correctly spelled, and all other words ("Stars", "shine", "in", "the", "dark", "night") are spelled correctly. However, because the intended phrase is "Stars shine brightest in the dark night" and the word "brightest" is misspelled, the text is not fully accurate. VERDI

'Every sunset brings a new dawn ahead' written neatly on a pbfl/flux-2-klein-9bFAIL
'Every sunset brings a new dawn ahead' written neatly on a pale yellow background

Evaluator

The text in the image contains several errors compared to the intended phrase: - "sunset" is misspelled as "surnset" - "brings" is misspelled as "brimgs" - "dawn" is misspelled as "nown" All words are fully visible and readable, but they are not spelled correctly. VERDICT: FAIL

Hard 0%vs100%
A poster with headline 'SUMMER SALE' in large bold text, bodbfl/flux-2-klein-9bFAIL
A poster with headline 'SUMMER SALE' in large bold text, body text 'Up to 50% off all items this weekend only' in medium size, and fine print 'Terms and conditions apply. See store for details.' at the bottom in small text

Evaluator

The word "items" in the body text is misspelled as "iems". VERDICT: FAIL

A flyer with large title 'GRAND OPENING', subtitle 'Join us bfl/flux-2-klein-9bFAIL
A flyer with large title 'GRAND OPENING', subtitle 'Join us for food, music, and fun' in medium text, and small disclaimer 'RSVP required. Limited seating available.' at the bottom

Evaluator

The intended text was: - Large title: "GRAND OPENING" — ✅ Correct and fully readable. - Subtitle: "Join us for food, music, and fun" — ✅ Correct and fully readable. - Small disclaimer: "RSVP required. Limited seating available." — ✅ Correct and fully readable. However, in the image, the subtitle is duplicated incorrectly: - The second line reads: "Join us te food, music, aval fun" - "te" is a misspelling of "for" - "aval" is a misspelling of "and" These are errors in the actual image, not

An event banner with headline 'ANNUAL GALA 2025' at the top bfl/flux-2-klein-9bFAIL
An event banner with headline 'ANNUAL GALA 2025' at the top in large serif font, description 'An evening of fine dining and live jazz' in medium text, and footnote 'Black tie optional. Doors open at 6 PM.' in small text

Evaluator

The text in the image contains several errors: - “An evening df of fine” → “df” is not “of” (letters appear distorted or replaced). - “tnimingn and ld dive” → “tnimingn” is not “dining”, and “ld” is not “live”. - “jazz” is correctly spelled, but it’s on its own line, which is fine per layout, but the preceding line is corrupted. The bottom line “Black tie optional. Doors open at 6 PM.” is correct. VERDICT: FAIL

Extreme 0%vs100%
A vintage movie poster for 'THE LAST VOYAGE' featuring a tagbfl/flux-2-klein-9bFAIL
A vintage movie poster for 'THE LAST VOYAGE' featuring a tagline 'One ship. One chance. No turning back.' with credits block reading 'Directed by James Harlow. Starring Elena Voss, Marcus Chen, and Sofia Reyes. Music by Thomas Grant. A Paramount Pict

Evaluator

The text in the image contains several errors compared to the intended text: 1. The tagline reads: "ONE ship. ONE CHANCLE:" — "CHANCLE" is misspelled; it should be "CHANCE". 2. The title is "THE VOY AGE" — the words "VOY" and "AGE" are split, but the intended title is "THE LAST VOYAGE". The word "LAST" is missing entirely. 3. In the credits block: - "MARRUS CHEN" — "Marcus Chen" is misspelled. - "SOFIA REYEEES." — "Sofia Reyes" is misspelled (extra 'E' at the end). - "A PARAIOUT PICTUR

A product label for 'GOLDEN HARVEST ORGANIC HONEY' with subtbfl/flux-2-klein-9bFAIL
A product label for 'GOLDEN HARVEST ORGANIC HONEY' with subtitle 'Raw & Unfiltered', weight '12 oz (340g)', description 'Pure wildflower honey harvested from sustainable apiaries in the Pacific Northwest. No additives or preservatives.', and small te

Evaluator

The text on the label contains several errors compared to the intended text: 1. "HAVESTOT" is misspelled — should be "HARVEST". 2. "Unflteted" is misspelled — should be "Unfiltered". 3. "honeyed" is misspelled — should be "honey". 4. "presertives" is misspelled — should be "preservatives". 5. "Best beere" is misspelled — should be "Best before". 6. The word "apiaries" is correctly spelled, but the phrase "honeyed from" is incorrect — it should be "honey from". 7. The weight "12 oz (340g)" is co

A magazine cover titled 'NATIONAL EXPLORER' with cover linesbfl/flux-2-klein-9bFAIL
A magazine cover titled 'NATIONAL EXPLORER' with cover lines 'The Hidden Temples of Cambodia — page 34', 'Interview: Dr. Maria Santos on Climate Action', 'Top 10 Trails You Must Hike This Year', and a barcode area with 'Issue 247 | March 2025 | $6.99

Evaluator

The intended text was: - "A magazine cover titled 'NATIONAL EXPLORER' with cover lines 'The Hidden Temples of Cambodia — page 34', 'Interview: Dr. Maria Santos on Climate Action', 'Top 10 Trails You Must Hike This Year', and a barcode area with 'Issue 247 | March 2025 | $6.99'". Actual text on the image: - Title: "NATIONAL EXPLORER" — ✅ Correct. - "The Temples of Cambodia — page 34" — ✅ Correct. - "Interview: Dr. Maria Santos Action" — ❌ "Action" is not preceded by "on", and "Santos Action" is

Spatial Reasoning 75%vs97%

Attributes Binding89%vs100%
Easy 100%vs100%
A red ball and a blue cube sitting side by side on a white sbfl/flux-2-klein-9bPASS
A red ball and a blue cube sitting side by side on a white surface

Evaluator

The image contains the following objects and attributes: 1. **Object 1:** A red sphere (ball). 2. **Object 2:** A blue cube. 3. **Surface:** White background. The prompt requested "A red ball and a blue cube sitting side by side on a white surface". - The red ball is present. - The blue cube is present. - They are side by side on a white surface. - There are no attribute swaps (e.g., no blue ball or red cube). VERDICT: PASS

A green apple and an orange pumpkin on a wooden cutting boarbfl/flux-2-klein-9bPASS
A green apple and an orange pumpkin on a wooden cutting board

Evaluator

**Object Analysis:** * **Apple:** Observed as green. It is round with a visible stem and shiny skin. * **Pumpkin:** Observed as orange. It is large, ribbed, and has a thick, curved stem. * **Surface:** Observed as a wooden cutting board. It is a thick block of wood with visible knife marks. **Attribute Binding Check:** * The prompt requested a "green apple". The image shows a green apple. (Correct) * The prompt requested an "orange pumpkin". The image shows an orange pumpkin. (Correct

A yellow taxi and a black limousine parked next to each othebfl/flux-2-klein-9bPASS
A yellow taxi and a black limousine parked next to each other on a street

Evaluator

The image shows a yellow taxi on the left and a black limousine on the right, positioned side-by-side on a city street. * **Object 1:** Yellow taxi. * **Observed Attributes:** Yellow color, taxi roof light/sign, taxi markings on the door. * **Object 2:** Black limousine. * **Observed Attributes:** Black color, extended length (limousine shape). The attributes (color and vehicle type) are correctly bound to the respective objects with no swaps. VERDICT: PASS

Medium 100%vs100%
A big brown horse, a small white rabbit, and a medium-sized bfl/flux-2-klein-9bPASS
A big brown horse, a small white rabbit, and a medium-sized black cat standing in a meadow

Evaluator

The image contains the following objects with the observed attributes: 1. **Horse:** Big, brown. (Matches "big brown horse") 2. **Rabbit:** Small, white. (Matches "small white rabbit") 3. **Cat:** Medium-sized, black. (Matches "medium-sized black cat") 4. **Setting:** A grassy field with wildflowers. (Matches "standing in a meadow") All attributes are correctly bound to the right objects with no swaps. VERDICT: PASS

A large red circle, a small blue triangle, and a medium greebfl/flux-2-klein-9bPASS
A large red circle, a small blue triangle, and a medium green star arranged on a white canvas

Evaluator

Based on the image provided: 1. **Large red circle:** There is a prominent, large red circle on the left side. 2. **Small blue triangle:** There is a tiny blue triangle located near the bottom right edge of the red circle. 3. **Medium green star:** There is a green five-pointed star to the right of the circle, which is smaller than the circle but larger than the triangle. 4. **White canvas:** The background is a textured white surface. All attributes (color, size, shape) are correctly bound

A tall thin white candle, a short fat red candle, and a medibfl/flux-2-klein-9bPASS
A tall thin white candle, a short fat red candle, and a medium spiral yellow candle on a shelf

Evaluator

The image contains three candles on a wooden shelf, matching the description perfectly: 1. **Left Object:** A tall, thin, white taper candle. (Matches "tall thin white candle") 2. **Middle Object:** A short, fat, red pillar candle. (Matches "short fat red candle") 3. **Right Object:** A medium-height, yellow, spiral-shaped candle sitting in a small brass dish. (Matches "medium spiral yellow candle") All attributes are correctly bound to the correct objects with no swaps. VERDICT: PASS

Hard 67%vs100%
A pink fire truck, a blue banana, and a green sun in a surrebfl/flux-2-klein-9bPASS
A pink fire truck, a blue banana, and a green sun in a surreal landscape

Evaluator

The image contains the following objects with the specified attributes: - **Fire Truck:** It is pink. - **Banana:** It is blue. - **Sun:** It is green. All attributes are correctly bound to their respective objects as requested in the prompt. VERDICT: PASS

A purple carrot, a red tree with red leaves, and a white watbfl/flux-2-klein-9bPASS
A purple carrot, a red tree with red leaves, and a white watermelon sliced open

Evaluator

The image contains the following objects with the observed attributes: 1. **Carrot:** It is purple with green stems, matching the "purple carrot" description. 2. **Tree:** It has a brown trunk and vibrant red leaves, matching the "red tree with red leaves" description. 3. **Watermelon:** It is sliced open to reveal white flesh with seeds, matching the "white watermelon sliced open" description. All attributes are correctly bound to their respective objects. VERDICT: PASS

An orange elephant, a yellow penguin, and a silver strawberrbfl/flux-2-klein-9bFAIL
An orange elephant, a yellow penguin, and a silver strawberry arranged on a table

Evaluator

Based on the visual evidence: 1. **Orange Elephant:** The object on the left is indeed an elephant and it is orange. This matches the prompt. 2. **Yellow Penguin:** The object in the middle is yellow, but it is shaped like a chick or a duckling, not a penguin. Penguins are distinctively black and white. While the color is correct, the object identity (shape/species) is incorrect. 3. **Silver Strawberry:** The object on the right is a strawberry and it has a silver/metallic finish. This match

Compositionality89%vs100%
Easy 100%vs100%
A tabby cat wearing oversized red sunglassesbfl/flux-2-klein-9bPASS
A tabby cat wearing oversized red sunglasses

Evaluator

Based on the image provided, here is the evaluation of the prompt elements: * **Tabby cat:** Present. The cat has the distinct striped and spotted coat pattern of a tabby. * **Oversized red sunglasses:** Present. The cat is wearing large sunglasses with bright red frames. * **Wearing:** Present. The sunglasses are positioned correctly on the cat's face. All described elements are present and combined coherently. VERDICT: PASS

A penguin holding a colorful umbrellabfl/flux-2-klein-9bPASS
A penguin holding a colorful umbrella

Evaluator

The image successfully depicts the prompt "A penguin holding a colorful umbrella". * **Penguin:** Present. A cute, cartoon-style penguin is the central figure. * **Holding:** Present. The penguin is grasping the handle of the umbrella with its flippers. * **Colorful umbrella:** Present. The umbrella features distinct panels of blue, red, yellow, and green. All described elements are present and coherently combined. VERDICT: PASS

A golden retriever wearing a tiny cowboy hatbfl/flux-2-klein-9bPASS
A golden retriever wearing a tiny cowboy hat

Evaluator

The image contains the following elements from the prompt: * **A golden retriever:** Present. The subject is clearly a Golden Retriever with characteristic golden fur and facial features. * **Wearing:** Present. The dog is wearing the accessory. * **A tiny cowboy hat:** Present. The dog is wearing a brown cowboy-style hat that is sized appropriately for the dog (effectively "tiny" compared to a human hat). The elements are combined coherently, creating a realistic and charming portrait. V

Medium 100%vs100%
A medieval knight in full armor sitting at a desk typing on bfl/flux-2-klein-9bPASS
A medieval knight in full armor sitting at a desk typing on a modern laptop

Evaluator

Based on the image provided, here is the evaluation of the prompt elements: * **Medieval knight:** Present. The figure is clearly a knight. * **In full armor:** Present. The figure is wearing a complete suit of plate armor, including a helmet, breastplate, and gauntlets. * **Sitting at a desk:** Present. The knight is seated on a wooden chair at a wooden table/desk. * **Typing:** Present. The knight's gauntleted hands are positioned on the keyboard as if typing. * **Modern laptop:** P

A Roman gladiator taking a selfie with a smartphone in the Cbfl/flux-2-klein-9bPASS
A Roman gladiator taking a selfie with a smartphone in the Colosseum

Evaluator

The image successfully depicts the prompt's unusual concept. Here is the breakdown of elements: 1. **Roman gladiator:** Present. The central figure is wearing a helmet with a red crest, a metal breastplate, and a red tunic/cloak, consistent with a Roman gladiator or soldier costume. 2. **Taking a selfie:** Present. The figure is holding a phone up with an extended arm, looking at the screen, in a classic selfie pose. 3. **Smartphone:** Present. A modern black smartphone is clearly visible in

An astronaut in a spacesuit riding a bicycle through a parkbfl/flux-2-klein-9bPASS
An astronaut in a spacesuit riding a bicycle through a park

Evaluator

The image successfully depicts all the requested elements: 1. **Astronaut in a spacesuit:** Present. The figure is wearing a full white spacesuit with an American flag patch and a reflective helmet visor. 2. **Riding a bicycle:** Present. The astronaut is straddling a blue mountain bike, holding the handlebars, with feet on the pedals. 3. **Through a park:** Present. The setting is a sunny park with green grass, large trees, a paved path, and a bench in the background. 4. **Coherent combina

Hard 67%vs100%
An octopus wearing a top hat and monocle playing chess on thbfl/flux-2-klein-9bPASS
An octopus wearing a top hat and monocle playing chess on the surface of the moon with Earth visible in the background

Evaluator

The image contains all the requested elements combined coherently: 1. **Octopus:** Present (a large, textured octopus is the central figure). 2. **Top hat:** Present (a black top hat is worn on the octopus's head). 3. **Monocle:** Present (a gold-rimmed monocle is worn over one eye). 4. **Playing chess:** Present (a chessboard with black and white pieces is set up in front of the octopus). 5. **Surface of the moon:** Present (the ground is grey, cratered, and rocky). 6. **Earth visible in

A dinosaur in a business suit giving a PowerPoint presentatibfl/flux-2-klein-9bPASS
A dinosaur in a business suit giving a PowerPoint presentation in a modern office while drinking coffee

Evaluator

Based on the visual evidence, here is the breakdown of the prompt elements: 1. **Dinosaur:** Present (A T-Rex head and body are clearly visible). 2. **Business suit:** Present (The dinosaur is wearing a grey suit jacket, white shirt, and blue tie). 3. **Giving a PowerPoint presentation:** Present (There is a large screen displaying a slide with charts and graphs, typical of a presentation). 4. **Modern office:** Present (The background features large windows with a city view, a conference t

A robot dressed as a pirate sailing a paper boat on a lake obfl/flux-2-klein-9bFAIL
A robot dressed as a pirate sailing a paper boat on a lake of lava while juggling three kittens

Evaluator

The image contains most elements of the prompt, but fails on the specific count and action regarding the kittens. Here is the breakdown: - **Robot:** Present (a skeletal robot figure). - **Dressed as a pirate:** Present (pirate hat, coat, sash). - **Sailing a paper boat:** Present (the boat is clearly a folded paper boat). - **On a lake of lava:** Present (the water is molten lava with fire and smoke). - **Juggling three kittens:** **FAIL**. There are only **two** kittens visible (one in each h

Counting33%vs89%
Easy 100%vs100%
Exactly 3 red apples arranged on a wooden tablebfl/flux-2-klein-9bPASS
Exactly 3 red apples arranged on a wooden table

Evaluator

The image shows exactly three red apples sitting on a wooden surface. The count matches the request perfectly. VERDICT: PASS

Exactly 4 white candles standing on a dark surfacebfl/flux-2-klein-9bPASS
Exactly 4 white candles standing on a dark surface

Evaluator

The image shows exactly 4 white pillar candles arranged in a row on a black surface. * **White candles:** 4 * **Dark surface:** Yes (black background/surface) The count matches the request perfectly. VERDICT: PASS

Exactly 5 yellow rubber ducks sitting in a row on a bathroombfl/flux-2-klein-9bPASS
Exactly 5 yellow rubber ducks sitting in a row on a bathroom shelf

Evaluator

Based on the image provided: 1. **Count:** I see exactly **5** yellow rubber ducks lined up in a row. 2. **Description:** They are yellow, appear to be rubber ducks, are sitting in a row, and are placed on a white shelf against a tiled wall (consistent with a bathroom setting). The count and description match the request perfectly. VERDICT: PASS

Medium 0%vs100%
2 orange cats and 3 black dogs sitting together on a grassy bfl/flux-2-klein-9bFAIL
2 orange cats and 3 black dogs sitting together on a grassy lawn

Evaluator

Based on the image provided: * **Orange cats:** I see **1** orange cat. (The prompt requested 2). * **Black dogs:** I see **2** black dogs. (The prompt requested 3). The counts do not match the requested description. VERDICT: FAIL

4 red roses and 2 white daisies in a glass vase on a tablebfl/flux-2-klein-9bFAIL
4 red roses and 2 white daisies in a glass vase on a table

Evaluator

Based on a visual inspection of the image: **Red Roses:** I count approximately **6** red roses (one far left, one top left, one center-left, one front center, one top right, one far right). **White Daisies:** I count **4** white daisies (one top left, one front left, one center right, one bottom right). The prompt requested "4 red roses and 2 white daisies". The actual image contains more flowers than requested (6 roses and 4 daisies). VERDICT: FAIL

1 blue butterfly and 3 yellow butterflies resting on a gardebfl/flux-2-klein-9bFAIL
1 blue butterfly and 3 yellow butterflies resting on a garden fence

Evaluator

I see 1 blue butterfly and 1 yellow butterfly resting on the fence. The prompt requested 3 yellow butterflies. The count does not match. VERDICT: FAIL

Hard 0%vs67%
Exactly 9 goldfish swimming in a large round fishbowlbfl/flux-2-klein-9bFAIL
Exactly 9 goldfish swimming in a large round fishbowl

Evaluator

I see 7 goldfish in the image (6 orange and 1 white). The prompt requested exactly 9. VERDICT: FAIL

A fruit bowl containing exactly 3 bananas, 4 oranges, and 2 bfl/flux-2-klein-9bFAIL
A fruit bowl containing exactly 3 bananas, 4 oranges, and 2 apples

Evaluator

Based on the image provided: - **Bananas:** I see 2 distinct bananas in the bunch (possibly a third hidden, but only 2 are clearly visible). - **Oranges:** I see 4 oranges (three clearly in the front/center and one partially visible in the back). - **Apples:** I see only 1 red apple. The prompt requested "exactly 3 bananas, 4 oranges, and 2 apples". The image contains approximately 2 bananas and only 1 apple. Therefore, the count does not match. VERDICT: FAIL

Exactly 11 colorful balloons floating against a clear blue sbfl/flux-2-klein-9bFAIL
Exactly 11 colorful balloons floating against a clear blue sky

Evaluator

Based on a careful count of the image: 1. Red balloon (upper left) 2. Green balloon (top left) 3. Yellow balloon (top center) 4. Blue balloon (top right) 5. Pink balloon (top right) 6. White balloon (center left, partially hidden) 7. Orange balloon (center) 8. Yellow balloon (center left) 9. Blue balloon (center right) 10. Yellow balloon (partially hidden behind the center right blue balloon) 11. Red/Orange balloon (bottom left) 12. Green balloon (partially hidden below the center clus

Negation56%vs89%
Easy 100%vs100%
A cat lying on a couch without a collarbfl/flux-2-klein-9bPASS
A cat lying on a couch without a collar

Evaluator

The image shows a tabby/tortoiseshell cat lying comfortably on a grey couch. Looking closely at the cat's neck area, there is no visible collar, strap, or tag. The fur around the neck is uninterrupted. The excluded element (a collar) is truly absent. VERDICT: PASS

A tropical beach scene with no peoplebfl/flux-2-klein-9bPASS
A tropical beach scene with no people

Evaluator

The image depicts a serene tropical beach scene. On the left, there is a line of tall palm trees with lush green fronds bordering a stretch of white sand. The sand is smooth with some faint footprints visible, but no human figures are present. The ocean is a clear turquoise blue with gentle white waves rolling onto the shore. The sky is bright blue with scattered white clouds. I have carefully scanned the entire image, including the water, the shoreline, and the vegetation, and there are absolu

A classroom with desks and a chalkboard but no studentsbfl/flux-2-klein-9bPASS
A classroom with desks and a chalkboard but no students

Evaluator

The image shows a classroom interior with rows of empty wooden desks and chairs facing a green chalkboard. There is a clock on the wall, windows on the left, and a teacher's desk at the front. I have carefully scanned the entire room, including the desks, the front area, and the background, and there are absolutely no people or students visible in the scene. The excluded element is completely absent. VERDICT: PASS

Medium 67%vs100%
A sandy beach with palm trees, seashells, and a beach umbrelbfl/flux-2-klein-9bFAIL
A sandy beach with palm trees, seashells, and a beach umbrella but no water or ocean visible

Evaluator

The image depicts a sandy beach scene in the foreground filled with seashells, palm trees leaning from the left, and beach umbrellas in the mid-ground. However, looking towards the horizon on the right side and center, there is a distinct blue strip of water visible where the sand meets the sky. This indicates the presence of the ocean or sea. Since the prompt explicitly requested "no water or ocean visible," and the ocean is clearly visible in the background, the exclusion criterion has not be

An office workspace with a desk, chair, bookshelf, and lamp bfl/flux-2-klein-9bPASS
An office workspace with a desk, chair, bookshelf, and lamp but no computers or screens

Evaluator

The image shows a clean office workspace featuring a large wooden desk, a black leather office chair, a tall wooden bookshelf filled with books, and a desk lamp. There is also a landline telephone on the desk. Upon careful inspection, there are absolutely no computers, laptops, monitors, or digital screens visible in the scene. The workspace is entirely analog. VERDICT: PASS

A kitchen with pots, pans, a stove, and a cutting board but bfl/flux-2-klein-9bPASS
A kitchen with pots, pans, a stove, and a cutting board but no food

Evaluator

The image shows a kitchen scene with a gas stove, several stainless steel pots and pans (some with lids, some open), and a wooden cutting board leaning against the tiled wall. Looking closely at the open pots and pans, they appear to be empty or possibly contain water, but there are no visible food items, ingredients, or prepared meals anywhere in the scene. The excluded element "food" is truly absent. VERDICT: PASS

Hard 0%vs67%
A pepperoni pizza with tomato sauce, basil, and crust but abbfl/flux-2-klein-9bFAIL
A pepperoni pizza with tomato sauce, basil, and crust but absolutely no cheese

Evaluator

The image shows a pepperoni pizza with a visible layer of melted white cheese (likely mozzarella) covering the tomato sauce and underneath the pepperoni slices. The cheese has browned spots and a melted texture consistent with a baked pizza. Since the prompt explicitly requested "absolutely no cheese" and the image clearly contains cheese, the exclusion criteria has not been met. VERDICT: FAIL

A car body with windows, doors, and headlights but no wheelsbfl/flux-2-klein-9bFAIL
A car body with windows, doors, and headlights but no wheels

Evaluator

The image displays a silver vintage car. It clearly features a car body with windows, doors, and headlights as requested. However, the prompt explicitly excluded wheels ("no wheels"). In this image, the car is clearly equipped with wheels; specifically, whitewall tires are visible on both the front and rear axles. Since the excluded element (wheels) is present, the image fails the negative constraint. VERDICT: FAIL

A human face with eyes, nose, and mouth but no eyebrowsbfl/flux-2-klein-9bFAIL
A human face with eyes, nose, and mouth but no eyebrows

Evaluator

The image shows a close-up of a human face. It clearly features eyes, a nose, and a mouth. However, contrary to the exclusion criteria ("no eyebrows"), the image prominently displays a pair of well-defined, brown eyebrows above the eyes. Since the excluded element is present, the image does not meet the specific constraint. VERDICT: FAIL

Relative Position92%vs100%
Easy 100%vs100%
A cat sitting on top of a cardboard boxbfl/flux-2-klein-9bPASS
A cat sitting on top of a cardboard box

Evaluator

The image shows a tabby and white cat sitting directly on the upper surface of a brown cardboard box. The cat is positioned above the box, resting its weight on it. This perfectly matches the description "A cat sitting on top of a cardboard box". VERDICT: PASS

A red ball underneath a wooden chairbfl/flux-2-klein-9bPASS
A red ball underneath a wooden chair

Evaluator

The image shows a wooden chair with a red ball positioned directly beneath its seat, resting on the floor between the chair legs. The spatial relationship described in the prompt ("underneath") is accurate. VERDICT: PASS

A coffee mug next to an open book on a deskbfl/flux-2-klein-9bPASS
A coffee mug next to an open book on a desk

Evaluator

The image shows a white coffee mug filled with dark liquid positioned to the right of an open book. Both objects are resting on a wooden desk surface. The book is open with pages visible, and a pair of glasses rests on top of the right page of the book. The spatial relationships described in the prompt ("next to", "on") are accurately represented. VERDICT: PASS

Medium 100%vs100%
A cat sitting on a pillow, the pillow placed on a wooden chabfl/flux-2-klein-9bPASS
A cat sitting on a pillow, the pillow placed on a wooden chair

Evaluator

The image shows a tabby and white cat sitting directly on top of a beige pillow. This pillow is resting on the seat of a wooden chair. The spatial arrangement perfectly matches the description: the cat is on the pillow, and the pillow is on the chair. VERDICT: PASS

A bird perched on a branch, the branch extending from a treebfl/flux-2-klein-9bPASS
A bird perched on a branch, the branch extending from a tree next to a park bench

Evaluator

The image shows two birds perched on branches that extend from a large tree trunk on the right side. Below and slightly to the left of the main tree trunk, there is a park bench situated on a paved area. The spatial relationships described in the prompt are accurate: the birds are *on* the branches, the branches extend *from* the tree, and the tree is *next to* the park bench. VERDICT: PASS

A candle on top of a stack of books, the books next to a potbfl/flux-2-klein-9bPASS
A candle on top of a stack of books, the books next to a potted plant

Evaluator

The image shows a lit candle sitting directly on top of a stack of five books. To the right of this stack, there is a potted plant in a terracotta pot. The spatial relationships described in the prompt ("A candle on top of a stack of books, the books next to a potted plant") are accurately reflected in the image. VERDICT: PASS

Hard 67%vs100%
A desk scene: laptop in the center, coffee mug to the left, bfl/flux-2-klein-9bPASS
A desk scene: laptop in the center, coffee mug to the left, phone to the right, lamp above, and notebook below

Evaluator

The image shows a desk setup that perfectly matches the described spatial relationships: - **Laptop:** Positioned centrally on the desk. - **Coffee mug:** Located to the left of the laptop. - **Phone:** Located to the right of the laptop. - **Lamp:** Positioned above the desk area, illuminating the scene. - **Notebook:** Placed below the laptop in the foreground. All positional relationships are correct. VERDICT: PASS

Five animals in a field: a horse on the far left, a cow to tbfl/flux-2-klein-9bPASS
Five animals in a field: a horse on the far left, a cow to the right of the horse, a sheep in the center, a goat to the right of the sheep, and a dog on the far right

Evaluator

The image displays a linear arrangement of five animals standing in a grassy field, facing the camera. From left to right, the order is: 1. A brown horse on the far left. 2. A brown and white cow immediately to the right of the horse. 3. A sheep in the center of the group. 4. A brown and white goat to the right of the sheep. 5. A yellow dog on the far right. All described positional relationships (left, right, center, far left, far right) are accurate according to the visual evidence. VERD

Five chess pieces on a board: a king in the center, a queen bfl/flux-2-klein-9bFAIL
Five chess pieces on a board: a king in the center, a queen to its left, a bishop to its right, a rook above the king, and a knight below the king

Evaluator

The image shows five chess pieces arranged on a board. - **Center:** There is a light-colored King (identifiable by the cross on top). - **Left:** To the left of the King is a light-colored Rook (identifiable by the castle-like top). The prompt claims this is a Queen. - **Right:** To the right of the King is a dark-colored Knight (identifiable by the horse head). The prompt claims this is a Bishop. - **Above (Back):** Behind the King is a light-colored Queen (identifiable by the crown).

Extreme 100%vs100%
A beach scene with a sandcastle in the foreground, a surfer bfl/flux-2-klein-9bPASS
A beach scene with a sandcastle in the foreground, a surfer on the waves in the midground, and a cruise ship on the horizon in the background, sizes decreasing with distance

Evaluator

The image displays a clear depth progression matching the description. - **Foreground:** A detailed sandcastle with multiple towers occupies the bottom of the frame. - **Midground:** A surfer with a blue board is visible in the water, positioned behind the sandcastle. - **Background:** A large cruise ship sits on the horizon line, furthest away. - **Sizes:** The sandcastle appears largest due to proximity. The surfer is smaller. The cruise ship, while massive in reality, appears smaller than th

A landscape with a red barn in the foreground at large scalebfl/flux-2-klein-9bPASS
A landscape with a red barn in the foreground at large scale, a windmill in the midground at medium scale, and snow-capped mountains in the background at small scale, all with consistent depth perspective

Evaluator

The image displays a clear depth progression consistent with the prompt. - **Foreground:** A large red barn dominates the left side of the frame, appearing closest to the viewer. - **Midground:** A windmill stands in the field to the right of the barn, further back than the barn but closer than the mountains. - **Background:** Snow-capped mountains rise in the distance, spanning the horizon behind the field and structures. The spatial relationships described (foreground barn, midground windmill

A city park with a park bench and pigeons in the foreground,bfl/flux-2-klein-9bPASS
A city park with a park bench and pigeons in the foreground, a fountain in the midground, and skyscrapers in the background, each layer progressively smaller

Evaluator

The image displays a clear depth progression matching the prompt's description. - **Foreground:** A wooden park bench is situated in the lower left/center, with numerous pigeons scattered on the pavement and grass around it, and some perched on the bench itself. - **Midground:** Beyond the grassy area, a large fountain with spraying water is clearly visible. - **Background:** Behind the fountain and a line of trees, a skyline of tall skyscrapers rises up. The spatial arrangement follows the pro

Scale & Proportions89%vs100%
Easy 100%vs100%
A ladybug sitting on a large green leaf, shown at realistic bfl/flux-2-klein-9bPASS
A ladybug sitting on a large green leaf, shown at realistic proportions

Evaluator

The image shows a ladybug resting on a green leaf. The ladybug is small relative to the leaf, occupying only a small portion of the visible surface area. This scale is consistent with real-world observations where a ladybug is a small insect and a leaf is significantly larger. The proportions are realistic. VERDICT: PASS

A person standing next to a yellow school bus, showing realibfl/flux-2-klein-9bPASS
A person standing next to a yellow school bus, showing realistic relative scale

Evaluator

The image shows a man standing next to a yellow school bus. The relative scale appears realistic: - The height of the bus roof is significantly higher than the man's head, consistent with a real school bus (approx. 10-11 ft tall vs. a ~6 ft person). - The size of the wheels relative to the man's legs is appropriate for a heavy vehicle. - The length of the bus extending behind the man is consistent with a full-sized bus. - The man's proportions relative to the bus door and windows are correct. W

A child holding a basketball, both at correct relative sizesbfl/flux-2-klein-9bPASS
A child holding a basketball, both at correct relative sizes

Evaluator

The image shows a young child holding a basketball. The size of the basketball relative to the child's hands and torso appears realistic. The ball is large enough to be a basketball but fits naturally in the child's grasp, consistent with how a youth-sized or standard basketball would look in the hands of a small child. There are no obvious distortions or impossible size relationships. VERDICT: PASS

Medium 100%vs100%
A tiny person standing on the rim of a coffee cup, looking dbfl/flux-2-klein-9bPASS
A tiny person standing on the rim of a coffee cup, looking down into it

Evaluator

The image shows a standard-sized white coffee cup and saucer with a spoon. Standing on the rim of the cup is a miniature figurine of a person. The size relationship is exactly as described: the person is tiny relative to the cup, creating a surreal scale effect. The proportions of the cup, saucer, and spoon are realistic and consistent with a normal coffee setting, while the person is appropriately scaled down to be "tiny." VERDICT: PASS

A giant tabby cat walking between city skyscrapers like a kabfl/flux-2-klein-9bPASS
A giant tabby cat walking between city skyscrapers like a kaiju

Evaluator

The image depicts a massive tabby cat striding down a city street. The scale is consistent with the "kaiju" description: the cat's head reaches up towards the mid-levels of the skyscrapers, its paws are larger than the cars driving on the road, and it towers over the pedestrians on the sidewalks. The perspective and relative sizes of the buildings, cars, and people all support the illusion that the cat is a giant monster walking through the city. VERDICT: PASS

A miniature house sitting on the palm of someone's handbfl/flux-2-klein-9bPASS
A miniature house sitting on the palm of someone's hand

Evaluator

The image shows a small, detailed model house resting comfortably in the open palm of a human hand. The scale is consistent with the description of a "miniature house," as it fits within the hand without being impossibly tiny or too large. The proportions of the fingers and palm relative to the house appear natural and realistic for this concept. VERDICT: PASS

Hard 67%vs100%
A group photo of an elephant, a horse, a dog, a cat, and a mbfl/flux-2-klein-9bFAIL
A group photo of an elephant, a horse, a dog, a cat, and a mouse, all standing in a line with correct real-world proportional sizes

Evaluator

The image shows an elephant, a horse, a dog, and two cats. The prompt requests an elephant, a horse, a dog, a cat, and a **mouse**. The mouse is completely missing from the image. Additionally, there are two cats instead of one. Regarding proportions, the standing cat in front of the dog appears disproportionately large (nearly reaching the dog's shoulder height), which is not consistent with real-world sizes. VERDICT: FAIL

A dinner table scene with a plate, fork, knife, wine glass, bfl/flux-2-klein-9bPASS
A dinner table scene with a plate, fork, knife, wine glass, salt shaker, and peppercorn, all at realistic relative sizes

Evaluator

The image displays a coherent table setting where the objects maintain realistic proportions relative to one another. The dinner plate is appropriately sized for the fork and knife placed beside it. The wine glass is tall enough relative to the plate, and the salt/pepper shakers are small condiment-sized items. The loose peppercorns are tiny specks, which is physically accurate. There are no obvious distortions where an object is too large or too small for its context. VERDICT: PASS

A farm scene with a barn, tractor, farmer, cow, chicken, andbfl/flux-2-klein-9bPASS
A farm scene with a barn, tractor, farmer, cow, chicken, and a fence post, all at correct proportional scale

Evaluator

The image displays a farm scene where the relative sizes of the objects appear realistic and consistent with real-world proportions. - **Barn:** The large red barn is appropriately sized as a major structure in the background. - **Tractor:** The green tractor is large, consistent with agricultural machinery. - **Farmer:** The man standing next to the tractor is of a realistic height relative to the vehicle (roughly waist/chest height to the main body). - **Cow:** The cows are large anim

Human realism 95%vs98%

Faces & Expressions92%vs100%
Easy 100%vs100%
A portrait of a middle-aged man with a calm, serious expressbfl/flux-2-klein-9bPASS
A portrait of a middle-aged man with a calm, serious expression

Evaluator

The face in the image is symmetric and features are anatomically correct — eyes, nose, mouth, and ears are proportionally aligned and natural. The man appears middle-aged, with visible signs of aging such as wrinkles, graying hair, and a beard. His expression is calm and serious, conveyed through neutral lips, direct gaze, and lack of smiling or emotive cues. There are no noticeable distortions — the facial structure is balanced, and lighting and composition support a realistic portrait. The emo

A close-up portrait of a young woman with a warm natural smibfl/flux-2-klein-9bPASS
A close-up portrait of a young woman with a warm natural smile

Evaluator

The face in the image is largely symmetrical with natural, correctly proportioned features. The eyes, eyebrows, nose, and mouth are well-aligned and appear anatomically accurate. The smile is warm and natural, with visible teeth and slight crinkling around the eyes, conveying genuine happiness. There are no obvious distortions or unnatural artifacts. The lighting and skin texture look realistic, and the emotion is clearly readable as joyful and approachable. VERDICT: PASS

A headshot of a teenager with a relaxed, neutral facebfl/flux-2-klein-9bPASS
A headshot of a teenager with a relaxed, neutral face

Evaluator

The face in the image is symmetric and features are anatomically correct: eyes are evenly spaced and aligned, eyebrows are balanced, nose and mouth are proportionate, and ears are symmetrical. There are no noticeable distortions. The expression is neutral and relaxed, with no strong emotion conveyed — consistent with the prompt’s request for a “relaxed, neutral face.” The gaze is direct and steady, which is appropriate for a headshot. VERDICT: PASS

Medium 67%vs100%
A close-up of a person's face showing clear disgust, nose wrbfl/flux-2-klein-9bFAIL
A close-up of a person's face showing clear disgust, nose wrinkled and lip curled

Evaluator

The face is not symmetric — the left side (viewer’s left) appears slightly more compressed and distorted compared to the right, especially around the nose and mouth. The eyebrows are unevenly drawn together, and the eyes are asymmetrical in their squinting. The nose is wrinkled and the lips are curled downward, which aligns with the intended emotion of disgust. However, the lower lip and chin area show unnatural, exaggerated bulging and texture, which appears to be a rendering artifact or distor

A portrait capturing genuine surprise — raised eyebrows, widbfl/flux-2-klein-9bPASS
A portrait capturing genuine surprise — raised eyebrows, wide eyes, slightly open mouth

Evaluator

The face is largely symmetric with correctly rendered features: both eyes are wide and alert, eyebrows are raised, and the mouth is slightly open — all aligning with the prompt’s request for “genuine surprise.” The expression is clear and emotionally readable. There are no significant distortions; the facial structure, proportions, and details (eyelashes, skin texture, nose, lips) appear natural and well-defined. Minor artifacts like slight unevenness in the eyebrow arches or subtle asymmetry in

A face showing quiet melancholy — downcast eyes, slightly drbfl/flux-2-klein-9bPASS
A face showing quiet melancholy — downcast eyes, slightly drawn mouth, distant gaze

Evaluator

The face is largely symmetric with correctly rendered features: eyes, nose, mouth, and ears are proportionally aligned and anatomically plausible. There are minor natural asymmetries (e.g., slight unevenness in eyebrow shape or skin texture), but nothing that disrupts facial coherence or suggests AI distortion. The intended emotion — quiet melancholy — is clearly readable. The downcast gaze, slightly drawn mouth, and furrowed brow convey deep sadness or introspective sorrow. The lighting and te

Hard 100%vs100%
An elderly woman in her 80s with deep wrinkles and white haibfl/flux-2-klein-9bPASS
An elderly woman in her 80s with deep wrinkles and white hair, smiling warmly with kind eyes

Evaluator

The face is largely symmetrical, with both eyes, eyebrows, and the smile appearing balanced. The features are anatomically plausible: the deep wrinkles around the eyes and mouth are consistent with an elderly person, and the white hair is natural-looking. The emotion is clearly readable — the warm, genuine smile and crinkled eyes convey kindness and joy. There are no significant distortions; the facial structure is coherent, and the lighting and texture are realistic. The image successfully capt

A 5-year-old child with round cheeks and large eyes, crying bfl/flux-2-klein-9bPASS
A 5-year-old child with round cheeks and large eyes, crying with tears streaming down their face

Evaluator

The face in the image is largely symmetric with correctly rendered features: the eyes are large and proportionate, the cheeks appear round, and the tear tracks are clearly visible streaming down from each eye. The child’s expression — furrowed brow, downturned mouth, and visible tears — strongly conveys the emotion of crying. There are no significant distortions; the facial structure is anatomically plausible for a young child, and the details (like the reflection in the eyes and the droplet of

An elderly man in his 70s with age spots and thinning gray hbfl/flux-2-klein-9bPASS
An elderly man in his 70s with age spots and thinning gray hair, looking skeptically over reading glasses

Evaluator

The face is largely symmetric with natural, correct anatomical features. The elderly man’s facial structure is balanced, with no obvious distortions. Age spots are clearly visible across the forehead, cheeks, and chin, consistent with the prompt. Thinning gray hair is present, receding slightly at the temples. He wears reading glasses, and his expression — a skeptical, slightly downturned look with furrowed brows and a tight mouth — is clearly readable and matches the requested emotion. There ar

Extreme 100%vs100%
A family photo with four people: a father beaming with pridebfl/flux-2-klein-9bPASS
A family photo with four people: a father beaming with pride, a mother smiling gently, a teenager looking bored, and a child grinning excitedly

Evaluator

The faces in the image are largely symmetrical with correctly rendered features — eyes, noses, mouths, and ears are proportionate and anatomically plausible. There are no obvious distortions or unnatural deformities. Regarding the intended emotions: - The father (left) is beaming with pride — his wide, toothy grin and crinkled eyes convey genuine joy. - The mother (right) is smiling gently — her soft, closed-mouth smile and relaxed expression match this description. - The teenager (front center

A group of three friends at a table: one laughing with head bfl/flux-2-klein-9bPASS
A group of three friends at a table: one laughing with head tilted back, one looking shocked with wide eyes, and one rolling their eyes with a smirk

Evaluator

The image shows three distinct facial expressions that align with the prompt: - Left person: Laughing with head tilted back, eyes closed, mouth wide open — clearly conveying joy. - Center person: Wide eyes, open mouth, raised eyebrows — clearly conveying shock. - Right person: One eye winking, smirk, hand on cheek, and a sweat drop — clearly conveying a dismissive or sarcastic “rolling eyes” expression. All faces are symmetric and feature correctly rendered anatomy for the stylized manga/anime

Three coworkers in a meeting: one looking frustrated with fubfl/flux-2-klein-9bPASS
Three coworkers in a meeting: one looking frustrated with furrowed brows, one nodding with a pleased expression, and one looking confused with a tilted head

Evaluator

The image shows three individuals seated at a conference table, each displaying distinct facial expressions. - The person on the left has a furrowed brow and a clenched jaw, with hands clasped near the mouth — conveying frustration or concern. The facial symmetry and features are natural and correctly rendered. - The person in the center has a slight smile and is looking toward the right, with a relaxed posture — conveying a pleased or satisfied expression. Features are symmetrical and realisti

Full Body100%vs100%
Easy 100%vs100%
A man standing straight facing the camera, hands in pockets,bfl/flux-2-klein-9bPASS
A man standing straight facing the camera, hands in pockets, normal proportions

Evaluator

The image shows a full-body shot of a man standing straight and facing the camera, with both hands in his pockets. His posture is upright and symmetrical, and his proportions appear normal — head, torso, arms, and legs are in proportion to each other without obvious exaggeration or distortion. He has two arms, two legs, and all limbs are present and correctly attached. The pose is clearly recognizable as “standing straight with hands in pockets,” a common and natural stance. There are no anatomi

A woman standing upright in a relaxed pose, arms at her sidebfl/flux-2-klein-9bPASS
A woman standing upright in a relaxed pose, arms at her sides, wearing jeans and a white t-shirt

Evaluator

The image shows a full-body shot of a woman standing upright with her arms at her sides, wearing a white t-shirt and blue jeans — exactly matching the prompt. - Limb count: Correct — two arms, two legs, both visible and intact. - Proportions: Anatomically correct — head, torso, limbs are proportionally balanced and natural. - Pose: The woman is standing upright in a relaxed pose with arms at her sides — fully recognizable and matches the prompt. - No distortions: The figure is symmetrical, limb

A person standing in a neutral pose in a park, full body visbfl/flux-2-klein-9bPASS
A person standing in a neutral pose in a park, full body visible from head to feet

Evaluator

The image shows a full-body view of a person standing in a park, as requested. The person is positioned in a neutral, upright stance, facing the camera, with arms hanging loosely at the sides and feet shoulder-width apart — a pose that is anatomically plausible and recognizable as a standard “neutral standing” pose. Limb count is correct: two arms, two legs, with all limbs attached to the torso and head in a natural configuration. Proportions appear anatomically accurate — head size relative to

Medium 100%vs100%
A woman mid-stride running through a park, hair and clothes bfl/flux-2-klein-9bPASS
A woman mid-stride running through a park, hair and clothes showing motion

Evaluator

The image shows a woman in mid-stride running through a park. Her hair is flowing backward, and her clothes (tank top and leggings) show dynamic motion consistent with running. Her pose is anatomically plausible: arms are bent and swinging naturally, legs are in a running gait with one foot off the ground, and her torso is slightly forward — all typical of running motion. Limb count is correct (two arms, two legs), and proportions appear natural and proportional to her body. There are no obvious

A man captured mid-jump over a puddle, body in the air with bfl/flux-2-klein-9bPASS
A man captured mid-jump over a puddle, body in the air with bent knees

Evaluator

The image shows a man mid-air, jumping over a puddle. His body is captured with bent knees, arms slightly out for balance, and feet positioned above the water — consistent with the described action. Limb count is correct: two arms, two legs, and the torso. Proportions appear anatomically plausible — no obvious limb duplication or missing limbs. The pose is recognizable as a jump, and the reflection in the puddle reinforces the action. There are no significant distortions; the anatomy is coherent

A dancer in a dynamic spin, dress flowing outward with one lbfl/flux-2-klein-9bPASS
A dancer in a dynamic spin, dress flowing outward with one leg extended

Evaluator

The dancer in the image is captured mid-spin, with her dress flowing outward as described, and one leg extended in a classic ballet pose (likely a développé or attitude en l’air). She is balancing on one foot en pointe, with the other leg lifted and extended behind her. Her arms are gracefully extended outward, enhancing the sense of motion. Limb count is correct: two arms, two legs, two feet, and the torso and head are intact. Proportions are anatomically accurate — the body is slender and bal

Hard 100%vs100%
A person in a yoga tree pose — standing on one leg with the bfl/flux-2-klein-9bPASS
A person in a yoga tree pose — standing on one leg with the other foot pressed to the inner thigh, arms raised overhead

Evaluator

The image shows a person from behind performing a yoga pose. The pose matches the description: standing on one leg, with the other foot pressed against the inner thigh, and arms raised overhead with palms together. The limb count is correct (two arms, two legs), and proportions appear anatomically plausible — no obvious limb duplication or gross distortion. The pose is clearly recognizable as the “Tree Pose” (Vrksasana), a standard yoga posture. The body alignment, though viewed from behind, is

A rock climber clinging to a cliff face with arms and legs ebfl/flux-2-klein-9bPASS
A rock climber clinging to a cliff face with arms and legs extended to different holds

Evaluator

The image shows a real photograph of a rock climber on a steep cliff face. The climber has two arms and two legs, with proportions consistent with a human body. The pose is anatomically plausible: one arm is extended upward gripping a hold, the other arm is extended outward gripping a chain or anchor point, and the legs are positioned to maintain balance and leverage on the rock. The climber’s body is angled appropriately for climbing, with visible muscle engagement and realistic joint articulat

A cellist seated with correct posture, legs around the instrbfl/flux-2-klein-9bPASS
A cellist seated with correct posture, legs around the instrument, bow arm extended

Evaluator

The cellist in the image is seated with correct posture, and the cello is properly positioned between his legs, resting on a stand. His left hand holds the neck of the cello, and his right hand holds the bow, which is extended across the strings — consistent with the described action. The limb count is correct (two arms, two legs), and proportions appear anatomically accurate. The pose is recognizable as a standard cello-playing posture, though the bow is held slightly lower than typical for act

Extreme 100%vs100%
Two gymnasts performing a synchronized handstand, side by sibfl/flux-2-klein-9bPASS
Two gymnasts performing a synchronized handstand, side by side with identical body alignment

Evaluator

The image shows two gymnasts performing a synchronized handstand, side by side, with nearly identical body alignment — both legs are straight and pointed, torsos are parallel to the floor, and heads are aligned downward. Their limb count is correct (two arms, two legs per person), and proportions appear anatomically accurate: limbs are proportionally long relative to torsos, joints are positioned naturally, and there is no obvious limb duplication or missing limbs. The pose is clearly recogniza

Two soccer players mid-tackle, one sliding and the other jumbfl/flux-2-klein-9bPASS
Two soccer players mid-tackle, one sliding and the other jumping over with the ball at their feet

Evaluator

The image shows two soccer players in a dynamic, realistic mid-tackle scenario. The player in the white jersey is sliding on the ground, with his body low to the turf, one leg extended toward the ball, and his arms out for balance. The player in the red jersey is airborne, jumping over the sliding player, with his body upright and his right foot near the ball. The ball is clearly visible at the feet of the sliding player, though it is not explicitly under his control — it is near his foot and be

Two tango dancers in a close embrace, the man dipping the wobfl/flux-2-klein-9bPASS
Two tango dancers in a close embrace, the man dipping the woman backward with intertwined legs

Evaluator

The image shows a real photograph of two tango dancers in a dynamic pose. The man is dipping the woman backward, and their legs are intertwined — her left leg is wrapped around his right thigh, and her right leg is extended back, lifted, and resting on his left thigh or hip area. The woman’s body is arched backward, supported by the man’s arms, and her head is tilted up toward him. Their facial expressions and posture convey the intimacy and intensity of tango. Anatomical correctness: - Limb co

Hands92%vs92%
Easy 100%vs100%
A person standing with both hands visible at their sides, fibfl/flux-2-klein-9bPASS
A person standing with both hands visible at their sides, fingers relaxed and naturally spread

Evaluator

Both hands are visible at the person’s sides, fingers relaxed and naturally spread, as per the prompt. Counting the fingers on each hand: - Left hand: Thumb, index, middle, ring, pinky — 5 fingers. - Right hand: Thumb, index, middle, ring, pinky — 5 fingers. Anatomically, the hands are correctly formed. The fingers are relaxed, slightly curled at the tips, which is natural for a standing pose with hands at the sides. There are no extra fingers, missing fingers, or anatomical anomalies. The thu

A woman holding her hands out in front of her palms-up, all bfl/flux-2-klein-9bPASS
A woman holding her hands out in front of her palms-up, all ten fingers clearly visible

Evaluator

Both hands are clearly visible, palms up, and all ten fingers are distinctly visible — five on each hand. The fingers are anatomically correct in number and arrangement. The gesture is natural and consistent with the prompt: hands held out in front of the body, palms up, fingers spread. No anomalies such as extra or missing fingers, fused digits, or unnatural deformities are present. VERDICT: PASS

A man with arms crossed, both hands resting on opposite uppebfl/flux-2-klein-9bPASS
A man with arms crossed, both hands resting on opposite upper arms with fingers visible

Evaluator

The man in the image has his arms crossed, with both hands resting on opposite upper arms, and fingers are visible. Counting the fingers on each hand: - The left hand (on the viewer’s right, resting on the left upper arm) clearly shows 5 fingers: thumb, index, middle, ring, and pinky. - The right hand (on the viewer’s left, resting on the right upper arm) also clearly shows 5 fingers. The grip and gesture are anatomically plausible. The hands are positioned naturally on the upper arms, with fi

Medium 100%vs100%
A person holding a white ceramic coffee mug by the handle wibfl/flux-2-klein-9bPASS
A person holding a white ceramic coffee mug by the handle with their right hand

Evaluator

The visible hand in the image is holding the white ceramic coffee mug by its handle with the right hand, as prompted. Counting the fingers: - Thumb: visible, curled around the handle. - Index finger: visible, wrapping around the handle. - Middle finger: visible, gripping the handle. - Ring finger: visible, gripping the handle. - Pinky finger: visible, wrapping around the handle. All five fingers are clearly visible and accounted for. Anatomical correctness: The grip is natural and anatomicall

A woman writing in a notebook with a pen gripped correctly ibfl/flux-2-klein-9bPASS
A woman writing in a notebook with a pen gripped correctly in her right hand

Evaluator

The right hand, holding the pen, clearly shows five fingers: thumb, index, middle, ring, and pinky — all visible and anatomically positioned. The grip is a standard tripod grip, with the pen held between the thumb, index, and middle fingers, which is anatomically correct for writing. The left hand is resting on the notebook, also showing five fingers, though less prominently. No anomalies are visible — no extra or missing fingers, no unnatural joint angles, and the hands appear proportionate and

A person playing an acoustic guitar with left hand on the frbfl/flux-2-klein-9bPASS
A person playing an acoustic guitar with left hand on the fretboard and right hand strumming

Evaluator

Left hand (on fretboard): 5 fingers visible — thumb, index, middle, ring, pinky — all anatomically positioned to press strings. Right hand (strumming): 5 fingers visible — thumb, index, middle, ring, pinky — thumb is partially obscured but clearly present; fingers are positioned as if strumming or fretting near the soundhole. Anatomical correctness: - Left hand: Fingers are bent and pressing down on strings in a standard fingerstyle/fretting position. Thumb is wrapped behind the neck — ty

Hard 67%vs67%
A person making a peace sign with their right hand, clearly bfl/flux-2-klein-9bPASS
A person making a peace sign with their right hand, clearly showing two fingers up and three curled

Evaluator

The visible right hand in the image is making a peace sign: the index and middle fingers are extended upward, while the ring, pinky, and thumb are curled or folded inward. This is a standard “V” sign gesture. Counting the fingers: - Index finger: visible and extended - Middle finger: visible and extended - Ring finger: curled inward - Pinky finger: curled inward - Thumb: curled inward (partially visible, tucked under the other fingers) That’s five fingers — all present and accounted for. Anat

A hand giving a thumbs-up with four fingers curled and thumbbfl/flux-2-klein-9bPASS
A hand giving a thumbs-up with four fingers curled and thumb extended upward

Evaluator

The image shows a single hand performing a thumbs-up gesture. The thumb is extended upward, and the other four fingers are curled into a fist. Counting the fingers: There are clearly four fingers curled (index, middle, ring, and pinky) and one thumb extended — totaling five fingers per hand. The hand is shown in full, and no anomalies are visible in the finger count or structure. Anatomically, the gesture is correct: the thumb is extended upward, and the other four fingers are curled inward,

A person counting to three on their fingers, with index, midbfl/flux-2-klein-9bFAIL
A person counting to three on their fingers, with index, middle, and ring fingers extended

Evaluator

The image shows a person holding up their left hand with the index, middle, and ring fingers extended — matching the prompt’s request to “count to three.” The thumb is also visible and slightly curled, and the pinky finger is not extended. That makes 3 extended fingers on the left hand, not 5. The right hand is partially visible, with the thumb and index finger extended, and the other fingers curled — not matching the “count to three” gesture. It does not show 5 fingers. Anatomically, the gest

Extreme 100%vs100%
Two people shaking hands in a professional setting, all fingbfl/flux-2-klein-9bPASS
Two people shaking hands in a professional setting, all fingers of both hands anatomically correct

Evaluator

Counting the fingers on the visible hands: - The hand on the left (in the navy suit) clearly shows 5 fingers: thumb, index, middle, ring, and pinky. - The hand on the right (in the gray suit) also clearly shows 5 fingers, though slightly obscured by the grip. The grip is a standard professional handshake: palms facing each other, fingers interlaced, thumbs touching or nearly touching. The posture is anatomically plausible — no fingers are fused, duplicated, or missing. The knuckles and finger

Two people doing a high-five, palms meeting with all fingersbfl/flux-2-klein-9bPASS
Two people doing a high-five, palms meeting with all fingers extended on both hands

Evaluator

Each visible hand in the image clearly shows five fingers — thumb, index, middle, ring, and pinky — with no missing or extra digits. The gesture is a high-five, with palms meeting and all fingers extended, which matches the prompt. The anatomy is accurate: the hands are positioned naturally for a high-five, with no unnatural deformities, overlapping fingers, or anatomical anomalies. The fingers are aligned and proportionate, and the thumbs are visible and correctly oriented. VERDICT: PASS

One person handing a small gift box to another person, both bfl/flux-2-klein-9bPASS
One person handing a small gift box to another person, both sets of hands gripping the box correctly

Evaluator

Both hands in the image are clearly visible and show all five fingers per hand — thumb, index, middle, ring, and little finger — with no missing or extra digits. The grip is anatomically correct: each hand is holding the gift box with the thumb and fingers wrapped around the sides, supporting the box securely. The thumbs are positioned on opposite sides of the box, and the fingers are curled around the edges — a natural and realistic way to hold a small object during a gift exchange. There are

Multi-Subject100%vs100%
Easy 100%vs100%
Two women standing side by side: one with short blonde hair bfl/flux-2-klein-9bPASS
Two women standing side by side: one with short blonde hair wearing a red jacket, the other with long black hair wearing a blue dress

Evaluator

Person 1 (left): - Hair: Short, blonde, styled in a neat, slightly wavy bob. - Clothing: Wearing a bright red jacket over a white and black patterned top. - Accessories: Gold necklace and hoop earrings visible. - Matches description: Yes — short blonde hair, red jacket. Person 2 (right): - Hair: Long, straight, black hair with soft waves. - Clothing: Wearing a sleeveless royal blue dress. - Accessories: Gold necklace with a pendant. - Matches description: Yes — long black hair, blue dress. Bot

Two men at a cafe: one with a beard and glasses wearing a grbfl/flux-2-klein-9bPASS
Two men at a cafe: one with a beard and glasses wearing a green sweater, the other clean-shaven in a black hoodie

Evaluator

Person 1 (left): - Has a full beard. - Wears black-framed glasses. - Wears a green ribbed sweater. - Visually distinct and matches description. Person 2 (right): - Clean-shaven. - Wears a black hoodie. - Visually distinct and matches description. All individuals are present, visually distinct, and match their described features. VERDICT: PASS

Two friends posing: one tall with curly red hair in denim ovbfl/flux-2-klein-9bPASS
Two friends posing: one tall with curly red hair in denim overalls, the other short with straight brown hair in a yellow sundress

Evaluator

Person 1 (tall): - Hair: Curly, red (appears vibrant and distinctly curly) - Clothing: Denim overalls (worn over a light-colored tank top) - Height: Appears significantly taller than the other person, consistent with “tall” Person 2 (short): - Hair: Straight, brown (medium-length, straight-cut) - Clothing: Yellow sundress (strap-style, flowing, sleeveless) - Height: Clearly shorter than the first person, consistent with “short” Both individuals are visually distinct and match the described fea

Medium 100%vs100%
Four people at a dinner table: a bald man in a suit, a womanbfl/flux-2-klein-9bPASS
Four people at a dinner table: a bald man in a suit, a woman with silver hair in a red blouse, a young man with dreadlocks in a denim jacket, and a woman with a hijab in a green dress

Evaluator

- Bald man in a suit: Present. He is on the far left, bald, wearing a dark suit with a white shirt and red tie. - Woman with silver hair in a red blouse: Present. She is second from the left, has silver-gray hair, and is wearing a red blouse. - Young man with dreadlocks in a denim jacket: Present. He is third from the left, has dark dreadlocks, and is wearing a denim jacket over a white shirt. - Woman with a hijab in a green dress: Present. She is on the far right, wearing a dark gray hijab and

Three musicians on stage: a tall woman with short pink hair bfl/flux-2-klein-9bPASS
Three musicians on stage: a tall woman with short pink hair playing bass, a man with a beard and bandana on drums, and a slim man with glasses playing keyboard

Evaluator

- Tall woman with short pink hair playing bass: Present. She has short, bright pink hair, is playing a bass guitar, and appears tall and slender. Her features match the description. - Man with a beard and bandana on drums: Present. He is seated behind the drum kit, has a full beard, and is wearing a patterned bandana on his head. His features match the description. - Slim man with glasses playing keyboard: Present. He is on the right, playing a keyboard, wearing glasses, and appears slim. His fe

Three hikers on a trail: a man with a backpack and sunhat, abfl/flux-2-klein-9bPASS
Three hikers on a trail: a man with a backpack and sunhat, a woman with braided hair and hiking poles, and a teenager with a baseball cap and water bottle

Evaluator

Person 1 (left): Man with a backpack and sunhat — Observed: Wearing a beige sunhat, carrying a large black backpack, smiling, has a beard, wearing a blue t-shirt and khaki shorts. Matches description. Person 2 (center): Woman with braided hair and hiking poles — Observed: Has long braided hair, wearing a purple shirt and black leggings, holding two hiking poles, carrying a blue backpack. Matches description. Person 3 (right): Teenager with a baseball cap and water bottle — Observed: Wearing a

Truthfulness 67%vs89%

Photorealism100%vs100%
Medium 100%vs100%
A still life on a wooden table featuring a clear glass of wabfl/flux-2-klein-9bPASS
A still life on a wooden table featuring a clear glass of water, a polished silver spoon, a folded linen napkin, and a ripe red apple — each material rendered with accurate texture and light behavior

Evaluator

The image successfully renders the requested materials with high fidelity. The glass exhibits correct refraction and transparency, distorting the wood grain behind it. The silver spoon shows sharp, accurate specular highlights and reflections. The apple has a realistic waxy skin texture with subtle subsurface scattering. The linen napkin displays a convincing woven fabric texture, and the wooden table provides a detailed, rustic background. All light behaviors and textures are physically accurat

A kitchen counter with a stainless steel pot, a wooden cuttibfl/flux-2-klein-9bPASS
A kitchen counter with a stainless steel pot, a wooden cutting board, a glass bottle of olive oil, and a cotton dish towel — each surface reflecting light correctly

Evaluator

The image accurately renders the requested materials. The stainless steel pot shows sharp, realistic reflections and metallic sheen. The wooden cutting board displays natural grain and texture. The glass bottle exhibits correct translucency and refraction of the olive oil inside, with appropriate highlights. The cotton towel has a soft, woven texture with realistic folds and light absorption. All surfaces interact with light in a physically plausible manner. VERDICT: PASS

A workspace desk with a brushed aluminum laptop, a leather-bbfl/flux-2-klein-9bPASS
A workspace desk with a brushed aluminum laptop, a leather-bound notebook, a ceramic coffee mug, and a glass paperweight — all materials visually distinct and realistic

Evaluator

The image successfully renders the requested materials with high fidelity. The brushed aluminum laptop shows appropriate metallic sheen and texture. The leather notebook displays realistic grain and stitching. The ceramic mug has a matte, speckled finish consistent with pottery. The glass paperweight exhibits correct refraction and transparency. The wooden desk surface has a natural grain pattern. All materials are visually distinct and realistic. VERDICT: PASS

Physics & Reflections83%vs83%
Easy 100%vs100%
A red sphere on a white surface with a single shadow cast tobfl/flux-2-klein-9bPASS
A red sphere on a white surface with a single shadow cast to the right, consistent with a light source from the upper left

Evaluator

The shadow is cast to the right and slightly downward, which is consistent with a light source positioned to the upper left. The shadow’s shape and softness are physically plausible for a sphere on a flat surface under directional lighting. There are no reflections, as expected on a matte white surface. Color and distortion are appropriate. VERDICT: PASS

A coffee mug on a sunny windowsill casting a sharp shadow tobfl/flux-2-klein-9bPASS
A coffee mug on a sunny windowsill casting a sharp shadow to the left

Evaluator

The lighting originates from the right (the window), casting a shadow to the left, which matches the prompt. The shadow is sharp, consistent with direct sunlight. The shape of the shadow accurately reflects the mug’s form, including the handle’s silhouette. There are no distortions or color anomalies in the shadow or reflections. The scene is physically plausible. VERDICT: PASS

A person standing outdoors at midday with a short shadow dirbfl/flux-2-klein-9bPASS
A person standing outdoors at midday with a short shadow directly below them

Evaluator

The shadow is cast directly beneath the subject, which is consistent with a midday sun position. The shadow’s shape and length are physically plausible for the lighting conditions, and there are no distortions or color anomalies. The direction and orientation align correctly with the light source. VERDICT: PASS

Medium 67%vs67%
A cat sitting on a polished marble floor with its reflectionbfl/flux-2-klein-9bPASS
A cat sitting on a polished marble floor with its reflection visible below

Evaluator

The reflection is physically plausible. It is correctly oriented (inverted vertically), matches the cat’s pose and markings, and shows appropriate distortion consistent with a glossy surface. The lighting and color tones in the reflection align with the real cat, and the reflection fades naturally toward the edges, as expected on polished marble. No directional or structural inconsistencies are present. VERDICT: PASS

A woman standing in front of a full-length mirror, her reflebfl/flux-2-klein-9bFAIL
A woman standing in front of a full-length mirror, her reflection accurately showing the back of her outfit and reversed orientation

Evaluator

The image displays two mirrors side-by-side. The reflection in the right mirror is physically plausible, showing the woman’s back as expected when facing a mirror. However, the left mirror shows an impossible reflection: it reflects the side of another mirror frame rather than the woman or the room correctly. This violates basic laws of optics and spatial consistency. Additionally, the woman appears to be reflected in both mirrors simultaneously from the same position, which is not geometrically

A vase of flowers on a glass table with a clear reflection vbfl/flux-2-klein-9bPASS
A vase of flowers on a glass table with a clear reflection visible on the table surface

Evaluator

The reflection on the glass table is physically plausible. It correctly mirrors the vase and flowers with appropriate vertical inversion. The orientation matches the object above, and the slight blurring and reduced contrast are consistent with a semi-reflective glass surface. Colors in the reflection align with the original bouquet, and no directional inconsistencies or distortions are present. VERDICT: PASS

Hard 100%vs100%
A row of trees reflected in a calm lake at sunset, with the bfl/flux-2-klein-9bPASS
A row of trees reflected in a calm lake at sunset, with the reflection slightly darker and gently rippled

Evaluator

The reflections are vertically inverted, matching the orientation of the trees. The gentle ripples distort the reflection realistically, and the colors match the sunset sky. The physics of the reflection are accurate. VERDICT: PASS

A city skyline reflected in a rain puddle on a street, with bfl/flux-2-klein-9bPASS
A city skyline reflected in a rain puddle on a street, with mild distortion and color shift

Evaluator

The reflection is inverted vertically, which is physically correct for a puddle. The distortion mimics water ripples and surface tension, and the color shift (slightly cooler, muted tones) aligns with light reflecting off water. The surrounding wet pavement and raindrops enhance realism. No obvious violations of physics in orientation or behavior of the reflection. VERDICT: PASS

A mountain range reflected in a still alpine lake, with the bfl/flux-2-klein-9bPASS
A mountain range reflected in a still alpine lake, with the reflection inverted and subtly muted in color

Evaluator

The reflection is physically plausible. It is correctly inverted vertically, matching the mountain range above. The orientation aligns perfectly with the shoreline. The color is subtly muted and darker than the actual mountains, which is consistent with how water reflects light. The slight distortion near the foreground suggests gentle ripples, adding realism. VERDICT: PASS

Extreme 67%vs67%
A clear glass sphere sitting on a mirror surface near a poolbfl/flux-2-klein-9bPASS
A clear glass sphere sitting on a mirror surface near a pool of water, showing refraction through the sphere, reflection on the mirror, and water reflection simultaneously

Evaluator

The image depicts a glass sphere resting directly on the water surface. The reflection below the sphere is inverted and distorted by the water's ripples, which is physically accurate for a reflection on a liquid surface. The refraction through the sphere correctly inverts the background scene (trees and sky). The lighting and shadows are consistent with an outdoor setting. The physics of the reflection and refraction are handled well. VERDICT: PASS

A chrome sculpture in a room with a glass floor and large wibfl/flux-2-klein-9bFAIL
A chrome sculpture in a room with a glass floor and large windows, showing multiple reflections and environmental mapping on every surface

Evaluator

The reflections on the chrome sculpture are physically plausible, showing distorted environmental mapping of the windows and room. The reflection on the glass floor is correctly oriented and matches the sculpture’s position. However, the shadowing is inconsistent: the sculpture casts almost no discernible shadow on the glass floor despite strong overhead lighting, and the reflections in the sculpture do not clearly correspond to the light sources visible in the ceiling. This lack of coherent sha

A crystal wine glass on a polished marble counter next to a bfl/flux-2-klein-9bPASS
A crystal wine glass on a polished marble counter next to a window overlooking a lake — showing refraction in the glass, reflection on the marble, and the lake's own reflective surface

Evaluator

The reflections and refractions are physically plausible. The marble counter shows a clear, slightly distorted reflection of the glass stem and base, consistent with a polished surface. The lake in the background reflects the sky and trees accurately. The glass itself exhibits realistic refraction, distorting the background view through its curved shape and cut patterns. Lighting direction is consistent across all elements. VERDICT: PASS

World Knowledge42%vs92%
Easy 67%vs67%
The Taj Mahal in Agra seen from the reflecting pool, with acbfl/flux-2-klein-9bFAIL
The Taj Mahal in Agra seen from the reflecting pool, with accurate dome, minarets, and symmetry

Evaluator

The image depicts the Taj Mahal, but it is not architecturally accurate. The real Taj Mahal has four minarets at the corners of the main platform. This image shows six minarets (two on each side of the central structure plus the corner ones), which is a hallucination. Additionally, the side structures (chhatris) are exaggerated in height and number compared to the actual monument. VERDICT: FAIL

The Eiffel Tower in Paris on a clear day, architecturally acbfl/flux-2-klein-9bPASS
The Eiffel Tower in Paris on a clear day, architecturally accurate with its iron lattice structure and three levels

Evaluator

The image depicts the Eiffel Tower with high architectural accuracy. The iron lattice structure, the three distinct levels (including the top observation deck and antenna), and the characteristic arches at the base are all correctly rendered. The perspective and surrounding environment (trees, buildings in the background) are consistent with a real-world view from the Champ de Mars. There are no obvious distortions or hallucinations in the structure itself. VERDICT: PASS

The Statue of Liberty on her island pedestal, with correct tbfl/flux-2-klein-9bPASS
The Statue of Liberty on her island pedestal, with correct torch, crown, tablet, and green patina

Evaluator

The image depicts the Statue of Liberty with high fidelity. The green patina, the raised torch, the spiked crown, and the tablet held in the left arm are all correctly represented. The pedestal architecture and the surrounding island environment are also accurate. There are no visible distortions or factual errors in the depiction of the landmark. VERDICT: PASS

Medium 33%vs100%
A traditional Japanese shamisen instrument with accurate thrbfl/flux-2-klein-9bFAIL
A traditional Japanese shamisen instrument with accurate three-stringed design, wooden body, and bachi plectrum

Evaluator

The image depicts a shamisen, but it contains significant inaccuracies. The body shape is incorrect (resembling a lute or cittern rather than the traditional square or round bag-like body). The bridge is placed incorrectly on the front face rather than resting on the skin. The tuning pegs are arranged in a Western-style headstock rather than the traditional side-mounted pegs. These errors make it factually inaccurate. VERDICT: FAIL

A Japanese kintsugi bowl — cracked ceramic repaired with visbfl/flux-2-klein-9bPASS
A Japanese kintsugi bowl — cracked ceramic repaired with visible gold lacquer lines

Evaluator

The image accurately depicts a Japanese kintsugi bowl. The subject is a ceramic bowl with visible cracks repaired using gold lacquer, which is the defining characteristic of kintsugi. The aesthetic, materials, and style are factually correct and recognizable. VERDICT: PASS

A medieval astrolabe with accurate rete, tympan, and engravebfl/flux-2-klein-9bFAIL
A medieval astrolabe with accurate rete, tympan, and engraved degree markings on brass

Evaluator

The image depicts a brass instrument that resembles an astrolabe in general shape and material, but it fails to meet the criteria for factual accuracy. The engraved markings are nonsensical gibberish rather than accurate degree markings or recognizable constellations. The internal structure (rete and tympan) is geometrically incoherent and does not correspond to the functional design of a medieval astrolabe. Therefore, it is not factually accurate. VERDICT: FAIL

Hard 0%vs100%
A detailed diagram of the water cycle showing evaporation frbfl/flux-2-klein-9bFAIL
A detailed diagram of the water cycle showing evaporation from the ocean, condensation forming clouds, precipitation as rain, and collection in rivers flowing back to the sea

Evaluator

The diagram is not factually accurate. The labels contain severe spelling errors and gibberish (e.g., "Condessation formny," "Peqiidation," "Colledo," "Battles do Rvers"). Additionally, the arrows are logically incorrect; for instance, an arrow labeled "Evaporation" points downward into the ocean, which contradicts the physical process. These errors make the diagram misleading and inaccurate. VERDICT: FAIL

A cross-section showing tectonic plates at a convergent bounbfl/flux-2-klein-9bFAIL
A cross-section showing tectonic plates at a convergent boundary with subduction, a trench, volcanic arc, and layered mantle

Evaluator

The image depicts a geological cross-section, but the labels are nonsensical gibberish (e.g., "Do Adenlervt", "Wodonvemrie Fraluliwy") rather than correct scientific terms. This makes the subject unrecognizable as an accurate educational diagram. Additionally, the geological structure is inaccurate: it shows a single plate splitting apart (divergent boundary) rather than two plates colliding and subducting (convergent boundary) as requested. VERDICT: FAIL

An anatomical illustration of the human heart showing four cbfl/flux-2-klein-9bFAIL
An anatomical illustration of the human heart showing four chambers, valves, aorta, pulmonary arteries, and veins with correct placement

Evaluator

The image depicts a human heart, but the labels are nonsensical gibberish (e.g., "Undoonm," "Stechariele," "Noamarlk") rather than correct anatomical terms. Additionally, the internal structures appear distorted and do not accurately represent the four chambers and valves as requested. The text is unreadable and factually incorrect. VERDICT: FAIL

Extreme 67%vs100%
A Morpho menelaus butterfly with accurate iridescent blue dobfl/flux-2-klein-9bFAIL
A Morpho menelaus butterfly with accurate iridescent blue dorsal wings, brown ventral wing pattern with eyespots, and correct body morphology

Evaluator

The image depicts a butterfly with dorsal wings that are blue and ventral wings that are brown with eyespots, which is characteristic of the Morpho genus. However, the prompt specifically requests a *Morpho menelaus*. The *Morpho menelaus* typically has a solid, vibrant iridescent blue on the dorsal side without the distinct dark brown borders seen here. The brown borders and specific wing shape shown in the image are more characteristic of the *Morpho peleides* (Blue Morpho) or a hybrid represe

The Hagia Sophia interior showing the massive central dome, bfl/flux-2-klein-9bPASS
The Hagia Sophia interior showing the massive central dome, pendentives, semi-domes, and Islamic calligraphy medallions alongside Byzantine mosaics

Evaluator

The image accurately depicts the interior of the Hagia Sophia, correctly showing the massive central dome, the pendentives, the semi-domes, and the large Islamic calligraphy medallions hanging from the dome. The Byzantine mosaics are also visible on the walls and arches. The architectural elements and their arrangement are factually correct and recognizable. VERDICT: PASS

A specimen of bismuth crystal showing accurate iridescent oxbfl/flux-2-klein-9bPASS
A specimen of bismuth crystal showing accurate iridescent oxide layers, geometric hopper crystal formation, and characteristic staircase structure

Evaluator

The subject is clearly recognizable as a bismuth crystal, featuring the characteristic geometric hopper formation and iridescent oxide layers. The staircase structure is accurately depicted, and the colors align with real-world examples. There are no significant factual or architectural inaccuracies. VERDICT: PASS

Professional Studio 89%vs93%

Camera & Lighting100%vs100%
Easy 100%vs100%
A wide-angle shot of a modern living room interior showing ebfl/flux-2-klein-9bPASS
A wide-angle shot of a modern living room interior showing expansive space with slight barrel distortion at the edges

Evaluator

The image shows a modern living room interior with a very wide field of view. 1. **Perspective/Wide-angle/Barrel Distortion**: The image exhibits extreme barrel distortion, creating a circular "fisheye" effect where the edges of the frame are heavily curved and the periphery is blacked out. This matches the prompt's requirement for a wide-angle shot with slight barrel distortion (though the effect here is actually quite extreme/fisheye rather than "slight"). 2. **Depth of Field**: The entire r

A close-up macro shot of a single dewdrop on a rose petal, sbfl/flux-2-klein-9bPASS
A close-up macro shot of a single dewdrop on a rose petal, shallow depth of field

Evaluator

The image depicts a close-up macro shot of a single, clear dewdrop resting on a red rose petal. 1. **Perspective/Shot Type**: The image is a macro shot, focusing closely on the dewdrop and the texture of the petal, which matches the prompt. 2. **Depth of Field**: The image features a very shallow depth of field. The dewdrop and the immediate area of the petal it rests on are in sharp focus, while the background and the edges of the petal are heavily blurred (bokeh effect), which perfectly matc

An aerial bird's-eye view looking straight down at a trafficbfl/flux-2-klein-9bPASS
An aerial bird's-eye view looking straight down at a traffic roundabout with cars and trees

Evaluator

The image depicts an aerial bird's-eye view looking straight down at a traffic roundabout. 1. **Perspective**: The perspective is a direct top-down (nadir) view, looking straight down at the roundabout, which matches the prompt's requirement for an "aerial bird's-eye view looking straight down." 2. **Depth of Field**: The entire scene is in sharp focus from the center of the roundabout to the edges of the frame, which is consistent with a high-altitude aerial shot. 3. **Lighting Pattern**: T

Medium 100%vs100%
A portrait lit with classic Rembrandt lighting — triangle ofbfl/flux-2-klein-9bPASS
A portrait lit with classic Rembrandt lighting — triangle of light on the shadowed cheek, moody atmosphere

Evaluator

The image depicts a portrait of a man with a clear lighting pattern. 1. **Lighting Pattern**: The image features a classic Rembrandt lighting setup. There is a distinct, small triangle of light visible on the shadowed cheek (the viewer's right side of the subject's face), which is the defining characteristic of Rembrandt lighting. 2. **Atmosphere**: The lighting creates a moody, high-contrast atmosphere with deep shadows, consistent with the prompt. 3. **Perspective and Depth of Field**: The p

A landscape photograph bathed in golden hour light — long wabfl/flux-2-klein-9bPASS
A landscape photograph bathed in golden hour light — long warm shadows, orange sky, soft glow

Evaluator

The image depicts a landscape during the golden hour. 1. **Lighting Pattern:** The sky is a vibrant orange/golden hue, and the light across the landscape is warm and low-angled. There are long, soft shadows cast across the foreground and middle ground, consistent with a low sun position. 2. **Perspective:** The image uses a wide-angle landscape perspective, showing a foreground field, a middle ground with trees/hedges, and a background of rolling hills/mountains. 3. **Depth of Field:** The ima

A high-key portrait of a person against a bright white backgbfl/flux-2-klein-9bPASS
A high-key portrait of a person against a bright white background, minimal shadows, airy and luminous

Evaluator

The image perfectly matches the prompt's specifications: 1. **High-key/Lighting:** The image is extremely bright with a high-key aesthetic. The lighting is soft and even, creating an "airy and luminous" feel. 2. **Background:** The background is a bright, seamless white, which is characteristic of high-key portraiture. 3. **Shadows:** There are minimal shadows on the subject's face, with only very subtle contouring, adhering to the "minimal shadows" requirement. 4. **Perspective/Depth of Fi

Hard 100%vs100%
A portrait shot at 85mm f/1.4 with creamy bokeh background, bfl/flux-2-klein-9bPASS
A portrait shot at 85mm f/1.4 with creamy bokeh background, backlit with warm rim light separating the subject from a blurred garden

Evaluator

The image matches the specified camera and lighting settings: 1. **Perspective/Lens (85mm):** The image is a classic portrait shot with a shallow depth of field, consistent with an 85mm lens. The subject is the clear focus, and the background is significantly blurred. 2. **Depth of Field (f/1.4 with creamy bokeh):** The background features a very soft, "creamy" bokeh effect where the light and shapes in the garden are highly blurred, characteristic of a wide aperture like f/1.4. 3. **Lightin

A street photograph with shallow depth of field as if shot abfl/flux-2-klein-9bPASS
A street photograph with shallow depth of field as if shot at 50mm f/1.8, subject sharp with soft city lights in the background

Evaluator

The image successfully matches the technical specifications provided in the prompt: 1. **Depth of Field:** The image exhibits a very shallow depth of field. The subject (the woman) is sharp and in focus, while the background (the city street, buildings, and cars) is heavily blurred. This accurately simulates the effect of a wide aperture like f/1.8. 2. **Perspective/Lens Simulation:** The compression and the way the background elements are blurred suggest a standard focal length like 50mm, wh

A food photograph shot from above at 35mm f/2.8 with naturalbfl/flux-2-klein-9bPASS
A food photograph shot from above at 35mm f/2.8 with natural window light from the left, slight vignette, warm color temperature

Evaluator

To evaluate the image against the prompt: 1. **Perspective (Shot from above):** The image is a top-down (flat lay) perspective, which matches the "shot from above" requirement. 2. **Depth of Field (35mm f/2.8):** The image shows a shallow depth of field. The center of the dish is sharp, while the edges of the bowl and the wooden surface show a soft blur/bokeh, consistent with an f/2.8 aperture. 3. **Lighting (Natural window light from the left):** There is a clear light source coming from th

Extreme 100%vs100%
A tilt-shift photograph of a city street making real-life lobfl/flux-2-klein-9bPASS
A tilt-shift photograph of a city street making real-life look like a miniature model — selective focus strip, saturated colors

Evaluator

The image successfully implements the tilt-shift effect described in the prompt. 1. **Perspective/Tilt-Shift Effect**: The image uses a very shallow depth of field with a distinct "selective focus strip." The focus is sharpest in the middle ground (the area around the crosswalk and the yellow car), while the foreground and the background (the top of the buildings and the distant street) are heavily blurred. This creates the intended "miniature model" look. 2. **Saturated Colors**: The colors

A cinematic frame with blue and orange anamorphic lens flarebfl/flux-2-klein-9bPASS
A cinematic frame with blue and orange anamorphic lens flare streaking horizontally across a dark moody scene

Evaluator

The image features a dark, moody scene with a prominent horizontal light streak across the center. The light streak transitions from a bright blue on the left to an orange/amber color on the right, directly matching the "blue and orange" color specification. The streak is a sharp, horizontal line, consistent with the "anamorphic lens flare streaking horizontally" description. The background is dark and out of focus, creating a cinematic depth of field. VERDICT: PASS

A portrait with dramatic split lighting — half the face brigbfl/flux-2-klein-9bPASS
A portrait with dramatic split lighting — half the face brightly lit, the other half in deep shadow, high contrast chiaroscuro

Evaluator

The image depicts a portrait of a man with a very clear and dramatic split lighting pattern. One side of his face (the viewer's left) is brightly illuminated, while the other side (the viewer's right) is cast in deep, heavy shadow, creating a high-contrast chiaroscuro effect. The lighting is sharp and well-defined, perfectly matching the description of "half the face brightly lit, the other half in deep shadow." The perspective is a standard close-up portrait, and the depth of field is shallow,

Color Precision100%vs92%
Easy 100%vs100%
A bright red sports car parked on a gray asphalt road — the bfl/flux-2-klein-9bPASS
A bright red sports car parked on a gray asphalt road — the red is vivid and unambiguous

Evaluator

The image shows a sports car that is a bright, vivid, and unambiguous red. The color is consistent across the body of the car and matches the description of "bright red" perfectly. The background road is a gray asphalt road as specified. VERDICT: PASS

A woman wearing a royal blue evening dress against a neutralbfl/flux-2-klein-9bPASS
A woman wearing a royal blue evening dress against a neutral background

Evaluator

The image shows a woman wearing a long evening dress in a vibrant, deep blue color. The color is a true royal blue, which matches the prompt's specification of "royal blue." The background is a solid, light beige/tan color, which qualifies as a "neutral background." VERDICT: PASS

A field of pure sunflower yellow flowers under a clear blue bfl/flux-2-klein-9bPASS
A field of pure sunflower yellow flowers under a clear blue sky

Evaluator

The image depicts a field of sunflowers with bright yellow petals and dark centers. The sky above is a clear, solid blue. The colors in the image directly match the prompt's description of "pure sunflower yellow flowers" and a "clear blue sky." VERDICT: PASS

Medium 100%vs67%
A bedroom wall painted in #FF6B35 (burnt orange) with white bfl/flux-2-klein-9bPASS
A bedroom wall painted in #FF6B35 (burnt orange) with white furniture and natural daylight

Evaluator

The image shows a bedroom with a prominent accent wall. The color of the wall is a deep, saturated burnt orange. This matches the color described in the prompt as "#FF6B35 (burnt orange)". The furniture (bed frame, headboard, and nightstand) is white, and the lighting appears to be natural daylight coming from a window on the right. The colors in the image align perfectly with the specified hex code and color name. VERDICT: PASS

A forest scene where the foliage is precisely #2D5F2D (dark bfl/flux-2-klein-9bPASS
A forest scene where the foliage is precisely #2D5F2D (dark muted green) with brown tree trunks

Evaluator

The prompt specifies a precise hex color for the foliage: **#2D5F2D** (a dark, muted green). Looking at the image: - The foliage consists of various shades of green. - The darkest green used in the canopy and the foreground bushes is a deep, dark forest green. - The hex code #2D5F2D is a dark, desaturated/muted green. The colors used in the foliage of the image closely align with this specific shade of dark, muted green. - The tree trunks are indeed brown as requested. The colors in the image

A room with walls in #4A90D9 (medium blue) with warm wooden bfl/flux-2-klein-9bPASS
A room with walls in #4A90D9 (medium blue) with warm wooden floor and white trim

Evaluator

The image shows a room with walls that are a medium blue color, consistent with the hex code #4A90D9. The floor is a warm-toned wood, and the trim (baseboards and crown molding) is white. The colors in the image match the prompt's requirements: - Walls: Medium blue (#4A90D9) - Floor: Warm wooden floor - Trim: White VERDICT: PASS

Hard 100%vs100%
A minimalist room with walls in #F5E6CC (warm cream), a sofabfl/flux-2-klein-9bPASS
A minimalist room with walls in #F5E6CC (warm cream), a sofa in #2C3E50 (dark navy), and accent pillows in #E74C3C (bright red)

Evaluator

The image contains the following colors: 1. **Walls:** The walls are a warm cream color, which aligns with the requested **#F5E6CC (warm cream)**. 2. **Sofa:** The sofa is a dark navy blue, which aligns with the requested **#2C3E50 (dark navy)**. 3. **Pillows:** The accent pillows are a bright red, which aligns with the requested **#E74C3C (bright red)**. All specified colors and hex values are accurately represented in the image. VERDICT: PASS

A product flat-lay on a #1A1A2E (deep dark blue) background,bfl/flux-2-klein-9bPASS
A product flat-lay on a #1A1A2E (deep dark blue) background, with a #F0C27F (soft gold) watch and #FFFFFF (pure white) earbuds

Evaluator

The image contains the following elements: 1. **Background:** The background is a very dark, deep blue. Visually, it aligns well with the description of #1A1A2E (a deep, dark navy/midnight blue). 2. **Watch:** The watch has a soft gold/champagne-colored strap and a matching gold-toned face. This matches the description of #F0C27F (soft gold). 3. **Earbuds:** The earbuds are a bright, pure white, matching the description of #FFFFFF. All colors specified in the prompt are accurately represente

A graphic design composition with background #FAFAFA (near-wbfl/flux-2-klein-9bPASS
A graphic design composition with background #FAFAFA (near-white), main shape #FF4500 (orange-red), and accent lines #333333 (dark charcoal)

Evaluator

The image contains the following colors: 1. **Background:** The background is a very light, off-white color. It appears to be a near-white shade consistent with #FAFAFA. 2. **Main Shape:** The large central shape is a vibrant orange-red. The color is a very close match to #FF4500 (OrangeRed). 3. **Accent Lines:** The two diagonal lines are a dark, near-black color. They are consistent with the dark charcoal description and the hex value #333333. The colors in the image accurately reflect the

Extreme 100%vs100%
A smooth gradient on a wall going from #FF0000 (red) on the bfl/flux-2-klein-9bPASS
A smooth gradient on a wall going from #FF0000 (red) on the left to #0000FF (blue) on the right, transitioning evenly through purple

Evaluator

The image shows a smooth gradient transitioning from a bright red on the left to a bright blue on the right. The transition passes through a purple/magenta hue in the center. - The left side is a pure red, which corresponds to #FF0000. - The right side is a pure blue, which corresponds to #0000FF. - The transition is even and smooth as requested. The colors and the gradient transition perfectly match the prompt's specifications. VERDICT: PASS

A horizontal banner with a gradient from #00C9FF (cyan) on tbfl/flux-2-klein-9bPASS
A horizontal banner with a gradient from #00C9FF (cyan) on the left to #92FE9D (mint green) on the right

Evaluator

The image shows a horizontal banner with a gradient. 1. **Left side color:** The left side of the banner is a bright cyan color. This matches the description of **#00C9FF (cyan)**. 2. **Right side color:** The right side of the banner transitions into a light, pale green. This matches the description of **#92FE9D (mint green)**. 3. **Gradient direction:** The gradient flows horizontally from left to right as specified. The colors in the image accurately reflect the hex values and names prov

A sunset sky gradient on a flat surface from #FF6B35 (orangebfl/flux-2-klein-9bPASS
A sunset sky gradient on a flat surface from #FF6B35 (orange) at the bottom to #1B1464 (deep indigo) at the top

Evaluator

The image shows a vertical color gradient. 1. **Bottom Color:** The bottom of the image is a bright, saturated orange. While it is close to the requested `#FF6B35`, it appears slightly more yellowish/golden than a pure orange, but it generally aligns with the spirit of the color name. 2. **Top Color:** The top of the image is a very dark, near-black indigo/navy. It matches the description of a deep indigo (`#1B1464`) effectively. 3. **Gradient Transition:** The transition from the bright or

Photorealism0%vs67%
Easy 0%vs67%
A single red sneaker on a clean white background with soft sbfl/flux-2-klein-9bFAIL
A single red sneaker on a clean white background with soft studio lighting and gentle shadow — product photography style

Evaluator

The image depicts a red sneaker on a white background. Upon close inspection, there are several significant AI artifacts and structural errors that prevent it from passing as a real photograph: 1. **Structural Inconsistency:** The most glaring issue is the way the shoe is constructed. The "Nike" logo/branding on the tongue is a distorted, illegible shape that does not resemble actual text. 2. **Anatomical/Design Errors:** The area where the laces enter the shoe is nonsensical. The laces appea

A clear glass perfume bottle on a reflective black surface wbfl/flux-2-klein-9bFAIL
A clear glass perfume bottle on a reflective black surface with studio lighting, no background distractions

Evaluator

The image fails to meet the criteria for a high-quality, realistic studio product shot due to several significant AI artifacts and physical inconsistencies: 1. **Impossible Geometry/Physics:** The most glaring issue is the straw (dip tube) inside the bottle. It is disconnected from the sprayer mechanism at the top and appears to float aimlessly in the center of the liquid. It does not connect to the neck of the bottle. 2. **Structural Inconsistency:** The glass bottle itself has inconsistent

A ripe green apple on a seamless white background with soft bfl/flux-2-klein-9bFAIL
A ripe green apple on a seamless white background with soft side lighting and subtle shadow — catalog style

Evaluator

The image depicts a single green apple on a white background. Upon close inspection, there are several significant AI artifacts and inconsistencies that prevent it from passing as a real studio-quality photograph: 1. **Stem Irregularity:** The stem is highly unnatural. It appears to be a dark, blunt, and somewhat shapeless object that does not realistically emerge from the fruit. The way it connects to the apple lacks the organic texture and structural integrity of a real stem. 2. **Surface T

Graphical design 58%vs100%

Layout & Design33%vs100%
Easy 100%vs100%
A movie night poster with a large popcorn image in the centebfl/flux-2-klein-9bPASS
A movie night poster with a large popcorn image in the center, the title 'MOVIE NIGHT' in bold at the top, and the tagline 'Every Friday at 8 PM' at the bottom

Evaluator

The image features a large, centrally placed popcorn bucket, which matches the prompt’s description of a “large popcorn image in the center.” The title “MOVIE NIGHT” is prominently displayed at the top in bold, stylized lettering, and the tagline “Every Friday at 8 PM” is correctly positioned at the bottom. The visual hierarchy is clear: the popcorn is the focal point, the title is dominant at the top, and the tagline is legible at the bottom. All text is readable and correctly placed according

An event poster featuring a guitar illustration, the headlinbfl/flux-2-klein-9bPASS
An event poster featuring a guitar illustration, the headline 'LIVE MUSIC FEST' in large text, and 'Tickets available now' as a tagline below

Evaluator

The poster includes: - A guitar illustration (acoustic guitar, correctly placed diagonally across the text). - The headline “LIVE MUSIC FEST” in large, bold, stacked text — visually dominant and correctly formatted. - The tagline “Tickets available now” at the bottom — readable, correctly placed, and in a smaller font as expected. The visual hierarchy is clear: the headline is largest and most prominent, the guitar integrates with the text, and the tagline is clearly subordinate but legible. A

A book club poster with an open book illustration, the titlebfl/flux-2-klein-9bPASS
A book club poster with an open book illustration, the title 'JOIN THE BOOK CLUB' prominently displayed, and 'Meets every Wednesday' as the tagline

Evaluator

All described layout zones/elements are present: - An open book illustration is centrally located. - The title “JOIN THE BOOK CLUB” is prominently displayed at the top, with clear visual hierarchy (larger font, bold styling, and color variation). - The tagline “Meets every Wednesday” is correctly placed at the bottom in a smaller, readable font. All text is readable and correctly positioned according to the prompt. VERDICT: PASS

Medium 0%vs100%
A landing page design with a hero header image at the top, abfl/flux-2-klein-9bFAIL
A landing page design with a hero header image at the top, a headline and subtitle in the middle, a 'Sign Up' call-to-action button, and a footer with three social media icons

Evaluator

The image shows a landing page with the following elements: - Hero header image at the top: Present — a photo of a person holding a scale, with a navigation bar including a logo and menu items. - Headline and subtitle in the middle: Present — large bold text (“Thle Mabem acbeak Goartiny.”) and a smaller subtitle below it. - “Sign Up” call-to-action button: Present — a blue button centered below the subtitle. - Footer with three social media icons: Present — Facebook, a custom icon, and Twitter,

A newsletter layout with a banner image header, a two-columnbfl/flux-2-klein-9bFAIL
A newsletter layout with a banner image header, a two-column body section with text on the left and an image on the right, a highlighted quote block, and a footer with unsubscribe link

Evaluator

The image shows a newsletter layout with the following elements: - A banner image header with two people and stylized text (“ForsteuiClela” and “Vlotic Istantix kortocchef”) — this matches the requested header. - A two-column body section: text on the left, image on the right — this is visually present. - A highlighted quote block with quotation marks and attributed text — this is present on the right side, below the image. - A footer with an “Unsubscribe” link — this is present at the bottom.

A restaurant menu layout with a logo header, three category bfl/flux-2-klein-9bFAIL
A restaurant menu layout with a logo header, three category sections (appetizers, mains, desserts) each with items and prices, and a footer with address and hours

Evaluator

The image shows a restaurant menu with a logo header (a circular emblem with “Mats” and a teapot icon), three category sections (Appetizers, Mains, Desserts), and a footer with address and hours. However, the text within all sections — including item names, prices, and footer information — is rendered in a garbled, nonsensical script that is not readable. The visual hierarchy is present (header, three columns, footer), but the content is illegible, which violates the requirement that “text is re

Hard 0%vs100%
A mobile app UI mockup showing a dashboard with a top navigabfl/flux-2-klein-9bFAIL
A mobile app UI mockup showing a dashboard with a top navigation bar, a search field, a 2x2 grid of feature cards with icons, a list of recent activity items, and a bottom tab bar

Evaluator

The image displays two mobile app UI mockups that largely match the described layout: both include a top navigation bar (with status icons and user profile), a search field, a 2x2 grid of feature cards with icons, a “Recent Activity” list, and a bottom tab bar. The visual hierarchy is consistent with standard mobile app design — top to bottom: navigation, search, feature cards, activity list, tab bar. However, the text is not readable. All labels — including app names (“Foath”, “Nesido”), featu

A magazine double-page spread with a left page containing a bfl/flux-2-klein-9bFAIL
A magazine double-page spread with a left page containing a large hero photo, and a right page with a two-column article, a pull quote, a sidebar with author bio, and a page number

Evaluator

The image shows a magazine double-page spread. The left page features a large hero photo of a man, which matches the prompt. The right page contains a two-column article layout, a pull quote in a distinct font and block, and a sidebar with an author bio and photo. Page numbers (44 and 13) are visible at the top corners. However, all text is gibberish — it appears to be placeholder or randomly generated text, not readable content. While the layout structure and visual hierarchy are correctly impl

A fashion magazine spread with a full-bleed left-page photo,bfl/flux-2-klein-9bFAIL
A fashion magazine spread with a full-bleed left-page photo, right page with three-column layout, headline, body copy, a smaller inset image, and styled page numbers

Evaluator

The image shows a magazine spread with a full-bleed photo on the left page and a three-column layout on the right page, which matches the prompt’s description. A headline is present at the top of the right page, and body copy fills the three columns beneath it. A smaller inset image is also visible in the lower-left portion of the right page. Page numbers (90 and 25) are placed at the bottom of each respective page. However, the text is not readable — it appears to be gibberish or placeholder t

Data Visualisation0%vs100%
Easy 0%vs100%
A pie chart showing market share: Company A 35%, Company B 2bfl/flux-2-klein-9bFAIL
A pie chart showing market share: Company A 35%, Company B 25%, Company C 20%, Others 20% — each slice a different color with labels and percentages

Evaluator

The requested pie chart should show: - Company A: 35% - Company B: 25% - Company C: 20% - Others: 20% In the provided image: - Company A is labeled 35% — correct. - Company B is labeled 25% — correct. - Company C is labeled 20% — correct. - “Others” is labeled 20% — correct. However, the chart contains **incorrect percentages** in two slices: - One slice is labeled “218%” — this is impossible for market share (total exceeds 100%). - Another slice is labeled “200%” — also impossible. Additiona

A bar chart showing monthly sales for January through June wbfl/flux-2-klein-9bFAIL
A bar chart showing monthly sales for January through June with values 12K, 18K, 15K, 22K, 19K, 25K — labeled axes, color-coded bars, and a title 'Monthly Sales 2025'

Evaluator

The requested bar chart is missing several critical elements and contains significant errors: 1. **Chart Type Mismatch**: The prompt requested a bar chart showing monthly sales for January through June. The image shows a stacked bar chart, which is not the same as a simple bar chart. 2. **Incorrect Month Labels**: The x-axis labels are not January through June. Instead, they show “January”, then “15K”, “23K”, “198K”, and “Jul” — which is not a valid sequence for months 1–6. 3. **Incorrect Y-A

A simple line graph showing daily website visitors over one bfl/flux-2-klein-9bFAIL
A simple line graph showing daily website visitors over one week (Mon-Sun): 1200, 1500, 1100, 1800, 2000, 1700, 1400 — with grid lines, axis labels, and title 'Weekly Traffic'

Evaluator

The requested visualization is a line graph showing daily website visitors from Monday to Sunday with specific values: 1200, 1500, 1100, 1800, 2000, 1700, 1400. In the provided image: - The title “Weekly Traffic” is correct. - The chart type is a line graph, as requested. - Grid lines are present. - However, the x-axis labels (“Wen-Sun”) are incorrect — they should represent Mon-Sun, and “Wen” is not a standard abbreviation for Monday (it’s typically “Mon” or “Mon-Sun” for the full range). The

Style Diversity92%vs100%
Easy 100%vs100%
A golden retriever sitting in a garden, rendered in Japanesebfl/flux-2-klein-9bPASS
A golden retriever sitting in a garden, rendered in Japanese anime style with flat colors and large expressive eyes

Evaluator

The image depicts a golden retriever sitting in a garden, which matches the subject matter of the prompt. The style is cartoonish and stylized, with large, expressive eyes — a hallmark of Japanese anime aesthetics. The coloring is largely flat with minimal shading, consistent with the requested “flat colors” characteristic. The outlines are clean and bold, and the overall composition is simplified and illustrative, aligning with anime conventions. However, the image lacks some key elements typi

A golden retriever sitting in a garden, rendered in pixel arbfl/flux-2-klein-9bPASS
A golden retriever sitting in a garden, rendered in pixel art style with a 32x32 grid and limited color palette

Evaluator

The image clearly depicts a golden retriever sitting in a garden, rendered in a pixel art style. The visual composition is built on a visible 32x32 grid, with each element — the dog, flowers, trees, and sky — composed of distinct square pixels. The color palette is limited, using a small number of colors per element (e.g., shades of green, blue, pink, brown, and yellow) to create the scene. The outlines are sharp and blocky, consistent with classic pixel art. All key characteristics of the reque

A golden retriever sitting in a garden, painted in the stylebfl/flux-2-klein-9bPASS
A golden retriever sitting in a garden, painted in the style of an oil painting with visible brushstrokes and rich texture

Evaluator

The image depicts a golden retriever sitting in a garden, surrounded by flowers and foliage, with a wooden fence in the background. The visual style is clearly that of an oil painting: there are visible brushstrokes, especially in the fur of the dog, the grass, and the leaves. The texture is rich and painterly, with layered colors and blending that mimics traditional oil painting techniques. The lighting and composition also evoke a classical, impressionistic feel. All key characteristics reque

Medium 100%vs100%
A city street scene in the style of Art Deco — geometric shabfl/flux-2-klein-9bPASS
A city street scene in the style of Art Deco — geometric shapes, gold and black palette, ornamental symmetry

Evaluator

The image clearly embodies the requested Art Deco style. It features: - **Geometric shapes**: Angular buildings, stepped facades, zigzag patterns on the street, and stylized signage. - **Gold and black palette**: The entire scene is rendered in a monochromatic gold-on-black scheme, emphasizing luxury and contrast. - **Ornamental symmetry**: The composition is highly symmetrical, with buildings mirroring each other down the street, and decorative motifs centered and repeated. All key visual cha

A woman with a parasol in the style of Impressionism — soft bfl/flux-2-klein-9bPASS
A woman with a parasol in the style of Impressionism — soft brushstrokes, dappled light, pastel palette

Evaluator

The image depicts a woman holding a parasol in a sunlit garden, rendered with visible, textured brushwork that suggests soft, blended strokes. The palette is dominated by pastels — pinks, blues, greens, and creams — evoking the gentle, luminous quality characteristic of Impressionism. Dappled light is evident through the filtered foliage and the way light plays across the parasol and the woman’s dress. While the composition is somewhat idealized and the figure’s features are rendered with clarit

A portrait of a man in Pop Art style — bold outlines, Ben-Dabfl/flux-2-klein-9bPASS
A portrait of a man in Pop Art style — bold outlines, Ben-Day dots, flat vivid primary colors

Evaluator

The image clearly embodies the requested Pop Art style. It features: - Bold, black outlines defining all shapes and features. - Ben-Day dots (halftone patterns) used for shading and color transitions across the face, hair, and background. - Flat, vivid primary colors (yellow, red, green, blue, purple) with high contrast and saturation. All key visual characteristics from the prompt are present and unmistakable. VERDICT: PASS

Hard 67%vs100%
A forest rendered in photorealistic watercolor — the detail bfl/flux-2-klein-9bPASS
A forest rendered in photorealistic watercolor — the detail and lighting of a photo combined with visible watercolor bleeds and paper texture

Evaluator

The image depicts a forest scene rendered in a watercolor style, with visible paper texture and watercolor bleeds—especially along the edges and in the background. The lighting is soft and atmospheric, with a sense of depth and realism in the tree trunks, foliage, and path, resembling photographic lighting and composition. However, the “photorealistic” aspect is somewhat ambiguous: while the scene has photographic depth and lighting, the textures (tree bark, leaves, ground) are stylized and pain

A woman in a futuristic city rendered in cyberpunk Art Nouvebfl/flux-2-klein-9bPASS
A woman in a futuristic city rendered in cyberpunk Art Nouveau — neon colors and tech elements combined with flowing organic lines and floral borders

Evaluator

The image successfully merges cyberpunk and Art Nouveau aesthetics as requested. Key characteristics are clearly present: - **Cyberpunk elements**: Neon signage, futuristic cityscape, rain-slicked streets, tech-enhanced clothing, and a dystopian urban environment. - **Art Nouveau elements**: The ornate, flowing floral border framing the image, curvilinear decorative motifs, and stylized organic forms in the architecture and framing. - **Neon colors**: Dominant purples, pinks, and blues with glo

A medieval castle depicted in Japanese ukiyo-e woodblock pribfl/flux-2-klein-9bFAIL
A medieval castle depicted in Japanese ukiyo-e woodblock print style combined with photorealistic sky and lighting

Evaluator

The image successfully incorporates the key visual characteristics of Japanese ukiyo-e woodblock prints: flat areas of color, bold outlines, stylized clouds and mountains, and the inclusion of Japanese text in the margins and blocks — all hallmarks of the style. The castle itself is rendered with the architectural precision and decorative flair typical of ukiyo-e depictions of historical structures. However, the prompt also requested “photorealistic sky and lighting.” While the sky does show a

Extreme 100%vs100%
An interior scene rendered in soft brutalism — raw concrete bfl/flux-2-klein-9bPASS
An interior scene rendered in soft brutalism — raw concrete textures combined with rounded pastel furniture and warm diffused light

Evaluator

The image clearly embodies the requested “soft brutalism” style. Key visual characteristics are present: - **Raw concrete textures**: The walls, ceiling, and floor are all rendered in exposed, unpolished concrete with visible aggregate and seams — a hallmark of brutalism. - **Rounded pastel furniture**: The furniture — including the sofas, armchairs, and coffee tables — is distinctly rounded, organic in form, and upholstered in soft pastel tones (pinks, mint, beige). - **Warm diffused light**:

A still life painted in anxious minimalism — sparse compositbfl/flux-2-klein-9bPASS
A still life painted in anxious minimalism — sparse composition with uncomfortable negative space, muted colors, and subtle visual tension

Evaluator

The image presents a still life with a solitary glass bottle on a weathered wooden surface against a large, textured, muted gray wall. The composition is sparse, with the bottle positioned off-center to the right, leaving significant negative space — a key element of “anxious minimalism.” The color palette is subdued, dominated by grays, browns, and the transparent glass, fitting “muted colors.” The brushwork is visible and textured, suggesting a painted medium, and the overall atmosphere feels

A cityscape rendered in nostalgic futurism — retro-futuristibfl/flux-2-klein-9bPASS
A cityscape rendered in nostalgic futurism — retro-futuristic 1960s space-age aesthetics with a melancholic warm-toned patina

Evaluator

The image clearly embodies the requested “nostalgic futurism — retro-futuristic 1960s space-age aesthetics with a melancholic warm-toned patina.” Key visual characteristics are present: - **Retro-futurism**: The architecture features sleek, optimistic 1960s sci-fi design — domed buildings, spires, flying saucers, and elevated walkways — all hallmarks of mid-century speculative design. - **Warm-toned patina**: The entire scene is bathed in a sepia-toned, golden-hour glow with soft, faded edges a