FLUX.2 [max]

Black Forest Labs' top-tier image model - highest quality, best prompt adherence, grounded generation with web context, and professional-grade editing consistency at $0.07 per megapixel.

FLUX.2 [max]

FLUX.2 [max] is the best image model Black Forest Labs makes. It ranks #4 on LM Arena with a score of 1168 - the highest of any model from an independent lab, trailing only three proprietary offerings from companies with 100x the headcount. It delivers the strongest prompt adherence, the highest editing consistency, and a feature no other FLUX.2 variant has: grounded generation with real-time web context.

TL;DR

  • LM Arena rank #4 (score 1168) - highest quality in the FLUX.2 family, top independent lab model
  • Exclusive feature: grounded generation incorporating real-time web information
  • Highest editing consistency for character, object, style, and background preservation
  • $0.07/MP output, $0.03/MP input - the premium tier
  • 6-10 second generation, up to 4MP resolution
  • API-only, enterprise fine-tuning available

Key Specifications

SpecificationDetails
ProviderBlack Forest Labs
Model FamilyFLUX.2
Parameters32B (flow transformer) + 24B (Mistral-3 VLM)
ArchitectureRectified flow transformer + Mistral-3 24B VLM
VAERetrained from scratch
Max Resolution4MP (2048x2048)
Generation Speed6-10 seconds
LM Arena Rank#4 (score 1168)
Multi-ReferenceUp to 10 images
Grounded GenerationYes (real-time web context)
Output Pricing$0.07/MP (first MP), $0.03/MP additional
Input Pricing$0.03/MP (reference images)
Release DateNovember 25, 2025
LicenseProprietary (API access)
Open WeightsNo
EnterpriseCustom fine-tuning and dedicated infrastructure

Benchmark Performance

MetricFLUX.2 MaxFLUX.2 FlexFLUX.2 ProFLUX.2 DevNano Banana 2
LM Arena Rank#4#5#7#9N/A
LM Arena Score1168115711531149N/A
Generation Speed6-10s3-6s4-8s2-4s4-6s
Price/Image (1MP)$0.07$0.06$0.03$0.012~$0.067
Grounded GenYesNoNoNoYes
Editing ConsistencyHighestStandardHighStandardHigh
Open WeightsNoNoNoYes (NC)No

The 19-point ELO gap between Max (1168) and Dev (1149) is perceptible but not dramatic in most use cases. Where Max distinguishes itself is consistency: across 100 generations from the same prompt, Max produces less variance in quality, fewer artifacts, and more reliable text rendering than any other FLUX.2 variant. That consistency premium is what justifies the 2.3x price vs. Pro for professional workflows.

Key Capabilities

Grounded Generation

Max's exclusive feature. The model can incorporate real-time web information into generated images - current product photos, trending visual styles, recent events. This makes it uniquely suited for marketing teams who need visuals that reference current cultural moments or trending products without manual reference image curation.

Professional Editing Consistency

The highest editing consistency in the family. When modifying a character's clothing, background, or pose across multiple generations, Max maintains facial features, proportions, and visual identity more reliably than Pro or Flex. This matters for:

  • Product photography variations (same product, different angles and settings)
  • Character-consistent marketing campaigns
  • Brand asset generation with strict style guidelines

Typography and Text

Complex typography, infographics, memes, and UI mockups with legible fine text work reliably in production. While Flex specializes in typography, Max delivers comparable text quality alongside its broader strengths in photorealism and editing consistency.

Photorealism

Enhanced texture synthesis for skin, materials, and fabrics. Physically accurate lighting and shadow behavior. These qualities make Max the choice for product photography, architectural visualization, and any workflow where the output needs to look indistinguishable from a photograph.

Pricing and Availability

Access PointPricingStatus
BFL API$0.07/MP output, $0.03/MP inputAvailable
BFL PlaygroundFree trialAvailable
WaveSpeed AI$0.07/imageAvailable
ReplicatePer-second pricingAvailable
OpenRouterVariableAvailable
EnterpriseCustom pricing + dedicated infraAvailable

At $0.07 per megapixel, a standard 1MP image costs $0.07. A 4MP image (2048x2048) costs $0.19. Adding reference images costs $0.03/MP per input image. For high-volume enterprise use, Black Forest Labs offers custom pricing with dedicated infrastructure and fine-tuning options.

Strengths

  • Highest quality image generation from an independent lab (LM Arena #4)
  • Grounded generation with real-time web context - exclusive to Max
  • Best editing consistency for character and style preservation
  • Professional-grade photorealism suitable for commercial photography
  • 4MP native output for print-quality resolution
  • Enterprise fine-tuning and dedicated infrastructure options
  • Full multi-reference support (up to 10 images)

Weaknesses

  • $0.07/MP is the most expensive FLUX.2 option - 2.3x Pro's pricing
  • 6-10 second generation is the slowest in the family
  • API-only - no local deployment or open weights
  • No developer-controlled step/guidance parameters (unlike Flex)
  • Premium pricing hard to justify for non-professional use cases
  • Quality advantage over Pro is measurable but marginal for many workflows

Sources:

✓ Last verified March 14, 2026

FLUX.2 [max]
About the author AI Benchmarks & Tools Analyst

James is a software engineer turned tech writer who spent six years building backend systems at a fintech startup in Chicago before pivoting to full-time analysis of AI tools and infrastructure.