GENERATIVE · IMAGE · GEMINI 3.1~1.4s
GENERATIVE · IMAGE · GEMINI 3.1

Nano Banana 2.

Pro-level image generation.Flash-fast.

State-of-the-art image generation and editing with real-world knowledge, precise text rendering, and 5-character consistency. Top of Infer's image leaderboard.

AVG LATENCY · ~1.4s
STARTING AT · $0.039 / IMG
TRY IT NOWCmd/Ctrl + Enter to generate
Cinematic still: a young Indian classical dancer mid-spiral in saffron silk at a temple courtyard.
Cinematic still: a young Indian classical dancer mid-spiral in saffron silk, golden-hour temple courtyard.
Editorial portrait of an elderly Mongolian eagle hunter in winter furs.
Editorial close-up portrait of a Mongolian eagle hunter in winter furs, golden eagle on his arm.
Modernist concert poster reading 'COSMIC RADIO · LIVE AT MIDNIGHT'.
Modernist concert poster: 'COSMIC RADIO · LIVE AT MIDNIGHT' in oversized condensed sans-serif.
Editorial scientific infographic showing Earth's interior cross-section with labeled concentric layers.
Editorial infographic of Earth's interior in cross-section, labeled layers, National Geographic quality.Generated by Nano Banana 2 · Google
LIVE OUTPUT

Where Nano Banana 2 shines.

Editorial scientific infographic showing Earth's interior cross-section with labeled concentric layers.

Educational infographics

Multi-panel diagrams, isometric cutaways, and labeled illustrations grounded in real-world knowledge — without the hallucinated geography of generic T2I models.

EXAMPLEThe Layers of the Earth, depicted as a clean, isometric cutaway illustration with labels and texture variations.

Three recurring characters at golden hour in a Tuscan vineyard — sommelier, chef, and apprentice.

Character-consistent storytelling

Up to 5 characters held consistent across a scene sequence. Ship visual narrative in one model call rather than stitching LoRAs.

EXAMPLEThree recurring characters at golden hour in a Tuscan vineyard — sommelier, chef, apprentice — each face held consistent across the frame.

Studio product photo of a luxury 'Happy Birthday' greeting card on cool grey marble.

Marketing & ad creative

Posters, greeting cards, and brand assets where typography needs to render perfectly the first time. Ideogram-grade text fidelity at half the latency.

EXAMPLEA studio product photo of a luxury 'Happy Birthday' greeting card on cool grey marble, with yellow stippled retro labyrinthine lettering.

A Tokyo coffee-shop chalkboard menu written in Japanese with hand-drawn pastry illustrations.

Localized content

Translate text inside images into 30+ languages while preserving layout, font weight, and design grammar. Single-call localization for any market.

EXAMPLEA coffee-shop menu chalkboard in Tokyo, written in Japanese. Hand-drawn illustrations of pastries on the side. Warm wood texture background.

Studio product photo of a matte black ceramic coffee cup on polished concrete with soft daylight and faint steam.

Product photography mockups

Studio-grade product shots with controllable lighting, surface, and composition. Replace 80% of catalog shoots before retouch.

EXAMPLEStudio product photo of a matte black ceramic coffee cup on a polished concrete surface, soft daylight from the left, faint steam rising, shallow depth of field.

Cinematic still of a woman in a scarlet traditional dress spinning in a golden grassland.

Editorial illustration

Magazine-worthy conceptual illustrations with deliberate composition, strong color theory, and editorial restraint.

EXAMPLECinematic still: a young woman in a vibrant scarlet traditional Indian dress with gold floral embroidery, mid-spin in a sun-drenched golden grassland.

Generated with Nano Banana 2.

A live cross-section of the model's range — portraits, products, typography, illustration, fashion, cinematic. Hover any tile to pause and read its prompt.

Editorial close-up portrait of an elderly Mongolian eagle hunter in winter furs, golden eagle on his outstretched arm.
Editorial close-up portrait of an elderly Mongolian eagle hunter in winter furs, weathered face lit by the low golden sun, a golden eagle on his arm.
Studio product photo of a rose-gold Swiss mechanical watch on wet slate.
Rose-gold Swiss mechanical watch on a wet slate slab, single backlight catching the sapphire crystal.
A Tokyo ramen shop chalkboard menu in vibrant hand-painted Japanese kanji.
A Tokyo ramen shop chalkboard menu in vibrant hand-painted Japanese kanji, illustrated steaming bowls along the side.
Cinematic still of a young Indian classical dancer mid-spiral in saffron silk at a golden-hour temple courtyard.
Cinematic still: young Indian classical dancer mid-spiral in saffron silk, motion blur trailing, golden-hour temple courtyard.
Hyper-detailed isometric infographic of a coral reef ecosystem with labeled marine species.
Isometric infographic of a coral reef ecosystem, labeled species — parrotfish, clownfish, giant clam, staghorn coral.
Vintage 1960s Japanese ramen poster with oversized retro brushstroke kanji.
Vintage 1960s Japanese ramen poster, oversized retro brushstroke kanji, illustrated bowl, Showa-era lithograph.
Surreal oil painting of a colossal clock-tower made of vintage typewriters above an amber cloud sea.
Surreal oil painting: colossal clock-tower made of vintage typewriters floating over an amber sea of clouds.
High-fashion model in an architectural couture gown of folded white origami paper on a black mirror runway.
High-fashion editorial: model in an origami-paper couture gown on a glossy black mirror runway, harsh strobe rim-light.
Studio product photo of a luxury notebook on charcoal felt with gold-leaf 'STILL · WATERS' debossed on the cover.
Luxury hardcover notebook on charcoal felt, debossed gold-leaf cover reading 'STILL · WATERS' in clean serif.
Cinematic photograph of a bustling Marrakech souk at golden hour, vibrant spice piles and weavers.
Marrakech souk at golden hour, pyramids of saffron and turmeric, weavers at backlit looms, patterned tile.
Two recurring characters at a sunlit Parisian sidewalk café — sommelier and chef held consistent across the frame.
Two recurring characters at a Parisian sidewalk café — silver-haired sommelier and young chef, held identical across the shot.
Hyper-detailed scientific cross-section illustration of a beating human heart, anatomically labeled.
Anatomical cross-section of a beating human heart, labeled — aorta, left ventricle, coronary artery, pulmonary vein.
Modernist concert poster reading 'COSMIC RADIO · LIVE AT MIDNIGHT' with synthesizer waveforms.
Modernist concert poster: 'COSMIC RADIO · LIVE AT MIDNIGHT' in oversized condensed sans-serif, abstract synth waveforms.
Photorealistic studio portrait of an elderly Japanese tea master cradling a matcha chawan.
Studio portrait of an elderly Japanese tea master cradling a delicate matcha chawan, north window light, deep blacks.

By the numbers.

#3On Infer's image generation1262 Elo · Arena EloView leaderboard →
1065Editing Elo
~1150Visual quality Elo
~3× fasterLatency vs GPT Image 1.5
GPT Image 1.5~3× faster at comparable quality, with stronger infographic factuality.AA latency: 1.4s vs ~5s
FLUX 1.1 [pro]Higher T2I Arena Elo (1067 vs ~990) and superior multilingual text rendering.
Seedream 4.0Better world-knowledge grounding for educational content.
$0.039/ image

Pay only for successful generations. No idle, no minimums, no per-seat. Volume discounts kick in at 10K req/mo.

VS NATIVESame price as Google's direct API — but with one Reka key, batched billing, and zero rate-limit headaches.
VS SELF-HOSTClosed weights. Self-hosting isn't an option — Reka's the shortest path to production.

Things teams ask.

Q.01How is Nano Banana 2 different from GPT Image 1.5?
Nano Banana 2 is roughly 3× faster (1.4s vs 5s typical) and ranks higher on the AA T2I Arena (1067 Elo vs ~1050). GPT Image 1.5 still leads on photorealistic portraits and complex multi-element scenes; Nano Banana 2 leads on infographics, multilingual text, and throughput-sensitive workloads.
Q.02Does it support image editing as well as generation?
Yes. The same endpoint accepts an input image and an instruction, with the model achieving a 1065 editing Elo. For specialist editing — masks, multi-step chains, identity preservation — pair it with FLUX.1 Kontext for best results.
Q.03Can I use it for character-consistent storytelling?
Up to 5 distinct characters can be held consistent across a scene sequence in a single call. For longer narratives, anchor on a reference image and re-issue with the prior output as input — the model remains stable across 10+ frames in our internal benchmarks.
Q.04What text rendering languages are supported?
30+ languages render legibly, including non-Latin scripts (CJK, Arabic, Devanagari). Multilingual text adherence is the model's strongest single capability and a key differentiator vs FLUX, Imagen, and Seedream.
Q.05What's the typical latency at 1024×1024?
Median is ~1.4s. P95 sits at ~2.6s. Cold starts are amortized — Infer pre-warms capacity, so you should not see startup penalties under normal load.
Q.06Are outputs watermarked?
Yes. All Nano Banana 2 outputs carry SynthID — a Google-developed invisible watermark. SynthID is robust to crops, filters, and re-encodes, and is designed to support provenance verification.
Q.07Can I use the outputs commercially?
Yes. Infer passes through Google's commercial terms for Nano Banana 2 outputs.

Ship with Nano Banana 2.

One key. One bill. One SDK shape — across 100+ models. Free credits on signup, no card required.