Google’s Nano Banana (Gemini 2.5 Flash Image) is a powerful, recently released tool for generating realistic 3D-figurine style images from photos. Its strengths lie in preserving likeness, allowing detailed control via prompts, and making the process accessible to non-experts. While it doesn’t (yet) produce manipulable 3D meshes, the image outputs are impressive and widely shareable. By picking good input photos, writing descriptive prompts (including style, lighting, props), and iterating, you can get very realistic results. In this article you will find how to create 3D Models from Photos using Google Nano Banana.
What is Google Nano Banana?
“Nano Banana” is the nickname for Google’s Gemini 2.5 Flash Image model, developed by Google DeepMind. It was released around August 2025 as part of an update to Google Gemini’s image generation & editing capabilities.
This model combines image editing and generation with advanced prompt understanding. Its headline features include:
- Subject consistency: When you upload a photo of a person or a pet, and then edit it (change outfit, background, style, etc.), the model preserves the facial features, body shape, pet features, so that the subject still looks recognizably like them.
- Multi-image blending & editing: Ability to combine more than one image, or blend in styles/textures from one image into another.
- Natural language prompts for edits: You describe in plain language what changes you want (e.g. “place me in a fantasy forest,” “turn pet into plush toy,” etc.).
- Visible + invisible watermarking: To indicate when images are AI-generated. Google uses visible watermarks and invisible SynthID watermarks.
The “3D figurine” or miniature collectible figurine style has become extremely popular via this model. People are transforming portraits (selfies, pets, etc.) into tiny, highly detailed 3D figures (on clear acrylic bases, with packaging mockups, etc.).
How It Works Under the Hood
To understand how to generate realistic 3D models (figurines) from pictures, it’s useful to know some of the internal mechanisms and constraints.
- Model architecture & training
Gemini 2.5 Flash Image (Nano Banana) is trained on large image datasets, including many examples of objects in different lighting, perspectives, styles. The training emphasizes consistency when editing: if the same subject is edited, features are preserved. Google DeepMind has specifically improved the ability to maintain likeness. - Input & prompt processing
- You can upload a photo (single subject, clearly visible) to serve as a basis. The better the photo (clear lighting, good resolution, minimal obstructions), the better results.
- You provide a prompt in natural language that describes what you want: the style (realistic, cartoon, plush, etc.), the props (base, background, packaging), pose, lighting, scale, etc.
- Figurine / 3D look generation
While it’s not necessarily a true full 3D model (i.e. with meshes you can rotate arbitrarily), the visual rendering mimics a collectible figurine – detailed shading, appropriately scaled base, photorealistic textures, etc. This is a “3D‐figurine style image” rather than full 3D object with full manipulable geometry. The key is visual realism, lighting, materials, shadows, reflections. - Rendering & edits
You can perform multi-turn edits. For example: upload, generate base figurine; edit background; change props; adjust lighting; mix styles. The model retains consistency across these steps. - Constraints & limitations
- Extremely complex geometric changes or impossible physics may not render perfectly.
- Fine detail (e.g. extremely small fabric texture, micro-shadows) may degrade.
- The output is an image, not a manipulatable 3D file (typically).
- Some tools may enforce watermarks / SynthID to comply with ethical guidelines.
Steps to Create a 3D Figurine Image Using Google Nano Banana
Follow these steps to turn your photo into a realistic figurine-style model:
Step 1: Open the Gemini App or Google AI Studio
- Download or open the Gemini app (available on Android/iOS) or use Google AI Studio on desktop.
- Ensure the app is updated to the latest version that supports the Nano Banana (Gemini 2.5 Flash Image) model.
Step 2: Upload Your Photo
- Tap on the image generation tool.
- Upload a clear, well-lit photo of the subject (yourself, a friend, or a pet).
- Choose a picture where the subject is not blocked by clutter — the clearer the subject, the better the figurine result.
Step 3: Select the Editing / Creation Mode
- Choose “Edit with AI” or “Create from photo” (naming may vary in the app).
- This lets the AI recognize the subject and keep their likeness during transformation.
Step 4: Enter Your Prompt
- Type a detailed prompt describing the figurine you want.
- Be specific about:
- Scale / style (1/6 scale, realistic, anime, plush, etc.)
- Base / pedestal (wood, acrylic, futuristic stand)
- Materials / finish (matte plastic, ceramic, plush, metallic, glossy)
- Lighting / environment (studio light, neon, spotlight, soft pastel background)
- Example: “Create a 1/7 scale hyper-realistic figurine of the person in this photo, on a clear acrylic base, with soft studio lighting and matte plastic finish.”
Step 5: Generate the Image
- Click Generate.
- Wait a few seconds — the AI will create one or multiple variations.
Step 6: Refine with Edits (Optional)
- If the result isn’t perfect, you can:
- Re-prompt (e.g. “make it shinier”, “add a packaging box”).
- Change background / lighting.
- Add props or accessories.
- The AI will keep the same subject likeness while applying edits.
Step 7: Save & Share
- Download the final image to your device.
- Be aware that the output will contain visible and invisible watermarks (SynthID) to indicate it’s AI-generated.
- Share responsibly on social media, blogs, or creative projects.
Why It’s Viral & Useful
- The Nano Banana style is visually appealing: mini-figurine / collectible aesthetic is fun, shareable.
- Accessibility: you don’t have to be an expert in 3D modeling; the tools are embedded in Gemini / Google AI Studio, prompts are natural language.
- Creativity & personalization: people use it for self-portraits, pets, fantasy scenarios, realistic collectibles, etc.
- Social shareability: because the results look polished and interesting, they spread fast on social media.
Best Practices for Prompts
To get realistic, high-quality results, your prompt matters as much as the photo. Here are key prompt-writing tips:
- Be specific about style and finish
E.g. “realistic”, “hyper-realistic”, “matte finish”, “glossy”, “plastic toy”, etc. - Mention scale and base/stand
Helps model understand context. - Include lighting & environment cues
Studio lighting, soft shadows, indoor room, natural light etc. - Define pose and accessories
If you want dynamic stance, arms-out, props, clothing, etc. - Avoid ambiguity
Phrases like “modern style” are vague; better: “minimalist studio backdrop with clean white walls”. - Iterate
Use multi-turn editing: start with a base, refine.
Example Prompts to Create a Realistic 3D Figurine Model
These prompts assume you will upload a photo (person or pet) and then use a prompt like one of these. Feel free to adjust according to what you want.
- “Create a 1/7 scale hyper-realistic figurine of the person in the photo, standing on a round transparent acrylic base under soft studio lighting, with a subtle wood grain desk background, wearing modern casual clothes, matte finish plastic surface, with a toy packaging box beside it printed with matching artwork.”
- “Turn the pet in this picture into a plush toy collectible figurine, oversized head, fuzzy fabric texture, pastel colors, soft diffuse lighting, sitting on a neutral cream pedestal, with a clear protective dome over it.”
- “Generate a fantasy warrior action-figure style model from this portrait: dynamic pose holding sword, cape flowing, armor metallic with polished reflections, rugged base with stones and moss, dramatic lighting with shadows, packaging design reminiscent of collectible action figures.”
- “Make a photo-real miniature character model 3D figurine, life-like skin texture, realistic hair, casual urban clothing, standing on a desk with a computer screen in background showing 3D modelling software viewport, base transparent, studio lighting.”
“Turn this image into a collectible figure of a sci-fi astronaut, with glossy helmet, reflective metallic suit, floating in a moon base diorama, acrylic circular base, cold moonlight lighting, high detail textures.”
Create a 1/7 scale commercialized figurine of the characters in the picture, in a realistic style, in a real environment. The figurine is placed on a computer desk. The figurine has a round transparent acrylic base, with no text on the base. The content on the computer screen is a 3D modeling process of this figurine. Next to the computer screen is a toy packaging box, designed in a style reminiscent of high-quality collectible figures, printed with original artwork. The packaging features two-dimensional flat illustrations.