Pickachu AI Image Editor: White-Label for Realtors, Interior Designers, and Brands

Pickachu AI Image Editor White-Label

We built Pickachu as an AI image editor. Then realtors started using it. Then interior designers. Then brands we never expected. What began as a single product became an engine other businesses rebrand and deploy under their own names.

This article is a transparent field report on what that engine does, who uses it, and how the white-label economics actually work — with real numbers, not marketing claims.

The Pivot: From Product to Platform

We launched Pickachu as our own consumer AI image editor on Telegram. Gemini 3 Pro under the hood, voice commands, up to 14 reference images, credit-based monetization. It worked. Users came. Revenue followed.

Then something we did not plan for happened: business owners started asking for their own version.

A realtor wanted a bot his clients could use to visualize renovated apartments before a walk-through. An interior designer wanted her own branded tool for generating mood boards from client photos. An e-commerce brand wanted product shots composited on new backgrounds without hiring a photographer.

The framing was wrong. "Telegram image editor" was too narrow. What we actually built was an AI visual assistant that adapts to any industry that touches images — and every industry now touches images.

Who Is Using It

Realtors — Property Visualization

Send a photo of a bare apartment. Describe the desired style: "modern minimalist, warm wood floors, large sofa." Receive a staged version in seconds. Buyers see potential. Sellers close faster.

Before our client integrated this, each listing required either physical staging (expensive) or professional rendering (slow). Now it is a 15-second operation inside Telegram, branded as part of his agency's service.

Interior Designers — Mood Boards and Concept Shots

Designers send reference photos, describe the client's taste, and receive concept variations. Multi-reference support is critical here — they can combine elements from 5 different Pinterest inspirations into a single coherent composition.

What used to take hours in Photoshop is a voice message and a 10-second wait.

E-Commerce Brands — Product Shots Without a Studio

Product on a white background is a starting point. Product on a marble counter with soft morning light is a sale. Brands use the bot to generate lifestyle shots, seasonal variants, and campaign compositions — without shipping products to a photographer.

The Common Thread

Every industry that depends on visual communication can use this. The model is the same. The use cases are not. What changes is the system prompt, the branding, the pricing, and the client base. The AI engine stays constant underneath.

The Product: What Your Clients Actually Get

When a white-label partner deploys this, their end users interact with a clean Telegram bot that carries the partner's brand. Here is what that experience looks like.

The Interaction Flow

  1. Send a photo — or up to 14 photos as references
  2. Describe the edit — by text or voice message
  3. The bot processes — Gemini 3 Pro generates the result
  4. Receive the output — preview + original (lossless PNG/JPG)
  5. Take action — retry with different prompt, download, or start new

The entire interface is a single Telegram message that updates in place. No chat spam. Clean, decisive UX — carrying whatever brand our partner chose to put on it.

Why Gemini 3 Pro?

We chose Gemini 3 Pro after evaluating the alternatives:

Model Quality Speed Cost Our verdict
DALL-E 3 Good Slow $$$ Too expensive for consumer pricing
Stable Diffusion Variable Fast $ Quality inconsistent
Midjourney Excellent Medium $$ No API for Telegram integration
Gemini 3 Pro Excellent Medium $$ Best quality/cost/API balance

Partners inherit this choice. They do not evaluate models, negotiate API contracts, or monitor pricing changes. They get the result.

Voice Commands: Why Every Industry Loves Them

Text prompts work for simple edits. For the kind of descriptions a realtor or designer actually needs, voice is superior:

"Take the living room from photo one, replace the sofa with the one from photo two, add warm afternoon light coming through the windows, and remove the clutter on the coffee table."

Typing that takes 30 seconds. Speaking it takes 5. Voice commands doubled engagement per user in our deployments — and were particularly popular with non-technical end users who would never type a detailed prompt but will happily speak one.

Multi-Reference Support — Where It Becomes a Moat

Most image editing bots accept one image and one prompt. Our engine accepts up to 14 reference images — the technical capability that unlocks the real-world use cases partners care about:

One image plus one prompt is a toy. Fourteen references is a workflow tool.

The Economics: What Partners Actually Earn

This is where white-label gets real. Every partner sets their own pricing, aimed at their own market. The engine beneath is the same. Here is the unit economics we learned running it ourselves and deploying for partners.

Credits Over Subscriptions — The Lesson We Paid For

Our first iteration used monthly subscriptions. Churn after month one: 70%. Image editing is sporadic — realtors spend heavy weeks closing listings, then quiet weeks. Subscriptions create guilt during quiet weeks. Guilt becomes cancellation.

We switched to credit packs. Repurchase rate within 60 days: 85%. Partners inherit this model by default and save themselves the same mistake.

Reference Pricing (What We Run)

Plan Credits Price Per image Markup over API cost
Starter 10 $2.49 $0.50 3.6x
Creator 25 $9.99 $0.40 2.9x
Pro 55 $19.99 $0.36 2.6x

Partners in higher-value verticals charge more. A realtor who closes a deal partly because AI staging won the buyer over does not flinch at $50 for 50 images. A designer billing a client $5,000 for concepts sees the bot as infrastructure. Pricing power scales with the vertical you serve.

Unit Economics

Metric Value
API cost per generation $0.13-0.14
Average revenue per generation (our tiers) $0.40
Gross margin per generation ~65%
Infrastructure cost ~$25/month
Break-even point ~100 generations/month

At 500 generations/month, monthly gross profit is ~$130. At 2,000: ~$520. Every additional generation is 65% margin. Partners charging premium pricing for vertical-specific use cases see margins of 75-85%.

Technical Decisions

Single-Message UI

Standard Telegram bots send a new message for every interaction. We imposed order through a single-message architecture:

Cost Tracking per Request

Every generation logs its cost to the database:

request_id | user_id | model | prompt | status | cost_usd | created_at

This gives us:

Localization

Full Russian and English support. Not just translated strings — the entire UX adapts: button labels, error messages, help text, onboarding flow. Adding languages is modular — new languages require only a new locale file, no code changes.

What We Learned From Real Deployments

1. The Use Cases We Never Predicted

We built for content creators. The buyers who came were realtors, designers, and brand owners. The lesson: once the engine works, stop trying to predict the market. Let buyers tell you what they need it for, and make the system flexible enough to reframe.

2. Voice Input > Text Input — Especially for Non-Technical Users

We expected 80% text, 20% voice. Reality across deployments: roughly 55% voice, 45% text. Realtors walking between properties speak. Designers dictating between calls speak. The bot meets users where they actually are — not at a keyboard.

3. Credit System > Subscription for Sporadic Use

Initial prototype used subscriptions. Churn after first month: 70%. Switching to credits: 85% repurchase within 60 days. Every partner inherits this model and skips the mistake.

4. The Moat Is Not the Model — It Is the Wrapper Around It

Any competitor can call the same Gemini API. The moat is multi-reference handling, voice-to-prompt pipeline, single-message UX, cost tracking, and content moderation. Each one is a small piece. Together, they are the reason a realtor picks the branded bot over opening a browser and typing into ChatGPT.

The White-Label Offering

If you serve an industry that touches visual content — real estate, interior design, e-commerce, marketing, events, fashion, or anything adjacent — you can run this engine under your brand.

What You Get

What You Bring

Who This Is For

White-label deployment starts from $1,500 plus monthly hosting (can run on your infrastructure for zero ongoing fees to us).

Frequently Asked Questions

How quickly can you deploy a white-label version?
Branded deployment with custom system prompt and pricing tiers typically takes 5-10 working days from kickoff. Industry-specific fine-tuning adds 3-5 days.

Who owns the users and revenue?
You do. The bot runs under your brand, users sign up to you, payments go to your account. We are infrastructure, not a middleman.

How long does a generation take?
5-15 seconds depending on complexity and server load.

What's the maximum image resolution?
Input: any resolution (auto-compressed for API). Output: up to the model's maximum (typically 1024×1024 or higher for Gemini 3 Pro).

Can I customize the AI's behavior for my industry?
Yes. The system prompt is fully configurable. Realtor deployments receive property-specific instructions. Designer deployments receive mood-board-specific instructions. Each vertical gets tuning for its use case.

How do you handle abuse?
Content moderation via Gemini's built-in safety filters plus custom rules. Usage rate limiting prevents API abuse. User blocking is available in the admin dashboard.

What happens if I outgrow the setup?
Source code is yours. You can scale on your own infrastructure, hire engineers to extend it, or retain us for continued development. No lock-in.

See also:

Run this under your brand

Describe your industry and audience on Telegram — we'll scope a white-label deployment for you.

Write on Telegram

or book a 30-min call