Pickachu AI Image Editor: White-Label for Realtors, Interior Designers, and Brands
We built Pickachu as an AI image editor. Then realtors started using it. Then interior designers. Then brands we never expected. What began as a single product became an engine other businesses rebrand and deploy under their own names.
This article is a transparent field report on what that engine does, who uses it, and how the white-label economics actually work — with real numbers, not marketing claims.
The Pivot: From Product to Platform
We launched Pickachu as our own consumer AI image editor on Telegram. Gemini 3 Pro under the hood, voice commands, up to 14 reference images, credit-based monetization. It worked. Users came. Revenue followed.
Then something we did not plan for happened: business owners started asking for their own version.
A realtor wanted a bot his clients could use to visualize renovated apartments before a walk-through. An interior designer wanted her own branded tool for generating mood boards from client photos. An e-commerce brand wanted product shots composited on new backgrounds without hiring a photographer.
The framing was wrong. "Telegram image editor" was too narrow. What we actually built was an AI visual assistant that adapts to any industry that touches images — and every industry now touches images.
Who Is Using It
Realtors — Property Visualization
Send a photo of a bare apartment. Describe the desired style: "modern minimalist, warm wood floors, large sofa." Receive a staged version in seconds. Buyers see potential. Sellers close faster.
Before our client integrated this, each listing required either physical staging (expensive) or professional rendering (slow). Now it is a 15-second operation inside Telegram, branded as part of his agency's service.
Interior Designers — Mood Boards and Concept Shots
Designers send reference photos, describe the client's taste, and receive concept variations. Multi-reference support is critical here — they can combine elements from 5 different Pinterest inspirations into a single coherent composition.
What used to take hours in Photoshop is a voice message and a 10-second wait.
E-Commerce Brands — Product Shots Without a Studio
Product on a white background is a starting point. Product on a marble counter with soft morning light is a sale. Brands use the bot to generate lifestyle shots, seasonal variants, and campaign compositions — without shipping products to a photographer.
The Common Thread
Every industry that depends on visual communication can use this. The model is the same. The use cases are not. What changes is the system prompt, the branding, the pricing, and the client base. The AI engine stays constant underneath.
The Product: What Your Clients Actually Get
When a white-label partner deploys this, their end users interact with a clean Telegram bot that carries the partner's brand. Here is what that experience looks like.
The Interaction Flow
- Send a photo — or up to 14 photos as references
- Describe the edit — by text or voice message
- The bot processes — Gemini 3 Pro generates the result
- Receive the output — preview + original (lossless PNG/JPG)
- Take action — retry with different prompt, download, or start new
The entire interface is a single Telegram message that updates in place. No chat spam. Clean, decisive UX — carrying whatever brand our partner chose to put on it.
Why Gemini 3 Pro?
We chose Gemini 3 Pro after evaluating the alternatives:
| Model | Quality | Speed | Cost | Our verdict |
|---|---|---|---|---|
| DALL-E 3 | Good | Slow | $$$ | Too expensive for consumer pricing |
| Stable Diffusion | Variable | Fast | $ | Quality inconsistent |
| Midjourney | Excellent | Medium | $$ | No API for Telegram integration |
| Gemini 3 Pro | Excellent | Medium | $$ | Best quality/cost/API balance |
Partners inherit this choice. They do not evaluate models, negotiate API contracts, or monitor pricing changes. They get the result.
Voice Commands: Why Every Industry Loves Them
Text prompts work for simple edits. For the kind of descriptions a realtor or designer actually needs, voice is superior:
"Take the living room from photo one, replace the sofa with the one from photo two, add warm afternoon light coming through the windows, and remove the clutter on the coffee table."
Typing that takes 30 seconds. Speaking it takes 5. Voice commands doubled engagement per user in our deployments — and were particularly popular with non-technical end users who would never type a detailed prompt but will happily speak one.
Multi-Reference Support — Where It Becomes a Moat
Most image editing bots accept one image and one prompt. Our engine accepts up to 14 reference images — the technical capability that unlocks the real-world use cases partners care about:
- Interior designers: "Combine the color palette from these 3 mood boards with the furniture style from these 2 references"
- Realtors: "Stage this empty apartment using the style from these 5 reference interiors"
- E-commerce: "Place my product on the backgrounds from these 4 lifestyle photos"
One image plus one prompt is a toy. Fourteen references is a workflow tool.
The Economics: What Partners Actually Earn
This is where white-label gets real. Every partner sets their own pricing, aimed at their own market. The engine beneath is the same. Here is the unit economics we learned running it ourselves and deploying for partners.
Credits Over Subscriptions — The Lesson We Paid For
Our first iteration used monthly subscriptions. Churn after month one: 70%. Image editing is sporadic — realtors spend heavy weeks closing listings, then quiet weeks. Subscriptions create guilt during quiet weeks. Guilt becomes cancellation.
We switched to credit packs. Repurchase rate within 60 days: 85%. Partners inherit this model by default and save themselves the same mistake.
Reference Pricing (What We Run)
| Plan | Credits | Price | Per image | Markup over API cost |
|---|---|---|---|---|
| Starter | 10 | $2.49 | $0.50 | 3.6x |
| Creator | 25 | $9.99 | $0.40 | 2.9x |
| Pro | 55 | $19.99 | $0.36 | 2.6x |
Partners in higher-value verticals charge more. A realtor who closes a deal partly because AI staging won the buyer over does not flinch at $50 for 50 images. A designer billing a client $5,000 for concepts sees the bot as infrastructure. Pricing power scales with the vertical you serve.
Unit Economics
| Metric | Value |
|---|---|
| API cost per generation | $0.13-0.14 |
| Average revenue per generation (our tiers) | $0.40 |
| Gross margin per generation | ~65% |
| Infrastructure cost | ~$25/month |
| Break-even point | ~100 generations/month |
At 500 generations/month, monthly gross profit is ~$130. At 2,000: ~$520. Every additional generation is 65% margin. Partners charging premium pricing for vertical-specific use cases see margins of 75-85%.
Technical Decisions
Single-Message UI
Standard Telegram bots send a new message for every interaction. We imposed order through a single-message architecture:
- One message shows the current state (uploaded images, prompt, status)
- Every action updates that same message instead of sending a new one
- Users always see a clean, current command post
Cost Tracking per Request
Every generation logs its cost to the database:
request_id | user_id | model | prompt | status | cost_usd | created_at
This gives us:
- Real-time unit economics monitoring
- Per-user profitability analysis
- Cost anomaly detection (if API pricing changes)
- Data for pricing optimization
Localization
Full Russian and English support. Not just translated strings — the entire UX adapts: button labels, error messages, help text, onboarding flow. Adding languages is modular — new languages require only a new locale file, no code changes.
What We Learned From Real Deployments
1. The Use Cases We Never Predicted
We built for content creators. The buyers who came were realtors, designers, and brand owners. The lesson: once the engine works, stop trying to predict the market. Let buyers tell you what they need it for, and make the system flexible enough to reframe.
2. Voice Input > Text Input — Especially for Non-Technical Users
We expected 80% text, 20% voice. Reality across deployments: roughly 55% voice, 45% text. Realtors walking between properties speak. Designers dictating between calls speak. The bot meets users where they actually are — not at a keyboard.
3. Credit System > Subscription for Sporadic Use
Initial prototype used subscriptions. Churn after first month: 70%. Switching to credits: 85% repurchase within 60 days. Every partner inherits this model and skips the mistake.
4. The Moat Is Not the Model — It Is the Wrapper Around It
Any competitor can call the same Gemini API. The moat is multi-reference handling, voice-to-prompt pipeline, single-message UX, cost tracking, and content moderation. Each one is a small piece. Together, they are the reason a realtor picks the branded bot over opening a browser and typing into ChatGPT.
The White-Label Offering
If you serve an industry that touches visual content — real estate, interior design, e-commerce, marketing, events, fashion, or anything adjacent — you can run this engine under your brand.
What You Get
- Your brand, your bot name — your clients never see ours
- Industry-tuned system prompt — the AI behaves like it was built for your vertical
- Custom pricing tiers — price for your market, not ours
- Admin dashboard — user analytics, revenue, cost tracking per request
- Full source code and documentation — no vendor lock-in
- Deployment on your infrastructure or ours — your choice
- 30 days of support included — maintenance retainer available after
What You Bring
- A vertical — an industry and an audience that needs visual AI
- A brand — the name your clients already trust
- Distribution — the channel to reach your users (your agency, your community, your client list)
Who This Is For
- Real estate agencies adding AI staging to their service
- Interior designers offering instant concept generation to clients
- E-commerce brands replacing studio shoots with AI compositions
- Marketing agencies adding a new productized service
- Fractional CTOs and consultancies adding AI to their offerings
White-label deployment starts from $1,500 plus monthly hosting (can run on your infrastructure for zero ongoing fees to us).
Frequently Asked Questions
How quickly can you deploy a white-label version?
Branded deployment with custom system prompt and pricing tiers typically takes 5-10 working days from kickoff. Industry-specific fine-tuning adds 3-5 days.
Who owns the users and revenue?
You do. The bot runs under your brand, users sign up to you, payments go to your account. We are infrastructure, not a middleman.
How long does a generation take?
5-15 seconds depending on complexity and server load.
What's the maximum image resolution?
Input: any resolution (auto-compressed for API). Output: up to the model's maximum (typically 1024×1024 or higher for Gemini 3 Pro).
Can I customize the AI's behavior for my industry?
Yes. The system prompt is fully configurable. Realtor deployments receive property-specific instructions. Designer deployments receive mood-board-specific instructions. Each vertical gets tuning for its use case.
How do you handle abuse?
Content moderation via Gemini's built-in safety filters plus custom rules. Usage rate limiting prevents API abuse. User blocking is available in the admin dashboard.
What happens if I outgrow the setup?
Source code is yours. You can scale on your own infrastructure, hire engineers to extend it, or retain us for continued development. No lock-in.
See also:
- Cream Mic: AI Desktop Assistant That Listens, Sees, and Answers
- ResumeQuick: AI Resume Builder in 12 Languages
- What Is NeCL? AI Engineering for SaaS
Run this under your brand
Describe your industry and audience on Telegram — we'll scope a white-label deployment for you.
Write on Telegramor book a 30-min call