Open retail counter. Multiple speakers, code-mixed Hindi · Marathi · English, ambient footfall — the noise floor every off-the-shelf transcription chokes on.
Conversation intelligence
Our AI listens to every conversation on your floor and surfaces the sale that almost happened — the customer, the item, the rupee value — while staff can still bring it back.
Backed by
The Monday-morning view
The one you can fix in 15 minutes: Priya at Virar West.
Incriminating evidence · What to do tonight
Coach Priya Sharma to introduce alternative recovery before her 10:00 am shift. One walk-in this afternoon cost ₹3,700.
Priya handled a walk-in at 11:08 where the customer asked for 4 medicines back-to-back — none with alternatives offered, customer walked out.
Web dashboard · live in our Mumbai retail deployment
What customers say
We were optimising for design. Ostronaut showed us that parents on our showroom floor are asking about safety certifications and after-sales support before they ever ask about colour. We rewrote our customer-conversation playbook in a week.
Every support call used to die in a ticket. Ostronaut turned them into a feature backlog — and surfaced a reseller-intelligence layer we didn’t know we had. Nobody on our team had to retag anything.
Every customer-facing team has thousands of human-to-human conversations every week, and almost none of them reach the rest of the business. Ostronaut turns those conversations into the intelligence, insights, and analytics you’re currently flying blind without.
Production scale · last 30 days
8,400+ hours of real customer audio · Hindi · Marathi · English · code-mixed · live across our Mumbai retail deployment.
Hardware
No app install, no behaviour change from staff. The hardware does the work — ambient capture, always on.
All three feed our own Indic AI stack — purpose-built for speaker identification, voice fingerprinting, and code-switching across Marathi, Hindi, and Hinglish. Our models find revenue leaks, compliance gaps, and coaching moments — then turn each one into a personalised practice rep for the staff member who lived it.
Capture → AI → Training. One loop.
Think Gong for the physical retail floor.
One Mumbai pharmacy counter, twenty-four seconds at a time. Raw audio, then the same clip after our pipeline, then the business events that fall out — each anchored to the exact line that triggered it. The fourth clip is a different counter on a different day, the kind our own audio classifier tags ‘unusable’ — listen to what our transcription stack still pulls out.
Open retail counter. Multiple speakers, code-mixed Hindi · Marathi · English, ambient footfall — the noise floor every off-the-shelf transcription chokes on.
The same clip, after our pipeline. Conversation reconstructed from the signal. Staff and customer separated turn by turn. No enrolment, no manual labelling.
Every turn evaluated against our retail conversation taxonomy. Coachable moments surface with the line that triggered them. Confidence-gated — uncertain detections never reach the inbox.
A different conversation. Our own audio-quality classifier flagged this clip ‘unusable’. Our transcription stack came back with seventy-one turns at ninety-five-percent average confidence — pure Devanagari, no transliteration, no language toggle.
What we built
Six stages, one loop. Each conversation flows from raw audio at the counter to a coachable moment in someone’s queue — without anybody typing anything. The depth of what’s underneath each stage is the reason it actually works in a real Mumbai pharmacy at 3pm. Read the deeper take →
Always-on capture across the counter. No app, no login, no behaviour change from staff. Audio is uploaded the moment the device sees a network.
What we built
Continuous audio is sliced into discrete customer-staff conversations by pause and proximity — not by uniform clock windows.
What we built
Devanagari is preserved. Multilingual code-switching inside a single utterance is handled in one pass. No language toggle.
What we built
Each turn is attributed to the right person. Staff are recognised across conversations and shifts. Customers stay anonymous.
What we built
Stockout uncoached, bounce unhandled, substitution offered, prescription check missed. Coachable moments, with evidence quoted from the transcript.
What we built
Every event becomes a remediation step for the right staff member. The same loop measures whether next month’s conversations got better.
What we built
Running across teams building some of the largest healthcare, D2C, and B2B operations in the region.
Sales intelligence across their hospital network — who closed, who deflected, why.
D2C customer conversation capture — every inbound call turned into product and CX signal.
Full retail + voice loop across 10 pharmacies — staff coaching from real customer conversations.
B2B sales calls captured + summarised, every staff conversation reviewable.
Customer support calls turned into product signal + reseller intelligence.
Founders
Four-time founder (OYO Flagship hypergrowth, Tushky · 500 Startups 2013, Pragmatic Leaders · YC W21) and ex-NSDC under India’s Skill India Mission — 10,000+ professionals coached on the practice loop since 2018. That evidence base sits under Ostronaut’s coaching layer. Read the full lineage →
Where this is heading
The customer, the item, the rupee value at risk — surfaced before they’re back in their car.
A message draft ready to send the moment that stock arrives. One tap from the manager.
What customers are asking for that your inventory misses — sorted by rupees, every morning.