A walk through the things I’ve built: what the problem was, what I did, and what it actually moved. Most of the AI/AR work has video, so where I can, I’ll just show you.
Comify: CTO & Co-founder
2025 – present. Intelligent, omnichannel customer-communication infrastructure for large consumer brands.
I co-founded Comify on a simple frustration: every brand spends enormous effort delivering messages and almost none deciding whether a message should be sent at all. We’re building the layer that decides what to say, to whom, and when, then delivers it at scale, cheaply.
- Architected a multi-agent communication platform handling 50M+ messages a day (push, WhatsApp) for brands including Lenskart and Cars24.
- Designed a serverless backend of 50+ production AWS Lambda functions to orchestrate and deliver at scale with minimal cost per message.
- Built a product-photoshoot image & video generation tool on vision LLMs (Gemini, Claude, Higgsfield) that turns product images into on-brand ad creatives in bulk from style guidelines, cutting catalogue creative cost ~30%.
- Designed self-learning agents, including a copywriting agent that researches a brand’s domain, applies psychological triggers to generate templates, then continuously optimizes them against live click-through.
- Built a clickstream recommendation system (recently-viewed, popular, demography-based) that lifted revenue 60–80% on targeted workflows versus untargeted sends.
- Hired and lead a cross-functional team across frontend, backend, AI, and data engineering.
Lenskart: AI & AR Research Lead
2022 – 2025. Built and led the AI / AR / Data Science team (grew 1 → 12), partnering directly with co-founders and senior product on roadmap.
I joined to start an AI/AR team from scratch and spent three years turning research demos into features millions of people actually used.
- Delivered best-in-class AR virtual try-on across Android, iOS, and web, with a ~9% lift in online revenue since launch.
- Built real-time eyeglass removal on the live AR feed at 12+ FPS on an iPhone 12, so existing spectacle wearers could see themselves in new frames.
- Launched contact-lens try-on for the Aqualens sub-brand, with a 20%+ sales lift in markets like the UAE.
- Architected a GenAI product-photoshoot pipeline (Stable Diffusion + ComfyUI) that raised model-photoshoot coverage from 73% → 100% in one month.
- Rebuilt the 3D try-on asset pipeline with Blender automation and custom QA apps: 3D coverage 65% → 92% in three months, asset dev time down ~30%.
- Shipped a real-time 2D try-on in a single week that replaced a third-party vendor and saved ~$0.8M/year in licensing.
Dynamic True Fit: animated glasses tracking on a live feed.
Real-time frame segmentation and removal, running on-device at the edge.
AI eyeglass generator: product variations rendered for try-on.
MyOperator (formerly VoiceTree): Founding Engineer → Director of Technology
2012 – 2022. Founding engineer at one of India’s fastest-growing cloud-telephony companies; grew from the first customer to 10,000+ paying customers.
Ten years, two products built from scratch, and the architecture education of a lifetime.
- Owned end-to-end architecture for a high-availability platform spanning 100+ servers at peak, sustaining a 99.9% uptime SLA.
- Created CODAC, a phone-number-verification product for e-commerce that peaked at 10M+ calls/day and generated ~$12M a year with a 3-person team: clients included Lenskart, Snapdeal, and Myntra. Distributed architecture, PHP → Python, MySQL, Redis/Memcached, 10+ telephony servers.
- Designed multi-server, load-balanced, fault-tolerant clusters with custom tooling for number distribution across servers.
- Led early cloud-telephony AI research: raw-audio emotion/anger recognition, audio keyword detection, voicebots, TTS/ASR foundations, and neural-net anomaly detection on time-series data.
- Led and grew a 30+ person engineering org with the lowest attrition across departments.
Selected experiments & demos
The stuff I build on weekends. Some of it became real products; most of it just taught me something.
Digital Twin: fully automated talking-avatar news videos, end to end.
SnapStitch: apparel catalogue photoshoots generated on-model.
A few more, by link rather than embed:
- AI news automation: heatwave segment · markets segment · shorts version
- Apparel virtual try-on: demo
- Catalog photoshoots for t-shirts: demo
- Contact-lens try-on: demo
- Real-time frame removal, take two: demo
And a few without a camera pointed at them:
- Videofarm: an API-driven layout engine that generates video programmatically; the engine under most of the automation above.
- Agentic development system built on Claude Code: autonomous, AI-driven software development I use on my own projects.
- Trained models: audio anomaly detection, keyword and anger detection in speech, and a custom text-to-speech voice.
- Home security on a Raspberry Pi: on-device detection, no cloud round-trip.
Want the full history? My résumé is here, and the About page has the narrative version. Or just email me.