Sourabh Sharma

Senior Machine Learning Engineer (LLMs, ML Infra, Vision)

6 years building production ML systems in e-commerce and B2B SaaS: pricing engines, vision pipelines, and LLM agents. I own the full stack—from problem framing to A/B-tested deployment—shipping ML that drives measurable business impact.

Areas of Interest

LLMs
Agentic
GenAI
Computer Vision
Full Stack

Key Skills

Machine Learning & AI

PyTorch
TensorFlow
SageMaker
Vertex AI
Kubeflow
Weights & Biases
MLflow
DVC
ONNX
Optuna
Hydra
PyTorch Lightning
scikit-learn
DeepSpeed
XGBoost

Cloud & Infrastructure

Docker
AWS
Google Cloud
Cloudflare
Kubernetes
Terraform
Airflow
Prefect
OpenTelemetry

Data Engineering & Analytics

Dask
Spark
Apache Kafka
dbt
PostgreSQL
Redis
Elasticsearch
Apache Superset
BigQuery
Google Analytics
Ray
Apache Beam

Web Development & Design

React
Next.js
Tailwind CSS
FastAPI
Flask
Hono
Playwright
Selenium
Zustand
Zod
Drizzle ORM
Deno
tRPC

Programming & Development

Python
TypeScript
Bash
Go

Work Experience

Aidaptive by JarvisML

Cupertino, USA

AI-driven personalization platform focusing on hospitality and e-commerce.

Senior ML Engineer

Sep '22 – Present
Dynamic Pricing System - Twiddy VRM
  • Built demand-driven pricing engine (R² 0.75) for $175M+ annual revenue (57K+ bookings, 1,240 properties); identified 8-10% revenue optimization potential validated through backtest.

  • Modeled demand and price elasticity with XGBoost (Optuna-tuned, 5-fold CV) over 67 engineered features (GA4 demand signals, image-derived attributes, seasonal/geo elasticity, occupancy and booking lags), with features materialized in BigQuery and calibrated outputs.

Python
XGBoost
Optuna
MLflow
BigQuery
Image Intelligence & Personalization System - Cross-Client
  • Deployed ML-based image reranking across 8.4K+ listings (49.4M impressions), achieving +172% CTR lift and 62% win rate on underperforming properties through visual attribute scoring.

  • Engineered hybrid attribute extraction: trained vision classifiers on 11M+ web-crawled images achieving macro-F1 0.9 on 100+ hierarchical attributes, augmented with Gemini Vision and Claude zero-shot labeling; serving 200K+ images for 8 enterprise clients.

  • Extended Flagr in Go for real-time inference (sub-100ms), deployed on Kubernetes with monitoring and telemetry.

Gemini Vision
Claude API
PyTorch
Go
Kubernetes
Vertex AI
Flagr
A/B Testing
Zero-shot Learning
ML Infrastructure Library (ml-lib)
  • Designed and shipped a company-wide ML infrastructure library and model registry (Vertex AI + BigQuery + internal services) that became the default backend for new ML features across vision and LLM workloads.

Python
Vertex AI
BigQuery
Model Registry
Agentic NL-to-SQL System (Ask A Metric)
  • Architected NL-to-SQL system using OpenAI Agents SDK, enabling non-technical stakeholders to query business metrics in natural language, with self-serve analytics across 4 data domains and 59 hospitality clients ($5B+ revenue data).

  • Developed FastMCP server exposing BigQuery operations as tools (schema introspection, guarded query execution) for agentic reasoning with Gemini (SQL generation) and GPT-4.1 (response synthesis).

  • Evolved from self-hosted 7B model (defog/sqlcoder) to production pipeline with semantic query catalog retrieval, syntax-aware CTE-based chunking, feedback-driven scoring, and tiered caching.

OpenAI Agents SDK
MCP
FastMCP
Gemini
RAG
Chunking
BigQuery
pgvector
FastAPI
Cloud Run
Model Routing
Query Rewriting
Guardrails
  • Developed a visual similarity recommender that increased RPI by 14% for select clients by surfacing higher-value alternative products.

  • Built an internal GenAI experimentation framework using LoRA-based fine-tuning (LLMs / SDXL) to generate ad copy and creatives with consistent characters.

Subway India

Bangalore, India

Quick-service restaurant franchise with 1,000+ stores across India.

Independent Contractor

contract
Jan '25 – Mar '25
  • Delivered enterprise support platform via WhatsApp for Subway India with 1,065 stores (81% adoption) across India, processing 11K tickets (362K+ messages) with 56% sub-2-minute resolution (51-min median) through decision tree routing across 13 departments, laying foundation for ML-powered ticket routing.

  • Architected platform serving both franchise support and employee innovation (Sankalp) with HR/committee governance through 4-tier RBAC, event-driven state machines, JWT/OTP/SSO auth, WhatsApp Cloud API integration, serverless deployment with PostgreSQL and Redis.

Next.js
TypeScript
PostgreSQL
Redis
WhatsApp API
Vercel

Aisle3

London, UK

Early-stage e-commerce aggregator for multi-merchant product discovery.

Founding ML Engineer

Nov '20 – Sep '22
Product Retrieval and Matching System
  • First ML hire; built multi-modal product matching with 200M-param Swin Transformer (DINO/SimCLR/MoCo contrastive pretraining) on 200GB+ image-text pairs, achieving P@1 0.91, Recall 0.96; with XGBoost + cross-encoder re-ranker combining vision, text, and attribute signals.

  • Designed serverless pipeline on AWS Step Functions, separating GPU/CPU workloads for cost-effective scaling.

  • Built annotation platform collecting 120K+ pairwise labels via active learning and HITL with embedding-based hard-negative sampling for contrastive model training.

PyTorch
Swin Transformers
Contrastive Learning
Hybrid Search
Faiss
Elasticsearch
AWS Step Functions
Active Learning
HITL
Online Vector Search Platform
  • Deployed Faiss + Elasticsearch for concurrent dense vector search, enabling product matching at scale (60GB+/day).

Faiss
Elasticsearch
Apache Airflow
Docker
Product Attribute Extraction Pipeline
  • Designed self-training data flywheel for color detection: rule-based (CIELAB/deltaCIEDE2000) → CNN (ConvNeXt) → ViT (SwinV2), each generation producing weak labels to bootstrap the next; U²Net segmentation for salient object detection.

  • Developed multi-headed attribute extractor (progressive training on shared backbone) for footwear attributes (pattern, material, ankle height), achieving F1 0.93; powered product filtering on the storefront.

SwinV2
ConvNeXt
U²Net
Weak Supervision
Self-Training
Multi-Task Learning
Progressive Training

Early Work(May '19 – Aug '20)

ML Engineer

@ FaceX
contract
5 mosBangalore, India

Startup providing face recognition and liveness detection solutions.

  • Built masked/peri-ocular face recognition for masked faces and a real-time face liveness SDK (client SDKs for mobile and backend services) used in Aadhaar e-KYC and fraud-prevention flows, handling 100K+ verifications/day.

Computer Vision
Face Recognition
Python
OpenCV

ML Engineer

@ Jumper.ai
contract
3 mosSingapore

Platform enabling social commerce through chatbots and automation.

  • Built NLP-based conversational AI flows for a social commerce chatbot platform by integrating backend APIs and a menu-digitization model for automated restaurant menu parsing, using ML-based intent and NER models.

NLP
Conversational AI
Python

Data Engineer

@ 73 Strings
contract
3 mosBangalore, India

Financial technology firm providing AI-driven valuation tools.

  • Developed ML- and rule-based crawlers and pipelines to scrape and structure company information, powering richer company profiles for downstream pipelines.

Python
Web Scraping
Data Pipelines

Intern

@ Gumption Labs
internship
2 mosBangalore, India

Startup focused on automated trading solutions for retail investors.

  • Experimented with genetic programming to explore trading rules and financial technical analysis using Selenium for automation.

Python
Genetic Programming
Selenium