AI Engineer · San Francisco · Available for Work

BuildingAIsystemsthatshiptoproduction — notdemos.

I'm the sole AI Engineer at Onpoint Insights, building a production medical coding pipeline grounded in CMS authoritative sources for a market projected to reach $8.4B by 2033. Before this I shipped multi-agent LLM systems, RAG assistants, and causal models on Azure Databricks and Snowflake.

01 — Selected Work

Four roles. One thread: shipping AI that holds up under real load.

01

AI Engineer

Onpoint Insights

San Francisco, CA
Jan 2026 — Present

As the sole AI Engineer on this project, I own end-to-end R&D for a production medical coding pipeline targeting a market projected to reach $8.4B by 2033. I avoided a flawed vanilla-LLM baseline — sub-70% coding accuracy — and instead designed a two-stage retrieve-then-rank architecture grounded in three authoritative CMS sources, with Medicaid and Medicare compliance rules baked in. The result is a 7.66% error rate against a 10–20% manual-coding industry baseline, in a sector where coding errors cost the healthcare industry over $1B annually.

0%
Error rate vs.
10–20% industry baseline
Accuracy over
single-pass LLM baselines
$0B
Target market
size by 2033
Python RAG FAISS GPT-4 CMS Data ICD-10 / CPT Retrieve-then-Rank

02

Data Scientist

Onpoint Insights

Andover, MA
Jan 2025 — Dec 2025

Shipped a multi-agent LLM pipeline on Azure Databricks for natural-language-to-SQL over 20K+ product records, containerized with Docker for reproducible runs across dev and staging. Built a RAG assistant over 20+ documents using transformer embeddings, FAISS vector indexing, and GPT-4 intent routing — 17% better answer accuracy than an embedding-only baseline. On the analytics side, A/B tests and causal regression quantified incremental campaign lift for a top-3 pharmaceutical client, enabling spend optimization that lifted ROI by 4.2%.

0%
Analyst effort
cut by NL-to-SQL pipeline
0%
Accuracy lift over
embedding-only baseline
0%
Campaign ROI lift
via causal modeling
Azure Databricks Docker FAISS GPT-4 RAG Multi-Agent LLM Power Automate A/B Testing Causal Inference

03

Data Science Intern

Onpoint Insights

Andover, MA
May 2024 — Aug 2024

Pulled 10M+ transaction records from Snowflake to engineer a hybrid recommendation system pairing Market Basket Analysis with Word2Vec embeddings, served via vector similarity search — drove a 6.7% lift in Average Order Size. Built an entity-resolution layer (NER plus embedding-driven cosine similarity) to reconcile partial company names from Azure SQL into canonical entities, and trained in-warehouse additive and multiplicative time-series forecasts via Snowpark to 12% MAPE for inventory and production planning.

0M+
Transaction records
processed from Snowflake
0%
Average Order Size
lift from hybrid recsys
0%
MAPE on in-warehouse
time-series forecasting
Snowflake Snowpark Word2Vec NER Azure SQL Market Basket Analysis SQL

04

Data Science Intern

Creative Galileo

Pune, India
Apr 2023 — Jul 2023

Built churn-prediction models on user behavioral data at a Series-A Ed-Tech with 10M+ app downloads, generating risk scores for retention initiatives via pandas, scikit-learn, and AWS SageMaker. A parallel telemetry analysis on AWS S3 surfaced latency bottlenecks in the funnel, driving targeted system optimizations that cut app load times 10% and payment-page churn 12%.

0M+
App downloads at
the Series-A platform
0%
App load time
reduction post-optimization
0%
Payment-page churn
reduction
Python pandas scikit-learn AWS SageMaker AWS S3 Churn Modeling

02 — Capabilities

A stack built for production, not slide decks.

AI & ML Engineering

Python·PyTorch·TensorFlow·Keras·scikit-learn·RAG·FAISS·Transformer Embeddings·LoRA / QLoRA·Multi-Agent LLM Pipelines·NER·Word2Vec·GPT-4

Data & Infrastructure

Snowflake·Snowpark·Azure Databricks·PySpark·Apache Hadoop·Docker·Git·GitHub Actions·MongoDB·MySQL·PostgreSQL·MS SQL Server·Oracle

Cloud Platforms

AWS S3·Redshift·Glue·Athena·SageMaker·QuickSight·Azure Kubernetes·Azure Blob Storage

Analytics & BI

Tableau·PowerBI·Alteryx·Mixpanel·A/B Testing·Causal Inference·Time-Series Forecasting·OCR·Power Automate

03 — About

From electronics in Pune to AI in San Francisco.

I started in electronics and telecommunication at Savitribai Phule Pune University, picked up honors in AI and machine learning, then crossed the Pacific for a master's in Business Analytics and AI at UT Dallas. I'm in San Francisco now, building the kind of AI systems that hold up under real production load — grounded in authoritative data, measured against honest baselines, and shipped.

  • The University of Texas at Dallas May 2025
    M.S., Business Analytics & Artificial Intelligence
    GPA · 3.5
  • Savitribai Phule Pune University May 2023
    B.E., Electronics & Telecommunication · Honors in AI / ML