ML Infrastructure & GenAI
Building LLM-powered tools at Airbnb — 30+ model integrations, PII pipelines, data labeling platforms serving ML teams org-wide.
Cloud & Platform Engineering
AWS, GCP, Docker, Kubernetes, Airflow, OpenShift — production systems at Airbnb, Shell, Southwest Airlines, and Eli Lilly.
Certifications & Recognition
AWS Solutions Architect, AWS Developer, GCP Data Engineer, Oracle Java & Database certified. Multiple Airbnb peer appreciations (2026).
hover or press a key
Years Experience
Rows/Run at Airbnb
LLM Models Integrated
Pageviews/Month (Shell)

Sr. Software Engineer — ML Infrastructure
Built BPI Virtual Analyst — a 5-step LLM wizard integrating 30+ models (GPT-4o, Claude, Gemini, Llama) used by ~55 analysts. Scaled from 600 → 10,000 rows/run. Built Presidio PII pipeline (30% faster). Led Redpen label export upgrade targeting 80% runtime reduction.
Python · Streamlit · Flask · Celery · Airflow · Labelbox · Presidio · AWS · OTEL

Sr. Software Engineer (via ThriveOn Solutions)
Built and maintained the Dose Management System (DMS) — full-stack healthcare portal for medication management. Java/Spring Boot backend, React frontend, deployed on OpenShift OCP across dev/QA/prod environments.
Java · Spring Boot · React · OpenShift · PostgreSQL · GitHub Actions

Sr. Software Engineer (via ThriveOn Solutions)
Architected deployment and testing automation pipelines. Containerized services with Docker + Kubernetes. Secure data management with Datadog monitoring. Statistical analysis and regression models on large datasets.
Python · Docker · Kubernetes · AWS · Datadog · Flask

Sr. Python Developer (via ThriveOn Solutions)
Built API service handling 17M pageviews/month at 94% cache efficiency. Cleared 200+ bottlenecks; app 5× faster after refactor. Improved NLP accuracy 86% → 94%. Deployed ML models on AWS SageMaker. Published at SPE ATCE Conference.
Python · PySpark · Azure Databricks · AWS SageMaker · MLFlow · Flask · Docker · Jenkins

Data Engineer
Built ERP analytics dashboard across 13 business units; boosted client activity by 20%. Automated PIP process — saved 600+ monthly work hours. Built real-time fraud detection pipeline using Kafka. Best Performer Q3 2018.
Python · Java · Oracle Cloud HCM · Kafka · ELK Stack · AWS · Flask · PostgreSQL

MS in Computer Science — GPA 3.69 · May 2021
New York University, Tandon School of Engineering, New York, NY
Courses: Data Structures, Big Data, Distributed Systems, Cloud Computing
|

B.Tech in Electronics & Communication Engineering — GPA 4.0 · May 2013
Jawaharlal Nehru Technological University, Andhra Pradesh, India
Awarded "Best Academic Project" for gesture-controlled Arduino robots coded in C
My work
Real-world projects showcasing skills across GenAI, ML infrastructure, full-stack development, and cloud platforms.
Large-scale internal tooling using Flask, Celery, Labelbox, and Redis. Engineered robust data pipelines with SQLAlchemy and Alembic for model evaluation workflows.
GenAI 5-step LLM wizard integrating 30+ models. Scaled to 10,000 rows/run for ~55 analysts. Presidio PII pipeline, AI clustering, Insight Miner.
Full-stack healthcare portal for medication management. Java/Spring Boot backend, React frontend, deployed on OpenShift OCP across clinical environments.
Computer Vision and NLP application built to assist visually impaired individuals. Winner at HackNYU.
ML reusable framework for subsurface applications. Refactored Jupyter → Python package (5× faster). Deployed on AWS SageMaker. Published at SPE ATCE Conference.
Contact
I'm currently open to new opportunities. Whether you have a question, a project idea, or just want to say hi — my inbox is always open.
Email directly
sailikhithcse@gmail.comLocation
San Francisco, CA